Tülin İnkaya’s research while affiliated with Bursa Uludağ Üni̇versi̇tesi̇ and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (22)


SATIŞ TAHMİNİ İÇİN DERİN ÖĞRENME YÖNTEMLERİNİN KARŞILAŞTIRILMASI
  • Article

July 2024

Uludağ University Journal of The Faculty of Engineering

Begüm Erol

·

Tülin İnkaya

Dijital dönüşüm ile tedarik zinciri yönetiminde büyük veri analitiğinin önemi gün geçtikçe artmaktadır. Özellikle müşteri taleplerinin hızlı ve doğru tahmin edilmesinde büyük verinin kullanımı firmalara rekabet avantajı sağlamaktadır. Bu doğrultuda, yapay zekâ tekniklerinden biri olan derin öğrenme modelleri büyük verideki karmaşık örüntülerin keşfedilmesinde öne çıkmaktadır. Son yıllarda literatürde çok sayıda derin öğrenme yöntemi önerilmiştir. Bu çalışmada, satış tahmini problemi için derin öğrenme yöntemlerinin performansları karşılaştırılmıştır. Bu kapsamda derin sinir ağı (DNN), derin otokodlayıcı (Deep AE), evrişimli sinir ağı (CNN), tekrarlayan sinir ağı (RNN), uzun kısa-süreli bellek (LSTM) ağı, çift yönlü LSTM (Bi-LSTM) ağı, kapılı tekrarlayan birim (GRU), CNN-LSTM ve evrişimli LSTM (ConvLSTM) yöntemleri uygulanmıştır. Çeşitli sektörlere ait satış verileri kullanılarak deneysel çalışmalar gerçekleştirilmiştir. Hiperparametre optimizasyonu ardından ele alınan yöntemlerin performansları tahmin doğruluğu ve eğitim süreleri açısından karşılaştırılarak sonuçların istatistiksel anlamlılığı değerlendirilmiştir. Sonuç olarak, LSTM ve GRU modellerinin tahmin doğruluğunda başarılı sonuçlar verdiği, CNN modelinin ise eğitim süresini kısalttığı görülmüştür.


LOF weighted KNN regression ensemble and its application to a die manufacturing company

December 2023

·

14 Reads

·

7 Citations

Sadhana

K-nearest neighbor (KNN) algorithm is a widely used machine learning technique for prediction problems due its simplicity, flexibility and interpretability. When predicting the output variable of a data point, it basically averages the output values of its k closest neighbors. However, the impact of the neighboring points on the estimation may differ. Even though there are weighted versions of KNN, the effect of outliers and density differences within the neighborhoods are not considered. In order to fill this gap, we propose a novel weighting scheme for KNN regression based on local outlier factor (LOF). In particular, we combine the inverse of the Euclidean distance and LOF value so that the weights of the neighbors are determined using not only distance and connectivity but also outlier and density information around the neighborhood. Also, bootstrap aggregation is used to leverage the stability and accuracy of the LOF weighted KNN regression. Using real-life benchmark datasets, extensive experiments and statistical tests were performed for evaluating the performance of the proposed approach. The experimental results indicate the superior performance of the proposed approach in small neighborhood sizes. Moreover, the proposed approach was implemented in a make-to-order manufacturing company, and die production times were estimated successfully.


Remaining Useful Life Estimation of Turbofan Engines with Deep Learning Using Change-Point Detection Based Labeling and Feature Engineering
  • Article
  • Full-text available

October 2023

·

99 Reads

·

12 Citations

Accurate remaining useful life (RUL) prediction is one of the most challenging problems in the prognostics of turbofan engines. Recently, RUL prediction methods for turbofan engines mainly involve data-driven models. Preprocessing the sensor data is essential for the performance of the prognostic models. Most studies on turbofan engines use piecewise linear (PwL) labeling, which starts with a constant initial RUL value in normal/healthy operating time. In this study, we designed a prognostic procedure that includes difference-based feature construction, change-point-detection-based PwL labeling, and a 1D-CNN-LSTM (one-dimensional–convolutional neural network–long short-term memory) hybrid neural network model for RUL prediction. The procedure was evaluated on the subset FD001 of the C-MAPSS dataset. The proposed procedure was compared with machine learning and deep learning models with and without the new difference feature. Also, the results were compared with the studies that used similar labeling approaches. Our analysis of the numerical results underscores the clear superiority of the proposed 1D-CNN-LSTM model with the difference feature in RUL prediction, with a score of 437.2 and an RMSE value of 16.1. This result illustrates the superior predictive capability of the 1D-CNN-LSTM model, which outperformed traditional machine learning methods and one of the earliest deep learning methods. These findings emphasize the superior predictive capability of the 1D-CNN-LSTM model and underline the potential of the feature engineering process for more accurate and robust RUL prediction in the context of turbofan engine prognostics.

Download

An Active Learning Approach Using Clustering-Based Initialization for Time Series Classification

October 2023

·

9 Reads

·

1 Citation

The increase of digitalization has enhanced the collection of time series data using sensors in various production and service systems such as manufacturing, energy, transportation, and healthcare systems. To manage these systems efficiently and effectively, artificial intelligence techniques are widely used in making predictions and inferences from time series data. Artificial intelligence methods require a sufficient amount of labeled data in the learning process. However, most of the data in real-life systems are unlabeled, and the annotation task is costly or difficult. For this purpose, active learning can be used as a solution approach. Active learning is one of the machine learning methods, in which the model interacts with the environment and requests the labels of the informative samples. In this study, we introduce an active learning-based approach for the time series classification problem. In the proposed approach, the k-medoids clustering method is first used to determine the representative samples in the dataset, and these cluster representatives are labeled during the initialization of active learning. Then, the k-nearest-neighbor (KNN) algorithm is used for the classification task. For the query selection, uncertainty sampling is applied so that the samples having the least certain labels are prioritized. The performance of the proposed approach was evaluated using sensor data from the production and healthcare systems. In the experimental study, the impacts of the initialization techniques, number of queries, and neighborhood size were analyzed. The experimental studies showed the promising performance of the proposed approach compared to the competing approaches.



The general framework of the proposed methodology
Average rank diagram
A novel LOF-based ensemble regression tree methodology

June 2023

·

36 Reads

·

1 Citation

Neural Computing and Applications

With the emergence of digitilization, numeric prediction has become a prominent problem in various fields including finance, engineering, industry, and medicine. Among several machine learning methods, regression tree is a widely preferred method due to its simplicity, interpretability and robustness. Motivated by this, we introduce a novel ensemble regression tree based methodology, namely LOF-BRT+OR. The proposed methodology is an integrated solution approach with outlier removal, regression tree and ensemble learning. First, irregular data points are removed using local outlier factor (LOF), which measures the degree of being an outlier for each point. Next, a novel regression tree with LOF weighted node model is introduced. In the proposed node model, the weights of the points in the nodes are determined according to their surrounding neighborhood, as a function of LOF values and neighbor ranks. Finally, in order to increase the prediction performance, ensemble learning is adopted. In particular, bootstrap aggregation is used to generate multiple regression trees with LOF weighted node model. The experimental study shows that the proposed methodology yields the best root mean squared error (RMSE) values in five out of nine data sets. Also, the non-parametric tests demonstrate the statistical significance of the proposed approach over the benchmark methods. The proposed methodology can be applicable to various prediction problems.


Satış tahmini için uzun kısa-süreli bellek ağı tabanlı derin transfer öğrenme yaklaşımıLong short-term memory network based deep transfer learning approach for sales forecasting

January 2023

·

3 Reads

·

3 Citations

Gazi Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi

Üretim ve hizmet sektörlerinde faaliyet gösteren firmalar, artan rekabet koşulları ile mücadele edebilmek için belirsizlik altında geleceğe yönelik çeşitli kararlar alırlar. Bu kritik kararlardan biri satış tahminidir. Dijital teknolojilerin yaygınlaşması ile derin öğrenme yaklaşımlarının satış tahmininde kullanımı artmaktadır. Derin öğrenme, başarılı sonuçlar vermesine rağmen büyük miktarda veri ile uzun eğitim sürelerine ihtiyaç duymaktadır. Bu duruma çözüm olarak problemler arası bilgi aktarımını sağlayan transfer öğrenme (TL) kullanılmaktadır. Transfer öğrenme, kaynak veriler ile modelin eğitimini ve hedef veriye aktarımını sağlamaktadır. Bu çalışmada, farklı ürünlerin satış tahmini modellerinden elde edilen bilginin gelecekteki tahmin modellerine aktarımını sağlamak üzere derin transfer öğrenme yaklaşımı önerilmiştir. Satış verisi tek değişkenli zaman serisi olarak ele alınmıştır. Kaynak veri seçiminde aktarılabilirlik ölçütü olarak hedef ve kaynak veri arasındaki gerçek cezalı düzenleme uzaklığı (ERP) kullanılmıştır. Seçilen kaynak veri ile zamansal bağımlılıkların modellenmesini sağlayan uzun kısa vadeli hafıza (LSTM) ağı eğitilmiştir. Ön eğitilen LSTM ağında parametre transferi yapılarak hedef veri için ERP-LSTM-TL tahmin modeli oluşturulmuştur. Çeşitli sektörlere ait satış veri kümelerinde yapılan deneysel çalışmalarda ERP-LSTM-TL, hedef veri ile eğitilen LSTM’e göre tahmin doğruluğunda ve eğitim süresinde iyileşme sağlamıştır. Önerilen yaklaşımın performansı klasik tahmin ve makine öğrenmesi yöntemlerinin performansları ile karşılaştırılmıştır. ERP-LSTM-TL karşılaştırılan yöntemlere göre istatistiksel olarak daha iyi sonuç vermiştir.



Consensus similarity graph construction for clustering

November 2022

·

59 Reads

·

2 Citations

Pattern Analysis and Applications

A similarity graph represents the local characteristics of a data set, and it is used as input to various clustering methods including spectral, graph-based, and hierarchical clustering. Several similarity graphs exist in the literature; however, there is not a single similarity graph that can handle all kinds of cluster shapes and structures. In this study, motivated by the successful applications of ensemble approaches to clustering, a generic method for consensus similarity graph construction is proposed. The proposed approach first constructs multiple similarity graphs using bootstrap aggregating (bagging). Then, these graphs are fused into a consensus similarity graph using the normalized co-association matrix. We use k-nearest neighbor, εε\varepsilon-neighborhood, fully connected graph, and proximity graphs as the base similarity graphs. Moreover, the proposed approach is coupled with various clustering algorithms including spectral, graph-based, and hierarchical clustering. The experimental results with various spatial and real data sets demonstrate the effectiveness of the consensus similarity graphs in clustering. The proposed approach is also robust to local noise.


Parameter-free surrounding neighborhood based regression methods

March 2022

·

14 Reads

·

4 Citations

Expert Systems with Applications

In machine learning, nearest neighbor (NN) regression is one of the most prominent methods for numeric prediction. It estimates the output variable of a new data point by averaging the output variables of the neighboring points. The selection of the neighborhood and its parameter(s) is crucial for the performance of NN regression, however this is still an open issue. This study contributes to the literature by adopting the parameter-free surrounding neighborhood (PSN) concept for NN regression. PSNs are based on proximity graphs, i.e. minimum spanning tree, relative neighborhood graph, and Gabriel graph. They yield a unique neighborhood for each point by combining proximity, connectivity and spatial distribution. The performances of the PSN regression methods are compared with k-nearest neighbors, k-nearest centroid neighbors, and support vector regression using real-world data sets. The statistical tests show that the PSN regression methods perform significantly better than most of the competing approaches. Also, the proposed approaches do not have any parameters to be set.


Citations (14)


... Hence, the distance between points is crucial, as the model gives more weight to nearer neighbours. The method is useful for a variety of data types, especially when the location of data points plays a role in predictions [57,58]. ...

Reference:

Advancing Iron Ore Grade Estimation: A Comparative Study of Machine Learning and Ordinary Kriging
LOF weighted KNN regression ensemble and its application to a die manufacturing company
  • Citing Article
  • December 2023

Sadhana

... The same data set was used in this study. The details of the data set are described in the literature [36]. ...

Remaining Useful Life Estimation of Turbofan Engines with Deep Learning Using Change-Point Detection Based Labeling and Feature Engineering

... This paper will show that the minimum norm point under constrained settings may not be the minimum norm point of the original convex hull that only consists of objective gradients, which will not mathematically guarantee the obtained direction will still be a common descent direction. However, many practical multi-objective optimization problems are constrained in nature, such as the multi-objective cluster ensemble problem (Aktaş et al. (2024)), the multi-objective sustainable Supply Chain management problem (Tautenhain et al. (2021)), multi-criteria portfolio problem (Petchrompo et al. (2022)), etc. Therefore, the gap between MGDA and constrained MDGA needs to be fulfilled. ...

Cluster ensemble selection and consensus clustering: A multi-objective optimization approach
  • Citing Article
  • October 2023

European Journal of Operational Research

... The first limitation lies in the difference in the output modes of the tasks. Most AL methods [35][36][37][38][39] in the field of computer science are aimed at many-to-one type tasks, while our TSRSI classification tasks are of the many-to-many type. The second limitation is that there is a significant difference between the TSRSI data and the data, such as text and audio data, to which these methods are applied. ...

An Active Learning Approach Using Clustering-Based Initialization for Time Series Classification
  • Citing Chapter
  • October 2023

... 1) Articles not written in English; 2) Articles without complete bibliometric information; 3) Duplicates of the same study; 4) Articles that are not fully relevant to the field of explainability in medicine research or are general-purpose papers without case study implementations. [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62], [63], [64], [65], [66], [67], [68], [69], [70], [71], [72], [73], [74], [75], [76], [77], [78], [79], [80], [81], [82], [83], [84], [85], [86], [87], [88], [89], [90], [91], [92], [93], [94], [95], [96], [97], [98], [99], [100], [101], [102], [103], [104], [105], [106], [107], [108] [109], [110], [111], [112], [113], [114], [115], [116], [117], [118], [119], [120], [121], [122], [123], [124], [125], [126], [127], [128], [129], [130], [131], [132], [133], [134], [135], [136], [137], [138], [139], [140], [141], [142], [143], [144], [145], [146], [147], [148], [149], [150], [151], [152], [153], [154], [155], [156], [157], [158], [159], [160], [161], [162], [163], [164], [165], [166], [167], [168], [169], [170], [171], [172], [173], [174], [175], [176], [177], [178], [179], [180], [181], [182], [183], [184] , [185], [186], [187], [188], [189], [190], [191], [192], [193], [194], [195], [196], [197], [198], [199], [200], [201], [202], [203], [204], [205], [206], [207], [208], [209], [210], [211], [212], [213], [214], [215], [216], [217], [218] [219], [220], [221], [222], [223], [224], [225] [226], [227], [228], [229], [230], [231], [232], [233], [234], [235] Forty more papers were excluded because it was not possible to access the full-text articles. At the end of this phase, the remaining 214 papers were identified as eligible reports. ...

A novel LOF-based ensemble regression tree methodology

Neural Computing and Applications

... Bu çalışmada, önerilen derin transfer öğrenme yaklaşımının performansını değerlendirmek için Tablo 3'te yer alan farklı sektörlere ait satış verileri kullanılmıştır [39,40]. Tüm veri kümeleri haftalık satış verilerini içerecek şekilde düzenlenmiştir. ...

Clustering-based Sales Forecasting in a Forklift Distributor

Uluslararası Muhendislik Arastirma ve Gelistirme Dergisi

... Several other studies on vertically differentiated products incorporating channel considerations into the determination of product variety with vertical differentiation (Shi et al. 2013, Chen et al. 2016, Wang et al. 2017, quality competition between two manufacturers (Chambers et al. 2006, Inkaya et al. 2018, dynamic pricing strategy (Akçay et al. 2010, Liu andZhang 2013), impact of production technology on firm's product variety strategies (Netessine and Taylor 2007), product development strategy for information goods Seshadri 2007, Wu andChen 2008), Jones and Mendelson 2011), issues we do not explore here. These articles take the point of view of manufacturers as it is assumed that any quality level can be picked by decisionmakers. ...

Product variety strategies for vertically differentiated products in a two-stage supply chain
  • Citing Article
  • April 2018

... The clustering approaches are classified as: hierarchical (agglomerative and divisive), partition clustering, density-based, neural networks-based, structure (graph) based, fuzzy-based, probabilistic, and meta-heuristic-based approaches. Among those, the nature-inspired meta-heuristic approaches provide greater flexibility to employ it to the clustering problems [4] . Until now, numerous meta-heuristic optimization algorithms have been proposed by researchers based on inspiration from nature. ...

Swarm Intelligence-Based Clustering Algorithms: A Survey
  • Citing Chapter
  • April 2016