Machine learning-based techniques for fault diagnosis in the semiconductor manufacturing process: a comparative study

dc.contributor.authorNuhu, Abubakar Abdussalam
dc.contributor.authorZeeshan, Qasim
dc.contributor.authorSafaei, Babak
dc.contributor.authorShahzad, Muhammad Atif
dc.date.accessioned2026-02-06T18:35:22Z
dc.date.issued2023
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractIndustries are going through the fourth industrial revolution (Industry 4.0), where technologies like the Industrial Internet of things, big data analytics, and machine learning (ML) are extensively utilized to improve the productivity and efficiency of manufacturing systems and processes. This work aims to further investigate the applicability and improve the effectiveness of ML prediction models for fault diagnosis in the smart manufacturing process. Hence, we propose several methodologies and ML models for fault diagnosis for smart manufacturing process applications. A case study has been conducted on a real dataset from a semiconductor manufacturing (SECOM) process. However, this dataset contains missing values, noisy features, and class imbalance problem. This imbalance problem makes it so difficult to accurately predict the minority class, due to the majority class size difference. In the literature, efforts have been made to alleviate the class imbalance problem using several synthetic data generation techniques (SDGT) on the UCI machine learning repository SECOM dataset. In this work, to handle the imbalance problem, we employed, compared, and evaluated the feasibility of three SDGT on this dataset. To handle issues related to the missing values and noisy features, we implemented two missing values imputation techniques and feature selection techniques, respectively. We then developed and compared the performance of ten predictive ML models against these proposed methodologies. The results obtained across several evaluation metrics of performance were significant. A comparative analysis shows the feasibility and validate the effectiveness of these SDGT and the proposed methodologies. Some among the proposed methodologies could produce an accuracy in the range of 99.5% to 100%. Furthermore, based on a comparative analysis with similar models from the literature, our proposed models outpaced those proposed in the literature.
dc.identifier.doi10.1007/s11227-022-04730-x
dc.identifier.endpage2081
dc.identifier.issn0920-8542
dc.identifier.issn1573-0484
dc.identifier.issue2
dc.identifier.orcid0000-0002-1675-4902
dc.identifier.orcid0000-0001-5488-8082
dc.identifier.orcid0000-0003-4699-4066
dc.identifier.scopus2-s2.0-85135634032
dc.identifier.scopusqualityQ1
dc.identifier.startpage2031
dc.identifier.urihttps://doi.org/10.1007/s11227-022-04730-x
dc.identifier.urihttps://hdl.handle.net/11129/11890
dc.identifier.volume79
dc.identifier.wosWOS:000836828400003
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofJournal of Supercomputing
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260204
dc.subjectSemiconductor manufacturing process
dc.subjectIndustry 4
dc.subject0
dc.subjectSynthetic data generation
dc.subjectMachine learning
dc.subjectFault diagnosis
dc.subjectPredictive maintenance
dc.titleMachine learning-based techniques for fault diagnosis in the semiconductor manufacturing process: a comparative study
dc.typeArticle

Files