{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,30]],"date-time":"2026-06-30T14:10:02Z","timestamp":1782828602396,"version":"3.54.5"},"reference-count":32,"publisher":"Emerald","issue":"3","license":[{"start":{"date-parts":[[2023,2,7]],"date-time":"2023-02-07T00:00:00Z","timestamp":1675728000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["DTA"],"published-print":{"date-parts":[[2023,6,14]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>The purpose of this paper is to identify the root cause of low yield problems in the semiconductor manufacturing process using sensor data continuously collected from manufacturing equipment and describe the process environment in the equipment.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>This paper proposes a sensor data mining process based on the sequential modeling of random forests for low yield diagnosis. The process consists of sequential steps: problem definition, data preparation, excursion time and critical sensor identification, data visualization and root cause identification.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>A case study is conducted using real-world data collected from a semiconductor manufacturer in South Korea to demonstrate the effectiveness of the diagnosis process. The proposed model successfully identified the excursion time and critical sensors previously identified by domain engineers using costly manual examination.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>The proposed procedure helps domain engineers narrow down the excursion time and critical sensors from the massive sensor data. The procedure's outcome is highly interpretable, informative and easy to visualize.<\/jats:p><\/jats:sec>","DOI":"10.1108\/dta-08-2022-0341","type":"journal-article","created":{"date-parts":[[2023,2,7]],"date-time":"2023-02-07T01:13:41Z","timestamp":1675732421000},"page":"397-417","source":"Crossref","is-referenced-by-count":5,"title":["A sensor data mining process for identifying root causes associated with low yield in semiconductor manufacturing"],"prefix":"10.1108","volume":"57","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4857-521X","authenticated-orcid":false,"given":"Eunji","family":"Kim","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jinwon","family":"An","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hyun-Chang","family":"Cho","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sungzoon","family":"Cho","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Byeongeon","family":"Lee","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"140","published-online":{"date-parts":[[2023,2,7]]},"reference":[{"issue":"1","key":"key2023062207415354400_ref001","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","year":"2001","journal-title":"Machine Learning"},{"key":"key2023062207415354400_ref002","first-page":"731","article-title":"Big data analytic for multivariate fault detection and classification in semiconductor manufacturing","year":"2017"},{"issue":"5","key":"key2023062207415354400_ref003","doi-asserted-by":"crossref","first-page":"961","DOI":"10.1007\/s10845-013-0791-5","article-title":"An empirical study of design-of-experiment data mining for yield-loss diagnosis for semiconductor manufacturing","volume":"25","year":"2014","journal-title":"Journal of Intelligent Manufacturing"},{"issue":"4","key":"key2023062207415354400_ref004","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1109\/TSM.2014.2356555","article-title":"A framework for root cause detection of sub-batch processing system for semiconductor manufacturing big data analytics","volume":"27","year":"2014","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"issue":"17","key":"key2023062207415354400_ref005","first-page":"1","article-title":"Analysing semiconductor manufacturing big data for root cause detection of excursion for yield enhancement","volume":"55","year":"2015","journal-title":"International Journal of Production Research"},{"issue":"1","key":"key2023062207415354400_ref006","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1016\/j.eswa.2006.04.014","article-title":"Data mining for yield enhancement in semiconductor manufacturing and an empirical study","volume":"33","year":"2007","journal-title":"Expert Systems With Applications"},{"issue":"1","key":"key2023062207415354400_ref007","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1109\/TSM.2020.2965288","article-title":"Chamber-to-chamber discrepancy detection in semiconductor manufacturing","volume":"33","year":"2020","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"issue":"4","key":"key2023062207415354400_ref008","doi-asserted-by":"crossref","first-page":"1925","DOI":"10.1109\/TASE.2020.2983061","article-title":"Data-driven approach for fault detection and diagnostic in semiconductor manufacturing","volume":"17","year":"2020","journal-title":"IEEE Transactions on Automation Science and Engineering"},{"issue":"4","key":"key2023062207415354400_ref009","doi-asserted-by":"crossref","first-page":"544","DOI":"10.1109\/TSM.2019.2929765","article-title":"Key parameter identification and defective wafer detection of semiconductor manufacturing processes using image processing techniques","volume":"32","year":"2019","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"key":"key2023062207415354400_ref010","volume-title":"Learning from Imbalanced Data Sets","year":"2018"},{"issue":"4","key":"key2023062207415354400_ref011","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1109\/TSM.2007.907607","article-title":"Fault detection using the k-nearest neighbor rule for semiconductor manufacturing processes","volume":"20","year":"2007","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"issue":"1","key":"key2023062207415354400_ref012","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1109\/TSM.2011.2175394","article-title":"Fault detection and classification in plasma etch equipment for semiconductor manufacturing e-diagnostics","volume":"25","year":"2012","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"key":"key2023062207415354400_ref013","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1007\/978-3-642-29977-3_24","article-title":"Main branch decision tree algorithm for yield enhancement with class imbalance","volume":"15","year":"2012","journal-title":"Intelligent Decision Technologies"},{"issue":"3","key":"key2023062207415354400_ref014","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1007\/s10845-020-01591-0","article-title":"Multiple time-series convolutional neural network for fault detection and diagnosis and empirical study in semiconductor manufacturing","volume":"32","year":"2021","journal-title":"Journal of Intelligent Manufacturing"},{"issue":"2","key":"key2023062207415354400_ref015","first-page":"319","article-title":"Joint modeling of classification and regression for improving faulty wafer detection in semiconductor manufacturing","volume":"31","year":"2018","journal-title":"Journal of Intelligent Manufacturing"},{"key":"key2023062207415354400_ref016","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.cie.2017.01.028","article-title":"Mining the relationship between production and customer service data for failure analysis of industrial products","volume":"106","year":"2017","journal-title":"Computers & Industrial Engineering"},{"key":"key2023062207415354400_ref017","doi-asserted-by":"crossref","first-page":"990","DOI":"10.1016\/j.asoc.2017.11.034","article-title":"Bayesian inference for mining semiconductor manufacturing big data for yield enhancement and smart production to empower industry 4.0","volume":"68","year":"2018","journal-title":"Applied Soft Computing"},{"key":"key2023062207415354400_ref018","first-page":"1","article-title":"DeepNAP: deep neural anomaly pre-detection in a semiconductor fab","volume":"457","year":"2018","journal-title":"Information Sciences"},{"issue":"3","key":"key2023062207415354400_ref019","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1109\/TSM.2019.2917521","article-title":"Fault detection and diagnosis using self-attentive convolutional neural networks for variable-length sensor data in semiconductor manufacturing","volume":"32","year":"2019","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"key":"key2023062207415354400_ref020","first-page":"79","article-title":"Data imbalance problem solving for smote based oversampling: study on fault detection prediction model in semiconductor manufacturing process","volume":"133","year":"2016","journal-title":"Advanced Science and Technology Letters"},{"issue":"1","key":"key2023062207415354400_ref021","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1109\/TSM.2016.2628865","article-title":"A deep learning model for robust wafer fault monitoring with sensor measurement noise","volume":"30","year":"2017","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"issue":"2","key":"key2023062207415354400_ref022","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1109\/TSM.2017.2676245","article-title":"A convolutional neural network for fault classification and diagnosis in semiconductor manufacturing processes","volume":"30","year":"2017","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"issue":"10","key":"key2023062207415354400_ref023","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1016\/j.jprocont.2009.07.011","article-title":"Fault detection and diagnosis in process data using one-class support vector machines","volume":"19","year":"2009","journal-title":"Journal of Process Control"},{"issue":"6","key":"key2023062207415354400_ref024","doi-asserted-by":"crossref","first-page":"2619","DOI":"10.1016\/j.eswa.2013.11.001","article-title":"Data based segmentation and summarization for sensor data in semiconductor manufacturing","volume":"41","year":"2014","journal-title":"Expert Systems With Applications"},{"issue":"5","key":"key2023062207415354400_ref025","doi-asserted-by":"crossref","first-page":"5711","DOI":"10.1016\/j.eswa.2010.10.062","article-title":"Spline regression based feature extraction for semiconductor process fault detection using support vector machine","volume":"38","year":"2011","journal-title":"Expert Systems With Applications"},{"issue":"2","key":"key2023062207415354400_ref026","doi-asserted-by":"crossref","first-page":"894","DOI":"10.1109\/TASE.2016.2545744","article-title":"Translation-invariant multiscale energy-based PCA for monitoring batch processes in semiconductor manufacturing","volume":"14","year":"2017","journal-title":"IEEE Transactions on Automation Science and Engineering"},{"issue":"2","key":"key2023062207415354400_ref027","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1109\/TSM.2018.2808703","article-title":"CDID: a system for identifying the root cause of a defect in semiconductor wafer fabrication","volume":"31","year":"2018","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"issue":"1","key":"key2023062207415354400_ref028","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1109\/TSM.2010.2065531","article-title":"Adaptive Mahalanobis distance and k-nearest neighbor rule for fault detection in semiconductor manufacturing","volume":"24","year":"2011","journal-title":"IEEE Transactions on Semiconductor Manufacturing"},{"key":"key2023062207415354400_ref029","first-page":"1450","article-title":"A novel Bayesian network-based fault prognostic method for semiconductor manufacturing process","year":"2017"},{"issue":"3","key":"key2023062207415354400_ref030","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1007\/s10489-009-0168-9","article-title":"Rule-based data mining for yield improvement in semiconductor manufacturing","volume":"33","year":"2010","journal-title":"Applied Intelligence"},{"key":"key2023062207415354400_ref031","doi-asserted-by":"crossref","first-page":"419","DOI":"10.1016\/j.cie.2017.09.036","article-title":"Ranking process parameter association with low yield wafers using spec-out event network analysis","volume":"113","year":"2017","journal-title":"Computers & Industrial Engineering"},{"issue":"1","key":"key2023062207415354400_ref032","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1109\/TSM.2014.2374339","article-title":"Fault detection using random projections and k-nearest neighbor rule for semiconductor manufacturing processes","volume":"28","year":"2015","journal-title":"IEEE Transactions on Semiconductor Manufacturing"}],"container-title":["Data Technologies and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DTA-08-2022-0341\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DTA-08-2022-0341\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:15:20Z","timestamp":1753398920000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/dta\/article\/57\/3\/397-417\/38850"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,7]]},"references-count":32,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,2,7]]},"published-print":{"date-parts":[[2023,6,14]]}},"alternative-id":["10.1108\/DTA-08-2022-0341"],"URL":"https:\/\/doi.org\/10.1108\/dta-08-2022-0341","relation":{},"ISSN":["2514-9288","2514-9288"],"issn-type":[{"value":"2514-9288","type":"print"},{"value":"2514-9288","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,7]]}}}