{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,11]],"date-time":"2025-03-11T04:18:45Z","timestamp":1741666725590,"version":"3.38.0"},"reference-count":38,"publisher":"SAGE Publications","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDA"],"published-print":{"date-parts":[[2018,6,27]]},"DOI":"10.3233\/ida-173500","type":"journal-article","created":{"date-parts":[[2018,7,3]],"date-time":"2018-07-03T19:06:15Z","timestamp":1530644775000},"page":"717-733","source":"Crossref","is-referenced-by-count":0,"title":["De-noising documents with a novelty detection method utilizing class vectors"],"prefix":"10.1177","volume":"22","author":[{"given":"Younghoon","family":"Lee","sequence":"first","affiliation":[{"name":"Department of Industrial Engineering and Institute for Industrial Systems Innovation, Seoul National University, Seoul 151-742, Korea"},{"name":"Data Driven User Experience Team, Mobile Communication Lab, LG Electronics, Seoul 153-802, Korea"}]},{"given":"Sungzoon","family":"Cho","sequence":"additional","affiliation":[{"name":"Department of Industrial Engineering and Institute for Industrial Systems Innovation, Seoul National University, Seoul 151-742, Korea"}]},{"given":"Jinhae","family":"Choi","sequence":"additional","affiliation":[{"name":"Data Driven User Experience Team, Mobile Communication Lab, LG Electronics, Seoul 153-802, Korea"}]}],"member":"179","reference":[{"issue":"4","key":"10.3233\/IDA-173500_ref1","first-page":"1","article-title":"The \u2018One Right Way\u2019 to gather the voice of the customer","volume":"25","author":"Katz","year":"2001","journal-title":"PDMA Visions Magazine"},{"issue":"1","key":"10.3233\/IDA-173500_ref3","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1287\/mksc.12.1.1","article-title":"The voice of the customer","volume":"12","author":"Griffin","year":"1993","journal-title":"Marketing science"},{"key":"10.3233\/IDA-173500_ref4","unstructured":"B.D. Temkin, B. Chatham and M. Amato, The Customer Experience Value Chain: An Enterprisewide Approach For Meeting Customer Needs, Forrester Research. March 15 (2005)."},{"issue":"4","key":"10.3233\/IDA-173500_ref5","doi-asserted-by":"crossref","first-page":"469","DOI":"10.3390\/a5040469","article-title":"Contextual anomaly detection in text data","volume":"5","author":"Mahapatra","year":"2012","journal-title":"Algorithms"},{"key":"10.3233\/IDA-173500_ref6","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1016\/j.sigpro.2013.12.026","article-title":"A review of novelty detection","volume":"99","author":"Pimentel","year":"2014","journal-title":"Signal Processing"},{"issue":"1","key":"10.3233\/IDA-173500_ref8","first-page":"307","article-title":"Outlier detection: applications and techniques","volume":"9","author":"Singh","year":"2012","journal-title":"International Journal of Computer Science Issues"},{"key":"10.3233\/IDA-173500_ref9","doi-asserted-by":"crossref","unstructured":"S. Ando, Clustering needles in a haystack: An information theoretic analysis of minority and outlier detection, in: Seventh IEEE International Conference on Data Mining (ICDM 2007), IEEE, 2007, pp. 13\u201322.","DOI":"10.1109\/ICDM.2007.53"},{"issue":"3","key":"10.3233\/IDA-173500_ref10","doi-asserted-by":"crossref","first-page":"405","DOI":"10.3233\/IDA-2009-0373","article-title":"Novelty detection with application to data streams","volume":"13","author":"Spinosa","year":"2009","journal-title":"Intelligent Data Analysis"},{"key":"10.3233\/IDA-173500_ref11","unstructured":"J. Zhang, Z. Ghahramani and Y. Yang, A probabilistic model for online document clustering with application to novelty detection, in: Advances in Neural Information Processing Systems, 2004, pp. 1617\u20131624."},{"key":"10.3233\/IDA-173500_ref12","unstructured":"L.D. Baker, T. Hofmann, A. McCallum and Y. Yang, A hierarchical probabilistic model for novelty detection in text, in: Proceedings of International Conference on Machine Learning, Citeseer, 1999."},{"key":"10.3233\/IDA-173500_ref13","unstructured":"L. Manevitz and M. Yousef, Learning from positive data for document classification using neural networks, in: Proceedings of the 2nd Bar-Ilan Workshop on Knowledge Discovery and Learning, 2000."},{"issue":"Dec","key":"10.3233\/IDA-173500_ref14","first-page":"139","article-title":"One-class SVMs for document classification","volume":"2","author":"Manevitz","year":"2001","journal-title":"Journal of Machine Learning Research"},{"key":"10.3233\/IDA-173500_ref15","unstructured":"D. Guthrie, L. Guthrie, B. Allison and Y. Wilks, Unsupervised Anomaly Detection., in: IJCAI, 2007, pp. 1624\u20131628."},{"key":"10.3233\/IDA-173500_ref18","doi-asserted-by":"crossref","unstructured":"J. Heymann, O. Walter, R. Haeb-Umbach and B. Raj, Unsupervised word segmentation from noisy input, in: Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on, IEEE, 2013, pp. 458\u2013463.","DOI":"10.1109\/ASRU.2013.6707773"},{"key":"10.3233\/IDA-173500_ref19","unstructured":"S. Chatterji, D. Chatterjee and S. Sarkar, An Efficient Technique for De-Noising Sentences using Monolingual Corpus and Synonym Dictionary., in: COLING (Demos), Citeseer, 2012, pp. 59\u201366."},{"issue":"2\u20133","key":"10.3233\/IDA-173500_ref20","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1080\/00437956.1954.11659520","article-title":"Distributional structure","volume":"10","author":"Harris","year":"1954","journal-title":"Word"},{"issue":"1","key":"10.3233\/IDA-173500_ref21","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1613\/jair.2934","article-title":"From frequency to meaning: Vector space models of semantics","volume":"37","author":"Turney","year":"2010","journal-title":"Journal of Artificial Intelligence Research"},{"key":"10.3233\/IDA-173500_ref22","doi-asserted-by":"crossref","unstructured":"J. Camacho-Collados and R. Navigli, Find the word that does not belong: A Framework for an Intrinsic Evaluation of Word Vector Representations, in: ACL Workshop on Evaluating Vector Space Representations for NLP, 2016, pp. 43\u201350.","DOI":"10.18653\/v1\/W16-2508"},{"issue":"3","key":"10.3233\/IDA-173500_ref24","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1145\/1541880.1541882","article-title":"Anomaly detection: A survey","volume":"41","author":"Chandola","year":"2009","journal-title":"ACM Computing Surveys (CSUR)"},{"issue":"12","key":"10.3233\/IDA-173500_ref25","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1016\/j.sigpro.2003.07.018","article-title":"Novelty detection: a review-part 1: statistical approaches","volume":"83","author":"Markou","year":"2003","journal-title":"Signal Processing"},{"key":"10.3233\/IDA-173500_ref26","unstructured":"E. Eskin, Anomaly detection over noisy data using learned probability distributions, in: In Proceedings of the International Conference on Machine Learning, Citeseer, 2000."},{"key":"10.3233\/IDA-173500_ref27","doi-asserted-by":"crossref","unstructured":"A. Srivastava and B. Zane-Ulman, Discovering recurring anomalies in text reports regarding complex space systems, in: IEEE Aerospace Conference, 2005, p. 37.","DOI":"10.1109\/AERO.2005.1559692"},{"key":"10.3233\/IDA-173500_ref28","unstructured":"A. Srivastava, Enabling the discovery of recurring anomalies in aerospace problem reports using high-dimensional clustering techniques, in: 2006 IEEE Aerospace Conference, IEEE, 2006, p. 17."},{"issue":"4","key":"10.3233\/IDA-173500_ref29","doi-asserted-by":"crossref","first-page":"4075","DOI":"10.1016\/j.eswa.2011.09.088","article-title":"Machine learning-based novelty detection for faulty wafer detection in semiconductor manufacturing","volume":"39","author":"Kim","year":"2012","journal-title":"Expert Systems with Applications"},{"issue":"2\u20133","key":"10.3233\/IDA-173500_ref34","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1080\/01638539809545028","article-title":"An introduction to latent semantic analysis","volume":"25","author":"Landauer","year":"1998","journal-title":"Discourse Processes"},{"key":"10.3233\/IDA-173500_ref36","unstructured":"V. Su\u00e1rez-Paniagua, I. Segura-Bedmar and P. Mart\u00ednez, Word embedding clustering for disease named entity recognition, in: Proceedings of the Fifth BioCreative Challenge Evaluation Workshop, 2015, pp. 299\u2013304."},{"key":"10.3233\/IDA-173500_ref38","doi-asserted-by":"crossref","unstructured":"C. Orrite, M. Rodr\u00edguez, F. Mart\u00ednez and M. Fairhurst, Classifier ensemble generation for the majority vote rule, in: Iberoamerican Congress on Pattern Recognition, Springer, 2008, pp. 340\u2013347.","DOI":"10.1007\/978-3-540-85920-8_42"},{"key":"10.3233\/IDA-173500_ref39","first-page":"171","article-title":"Profiles and majority voting-based ensemble method for protein secondary structure prediction","volume":"7","author":"Bouziane","year":"2011","journal-title":"Evolutionary Bioinformatics Online"},{"key":"10.3233\/IDA-173500_ref40","doi-asserted-by":"crossref","unstructured":"A. Mucherino, P.J. Papajorgji and P.M. Pardalos, k-Nearest Neighbor Classification, in: Data Mining in Agriculture, Springer, 2009, pp. 83\u2013106.","DOI":"10.1007\/978-0-387-88615-2_4"},{"issue":"1","key":"10.3233\/IDA-173500_ref41","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1007\/BF00993481","article-title":"A weighted nearest neighbor algorithm for learning with symbolic features","volume":"10","author":"Cost","year":"1993","journal-title":"Machine Learning"},{"issue":"1","key":"10.3233\/IDA-173500_ref42","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/TIT.1967.1053964","article-title":"Nearest neighbor pattern classification","volume":"13","author":"Cover","year":"1967","journal-title":"IEEE Transactions on Information Theory"},{"issue":"1\u20132","key":"10.3233\/IDA-173500_ref46","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1093\/biomet\/54.1-2.167","article-title":"Estimation of the probability of an event as a function of several independent variables","volume":"54","author":"Walker","year":"1967","journal-title":"Biometrika"},{"key":"10.3233\/IDA-173500_ref47","unstructured":"P. Langley, W. Iba and K. Thompson, An analysis of Bayesian classifiers, in: Aaai, Vol.\u00a090, 1992, pp. 223\u2013228."},{"issue":"2\u20133","key":"10.3233\/IDA-173500_ref48","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1023\/A:1007413511361","article-title":"On the optimality of the simple Bayesian classifier under zero-one loss","volume":"29","author":"Domingos","year":"1997","journal-title":"Machine Learning"},{"key":"10.3233\/IDA-173500_ref49","doi-asserted-by":"crossref","unstructured":"D.D. Lewis, Naive (Bayes) at forty: The independence assumption in information retrieval, in: European Conference on Machine Learning, Springer, 1998, pp. 4\u201315.","DOI":"10.1007\/BFb0026666"},{"issue":"1\u20132","key":"10.3233\/IDA-173500_ref50","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1016\/S0092-8240(05)80006-0","article-title":"A logical calculus of the ideas immanent in nervous activity","volume":"52","author":"McCulloch","year":"1990","journal-title":"Bulletin of Mathematical Biology"},{"key":"10.3233\/IDA-173500_ref52","unstructured":"C.N. dos Santos and M. Gatti, Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts., in: COLING, 2014, pp. 69\u201378."},{"key":"10.3233\/IDA-173500_ref54","doi-asserted-by":"crossref","unstructured":"S. Lai, L. Xu, K. Liu and J. Zhao, Recurrent Convolutional Neural Networks for Text Classification., in: AAAI, 2015, pp. 2267\u20132273.","DOI":"10.1609\/aaai.v29i1.9513"}],"container-title":["Intelligent Data Analysis"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDA-173500","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,10]],"date-time":"2025-03-10T14:31:14Z","timestamp":1741617074000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/IDA-173500"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,27]]},"references-count":38,"journal-issue":{"issue":"4"},"URL":"https:\/\/doi.org\/10.3233\/ida-173500","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"type":"print","value":"1088-467X"},{"type":"electronic","value":"1571-4128"}],"subject":[],"published":{"date-parts":[[2018,6,27]]}}}