{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T11:13:55Z","timestamp":1782213235397,"version":"3.54.5"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T00:00:00Z","timestamp":1630368000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T00:00:00Z","timestamp":1630368000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Wireless Pers Commun"],"published-print":{"date-parts":[[2022,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>A healthy life is essential for a happy society, however it is a fact that seemingly invisible diseases plague our families and people suffer. The thyroid disease falls in such a category. Thyroid disorders are long-term and with carefully handled illnesses, people with thyroid disorders may also live stable and normal lives. Thyroid diagnosis, particularly for an inexperienced clinician, is a difficult proposal. Many researchers have established various methods for the diagnosis of the disease and several models for disease prediction have been developed. As with several other domains, machine learning approaches to modelling health care problems is gaining popularity. This study aims at providing solutions towards such a thyroid disease prediction. Dimension reduction techniques are applied, and reduced dimension data input to classifiers. Also, data augmentation is applied so as to be able to generate sufficient data for deep neural network model. Classifier prediction is compared to other similar researches. Real life dataset for thyroid disease has been used, and experiments conducted in distributed environment. Our proposed two stage approach gives a maximum accuracy of 99.95% which is very good as compared to existing techniques. We have shown that dimension reduction and data augmentation can be used very efficiently for achieving high accuracy of disease prediction.<\/jats:p>","DOI":"10.1007\/s11277-021-08974-3","type":"journal-article","created":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T11:04:00Z","timestamp":1630407840000},"page":"1921-1938","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":57,"title":["Increasing the Prediction Accuracy for Thyroid Disease: A Step Towards Better Health for Society"],"prefix":"10.1007","volume":"122","author":[{"given":"Ritesh","family":"Jha","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0680-2691","authenticated-orcid":false,"given":"Vandana","family":"Bhattacharjee","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Abhijit","family":"Mustafi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2021,8,31]]},"reference":[{"key":"8974_CR1","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1007\/s11277-019-06273-6","volume":"106","author":"S Anwar","year":"2019","unstructured":"Anwar, S., Prasad, R., Chowdhary, B. S., et al. (2019). A telemedicine platform for disaster management and emergency care. Wireless Personal Communications, 106, 191\u2013204. https:\/\/doi.org\/10.1007\/s11277-019-06273-6","journal-title":"Wireless Personal Communications"},{"key":"8974_CR2","doi-asserted-by":"publisher","first-page":"711","DOI":"10.1007\/s11277-020-07595-6","volume":"115","author":"S Prasad","year":"2020","unstructured":"Prasad, S., & Prasad, R. (2020). Child temperature monitoring system. Wireless Personal Communications, 115, 711\u2013723. https:\/\/doi.org\/10.1007\/s11277-020-07595-6","journal-title":"Wireless Personal Communications"},{"key":"8974_CR3","doi-asserted-by":"publisher","first-page":"1567","DOI":"10.1007\/s11277-020-07299-x","volume":"113","author":"S Anwar","year":"2020","unstructured":"Anwar, S., & Prasad, R. (2020). Connections of chronic diseases and socio-dynamic cues for integrating ICT with care plan adherence. Wireless Personal Communications, 113, 1567\u20131578. https:\/\/doi.org\/10.1007\/s11277-020-07299-x","journal-title":"Wireless Personal Communications"},{"key":"8974_CR4","doi-asserted-by":"publisher","first-page":"1501","DOI":"10.1007\/s11277-020-07435-7","volume":"114","author":"A Koren","year":"2020","unstructured":"Koren, A., Jur\u010devi\u0107, M., & Prasad, R. (2020). Comparison of data-driven models for cleaning eHealth sensor data: Use case on ECG signal. Wireless Personal Communications, 114, 1501\u20131517. https:\/\/doi.org\/10.1007\/s11277-020-07435-7","journal-title":"Wireless Personal Communications"},{"key":"8974_CR5","doi-asserted-by":"publisher","first-page":"678","DOI":"10.1109\/ACCESS.2015.2437951","volume":"3","author":"SMR Islam","year":"2015","unstructured":"Islam, S. M. R., Kwak, D., Kabir, M. H., Hossain, M., & Kwak, K.-S. (2015). The internet of things for health care: A comprehensive survey. IEEE Access, 3, 678\u2013708.","journal-title":"IEEE Access"},{"issue":"1","key":"8974_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/2047-2501-2-3","volume":"2","author":"W Raghupathi","year":"2014","unstructured":"Raghupathi, W., & Raghupathi, V. (2014). Big data analytics in healthcare: Promise and potential. Health Information Science and Systems, 2(1), 1\u201310.","journal-title":"Health Information Science and Systems"},{"issue":"3","key":"8974_CR7","doi-asserted-by":"publisher","first-page":"701","DOI":"10.1109\/TKDE.2015.2499200","volume":"28","author":"Z Yu","year":"2016","unstructured":"Yu, Z., et al. (2016). Incremental semi-supervised clustering ensemble for high dimensional data clustering. IEEE Transactions on Knowledge and Data Engineering, 28(3), 701\u2013714.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"8974_CR8","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1016\/j.procs.2016.05.171","volume":"85","author":"S Rallapalli","year":"2016","unstructured":"Rallapalli, S., Gondkar, R. R., & Ketavarapu, U. P. K. (2016). Impact of processing and analyzing healthcare big data on cloud computing environment by implementing hadoop cluster. Procedia Computer Science, 85, 16\u201322.","journal-title":"Procedia Computer Science"},{"issue":"12","key":"8974_CR9","doi-asserted-by":"publisher","first-page":"3191","DOI":"10.1109\/TKDE.2016.2605687","volume":"28","author":"S Wang","year":"2016","unstructured":"Wang, S., Chang, X., Li, X., Long, G., Yao, L., & Sheng, Q. Z. (2016). Diagnosis code assignment using sparsity-based disease correlation embedding. IEEE Transactions on Knowledge and Data Engineering, 28(12), 3191\u20133202.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"1","key":"8974_CR10","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1016\/j.bdr.2015.02.002","volume":"2","author":"T Huang","year":"2015","unstructured":"Huang, T., Lan, L., Fang, X., An, P., Min, J., & Wang, F. (2015). Promises and challenges of big data computing in health sciences. Big Data Research, 2(1), 2\u201311.","journal-title":"Big Data Research"},{"issue":"1","key":"8974_CR11","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1145\/1327452.1327492","volume":"51","author":"J Dean","year":"2008","unstructured":"Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107\u2013113.","journal-title":"Communications of the ACM"},{"key":"8974_CR12","doi-asserted-by":"publisher","first-page":"981","DOI":"10.3390\/app8060981","volume":"8","author":"V Menger","year":"2018","unstructured":"Menger, V., Scheepers, F., & Spruit, M. (2018). Comparing deep learning and classical machine learning approaches, for predicting inpatient violence incidents from clinical text. Applied Sciences, 8, 981.","journal-title":"Applied Sciences"},{"key":"8974_CR13","doi-asserted-by":"publisher","unstructured":"Ozyilmaz, L., Yildirim T. (2002). Diagnosis of thyroid disease using artificial neural network methods. In Proceedings of the 9th international conference on neural information processing, 2002. ICONIP '02., Singapore, pp. 2033\u20132036 vol.4, doi: https:\/\/doi.org\/10.1109\/ICONIP.2002.1199031.","DOI":"10.1109\/ICONIP.2002.1199031"},{"issue":"6","key":"8974_CR14","doi-asserted-by":"publisher","first-page":"1227","DOI":"10.1007\/s13042-017-0756-7","volume":"10","author":"T Alqurashi","year":"2019","unstructured":"Alqurashi, T., & Wang, W. (2019). Clustering ensemble method. International Journal of Machine Learning and Cybernetics, 10(6), 1227\u20131246.","journal-title":"International Journal of Machine Learning and Cybernetics"},{"issue":"10","key":"8974_CR15","doi-asserted-by":"publisher","first-page":"264","DOI":"10.4236\/eng.2013.510B055","volume":"5","author":"A Akbas","year":"2013","unstructured":"Akbas, A., Turhal, U., Babur, S., & Avci, C. (2013). Performance improvement with combining multiple approaches to diagnosis of thyroid cancer. Engineering, 5(10), 264\u2013267.","journal-title":"Engineering"},{"key":"8974_CR16","doi-asserted-by":"crossref","unstructured":"Awasthi, A. K., Antony, A. (2018). An intelligent system for thyroid disease classification and diagnosis. In 2018 Second international conference on inventive communication and computational technologies(ICICCT). IEEE, pp 1261\u20131264.","DOI":"10.1109\/ICICCT.2018.8473349"},{"key":"8974_CR17","doi-asserted-by":"crossref","unstructured":"Azar, A. T., Hassanien, A. E., Kim, T. H. (2012). Expert system based on neural-fuzzy rules for thyroid diseases diagnosis. In Computer applications for bio-technology, multimedia, and Ubiquitous City. Springer, Berlin, Heidelberg pp. 94\u2013105.","DOI":"10.1007\/978-3-642-35521-9_13"},{"key":"8974_CR18","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1007\/s42454-020-00006-y","volume":"2","author":"DC Yadav","year":"2020","unstructured":"Yadav, D. C., & Pal, S. (2020). Prediction of thyroid disease using decision tree ensemble method. Human-Intelligent Systems Integration, 2, 89\u201395.","journal-title":"Human-Intelligent Systems Integration"},{"issue":"5","key":"8974_CR19","doi-asserted-by":"publisher","first-page":"3327","DOI":"10.1007\/s10916-012-9825-3","volume":"36","author":"L-N Li","year":"2012","unstructured":"Li, L.-N., Ouyang, J.-H., Chen, H.-L., & Liu, D.-Y. (2012). A computer aided diagnosis system for thyroid disease using extreme learning machine. Journal of Medical Systems, 36(5), 3327\u20133337.","journal-title":"Journal of Medical Systems"},{"key":"8974_CR20","doi-asserted-by":"publisher","first-page":"9786","DOI":"10.1109\/ACCESS.2016.2647619","volume":"4","author":"PK Sahoo","year":"2016","unstructured":"Sahoo, P. K., Mohapatra, S. K., & Wu, S.-L. (2016). Analyzing healthcare big data with prediction for future health condition. IEEE Access, 4, 9786\u20139799.","journal-title":"IEEE Access"},{"issue":"4","key":"8974_CR21","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1001\/archinte.160.4.526","volume":"160","author":"GJ Canaris","year":"2000","unstructured":"Canaris, G. J., Manowitz, N. R., Mayor, G., & Ridgway, E. C. (2000). The Colorado thyroid disease prevalence study. Archives of Internal Medicine, 160(4), 526\u2013534.","journal-title":"Archives of Internal Medicine"},{"issue":"3","key":"8974_CR22","doi-asserted-by":"publisher","first-page":"1179","DOI":"10.1007\/s00500-014-1581-5","volume":"20","author":"V Prasad","year":"2016","unstructured":"Prasad, V., Srinivasa Rao, T., & Surendra Prasad Babu, M. (2016). Thyroid disease diagnosis via hybrid architecture composing rough data sets theory and machine learning algorithms. Soft Computing, 20(3), 1179\u20131189.","journal-title":"Soft Computing"},{"key":"8974_CR23","doi-asserted-by":"publisher","first-page":"366","DOI":"10.1109\/TCYB.2017.2761908","volume":"49","author":"Z Yu","year":"2017","unstructured":"Yu, Z., Zhang, Y., You, J., Philip Chen, C. L., Wong, H.-S., Han, G., & Zhang, J. (2017). Adaptive semi-supervised classifier ensemble for high dimensional data classification. IEEE Transactions on Cybernetics, 49, 366\u2013379.","journal-title":"IEEE Transactions on Cybernetics"},{"issue":"7","key":"8974_CR24","doi-asserted-by":"publisher","first-page":"718","DOI":"10.1089\/thy.2005.15.718","volume":"15","author":"MJ Nyirenda","year":"2005","unstructured":"Nyirenda, M. J., Clark, D. N., Finlayson, A. R., Read, J., Elders, A., Bain, M., Fox, K. A. A., & Toft, A. D. (2005). Thyroid disease and increased cardiovascular risk. Thyroid, 15(7), 718\u2013724.","journal-title":"Thyroid"},{"issue":"6","key":"8974_CR25","first-page":"617","volume":"6","author":"MT Raghuraman","year":"2019","unstructured":"Raghuraman, M. T., Sailatha, E., Gunasekaran, S. (2019). Efficient thyroid disease prediction and comparative study using machine learning algorithms. International Journal of Information and Computing Science. 6(6), 617\u2013624.","journal-title":"International Journal of Information and Computing Science"},{"issue":"03","key":"8974_CR26","first-page":"229","volume":"11","author":"K Dharmarajan","year":"2020","unstructured":"Dharmarajan, K., Balasree, K., Arunachalam, A. S., & Abirmai, K. (2020). Thyroid disease classification using decision tree and SVM. Indian Journal of Public Health Research & Development, 11(03), 229\u2013234.","journal-title":"Indian Journal of Public Health Research & Development"},{"key":"8974_CR27","unstructured":"UCI Machine Learning Repository [http:\/\/archive.ics.uci.edu\/ml]. Irvine, CA: University of California, School of Information and Computer Science"},{"issue":"3","key":"8974_CR28","first-page":"115","volume":"7","author":"I Ioni\u0163\u0103","year":"2016","unstructured":"Ioni\u0163\u0103, I., & Ioni\u0163\u0103, L. (2016). Prediction of thyroid disease using data mining techniques. BRAIN. Broad Research in Artificial Intelligence and Neuroscience, 7(3), 115\u2013124.","journal-title":"BRAIN. Broad Research in Artificial Intelligence and Neuroscience"},{"key":"8974_CR29","doi-asserted-by":"crossref","unstructured":"Tyagi, A., Mehra, R., Saxena, A. (2018). Interactive thyroid disease prediction system using machine learning technique. In 2018 Fifth international conference on parallel, distributed and grid computing (PDGC). IEEE, pp 689\u2013693.","DOI":"10.1109\/PDGC.2018.8745910"},{"issue":"8","key":"8974_CR30","doi-asserted-by":"publisher","first-page":"181","DOI":"10.23956\/ijarcsse.v7i8.47","volume":"7","author":"A Sivasakthivel","year":"2017","unstructured":"Sivasakthivel, A., & Shrivakshan, G. T. (2017). A comparative study of diag-nosing thyroid diseases using classification algorithm. International Journals of Advanced Research in Computer Science and Software Engineering, 7(8), 181. ISSN: 2277-128X.","journal-title":"International Journals of Advanced Research in Computer Science and Software Engineering"}],"container-title":["Wireless Personal Communications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11277-021-08974-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11277-021-08974-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11277-021-08974-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,12,29]],"date-time":"2021-12-29T09:15:39Z","timestamp":1640769339000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11277-021-08974-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,31]]},"references-count":30,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,1]]}},"alternative-id":["8974"],"URL":"https:\/\/doi.org\/10.1007\/s11277-021-08974-3","relation":{},"ISSN":["0929-6212","1572-834X"],"issn-type":[{"value":"0929-6212","type":"print"},{"value":"1572-834X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,31]]},"assertion":[{"value":"9 August 2021","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 August 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}