{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:04:52Z","timestamp":1760241892476,"version":"build-2065373602"},"reference-count":11,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2018,10,20]],"date-time":"2018-10-20T00:00:00Z","timestamp":1539993600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Key R&amp;D Program of China","award":["2016YFC0904902"],"award-info":[{"award-number":["2016YFC0904902"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61502032"],"award-info":[{"award-number":["61502032"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Chronic diseases represented by circulatory diseases have gradually become the main types of diseases affecting the health of our population. Establishing a circulatory system disease prediction model to predict the occurrence of diseases and controlling them is of great significance to the health of our population. This article is based on the prospective population cohort data of chronic diseases in China, based on the existing medical cohort studies, the Kaplan\u2013Meier method was used for feature selection, and the traditional medical analysis model represented by the Cox proportional hazards model was used and introduced. Support vector machine research methods in machine learning establish circulatory system disease prediction models. This paper also attempts to introduce the proportion of the explanation variation (PEV) and the shrinkage factor to improve the Cox proportional hazards model; and the use of Particle Swarm Optimization (PSO) algorithm to optimize the parameters of SVM model. Finally, the experimental verification of the above prediction models is carried out. This paper uses the model training time, Accuracy rate(ACC), the area under curve (AUC)of the Receiver Operator Characteristic curve (ROC) and other forecasting indicators. The experimental results show that the PSO-SVM-CSDPC disease prediction model and the S-Cox-CSDPC circulation system disease prediction model have the advantages of fast model solving speed, accurate prediction results and strong generalization ability, which are helpful for the intervention and control of chronic diseases.<\/jats:p>","DOI":"10.3390\/a11100162","type":"journal-article","created":{"date-parts":[[2018,10,23]],"date-time":"2018-10-23T08:43:36Z","timestamp":1540284216000},"page":"162","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort"],"prefix":"10.3390","volume":"11","author":[{"given":"Haijing","family":"Tang","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China"}]},{"given":"Guo","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China"}]},{"given":"Yu","family":"Kang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7983-6473","authenticated-orcid":false,"given":"Xu","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,10,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1243","DOI":"10.1093\/ije\/dyi174","article-title":"The Kadoorie Study of Chronic Disease in China (KSCDC)","volume":"34","author":"Chen","year":"2005","journal-title":"Int. J. Epidemiol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1652","DOI":"10.1093\/ije\/dyr120","article-title":"China Kadoorie Biobank of 0.5 million people: Survey methods, baseline characteristics and long-term follow-up","volume":"40","author":"Chen","year":"2011","journal-title":"Int. J. Epidemiol."},{"key":"ref_3","first-page":"249","article-title":"The China Kadoorie Biobank: Related methodology and baseline characteristics of the participants","volume":"33","author":"Li","year":"2012","journal-title":"Chin. J. Epidemiol."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1111\/j.2517-6161.1983.tb01258.x","article-title":"Regression, Prediction and Shrinkage","volume":"45","author":"Copas","year":"1983","journal-title":"J. R. Stat. Soc."},{"key":"ref_5","first-page":"2442","article-title":"Study on Optimization of SVR Parameters Selection Based on PSO","volume":"18","author":"Xiong","year":"2006","journal-title":"J. Syst. Simul."},{"key":"ref_6","unstructured":"Kennedy, J., and Eberhart, R. (December, January 27). Particle swarm optimization. Proceedings of the IEEE International Conference on Neural Networks, Perth, Australia."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1999","DOI":"10.1002\/(SICI)1097-0258(19961015)15:19<1999::AID-SIM353>3.0.CO;2-D","article-title":"Explained variation in survival analysis","volume":"15","author":"Michael","year":"1996","journal-title":"Stat. Med."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1093\/biomet\/75.3.525","article-title":"Measures of Dependence for Censored Survival Data","volume":"75","author":"Kent","year":"1988","journal-title":"Biometrika"},{"key":"ref_9","first-page":"169","article-title":"The Parameter Estimation and the Existent Problems Analysis of the Logistic Model","volume":"1","author":"Zhu","year":"2011","journal-title":"Sci. Inf."},{"key":"ref_10","first-page":"83","article-title":"Comparative Study of K Nearest Neighbor and Logistic Regression Classification Algorithms","volume":"26","author":"Wan","year":"2016","journal-title":"J. Luoyang Inst. Sci. Technol."},{"key":"ref_11","first-page":"90","article-title":"An improved genetic algorithm-support vector machines scheme for coronary artery disease diagnosis","volume":"4","author":"Lu","year":"2014","journal-title":"J. Biol."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/11\/10\/162\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:26:55Z","timestamp":1760196415000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/11\/10\/162"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,20]]},"references-count":11,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2018,10]]}},"alternative-id":["a11100162"],"URL":"https:\/\/doi.org\/10.3390\/a11100162","relation":{},"ISSN":["1999-4893"],"issn-type":[{"type":"electronic","value":"1999-4893"}],"subject":[],"published":{"date-parts":[[2018,10,20]]}}}