{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T15:20:31Z","timestamp":1753888831438,"version":"3.41.2"},"reference-count":49,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2024,10,20]],"date-time":"2024-10-20T00:00:00Z","timestamp":1729382400000},"content-version":"vor","delay-in-days":293,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Applied Computational Intelligence and Soft Computing"],"published-print":{"date-parts":[[2024,1]]},"abstract":"<jats:p>Alzheimer\u2019s disease (AD) is a chronic, advanced brain sickness disease that slowly destroys memory and thinking skills and, in the end, the ability to perform routine tasks. This disease is caused by the abnormal clumping of proteins such as amyloids around the brain cells. The identification of proteins involved in Alzheimer\u2019s is essential to understand the disease and to discover and design the drugs. Experimental processes involving in\u2010vitro or in\u2010vivo experiments for this purpose are very time\u2010consuming, laborious, and highly costly. However, costly and tedious experimental procedures can be performed efficiently by targeting the most probable proteins involved in Alzheimer\u2019s predicted and ranked through a computational method with better generalization accuracy. In this study, we have proposed a machine learning (ML)\u2013based predictive model to identify proteins potentially involved in Alzheimer\u2019s. Through a series of simulation studies, we have shown that our proposed model by using protein sequence information only gives state\u2010of\u2010the\u2010art generalization performance with an area under the precision\u2010recall curve of 0.93 verified through various ML\u2010centric and biologically relevant techniques and metrics. Through data mining in this study, we have also performed feature analysis to identify the role of individual amino acids in such proteins. Python code for feature extraction, training, and evaluating our proposed models together with the dataset is available at the URL: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/sourceforge.net\/projects\/alzheimer-associated-proteins\/files\/\">https:\/\/sourceforge.net\/projects\/alzheimer-associated-proteins\/files\/<\/jats:ext-link>.<\/jats:p>","DOI":"10.1155\/2024\/7914178","type":"journal-article","created":{"date-parts":[[2024,10,21]],"date-time":"2024-10-21T04:33:11Z","timestamp":1729485191000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Extreme Gradient Boosting Beats In\u2010Silico Identification of Proteins Potentially Associated With Alzheimer\u2019s"],"prefix":"10.1155","volume":"2024","author":[{"given":"Sadia","family":"Khalil","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7691-5715","authenticated-orcid":false,"given":"Wajid Arshad","family":"Abbasi","sequence":"additional","affiliation":[]},{"given":"Syed Ali","family":"Abbas","sequence":"additional","affiliation":[]},{"given":"Maryum","family":"Bibi","sequence":"additional","affiliation":[]},{"given":"Saiqa","family":"Andleeb","sequence":"additional","affiliation":[]},{"given":"Amsa","family":"Shabir","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2024,10,20]]},"reference":[{"key":"e_1_2_10_1_2","doi-asserted-by":"publisher","DOI":"10.1136\/bmj.b158"},{"key":"e_1_2_10_2_2","doi-asserted-by":"publisher","DOI":"10.1177\/1179573520907397"},{"key":"e_1_2_10_3_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jalz.2012.11.007"},{"key":"e_1_2_10_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/S2468-2667(18)30118-X"},{"key":"e_1_2_10_5_2","doi-asserted-by":"publisher","DOI":"10.1002\/trc2.12295"},{"key":"e_1_2_10_6_2","doi-asserted-by":"publisher","DOI":"10.1021\/cn300091a"},{"key":"e_1_2_10_7_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-59745-188-8_21"},{"key":"e_1_2_10_8_2","doi-asserted-by":"publisher","DOI":"10.3791\/56739"},{"key":"e_1_2_10_9_2","doi-asserted-by":"publisher","DOI":"10.4103\/1947-2714.100998"},{"key":"e_1_2_10_10_2","doi-asserted-by":"publisher","DOI":"10.4103\/0975-7406.100281"},{"key":"e_1_2_10_11_2","doi-asserted-by":"publisher","DOI":"10.1002\/wcms.1225"},{"key":"e_1_2_10_12_2","doi-asserted-by":"publisher","DOI":"10.1177\/155005941104200304"},{"key":"e_1_2_10_13_2","doi-asserted-by":"publisher","DOI":"10.3390\/bs8010016"},{"key":"e_1_2_10_14_2","doi-asserted-by":"publisher","DOI":"10.1002\/prot.21989"},{"key":"e_1_2_10_15_2","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-12-389"},{"key":"e_1_2_10_16_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bts504"},{"key":"e_1_2_10_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/TST.2015.7297749"},{"key":"e_1_2_10_18_2","doi-asserted-by":"publisher","DOI":"10.1186\/s12883-017-1010-3"},{"key":"e_1_2_10_19_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41374-019-0202-4"},{"key":"e_1_2_10_20_2","doi-asserted-by":"publisher","DOI":"10.1038\/ng.3259"},{"key":"e_1_2_10_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.112873"},{"key":"e_1_2_10_22_2","doi-asserted-by":"publisher","DOI":"10.3390\/genes13081406"},{"key":"e_1_2_10_23_2","doi-asserted-by":"publisher","DOI":"10.7717\/peerj.6543"},{"key":"e_1_2_10_24_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq003"},{"key":"e_1_2_10_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3076448"},{"key":"e_1_2_10_26_2","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkx950"},{"key":"e_1_2_10_27_2","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gky1049"},{"key":"e_1_2_10_28_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022627411411"},{"key":"e_1_2_10_29_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_10_30_2","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1013203451"},{"key":"e_1_2_10_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2024.3367588"},{"key":"e_1_2_10_32_2","first-page":"785","volume-title":"Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, in KDD\u201916","author":"Chen T.","year":"2016"},{"key":"e_1_2_10_33_2","unstructured":"AbbasiW. A. HassanF. U. YaseenA. andMinhasF. U. A. A. ISLAND: In-Silico Prediction of Proteins Binding Affinity Using Sequence Descriptors 2017 https:\/\/128.84.21.199\/abs\/1711.10540."},{"key":"e_1_2_10_34_2","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-15-291"},{"key":"e_1_2_10_35_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq112"},{"key":"e_1_2_10_36_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr513"},{"key":"e_1_2_10_37_2","first-page":"564","article-title":"The Spectrum Kernel: A String Kernel for SVM Protein Classification","volume":"7","author":"Leslie C.","year":"2002","journal-title":"Pacific Symposium on Biocomputing"},{"key":"e_1_2_10_38_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btm208"},{"key":"e_1_2_10_39_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720016500116"},{"key":"e_1_2_10_40_2","doi-asserted-by":"publisher","DOI":"10.1038\/nbt0804-1035"},{"key":"e_1_2_10_41_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btt072"},{"key":"e_1_2_10_42_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720022500196"},{"key":"e_1_2_10_43_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btv345"},{"key":"e_1_2_10_44_2","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-10-48"},{"key":"e_1_2_10_45_2","first-page":"1137","volume-title":"Proceedings of the 14th International Joint Conference on Artificial Intelligence-Volume 2","author":"Kohavi R.","year":"1995"},{"key":"e_1_2_10_46_2","doi-asserted-by":"publisher","DOI":"10.1038\/75556"},{"key":"e_1_2_10_47_2","doi-asserted-by":"publisher","DOI":"10.1038\/234034a0"},{"key":"e_1_2_10_48_2","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1145\/1143844.1143874","volume-title":"Proceedings of the 23rd International Conference on Machine Learning","author":"Davis J.","year":"2006"},{"key":"e_1_2_10_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3606367"}],"container-title":["Applied Computational Intelligence and Soft Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/2024\/7914178","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,21]],"date-time":"2024-10-21T04:33:16Z","timestamp":1729485196000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/2024\/7914178"}},"subtitle":[],"editor":[{"given":"Ahmad","family":"Al-Omari","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,1]]},"references-count":49,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,1]]}},"alternative-id":["10.1155\/2024\/7914178"],"URL":"https:\/\/doi.org\/10.1155\/2024\/7914178","archive":["Portico"],"relation":{},"ISSN":["1687-9724","1687-9732"],"issn-type":[{"type":"print","value":"1687-9724"},{"type":"electronic","value":"1687-9732"}],"subject":[],"published":{"date-parts":[[2024,1]]},"assertion":[{"value":"2024-04-17","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-10-04","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-10-20","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"7914178"}}