{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T23:51:57Z","timestamp":1772063517878,"version":"3.50.1"},"reference-count":35,"publisher":"World Scientific Pub Co Pte Ltd","issue":"04","funder":[{"name":"Higher Education Commission of Pakistan","award":["213-58990-2PS2-046"],"award-info":[{"award-number":["213-58990-2PS2-046"]}]},{"name":"Information Technology Endowment Fund PIEAS"},{"DOI":"10.13039\/501100004681","name":"Higher Education Commission, Pakistan","doi-asserted-by":"publisher","award":["NRPU Project 6085"],"award-info":[{"award-number":["NRPU Project 6085"]}],"id":[{"id":"10.13039\/501100004681","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004681","name":"Higher Education Commission, Pakistan","doi-asserted-by":"publisher","award":["315-12753-2EG3-197"],"award-info":[{"award-number":["315-12753-2EG3-197"]}],"id":[{"id":"10.13039\/501100004681","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Pakistan Institute of Engineering and Applied Sciences","award":["PIEAS MS Fellowship"],"award-info":[{"award-number":["PIEAS MS Fellowship"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2018,8]]},"abstract":"<jats:p> Detection of protein\u2013protein interactions (PPIs) plays a vital role in molecular biology. Particularly, pathogenic infections are caused by interactions of host and pathogen proteins. It is important to identify host\u2013pathogen interactions (HPIs) to discover new drugs to counter infectious diseases. Conventional wet lab PPI detection techniques have limitations in terms of cost and large-scale application. Hence, computational approaches are developed to predict PPIs. This study aims to develop machine learning models to predict inter-species PPIs with a special interest in HPIs. Specifically, we focus on seeking answers to three questions that arise while developing an HPI predictor: (1) How should negative training examples be selected? (2) Does assigning sample weights to individual negative examples based on their similarity to positive examples improve generalization performance? and, (3) What should be the size of negative samples as compared to the positive samples during training and evaluation? We compare two available methods for negative sampling: random versus DeNovo sampling and our experiments show that DeNovo sampling offers better accuracy. However, our experiments also show that generalization performance can be improved further by using a soft DeNovo approach that assigns sample weights to negative examples inversely proportional to their similarity to known positive examples during training. Based on our findings, we have also developed an HPI predictor called HOPITOR (Host-Pathogen Interaction Predictor) that can predict interactions between human and viral proteins. The HOPITOR web server can be accessed at the URL: http:\/\/faculty.pieas.edu.pk\/fayyaz\/software.html#HoPItor . <\/jats:p>","DOI":"10.1142\/s0219720018500142","type":"journal-article","created":{"date-parts":[[2018,5,30]],"date-time":"2018-05-30T06:20:17Z","timestamp":1527661217000},"page":"1850014","source":"Crossref","is-referenced-by-count":27,"title":["Training host-pathogen protein\u2013protein interaction predictors"],"prefix":"10.1142","volume":"16","author":[{"given":"Abdul Hannan","family":"Basit","sequence":"first","affiliation":[{"name":"Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan"},{"name":"Department of Electrical Engineering, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan"}]},{"given":"Wajid Arshad","family":"Abbasi","sequence":"additional","affiliation":[{"name":"Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan"}]},{"given":"Amina","family":"Asif","sequence":"additional","affiliation":[{"name":"Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan"}]},{"given":"Sadaf","family":"Gull","sequence":"additional","affiliation":[{"name":"Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9129-1189","authenticated-orcid":false,"given":"Fayyaz Ul Amir Afsar","family":"Minhas","sequence":"additional","affiliation":[{"name":"Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan"}]}],"member":"219","published-online":{"date-parts":[[2018,10,23]]},"reference":[{"key":"S0219720018500142BIB001","doi-asserted-by":"publisher","DOI":"10.1201\/b10456"},{"key":"S0219720018500142BIB002","doi-asserted-by":"publisher","DOI":"10.1002\/pmic.201100563"},{"key":"S0219720018500142BIB003","volume-title":"From Protein Structure to Function with Bioinformatics","author":"Rigden DJ","year":"2008"},{"key":"S0219720018500142BIB004","doi-asserted-by":"publisher","DOI":"10.1002\/pmic.200700131"},{"key":"S0219720018500142BIB005","first-page":"325","volume-title":"Advances in Protein Chemistry","author":"Waugh DF","year":"1954"},{"key":"S0219720018500142BIB006","doi-asserted-by":"publisher","DOI":"10.1002\/9783527648207"},{"key":"S0219720018500142BIB007","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btm208"},{"key":"S0219720018500142BIB010","doi-asserted-by":"publisher","DOI":"10.1007\/10_2007_089"},{"key":"S0219720018500142BIB011","doi-asserted-by":"publisher","DOI":"10.3389\/fmicb.2015.00094"},{"key":"S0219720018500142BIB012","doi-asserted-by":"publisher","DOI":"10.1016\/j.meegid.2011.02.022"},{"key":"S0219720018500142BIB013","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btv737"},{"key":"S0219720018500142BIB014","doi-asserted-by":"publisher","DOI":"10.1110\/ps.073228407"},{"key":"S0219720018500142BIB015","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkl971"},{"key":"S0219720018500142BIB016","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bts375"},{"key":"S0219720018500142BIB017","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0607879104"},{"key":"S0219720018500142BIB018","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-13-S7-S5"},{"key":"S0219720018500142BIB019","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720016500116"},{"key":"S0219720018500142BIB020","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-7-S1-S2"},{"key":"S0219720018500142BIB021","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"S0219720018500142BIB022","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939785"},{"key":"S0219720018500142BIB023","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btr514"},{"key":"S0219720018500142BIB024","doi-asserted-by":"publisher","DOI":"10.1080\/10618600.2012.680866"},{"key":"S0219720018500142BIB026","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001407005703"},{"key":"S0219720018500142BIB027","doi-asserted-by":"publisher","DOI":"10.1063\/1.3615722"},{"key":"S0219720018500142BIB028","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gku830"},{"key":"S0219720018500142BIB029","first-page":"2825","volume":"12","author":"Pedregosa F","year":"2011","journal-title":"J Mach Learn Res"},{"key":"S0219720018500142BIB031","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-60327-241-4_13"},{"key":"S0219720018500142BIB032","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1013203451"},{"key":"S0219720018500142BIB035","doi-asserted-by":"publisher","DOI":"10.1164\/ajrccm.163.supplement_1.2011109"},{"key":"S0219720018500142BIB036","doi-asserted-by":"publisher","DOI":"10.1513\/pats.200502-014AW"},{"key":"S0219720018500142BIB037","doi-asserted-by":"publisher","DOI":"10.1128\/JVI.02303-06"},{"key":"S0219720018500142BIB038","doi-asserted-by":"publisher","DOI":"10.1128\/JVI.79.14.9315-9319.2005"},{"key":"S0219720018500142BIB039","doi-asserted-by":"publisher","DOI":"10.1016\/j.virol.2013.06.019"},{"key":"S0219720018500142BIB040","doi-asserted-by":"publisher","DOI":"10.1016\/j.virol.2007.06.037"},{"key":"S0219720018500142BIB041","doi-asserted-by":"publisher","DOI":"10.1128\/JVI.79.22.14411-14420.2005"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720018500142","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,5,19]],"date-time":"2020-05-19T16:01:34Z","timestamp":1589904094000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720018500142"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,8]]},"references-count":35,"journal-issue":{"issue":"04","published-online":{"date-parts":[[2018,10,23]]},"published-print":{"date-parts":[[2018,8]]}},"alternative-id":["10.1142\/S0219720018500142"],"URL":"https:\/\/doi.org\/10.1142\/s0219720018500142","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"value":"0219-7200","type":"print"},{"value":"1757-6334","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,8]]}}}