{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T05:54:33Z","timestamp":1768974873504,"version":"3.49.0"},"reference-count":31,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T00:00:00Z","timestamp":1651708800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T00:00:00Z","timestamp":1651708800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>To reduce drug side effects and enhance their therapeutic effect compared with single drugs, drug combination research, combining two or more drugs, is highly important. Conducting in-vivo and in-vitro experiments on a vast number of drug combinations incurs astronomical time and cost. To reduce the number of combinations, researchers classify whether drug combinations are synergistic through in-silico methods. Since unstructured data, such as biomedical documents, include experimental types, methods, and results, it can be beneficial extracting features from documents to predict anti-cancer drug combination synergy. However, few studies predict anti-cancer drug combination synergy using document-extracted features.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We present a novel approach for anti-cancer drug combination synergy prediction using document-based feature extraction. Our approach is divided into two steps. First, we extracted documents containing validated anti-cancer drug combinations and cell lines. Drug and cell line synonyms in the extracted documents were converted into representative words, and the documents were preprocessed by tokenization, lemmatization, and stopword removal. Second, the drug and cell line features were extracted from the preprocessed documents, and training data were constructed by feature concatenation. A prediction model based on deep and machine learning was created using the training data. The use of our features yielded higher results compared to the majority of published studies.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Using our prediction model, researchers can save time and cost on new anti-cancer drug combination discoveries. Additionally, since our feature extraction method does not require structuring of unstructured data, new data can be immediately applied without any data scalability issues.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-022-04698-8","type":"journal-article","created":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T10:07:48Z","timestamp":1651745268000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction"],"prefix":"10.1186","volume":"23","author":[{"given":"Yongsun","family":"Shim","sequence":"first","affiliation":[]},{"given":"Munhwan","family":"Lee","sequence":"additional","affiliation":[]},{"given":"Pil-Jong","family":"Kim","sequence":"additional","affiliation":[]},{"given":"Hong-Gee","family":"Kim","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,5,5]]},"reference":[{"key":"4698_CR1","doi-asserted-by":"publisher","first-page":"7","DOI":"10.3322\/caac.21551","volume":"69","author":"RL Siegel","year":"2019","unstructured":"Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. 2019;69:7\u201334.","journal-title":"CA Cancer J Clin"},{"key":"4698_CR2","first-page":"48","volume":"31","author":"R Tonkens","year":"2005","unstructured":"Tonkens R. An overview of the drug development process. Phys Exec. 2005;31:48\u201352.","journal-title":"Phys Exec"},{"key":"4698_CR3","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1093\/bib\/bbv020","volume":"17","author":"J Li","year":"2016","unstructured":"Li J, Zheng S, Chen B, Butte AJ, Swamidass SJ, Lu Z. A survey of current trends in computational drug repositioning. Brief Bioinform. 2016;17:2\u201312.","journal-title":"Brief Bioinform"},{"issue":"7","key":"4698_CR4","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.1016\/j.drudis.2016.05.015","volume":"21","author":"W Sun","year":"2016","unstructured":"Sun W, Sanderson PE, Zheng W. Drug combination therapy increases successful drug repositioning. Drug Discov Today. 2016;21(7):1189\u201395.","journal-title":"Drug Discov Today"},{"key":"4698_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-018-07882-8","volume":"10","author":"F Cheng","year":"2019","unstructured":"Cheng F, Kov\u00e1cs IA, Barab\u00e1si A-L. Network-based prediction of drug combinations. Nat Commun. 2019;10:1\u201311.","journal-title":"Nat Commun"},{"key":"4698_CR6","first-page":"143","volume-title":"Toxicity and drug testing","author":"H-G Breitinger","year":"2012","unstructured":"Breitinger H-G. Drug synergy-mechanisms and methods of analysis. In: Acree B, editor. Toxicity and drug testing. InTech; 2012. p. 143\u201366."},{"key":"4698_CR7","doi-asserted-by":"publisher","first-page":"1480","DOI":"10.1126\/science.1254721","volume":"346","author":"AS Crystal","year":"2014","unstructured":"Crystal AS, Shaw AT, Sequist LV, Friboulet L, Niederst MJ, Lockerman EL, Frias RL, Gainor JF, Amzallag A, Greninger P, Dana Lee AK, Gomez-Caraballo M, Elamine L, Howe E, Hur W, Lifshits E, Robinson HE, Katayama R, Faber AC, Awad MM, Ramaswamy S, Mino-Kenudson M, Iafrate AJ, Benes CH, Engelman JA. Patient-derived models of acquired resistance can identify effective drug combinations for cancer. Science. 2014;346:1480\u20136.","journal-title":"Science"},{"key":"4698_CR8","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1038\/sj.bjp.0707305","volume":"152","author":"S Ekins","year":"2007","unstructured":"Ekins S, Mestres J, Testa B. In silico pharmacology for drug discovery: methods for virtual ligand screening and profiling. Br J Pharmacol. 2007;152:9\u201320.","journal-title":"Br J Pharmacol"},{"key":"4698_CR9","doi-asserted-by":"publisher","first-page":"1538","DOI":"10.1093\/bioinformatics\/btx806","volume":"34","author":"K Preuer","year":"2018","unstructured":"Preuer K, Lewis RPI, Hochreiter S, Bender A, Bulusu KC, Klambauer G. Deepsynergy: predicting anti-cancer drug synergy with deep learning. Bioinformatics. 2018;34:1538\u201346.","journal-title":"Bioinformatics"},{"key":"4698_CR10","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/s12859-018-2509-3","volume":"19","author":"F Xia","year":"2018","unstructured":"Xia F, Shukla M, Brettin T, Garcia-Cardona C, Cohn J, Allen JE, Maslov S, Holbeck SL, Doroshow JH, Evrard YA, Stahlberg EA, Stevens RL. Predicting tumor cell line response to drug pairs with deep learning. BMC Bioinform. 2018;19:71\u20139.","journal-title":"BMC Bioinform"},{"key":"4698_CR11","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1093\/jamia\/ocaa212","volume":"28","author":"Y Kim","year":"2021","unstructured":"Kim Y, Zheng S, Tang J, Zheng WJ, Li Z, Jiang X. Anticancer drug synergy prediction in understudied tissues using transfer learning. J Am Med Inform Assoc. 2021;28:42\u201351.","journal-title":"J Am Med Inform Assoc"},{"key":"4698_CR12","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1007\/978-1-0716-0849-4_12","volume-title":"Translational bioinformatics for therapeutic development","author":"T Zhang","year":"2021","unstructured":"Zhang T, Zhang L, Payne PRO, Li F. Synergistic drug combination prediction by integrating multiomics data in deep learning models. In: Markowitz J, editor. Translational bioinformatics for therapeutic development. Springer; 2021. p. 223\u201338."},{"key":"4698_CR13","doi-asserted-by":"crossref","unstructured":"Janizek JD, Celik S, Lee S-I. Explainable machine learning prediction of synergistic drug combinations for precision cancer medicine. bioRxiv. 2018.","DOI":"10.1101\/331769"},{"key":"4698_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-019-45236-6","volume":"9","author":"R Celebi","year":"2019","unstructured":"Celebi R, Bear Don\u2019t Walk O, Movva R, Alpsoy S, Dumontier M. In-silico prediction of synergistic anti-cancer drug combinations using multi-omics data. Sci Rep. 2019;9:1\u201310.","journal-title":"Sci Rep"},{"key":"4698_CR15","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1186\/s12918-018-0546-1","volume":"12","author":"M Jeon","year":"2018","unstructured":"Jeon M, Kim S, Park S, Lee H, Kang J. In silico drug combination discovery for personalized cancer therapy. BMC Syst Biol. 2018;12:59\u201367.","journal-title":"BMC Syst Biol"},{"key":"4698_CR16","doi-asserted-by":"publisher","first-page":"278","DOI":"10.3390\/biology9090278","volume":"9","author":"J Li","year":"2020","unstructured":"Li J, Huo Y, Wu X, Liu E, Zeng Z, Tian Z, Fan K, Stover D, Cheng L, Li L. Essentiality and transcriptome-enriched pathway scores predict drug-combination synergy. Biology. 2020;9:278.","journal-title":"Biology"},{"key":"4698_CR17","doi-asserted-by":"publisher","first-page":"200","DOI":"10.1016\/j.jbi.2012.10.007","volume":"46","author":"F Zhu","year":"2013","unstructured":"Zhu F, Patumcharoenpol P, Zhang C, Chan YYJ, Meechai A, Vongsangnak W, Shen B. Biomedical text mining and its applications in cancer research. J Biomed Inform. 2013;46:200\u201311.","journal-title":"J Biomed Inform"},{"key":"4698_CR18","doi-asserted-by":"publisher","first-page":"3564","DOI":"10.1158\/0008-5472.CAN-17-0489","volume":"77","author":"SL Holbeck","year":"2017","unstructured":"Holbeck SL, Camalier R, Crowell JA, Govindharajulu JP, Hollingshead M, Anderson LW, Polley E, Rubinstein L, Srivastava A, Wilsker D, Collins JM, Doroshow JH. The national cancer institute almanac: a comprehensive screening resource for the detection of anticancer drug pairs with enhanced therapeutic activity. Can Res. 2017;77:3564\u201376.","journal-title":"Can Res"},{"key":"4698_CR19","doi-asserted-by":"publisher","first-page":"585","DOI":"10.1111\/j.1744-7348.1939.tb06990.x","volume":"26","author":"CI Bliss","year":"1939","unstructured":"Bliss CI. The toxicity of poisons applied jointly 1. Ann Appl Biol. 1939;26:585\u2013615.","journal-title":"Ann Appl Biol"},{"key":"4698_CR20","doi-asserted-by":"publisher","first-page":"1388","DOI":"10.1093\/nar\/gkaa971","volume":"49","author":"S Kim","year":"2021","unstructured":"Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE. Pubchem in 2021: new data content and improved web interfaces. Nucleic Acids Res. 2021;49:1388\u201395.","journal-title":"Nucleic Acids Res"},{"key":"4698_CR21","doi-asserted-by":"publisher","first-page":"25","DOI":"10.7171\/jbt.18-2902-002","volume":"29","author":"A Bairoch","year":"2018","unstructured":"Bairoch A. The cellosaurus, a cell-line knowledge resource. J Biomol Tech. 2018;29:25\u201338.","journal-title":"J Biomol Tech"},{"key":"4698_CR22","unstructured":"Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)."},{"key":"4698_CR23","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","volume":"61","author":"J Schmidhuber","year":"2015","unstructured":"Schmidhuber J. Deep learning in neural networks: an overview. Neural Netw. 2015;61:85\u2013117.","journal-title":"Neural Netw"},{"key":"4698_CR24","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin CE. Xgboost: a scalable tree boosting system. In: Krishnapuram B, editor. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 2016. p. 785\u201394.","DOI":"10.1145\/2939672.2939785"},{"key":"4698_CR25","first-page":"3","volume-title":"Machine learning","author":"P Geurts","year":"2006","unstructured":"Geurts P, Ernst D, Wehenkel L. Extremely randomized trees. In: F\u00fcrnkranz J, editor. Machine learning. Springer; 2006. p. 3\u201342."},{"key":"4698_CR26","doi-asserted-by":"publisher","first-page":"352","DOI":"10.1016\/S1532-0464(03)00034-0","volume":"35","author":"S Dreiseitl","year":"2002","unstructured":"Dreiseitl S, Ohno-Machado L. Logistic regression and artificial neural network classification models: a methodology review. J Biomed Inform. 2002;35:352\u20139.","journal-title":"J Biomed Inform"},{"key":"4698_CR27","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/bib\/bbaa398","volume":"22","author":"K Fan","year":"2021","unstructured":"Fan K, Cheng L, Li L. Artificial intelligence and machine learning methods in predicting anti-cancer drug combination effects. Brief Bioinform. 2021;22:1\u201312.","journal-title":"Brief Bioinform"},{"key":"4698_CR28","doi-asserted-by":"publisher","first-page":"1083","DOI":"10.1093\/nar\/gkaa968","volume":"49","author":"A Luna","year":"2021","unstructured":"Luna A, Elloumi F, Varma S, Wang Y, Rajapakse VN, Aladjem MI, Robert J, Sander C, Pommier Y, Reinhold WC. Cellminer cross-database (CellMinerCDB) version 12: Exploration of patient-derived cancer cell line pharmacogenomics. Nucleic Acids Res. 2021;49:1083\u201393.","journal-title":"Nucleic Acids Res"},{"key":"4698_CR29","doi-asserted-by":"publisher","first-page":"901","DOI":"10.1093\/nar\/gkm958","volume":"36","author":"DS Wishart","year":"2008","unstructured":"Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M. Drugbank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res. 2008;36:901\u20136.","journal-title":"Nucleic Acids Res"},{"key":"4698_CR30","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1093\/nar\/gkz337","volume":"47","author":"B Zagidullin","year":"2019","unstructured":"Zagidullin B, Aldahdooh J, Zheng S, Wang W, Wang Y, Saad J, Malyutina A, Jafari M, Tanoli Z, Pessia A, Tang J. Drugcomb: an integrative cancer drug combination data portal. Nucleic Acids Res. 2019;47:43\u201351.","journal-title":"Nucleic Acids Res"},{"key":"4698_CR31","doi-asserted-by":"crossref","unstructured":"Menden MP, Wang D, Mason MJ, Szalai B, Bulusu KC, Guan Y, Yu T, Kang J, Jeon M, Wolfinger R, Nguyen T, Zaslavskiy M, Consortium A-SDCD, Jang IS, Ghazoui Z, Ahsen ME, Vogel R, Neto EC, Norman T, Tang EKY, Garnett MJ, Veroli GYD, Fawell S, Stolovitzky G, Guinney J, Dry JR, Saez-Rodriguez J. Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen. Nature Communications 2019;10:1\u201317.","DOI":"10.1038\/s41467-019-09799-2"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04698-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04698-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04698-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T21:03:53Z","timestamp":1651784633000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04698-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,5,5]]},"references-count":31,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4698"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04698-8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,5]]},"assertion":[{"value":"16 December 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 April 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 May 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"163"}}