{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,12]],"date-time":"2025-11-12T20:15:00Z","timestamp":1762978500662},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2012,12,1]],"date-time":"2012-12-01T00:00:00Z","timestamp":1354320000000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>For selection and evaluation of potential biomarkers, inclusion of already published information is of utmost importance. In spite of significant advancements in text- and data-mining techniques, the vast knowledge space of biomarkers in biomedical text has remained unexplored. Existing named entity recognition approaches are not sufficiently selective for the retrieval of biomarker information from the literature. The purpose of this study was to identify textual features that enhance the effectiveness of biomarker information retrieval for different indication areas and diverse end user perspectives.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Methods<\/jats:title>\n            <jats:p>A biomarker terminology was created and further organized into six concept classes. Performance of this terminology was optimized towards balanced selectivity and specificity. The information retrieval performance using the biomarker terminology was evaluated based on various combinations of the terminology's six classes. Further validation of these results was performed on two independent corpora representing two different neurodegenerative diseases.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The current state of the biomarker terminology contains 119 entity classes supported by 1890 different synonyms. The result of information retrieval shows improved retrieval rate of informative abstracts, which is achieved by including clinical management terms and evidence of gene\/protein alterations (e.g. gene\/protein expression status or certain polymorphisms) in combination with disease and gene name recognition. When additional filtering through other classes (e.g. diagnostic or prognostic methods) is applied, the typical high number of unspecific search results is significantly reduced. The evaluation results suggest that this approach enables the automated identification of biomarker information in the literature. A demo version of the search engine SCAIView, including the biomarker retrieval, is made available to the public through <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/www.scaiview.com\/scaiview-academia.html\" ext-link-type=\"uri\">http:\/\/www.scaiview.com\/scaiview-academia.html<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The approach presented in this paper demonstrates that using a dedicated biomarker terminology for automated analysis of the scientific literature maybe helpful as an aid to finding biomarker information in text. Successful extraction of candidate biomarkers information from published resources can be considered as the first step towards developing novel hypotheses. These hypotheses will be valuable for the early decision-making in the drug discovery and development process.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1472-6947-12-148","type":"journal-article","created":{"date-parts":[[2012,12,19]],"date-time":"2012-12-19T00:15:23Z","timestamp":1355876123000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":30,"title":["Mining biomarker information in biomedical literature"],"prefix":"10.1186","volume":"12","author":[{"given":"Erfan","family":"Younesi","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luca","family":"Toldo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bernd","family":"M\u00fcller","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christoph M","family":"Friedrich","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Natalia","family":"Novac","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Scheer","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Martin","family":"Hofmann-Apitius","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Juliane","family":"Fluck","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2012,12,18]]},"reference":[{"key":"597_CR1","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1016\/j.ygeno.2008.07.006","volume":"93","author":"D Ghosh","year":"2009","unstructured":"Ghosh D, Poisson LM: Omics data and levels of evidence for biomarker discovery. Genomics. 2009, 93: 13-16. 10.1016\/j.ygeno.2008.07.006.","journal-title":"Genomics"},{"key":"597_CR2","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1067\/mcp.2001.113989","volume":"69","author":"BDW Group","year":"2001","unstructured":"Group BDW: Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther. 2001, 69: 89-95.","journal-title":"Clin Pharmacol Ther"},{"key":"597_CR3","doi-asserted-by":"publisher","first-page":"517","DOI":"10.1093\/carcin\/21.3.517","volume":"21","author":"FP Perera","year":"2000","unstructured":"Perera FP, Weinstein IB: Molecular epidemiology: recent advances and future directions. Carcinogenesis. 2000, 21: 517-524. 10.1093\/carcin\/21.3.517.","journal-title":"Carcinogenesis"},{"key":"597_CR4","doi-asserted-by":"publisher","first-page":"182","DOI":"10.1602\/neurorx.1.2.182","volume":"1","author":"R Mayeux","year":"2004","unstructured":"Mayeux R: Biomarkers: potential uses and limitations. NeuroRx. 2004, 1: 182-188. 10.1602\/neurorx.1.2.182.","journal-title":"NeuroRx"},{"issue":"Suppl 1","key":"597_CR5","doi-asserted-by":"publisher","first-page":"S315","DOI":"10.1016\/j.toxlet.2006.07.320","volume":"164","author":"J Timbrell","year":"2006","unstructured":"Timbrell J: Types of biomarker and challenges for new biomarkers. Toxicol Lett. 2006, 164 (Suppl 1): S315-","journal-title":"Toxicol Lett"},{"key":"597_CR6","doi-asserted-by":"publisher","first-page":"361","DOI":"10.1038\/sj.clpt.6100471","volume":"83","author":"CA Altar","year":"2008","unstructured":"Altar CA: The biomarkers consortium: on the critical path of drug discovery. Clin Pharmacol Ther. 2008, 83: 361-364. 10.1038\/sj.clpt.6100471.","journal-title":"Clin Pharmacol Ther"},{"key":"597_CR7","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1146\/annurev.pharmtox.48.113006.094611","volume":"48","author":"JA Wagner","year":"2008","unstructured":"Wagner JA: Strategic approach to fit-for-purpose biomarkers in drug development. Annu Rev Pharmacol Toxicol. 2008, 48: 631-651. 10.1146\/annurev.pharmtox.48.113006.094611.","journal-title":"Annu Rev Pharmacol Toxicol"},{"key":"597_CR8","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1016\/j.taap.2009.12.015","volume":"243","author":"E Marrer","year":"2010","unstructured":"Marrer E, Dieterle F: Impact of biomarker development on drug safety assessment. Toxicol Appl Pharmacol. 2010, 243: 167-179. 10.1016\/j.taap.2009.12.015.","journal-title":"Toxicol Appl Pharmacol"},{"key":"597_CR9","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.vascn.2007.10.002","volume":"57","author":"R Bakhtiar","year":"2008","unstructured":"Bakhtiar R: Biomarkers in drug discovery and development. J Pharmacol Toxicol Methods. 2008, 57: 85-91. 10.1016\/j.vascn.2007.10.002.","journal-title":"J Pharmacol Toxicol Methods"},{"key":"597_CR10","doi-asserted-by":"publisher","first-page":"253","DOI":"10.1038\/nrd3417","volume":"10","author":"H Hurko","year":"2011","unstructured":"Hurko H, Jones GK: Valuation of biomarkers. Nat Rev Drug Discov. 2011, 10: 253-254. 10.1038\/nrd3417.","journal-title":"Nat Rev Drug Discov"},{"issue":"Suppl 5","key":"597_CR11","doi-asserted-by":"publisher","first-page":"O5","DOI":"10.1186\/1471-2105-11-S5-O5","volume":"11","author":"M Ongenaert","year":"2010","unstructured":"Ongenaert M, Dehaspe L: Integrating automated literature searches and text mining in biomarker discovery. BMC Bioinforma. 2010, 11 (Suppl 5): O5-10.1186\/1471-2105-11-S5-O5.","journal-title":"BMC Bioinforma"},{"key":"597_CR12","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1186\/gb-2005-6-7-224","volume":"6","author":"M Krallinger","year":"2005","unstructured":"Krallinger M, Valencia A: Text-mining and information-retrieval services for molecular biology. Genome Biol. 2005, 6: 224-10.1186\/gb-2005-6-7-224.","journal-title":"Genome Biol"},{"key":"597_CR13","doi-asserted-by":"publisher","first-page":"e1000046","DOI":"10.1371\/journal.pmed.1000046","volume":"6","author":"HC Harsha","year":"2009","unstructured":"Harsha HC, Kandasamy K, Ranganathan P, Rani S, Ramabadran S, Gollapudi S, Balakrishnan L, Dwivedi SB, Telikicherla D, Selvan LDN, Goel R, Mathivanan S, Marimuthu R, DeCaprio JA, Srivastava S, Hanash SM, Htuban RH, Pandey A: A compendium of potential biomarkers of pancreatic cancer. PLoS Med. 2009, 6: e1000046-10.1371\/journal.pmed.1000046.","journal-title":"PLoS Med"},{"key":"597_CR14","unstructured":"BioCreAtIvE workshop. [http:\/\/www.biocreative.org]"},{"issue":"Suppl 1","key":"597_CR15","first-page":"S1","volume":"6","author":"C Blaschke","year":"2004","unstructured":"Blaschke C, Hirschman L, Valencia A, Yeh A: A critical assessment of text mining methods in molecular biology. BMC Bioinforma. 2004, 6 (Suppl 1): S1-S23.","journal-title":"BMC Bioinforma"},{"issue":"Suppl 2","key":"597_CR16","doi-asserted-by":"publisher","first-page":"S1","DOI":"10.1186\/gb-2008-9-s2-s1","volume":"9","author":"L Hirschman","year":"2008","unstructured":"Hirschman L, Krallinger M, Wilbur J, Valencia A: The BioCreAtIvE II - critical assessment for information extraction in biology challenge. Genome Biol. 2008, 9 (Suppl 2): S1-S14. 10.1186\/gb-2008-9-s2-s1.","journal-title":"Genome Biol"},{"key":"597_CR17","doi-asserted-by":"publisher","first-page":"e8010","DOI":"10.1371\/journal.pone.0008010","volume":"4","author":"JL Pennings","year":"2009","unstructured":"Pennings JL, Koster MP, Rodenburg W, Schielen PC, de Vries A: Discovery of novel serum biomarkers for prenatal down syndrome screening by integrative data mining. PLoS One. 2009, 4: e8010-10.1371\/journal.pone.0008010.","journal-title":"PLoS One"},{"key":"597_CR18","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1016\/j.compbiolchem.2006.09.002","volume":"30","author":"X Deng","year":"2006","unstructured":"Deng X, Geng H, Bastola DR, Ali HH: Link test\u2013a statistical method for finding prostate cancer biomarkers. Comput Biol Chem. 2006, 30: 425-433. 10.1016\/j.compbiolchem.2006.09.002.","journal-title":"Comput Biol Chem"},{"key":"597_CR19","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1186\/1471-2105-9-207","volume":"9","author":"M Bundschus","year":"2008","unstructured":"Bundschus M, Dejori M, Stetter M, Tresp V, Kriegel HP: Extraction of semantic biomedical relations from text using conditional random fields. BMC Bioinforma. 2008, 9: 207-10.1186\/1471-2105-9-207.","journal-title":"BMC Bioinforma"},{"issue":"Suppl 2","key":"597_CR20","doi-asserted-by":"publisher","first-page":"S9","DOI":"10.1186\/1471-2105-10-S2-S9","volume":"10","author":"PL Elkin","year":"2009","unstructured":"Elkin PL, Tuttle MS, Trusko BE, Brown HB: BioProspecting: novel marker discovery obtained by mining the bibleome. BMC Bioinforma. 2009, 10 (Suppl 2): S9-10.1186\/1471-2105-10-S2-S9.","journal-title":"BMC Bioinforma"},{"key":"597_CR21","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1145\/1741906.1741927","volume-title":"Proceedings of the International Conference and Workshop on Emerging Trends in Technology: 26\u201327 February 2010","author":"MT Islam","year":"2010","unstructured":"Islam MT, Shaikh M, Nayak A, Ranganathan S: Biomarker Information Extraction Tool (BIET) development using natural language processing and machine learning. Proceedings of the International Conference and Workshop on Emerging Trends in Technology: 26\u201327 February 2010. Edited by: Mishra BK, Kekre HB, Thampi GT, Gharpure P, Mukherji A, Lohani RB. 2010, ICWET, Mumbai, 121-126."},{"key":"597_CR22","first-page":"165","volume-title":"Proceedings of HealthGrid 2008; 2\u20134 June 2008","author":"CM Friedrich","year":"2008","unstructured":"Friedrich CM, Dach H, Gattermayer T, Engelbrecht G, Benkner S, Hofmann-Apitius M: @neuLink: a service-oriented application for biomedical knowledge discovery. Proceedings of HealthGrid 2008; 2\u20134 June 2008. Edited by: Solomonides T, Silverstein JC, Saltz J, Legre Y, Kratz M, Foster I, Breton V, Beck JR. 2008, IOS Press, Chicago, 165-172."},{"key":"597_CR23","doi-asserted-by":"publisher","first-page":"1365","DOI":"10.1109\/TITB.2010.2049268","volume":"14","author":"S Benkner","year":"2010","unstructured":"Benkner S, Arbona A, Berti G, Chiarini A, Dunlop R, Engelbrecht G, Frangi AF, Friedrich CM, Hanser S, Hasselmeyer P, Hose RD, Iavindrasana J, K\u00f6hler M, Iacono LL, Lonsdale G, Meyer R, Moore B, Rajasekaran H, Summers PE, W\u00f6hrer A, Woods S: @neurIST: infrastructure for advanced disease management through integration of heterogeneous data, computing, and complex processing services. IEEE Trans Inf Technol Biomed. 2010, 14: 1365-1377.","journal-title":"IEEE Trans Inf Technol Biomed"},{"key":"597_CR24","first-page":"403","volume":"8","author":"D Hanisch","year":"2003","unstructured":"Hanisch D, Fluck J, Mevissen HT, Zimmer R: Playing biology's name game: identifying protein names in scientific text. Pac Symp Biocomput. 2003, 8: 403-14.","journal-title":"Pac Symp Biocomput"},{"issue":"Suppl 1","key":"597_CR25","doi-asserted-by":"publisher","first-page":"S14","DOI":"10.1186\/1471-2105-6-S1-S14","volume":"6","author":"D Hanisch","year":"2005","unstructured":"Hanisch D, Fundel K, Mevissen HT, Zimmer R, Fluck J: ProMiner: rule based protein and gene entity recognition. BMC Bioinforma. 2005, 6 (Suppl 1): S14-10.1186\/1471-2105-6-S1-S14.","journal-title":"BMC Bioinforma"},{"key":"597_CR26","doi-asserted-by":"publisher","first-page":"S3","DOI":"10.1186\/gb-2008-9-s2-s3","volume":"9","author":"AA Morgan","year":"2008","unstructured":"Morgan AA, Lu Z, Wang X, Cohen AM, Fluck J, Ruch P, Divoli A, Fundel K, Leaman R, Hakenberg J, Sun C, Liu HH, Torres R, Krauthammer M, Lau WW, Liu H, Hsu CN, Schuemie M, Cohen KB, Hirschman L: Overview of BioCreAtIvE II gene normalization. Genome Biol. 2008, 9: S3-","journal-title":"Genome Biol"},{"key":"597_CR27","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1214\/aoms\/1177729694","volume":"22","author":"S Kullback","year":"1951","unstructured":"Kullback S, Leibler R: On information and sufficiency. Ann Math Stat. 1951, 22: 79-86. 10.1214\/aoms\/1177729694.","journal-title":"Ann Math Stat"},{"key":"597_CR28","unstructured":"B\u00fcttcher S, Clarke CLA, Cormack GV: Information retrieval: implementing and evaluating search engines. Cambridge, Mass. MIT Press. 296-298."},{"key":"597_CR29","first-page":"795","volume":"41","author":"NC Smeeton","year":"1985","unstructured":"Smeeton NC: Early history of the kappa statistic. Biometrics. 1985, 41: 795-","journal-title":"Biometrics"},{"key":"597_CR30","unstructured":"BIOBASE BKL Proteome database. [http:\/\/www.biobaseinternational.com\/index.php?id=proteomedatabases]"},{"key":"597_CR31","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","volume":"102","author":"A Subramanian","year":"2005","unstructured":"Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Gloub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. 2005, 102: 15545-15550. 10.1073\/pnas.0506580102.","journal-title":"Proc Natl Acad Sci"},{"key":"597_CR32","doi-asserted-by":"publisher","first-page":"345","DOI":"10.1007\/978-3-540-31865-1_25","volume":"3408","author":"C Goutte","year":"2005","unstructured":"Goutte C, Gaussier E: A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. Advances in Information Retrieval. Lecture Notes in Computer Science. 2005, 3408: 345-59.","journal-title":"Advances in Information Retrieval. Lecture Notes in Computer Science"},{"key":"597_CR33","first-page":"251","volume-title":"Lung Cancer, Methods in Molecular Medicine","author":"E Szabo","year":"2003","unstructured":"Szabo E: MUC1 expression in lung cancer. Lung Cancer, Methods in Molecular Medicine. Edited by: Driscoll B. 2003, Humana Press, New Jersey, 251-258. Volume 74, 3","edition":"3"},{"key":"597_CR34","doi-asserted-by":"publisher","first-page":"3237","DOI":"10.1158\/1078-0432.CCR-03-0503","volume":"10","author":"RD Petty","year":"2004","unstructured":"Petty RD, Nicolson MC, Kerr KM, Collie-Duguid E, Murray GI: Gene expression profiling in non-small cell lung cancer, from molecular mechanisms to clinical application. Clin Cancer Res. 2004, 10: 3237-10.1158\/1078-0432.CCR-03-0503.","journal-title":"Clin Cancer Res"}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1472-6947-12-148.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1472-6947-12-148\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1472-6947-12-148","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1472-6947-12-148.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T21:29:50Z","timestamp":1630531790000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/1472-6947-12-148"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,12]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["597"],"URL":"https:\/\/doi.org\/10.1186\/1472-6947-12-148","relation":{},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,12]]},"assertion":[{"value":"3 February 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 December 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 December 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"148"}}