{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T02:35:30Z","timestamp":1768876530672,"version":"3.49.0"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,11,15]],"date-time":"2016-11-15T00:00:00Z","timestamp":1479168000000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003246","name":"Nederlandse Organisatie voor Wetenschappelijk Onderzoek","doi-asserted-by":"publisher","award":["2010-2016"],"award-info":[{"award-number":["2010-2016"]}],"id":[{"id":"10.13039\/501100003246","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2016,12]]},"DOI":"10.1186\/s40537-016-0057-0","type":"journal-article","created":{"date-parts":[[2016,11,15]],"date-time":"2016-11-15T11:41:27Z","timestamp":1479210087000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["Understanding big data themes from scientific biomedical literature through topic modeling"],"prefix":"10.1186","volume":"3","author":[{"given":"Allard J.","family":"van Altena","sequence":"first","affiliation":[]},{"given":"Perry D.","family":"Moerland","sequence":"additional","affiliation":[]},{"given":"Aeilko H.","family":"Zwinderman","sequence":"additional","affiliation":[]},{"given":"S\u00edlvia D.","family":"Olabarriaga","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2016,11,15]]},"reference":[{"key":"57_CR1","volume-title":"Hype cycle for emerging technologies, 2011","author":"J Fenn","year":"2011","unstructured":"Fenn J, LeHong H. Hype cycle for emerging technologies, 2011. Stamford: Gartner; 2011."},{"key":"57_CR2","unstructured":"Google: Google Trends. https:\/\/www.google.com\/trends\/explore#q=big+data . Accessed 28 Mar 2016."},{"issue":"4","key":"57_CR3","doi-asserted-by":"publisher","first-page":"1193","DOI":"10.1109\/JBHI.2015.2450362","volume":"19","author":"J Andreu-Perez","year":"2015","unstructured":"Andreu-Perez J, Poon CC, Merrifield RD, Wong ST, Yang G-Z. Big data for health. IEEE J Biomed Health Inform. 2015;19(4):1193\u2013208. doi: 10.1109\/JBHI.2015.2450362 .","journal-title":"IEEE J Biomed Health Inform"},{"key":"57_CR4","doi-asserted-by":"crossref","unstructured":"Gartner: Gartner Acquisitions. http:\/\/www.gartner.com\/technology\/about\/acquisition_history.jsp . Accessed 27 Mar 2016.","DOI":"10.1007\/978-3-319-40893-4_3"},{"key":"57_CR5","first-page":"70","volume":"6","author":"D Laney","year":"2001","unstructured":"Laney D. 3D data management: controlling data volume, velocity and variety. META Group Res Note. 2001;6:70.","journal-title":"META Group Res Note"},{"key":"57_CR6","volume-title":"Oracle: Big data for the enterprise","author":"JP Dijcks","year":"2012","unstructured":"Dijcks JP. Oracle: Big data for the enterprise. Redwood City: Oracle; 2012."},{"key":"57_CR7","unstructured":"IBM: IBM - What Is big data? Accessed through Google cache. https:\/\/www.ibm.com\/software\/data\/bigdata\/what-is-big-data.html . Accessed 17 Dec 2015."},{"key":"57_CR8","unstructured":"Dutcher J. What is big data? https:\/\/datascience.berkeley.edu\/what-is-big-data\/ . Accessed 12 Sept 2016."},{"issue":"8","key":"57_CR9","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1145\/1536616.1536632","volume":"52","author":"A Jacobs","year":"2009","unstructured":"Jacobs A. The pathologies of big data. Commun ACM. 2009;52(8):36\u201344. doi: 10.1145\/1536616.1536632 .","journal-title":"Commun ACM"},{"issue":"9","key":"57_CR10","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1177\/0022034515587863","volume":"94","author":"T DeRouen","year":"2015","unstructured":"DeRouen T. Promises and pitfalls in the use of \u201cBig Data\u201d for clinical research. J Dent Res. 2015;94(9):107\u20139. doi: 10.1177\/0022034515587863 .","journal-title":"J Dent Res"},{"key":"57_CR11","volume-title":"Understanding Big data: analytics for enterprise class hadoop and streaming data","author":"P Zikopoulos","year":"2011","unstructured":"Zikopoulos P, Eaton C. Understanding Big data: analytics for enterprise class hadoop and streaming data. New York: McGraw-Hill Osborne Media; 2011."},{"key":"57_CR12","unstructured":"Levi M. Kleren van de keizer [The emperor\u2019s clothes]. Medisch Contact; 2015."},{"issue":"2","key":"57_CR13","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1007\/s11036-013-0489-0","volume":"19","author":"M Chen","year":"2014","unstructured":"Chen M, Mao S, Liu Y. Big data: a survey. Mobile Netw Appl. 2014;19(2):171\u2013209. doi: 10.1007\/s11036-013-0489-0 .","journal-title":"Mobile Netw Appl"},{"issue":"3","key":"57_CR14","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1108\/LR-06-2015-0061","volume":"65","author":"A Mauro De","year":"2016","unstructured":"De Mauro A, Greco M, Grimaldi M. A formal definition of big data based on its essential features. Lib Rev. 2016;65(3):122\u201335. doi: 10.1108\/LR-06-2015-0061 .","journal-title":"Lib Rev"},{"key":"57_CR15","unstructured":"Ward JS, Barker A. Undefined by data: a survey of big data definitions; 2013."},{"key":"57_CR16","doi-asserted-by":"crossref","unstructured":"Hansmann T, Niemeyer P. Big data - characterizing an emerging research field using topic models. In: Proceedings of the 2014 IEEE\/WIC\/ACM International joint conferences on web intelligence (WI) and Intelligent Agent Technologies (IAT). Vol 1. WI-IAT \u201914. Washington, DC: IEEE Computer Society; 2014. p. 43\u201351. doi:10.1109\/WI-IAT.2014.15","DOI":"10.1109\/WI-IAT.2014.15"},{"key":"57_CR17","first-page":"993","volume":"3","author":"DM Blei","year":"2003","unstructured":"Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res. 2003;3:993\u20131022.","journal-title":"J Mach Learn Res"},{"issue":"4","key":"57_CR18","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1145\/2133806.2133826","volume":"55","author":"DM Blei","year":"2012","unstructured":"Blei DM. Probabilistic topic models. Commun ACM. 2012;55(4):77\u201384. doi: 10.1145\/2133806.2133826 .","journal-title":"Commun ACM"},{"issue":"7","key":"57_CR19","first-page":"424","volume":"427","author":"M Steyvers","year":"2007","unstructured":"Steyvers M, Griffiths T. Probabilistic topic models. Handbook Latent Semant Anal. 2007;427(7):424\u201340.","journal-title":"Handbook Latent Semant Anal"},{"key":"57_CR20","unstructured":"R Core Team R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2015. https:\/\/www.R-project.org\/ ."},{"issue":"5","key":"57_CR21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v025.i05","volume":"25","author":"I Feinerer","year":"2008","unstructured":"Feinerer I, Hornik K, Meyer D. Text mining infrastructure in r. J Stat Softw. 2008;25(5):1\u201354.","journal-title":"J Stat Softw"},{"key":"57_CR22","doi-asserted-by":"crossref","unstructured":"Benoit K, Nulty P. Quanteda: quantitative analysis of textual data. 2015. R package version 0.8.5-10. http:\/\/github.com\/kbenoit\/quanteda .","DOI":"10.32614\/CRAN.package.quanteda"},{"key":"57_CR23","first-page":"361","volume":"5","author":"DD Lewis","year":"2004","unstructured":"Lewis DD, Yang Y, Rose TG, Li F. Rcv1: A new benchmark collection for text categorization research. J Mach Learn Res. 2004;5:361\u201397.","journal-title":"J Mach Learn Res"},{"key":"57_CR24","volume-title":"The SMART retrieval system-experiments in automatic document processing","author":"G Salton","year":"1971","unstructured":"Salton G. The SMART retrieval system-experiments in automatic document processing. Upper Saddle River: Prentice-Hall Inc; 1971."},{"key":"57_CR25","unstructured":"Lewis DD, Yang Y, Rose TG, Li F. http:\/\/jmlr.csail.mit.edu\/papers\/volume5\/lewis04a\/a11-smart-stop-list\/english.stop . Accessed 2015-11-20"},{"issue":"40","key":"57_CR26","first-page":"1","volume":"13","author":"B Gr\u00fcn","year":"2011","unstructured":"Gr\u00fcn B, Hornik K. Topicmodels: an R package for fitting topic models. J Stat Softw. 2011;13(40):1\u201330.","journal-title":"J Stat Softw"},{"key":"57_CR27","unstructured":"Chuang J, Gupta S, Manning C, Heer J. Topic model diagnostics: assessing domain relevance via topical alignment. In: Proceedings of the 30th International Conference on machine learning (ICML-13); 2013. p. 612\u201320."},{"issue":"2","key":"57_CR28","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1214\/aos\/1176344136","volume":"6","author":"G Schwarz","year":"1978","unstructured":"Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;6(2):461\u20134. doi: 10.1214\/aos\/1176344136 .","journal-title":"Ann Stat"},{"key":"57_CR29","doi-asserted-by":"crossref","unstructured":"Akaike H. In: Parzen E, Tanabe K, Kitagawa G, editors. Information theory and an extension of the maximum likelihood principle. New York: Springer; 1998. p. 199\u2013213. doi:10.1007\/978-1-4612-1694-0_15","DOI":"10.1007\/978-1-4612-1694-0_15"},{"key":"57_CR30","doi-asserted-by":"crossref","unstructured":"Sievert C, Shirley KE. LDAvis: a method for visualizing and interpreting topics. In: Proceedings of the Workshop on interactive language learning, visualization, and interfaces; 2014. p. 63\u201370.","DOI":"10.3115\/v1\/W14-3110"},{"key":"57_CR31","volume-title":"Human behavior and the principle of least effort: an introduction to human ecology","author":"GK Zipf","year":"1949","unstructured":"Zipf GK. Human behavior and the principle of least effort: an introduction to human ecology. Indianapolis: Addison-Wesley Press; 1949."},{"key":"57_CR32","unstructured":"Schroeck M, Shockley R, Smart J, Romero-Morales D, Tufano P. Analytics: the real-world use of big data. IBM Global Business Services. 2012: 1\u201320."},{"issue":"4","key":"57_CR33","doi-asserted-by":"publisher","first-page":"70","DOI":"10.1145\/2627534.2627557","volume":"41","author":"S Suthaharan","year":"2014","unstructured":"Suthaharan S. Big data classification: problems and challenges in network intrusion prediction with machine learning. SIGMETRICS Perform Eval Rev. 2014;41(4):70\u20133. doi: 10.1145\/2627534.2627557 .","journal-title":"SIGMETRICS Perform Eval Rev"},{"key":"57_CR34","doi-asserted-by":"crossref","unstructured":"Chang L. NIST big data interoperability framework. vol 1. Definitions. doi:10.6028\/NIST.SP.1500-1","DOI":"10.6028\/NIST.SP.1500-1"},{"issue":"3","key":"57_CR35","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1145\/2168931.2168943","volume":"19","author":"D Fisher","year":"2012","unstructured":"Fisher D, DeLine R, Czerwinski M, Drucker S. Interactions with big data analytics. Interactions. 2012;19(3):50\u20139. doi: 10.1145\/2168931.2168943 .","journal-title":"Interactions"},{"issue":"4","key":"57_CR36","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.2307\/41703503","volume":"36","author":"H Chen","year":"2012","unstructured":"Chen H, Chiang RH, Storey VC. Business intelligence and analytics: from big data to big impact. MIS Q. 2012;36(4):1165\u201388.","journal-title":"MIS Q"},{"issue":"1","key":"57_CR37","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1089\/big.2012.1503","volume":"1","author":"E Dumbill","year":"2013","unstructured":"Dumbill E. Making sense of big data. Big Data. 2013;1(1):1\u20132.","journal-title":"Big Data"},{"issue":"5","key":"57_CR38","doi-asserted-by":"publisher","first-page":"662","DOI":"10.1080\/1369118X.2012.678878","volume":"15","author":"D Boyd","year":"2012","unstructured":"Boyd D, Crawford K. Critical questions for big data: provocations for a cultural, technological and scholarly phenomenon. Inf Commun Soc. 2012;15(5):662\u201379. doi: 10.1080\/1369118X.2012.678878 .","journal-title":"Inf Commun Soc"},{"key":"57_CR39","unstructured":"Center I. Big data analytics. Intel IT Center; 2012."},{"key":"57_CR40","unstructured":"Microsoft: the big bang: how the big data explosion is changing the world. https:\/\/news.microsoft.com\/2013\/02\/11\/the-big-bang-how-the-big-data-explosion-is-changing-the-world\/ . Accessed 11 Feb 2013."},{"key":"57_CR41","doi-asserted-by":"crossref","unstructured":"Shneiderman B. Extreme visualization: Squeezing a billion records into a million pixels. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data. SIGMOD \u201908. New York: ACM. p. 3\u201312; 2008. doi:10.1145\/1376616.1376618","DOI":"10.1145\/1376616.1376618"},{"key":"57_CR42","volume-title":"Big data: a revolution that will transform how we live","author":"V Mayer-Sch\u00f6nberger","year":"2013","unstructured":"Mayer-Sch\u00f6nberger V, Cukier K. Big data: a revolution that will transform how we live. London: John Murray Publishers; 2013."},{"key":"57_CR43","unstructured":"Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Byers AH. Big data: the next frontier for innovation, competition, and productivity. 2011."},{"issue":"1","key":"57_CR44","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/s40537-015-0030-3","volume":"2","author":"C-W Tsai","year":"2015","unstructured":"Tsai C-W, Lai C-F, Chao H-C, Vasilakos AV. Big data analytics: a survey. J Big Data. 2015;2(1):21. doi: 10.1186\/s40537-015-0030-3 .","journal-title":"J Big Data"},{"key":"57_CR45","unstructured":"Alchemy API: Alchemy. http:\/\/www.alchemyapi.com . Accessed 15 Dec 2015."},{"key":"57_CR46","doi-asserted-by":"crossref","unstructured":"Wallach HM, Murray I, Salakhutdinov R, Mimno D. Evaluation methods for topic models. In: Proceedings of the 26th Annual international conference on machine learning. ICML \u201909. New York: ACM; 2009. p. 1105\u20131112. doi:10.1145\/1553374.1553515.","DOI":"10.1145\/1553374.1553515"},{"key":"57_CR47","unstructured":"Sievert C. Finding structure in xkcd comics with latent dirichlet allocation. https:\/\/cpsievert.github.io\/xkcd\/ . Accessed 20 Nov 2015."},{"key":"57_CR48","first-page":"288","volume-title":"Advances in neural information processing systems 22","author":"J Chang","year":"2009","unstructured":"Chang J, Gerrish S, Wang C, Boyd-Graber JL, Blei DM. Reading tea leaves: how humans interpret topic models. In: Bengio Y, Schuurmans D, Lafferty J, Williams C, Culotta A, editors. Advances in neural information processing systems 22. Red Hook: Curran Associates Inc; 2009. p. 288\u201396."},{"key":"57_CR49","unstructured":"Lau JH, Grieser K, Newman D, Baldwin T. Automatic labelling of topic models. Proceedings of the 49th Annual Meeting of the association for computational linguistics: human language technologies, vol 1. HLT \u201911. Stroudsburg: Association for Computational Linguistics; 2011. p. 1536\u201345."},{"key":"57_CR50","doi-asserted-by":"crossref","unstructured":"Mei Q, Shen X, Zhai C. Automatic labeling of multinomial topic models. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining. KDD \u201907. New York: ACM; 2007. p. 490\u2013499. doi:10.1145\/1281192.1281246","DOI":"10.1145\/1281192.1281246"},{"key":"57_CR51","unstructured":"Amazon: Amazon Mechanical Turk. https:\/\/www.mturk.com . Accessed 27 Feb 2016."},{"key":"57_CR52","doi-asserted-by":"crossref","unstructured":"Zhao WX, Jiang J, Weng J, He J, Lim EP, Yan H, Li X. Comparing twitter and traditional media using topic models. In: Proceedings of the 33rd European Conference on advances in information retrieval. ECIR\u201911. Berlin: Springer; 2011. p. 338\u2013349. http:\/\/dl.acm.org\/citation.cfm?id=1996889.1996934 .","DOI":"10.1007\/978-3-642-20161-5_34"},{"issue":"1","key":"57_CR53","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-016-0039-2","volume":"3","author":"JL Hurtado","year":"2016","unstructured":"Hurtado JL, Agarwal A, Zhu X. Topic discovery and future trend forecasting for texts. J Big Data. 2016;3(1):1\u201321. doi: 10.1186\/s40537-016-0039-2 .","journal-title":"J Big Data"},{"key":"57_CR54","unstructured":"Altena, van AJ. AMCeScience\/R-topicmodelling at Submission. https:\/\/github.com\/AMCeScience\/R-topicmodelling\/releases\/tag\/Submission ."}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-016-0057-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s40537-016-0057-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-016-0057-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,20]],"date-time":"2024-06-20T16:13:14Z","timestamp":1718899994000},"score":1,"resource":{"primary":{"URL":"http:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-016-0057-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,11,15]]},"references-count":54,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,12]]}},"alternative-id":["57"],"URL":"https:\/\/doi.org\/10.1186\/s40537-016-0057-0","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,11,15]]},"article-number":"23"}}