{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T16:33:28Z","timestamp":1781109208396,"version":"3.54.1"},"reference-count":56,"publisher":"IGI Global Scientific Publishing","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,10,1]]},"abstract":"<p>In this article, the authors explore the potential of a big data analytics approach to unstructured text analytics of cancer blogs. The application is developed using Cloudera platform's Hadoop MapReduce framework. It uses several text analytics algorithms, including word count, word association, clustering, and classification, to identify and analyze the patterns and keywords in cancer blog postings. This article establishes an exploratory approach to involving big data analytics methods in developing text analytics applications for the analysis of cancer blogs. Additional insights are extracted through various means, including the development of categories or keywords contained in the blogs, the development of a taxonomy, and the examination of relationships among the categories. The application has the potential for generalizability and implementation with health content in other blogs and social media. It can provide insight and decision support for cancer management and facilitate efficient and relevant searches for information related to cancer.<\/p>","DOI":"10.4018\/ijhisi.2019100101","type":"journal-article","created":{"date-parts":[[2019,9,20]],"date-time":"2019-09-20T10:04:36Z","timestamp":1568973876000},"page":"1-20","source":"Crossref","is-referenced-by-count":5,"title":["Exploring Big Data Analytic Approaches to Cancer Blog Text Analysis"],"prefix":"10.4018","volume":"14","author":[{"given":"Viju","family":"Raghupathi","sequence":"first","affiliation":[{"name":"Koppelman School of Business, Brooklyn College of the City University of New York, Brooklyn, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yilu","family":"Zhou","sequence":"additional","affiliation":[{"name":"Gabelli School of Business, Fordham University, New York, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wullianallur","family":"Raghupathi","sequence":"additional","affiliation":[{"name":"Gabelli School of Business, Fordham University, New York, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"2432","reference":[{"key":"IJHISI.2019100101-0","doi-asserted-by":"crossref","unstructured":"Agarwal, V., Zhang, L., Zhu, J., Fang, S., Cheng, T., Hong, C., & Shah, N. H. (2016). Impact of predicting health care utilization via web search behavior: a data-driven analysis. Journal of medical Internet research, 18(9), 1-13.","DOI":"10.2196\/jmir.6240"},{"key":"IJHISI.2019100101-1","doi-asserted-by":"crossref","unstructured":"Bian, J., Topaloglu, U., & F. Yu. (2012). Towards Large-scale Twitter Mining for Drug-related Adverse Events. In SHB\u201912, Maui, Hawaii, October 29.","DOI":"10.1145\/2389707.2389713"},{"key":"IJHISI.2019100101-2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2012.10.007"},{"key":"IJHISI.2019100101-3","author":"D.Bollier","year":"2010","journal-title":"The Promise and Peril of Big Data"},{"key":"IJHISI.2019100101-4","doi-asserted-by":"publisher","DOI":"10.1145\/2331042.2331057"},{"key":"IJHISI.2019100101-5","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.2180"},{"key":"IJHISI.2019100101-6","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.4517"},{"key":"IJHISI.2019100101-7","doi-asserted-by":"publisher","DOI":"10.1089\/big.2015.29001.mcr"},{"key":"IJHISI.2019100101-8","doi-asserted-by":"crossref","unstructured":"Dhar, V. (2014). Big data and predictive analytics in health care. Big Data, 2(3), 113-116.","DOI":"10.1089\/big.2014.1525"},{"key":"IJHISI.2019100101-9","unstructured":"Explorys. (n.d.). Unlocking the Power of Big Data to Improve Healthcare for Everyone. Retrieved from https:\/\/www.explorys.com\/docs\/data-sheets\/explorys-overview.pdf"},{"key":"IJHISI.2019100101-10","unstructured":"Frost & Sullivan. (2012). Drowning in Big Data? Reducing Information Technology Complexities and Costs for Healthcare Organizations. Retrieved from http:\/\/www.emc.com\/collateral\/analyst-reports\/frost-sullivan-reducing-information-technology-complexities-ar.pdf"},{"key":"IJHISI.2019100101-11","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.2721"},{"key":"IJHISI.2019100101-12","unstructured":"Hardy, Q. (2012, July 25). McKinsey says social media could add $1.3 trillion to the economy. The New York Times. Retrieved from https:\/\/bits.blogs.nytimes.com\/2012\/07\/25\/mckinsey-says-social-media-adds-1-3-trillion-to-the-economy"},{"key":"IJHISI.2019100101-13","unstructured":"IBM. (2013). Data Driven Healthcare Organizations Use Big Data Analytics for Big Gains. Retrieved from http:\/\/www.ibmbigdatahub.com\/whitepaper\/data-driven-healthcare-organizations-use-big-data-analytics-big-gains"},{"key":"IJHISI.2019100101-14","unstructured":"IBM. (2012). How Big Data Analytics Reduced Medicaid Re-admissions. A jStart Case Study. Retrieved from http:\/\/www-01.ibm.com\/software\/ebusiness\/jstart\/portfolio\/uncMedicaidCaseStudy.pdf"},{"key":"IJHISI.2019100101-15","unstructured":"Intel. (2012). Leveraging Big Data and Analytics in Healthcare and Life Sciences: Enabling Personalized Medicine for High-Quality Care, Better Outcomes. Retrieved from http:\/\/www.intel.com\/content\/dam\/www\/public\/us\/en\/documents\/white-papers\/healthcare-leveraging-big-data-paper.pdf"},{"key":"IJHISI.2019100101-16","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.3646"},{"key":"IJHISI.2019100101-17","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.5144"},{"key":"IJHISI.2019100101-18","doi-asserted-by":"publisher","DOI":"10.1080\/15456870.2010.505904"},{"key":"IJHISI.2019100101-19","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.4220"},{"key":"IJHISI.2019100101-20","doi-asserted-by":"publisher","DOI":"10.3163\/1536-5050.97.4.009"},{"key":"IJHISI.2019100101-21","unstructured":"Knowledgent. (n.d.). Big Data and Healthcare Payers. Retrieved from https:\/\/knowledgent.com\/whitepaper\/big-data-and-healthcare-payers\/"},{"key":"IJHISI.2019100101-22","unstructured":"Konkel, F. (2013, January 25). Predictive analytics allows Feds to track outbreaks in real time. FCW. Retrieved from https:\/\/fcw.com\/articles\/2013\/01\/25\/flu-social-media.aspx"},{"key":"IJHISI.2019100101-23","unstructured":"Kotenko, J. (2013, April 18). The doctor will see you now: how the Internet and social media are changing healthcare. Digitaltrends. Retrieved from http:\/\/www.digitaltrends.com\/social-media\/the-internet-and-healthcare"},{"key":"IJHISI.2019100101-24","doi-asserted-by":"publisher","DOI":"10.1145\/1035134.1035162"},{"key":"IJHISI.2019100101-25","doi-asserted-by":"publisher","DOI":"10.1007\/11581062_40"},{"key":"IJHISI.2019100101-26","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.4304"},{"key":"IJHISI.2019100101-27","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-12-112"},{"key":"IJHISI.2019100101-28","doi-asserted-by":"publisher","DOI":"10.1147\/sj.433.0490"},{"key":"IJHISI.2019100101-29","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.5521"},{"key":"IJHISI.2019100101-30","unstructured":"McDonald, C. (2017, April 7). \u201cTransforming healthcare through big data. Retrieved from https:\/\/www.healthitoutcomes.com\/doc\/transforming-healthcare-through-big-data-0001"},{"key":"IJHISI.2019100101-31","first-page":"128","article-title":"Extracting information from textual documentations in the electronic health record: a review of recent research.","author":"S. M.Meystre","year":"2008","journal-title":"IMIA Yearbook of Medical Informatics"},{"key":"IJHISI.2019100101-32","doi-asserted-by":"publisher","DOI":"10.2105\/AJPH.2009.175125"},{"key":"IJHISI.2019100101-33","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2005.08.003"},{"key":"IJHISI.2019100101-34","doi-asserted-by":"publisher","DOI":"10.1002\/9781119205005"},{"issue":"3","key":"IJHISI.2019100101-35","first-page":"14","article-title":"HADOOP: Scalable, flexible, data storage and analysis.","volume":"1","author":"M.Olson","year":"2010","journal-title":"IQT Quarterly"},{"key":"IJHISI.2019100101-36","doi-asserted-by":"publisher","DOI":"10.1200\/JOP.2015.010504"},{"key":"IJHISI.2019100101-37","doi-asserted-by":"publisher","DOI":"10.1145\/1089815.1089824"},{"issue":"3","key":"IJHISI.2019100101-38","article-title":"An Overview of Health Analytics.","volume":"4","author":"W.Raghupathi","year":"2013","journal-title":"Journal of Medical & Health Informatics"},{"issue":"3","key":"IJHISI.2019100101-39","first-page":"1","article-title":"Big data analytics in healthcare: Promise and potential.","volume":"2","author":"W.Raghupathi","year":"2014","journal-title":"Health Information Science and Systems"},{"key":"IJHISI.2019100101-40","unstructured":"Rauber, C. (2004, October 31). Raising Kaiser\u2019s role. Retrieved from https:\/\/www.bizjournals.com\/sanfrancisco\/stories\/2004\/11\/01\/story6.html"},{"key":"IJHISI.2019100101-41","doi-asserted-by":"publisher","DOI":"10.1145\/2347736.2347741"},{"key":"IJHISI.2019100101-42","author":"S.Spangler","year":"2007","journal-title":"Mining the Talk: Unlocking the business value in unstructured information"},{"key":"IJHISI.2019100101-43","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2014.06.009"},{"key":"IJHISI.2019100101-44","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.6045"},{"key":"IJHISI.2019100101-45","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.3875"},{"key":"IJHISI.2019100101-46","doi-asserted-by":"publisher","DOI":"10.2196\/medinform.2913"},{"key":"IJHISI.2019100101-47","unstructured":"White, J. (2015, October 9). Why hospitals need social media, online presence. Retrieved from http:\/\/www.healthcarebusinesstech.com\/hospitals-social-media\/"},{"key":"IJHISI.2019100101-48","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2012-001473"},{"key":"IJHISI.2019100101-49","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.4955"},{"key":"IJHISI.2019100101-50","doi-asserted-by":"crossref","unstructured":"Wright, A., Chen, E. S., & F. L. Maloney. (2010). An automated technique for identifying associations between medications, laboratory results and problems. Journal of Biomedical Informatics, 43, 891.901.","DOI":"10.1016\/j.jbi.2010.09.009"},{"key":"IJHISI.2019100101-51","unstructured":"Yom-Tov. E. (2017). Predicting drug recalls from Internet search engine queries. IEEE Journal of Translational Engineering in Health and Medicine, 5."},{"key":"IJHISI.2019100101-52","doi-asserted-by":"publisher","DOI":"10.2196\/jmir.3156"},{"key":"IJHISI.2019100101-53","unstructured":"Zenger, B. (2012, February). Can Big Data Solve Healthcare\u2019s Big Problems? EquityHealthcare. Retrieved from https:\/\/www.equityhealthcare.com\/docs\/librariesprovider2\/news-item-documents\/eh-blog-on-analytics.pdf"},{"key":"IJHISI.2019100101-54","author":"P. C.Zikopoulos","year":"2013","journal-title":"Harness the Power of Big Data \u2013 The IBM Big Data Platform"},{"key":"IJHISI.2019100101-55","author":"P. C.Zikopoulos","year":"2012","journal-title":"Understanding Big Data \u2013 Analytics for Enterprise Class Hadoop and Streaming Data"}],"container-title":["International Journal of Healthcare Information Systems and Informatics"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=238043","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,16]],"date-time":"2023-01-16T16:41:52Z","timestamp":1673887312000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJHISI.2019100101"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2019,10,1]]},"references-count":56,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2019,10]]}},"URL":"https:\/\/doi.org\/10.4018\/ijhisi.2019100101","relation":{},"ISSN":["1555-3396","1555-340X"],"issn-type":[{"value":"1555-3396","type":"print"},{"value":"1555-340X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,10,1]]}}}