{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T18:46:50Z","timestamp":1773773210926,"version":"3.50.1"},"reference-count":25,"publisher":"World Scientific Pub Co Pte Ltd","issue":"03n04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2020,6]]},"abstract":"<jats:p> Social media contain rich information that can be used to help understand human mind and behavior. Social media data, however, are mostly unstructured (e.g., text and image) and a large number of features may be needed to represent them (e.g., we may need millions of unigrams to represent social media texts). Moreover, accurately assessing human behavior is often difficult (e.g., assessing addiction may require medical diagnosis). As a result, the ground truth data needed to train a supervised human behavior model are often difficult to obtain at a large scale. To avoid overfitting, many state-of-the-art behavior models employ sophisticated unsupervised or self-supervised machine learning methods to leverage a large amount of unsupervised data for both feature learning and dimension reduction. Unfortunately, despite their high performance, these advanced machine learning models often rely on latent features that are hard to explain. Since understanding the knowledge captured in these models is important to behavior scientists and public health providers, we explore new methods to build machine learning models that are not only accurate but also interpretable. We evaluate the effectiveness of the proposed methods in predicting Substance Use Disorders (SUD). We believe the methods we proposed are general and applicable to a wide range of data-driven human trait and behavior analysis applications. <\/jats:p>","DOI":"10.1142\/s021821302060009x","type":"journal-article","created":{"date-parts":[[2020,6,17]],"date-time":"2020-06-17T11:01:59Z","timestamp":1592391719000},"page":"2060009","source":"Crossref","is-referenced-by-count":3,"title":["Building High Performance Explainable Machine Learning Models for Social Media-based Substance Use Prediction"],"prefix":"10.1142","volume":"29","author":[{"given":"Tao","family":"Ding","sequence":"first","affiliation":[{"name":"Department of Information Systems, University of Maryland, Baltimore County, Baltimore, 21250, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9722-4243","authenticated-orcid":false,"given":"Fatema","family":"Hasan","sequence":"additional","affiliation":[{"name":"Department of Information Systems, University of Maryland, Baltimore County, Baltimore, 21250, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Warren K.","family":"Bickel","sequence":"additional","affiliation":[{"name":"Addiction Recovery Research Center, Virginia Tech Carilion School of Medicine and Research Institute, Roanoke, VA 24016, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shimei","family":"Pan","sequence":"additional","affiliation":[{"name":"Department of Information Systems, University of Maryland, Baltimore County, Baltimore, 21250, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2020,6,18]]},"reference":[{"key":"p_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1218772110"},{"key":"p_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0073791"},{"key":"p_5","first-page":"14","author":"Benton A.","year":"2016","journal-title":"Short Papers) ("},{"key":"p_8","doi-asserted-by":"publisher","DOI":"10.1137\/S0895479896305696"},{"key":"p_11","doi-asserted-by":"publisher","DOI":"10.1214\/09-SS057"},{"key":"p_14","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1418680112"},{"key":"p_18","first-page":"729","author":"Preoiuc-Pietro D.","year":"2017","journal-title":"Long Papers) ("},{"key":"p_19","first-page":"1","volume":"13","author":"De Choudhury M.","year":"2013","journal-title":"ICWSM"},{"key":"p_21","doi-asserted-by":"publisher","DOI":"10.1140\/epjds\/s13688-017-0102-z"},{"issue":"9","key":"p_24","doi-asserted-by":"crossref","first-page":"e0138717","DOI":"10.1371\/journal.pone.0138717","volume":"10","author":"Preoiuc-Pietro D.","year":"2015","journal-title":"PloS One"},{"key":"p_26","doi-asserted-by":"publisher","DOI":"10.1162\/0899766042321814"},{"key":"p_32","first-page":"2016","author":"Ramprasaath R.","year":"2016","journal-title":"CVPR"},{"key":"p_34","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/70.1.41"},{"key":"p_39","first-page":"294","volume":"1","author":"Spirtes P.","year":"1995","journal-title":"KDD"},{"key":"p_42","doi-asserted-by":"publisher","DOI":"10.1037\/a0039210"},{"key":"p_43","doi-asserted-by":"publisher","DOI":"10.1111\/j.1360-0443.2011.03742.x"},{"key":"p_44","doi-asserted-by":"publisher","DOI":"10.1038\/sj.npp.1300030"},{"key":"p_45","doi-asserted-by":"publisher","DOI":"10.1097\/01.ALC.0000156453.05028.F5"},{"key":"p_46","doi-asserted-by":"publisher","DOI":"10.1046\/j.1360-0443.2000.951116919.x"},{"key":"p_47","first-page":"993","volume":"3","author":"Blei D.","year":"2003","journal-title":"JMLR"},{"key":"p_48","first-page":"1188","volume":"14","author":"Le Q. V.","year":"2014","journal-title":"ICML"},{"key":"p_49","doi-asserted-by":"publisher","DOI":"10.1126\/science.1127647"},{"key":"p_50","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-9473(01)00065-2"},{"key":"p_51","first-page":"1643","volume":"11","author":"Spirtes P.","year":"2010","journal-title":"Journal of Machine Learning Research"},{"key":"p_54","doi-asserted-by":"publisher","DOI":"10.1214\/11-AOS940"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S021821302060009X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,6,17]],"date-time":"2020-06-17T11:02:28Z","timestamp":1592391748000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S021821302060009X"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6]]},"references-count":25,"journal-issue":{"issue":"03n04","published-print":{"date-parts":[[2020,6]]}},"alternative-id":["10.1142\/S021821302060009X"],"URL":"https:\/\/doi.org\/10.1142\/s021821302060009x","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6]]}}}