{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:25:04Z","timestamp":1777703104747,"version":"3.51.4"},"reference-count":44,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2019,5,14]],"date-time":"2019-05-14T00:00:00Z","timestamp":1557792000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2019,5,14]]},"abstract":"<jats:p>\u00a0Author Profiling (AP) aims at predicting specific characteristics from a group of authors by analyzing their written documents. Many research has been focused on determining suitable features for modeling writing patterns from authors. Reported results indicate that content-based features continue to be the most relevant and discriminant features for solving this task. Thus, in this paper, we present a thorough analysis regarding the appropriateness of different distributional term representations (DTR) for the AP task. In this regard, we introduce a novel framework for supervised AP using these representations and, supported on it. We approach a comparative analysis of representations such as DOR, TCOR, SSR, and word2vec in the AP problem. We also compare the performance of the DTRs against classic approaches including popular topic-based methods. The obtained results indicate that DTRs are suitable for solving the AP task in social media domains as they achieve competitive results while providing meaningful interpretability.<\/jats:p>","DOI":"10.3233\/jifs-179033","type":"journal-article","created":{"date-parts":[[2019,5,14]],"date-time":"2019-05-14T12:07:26Z","timestamp":1557835646000},"page":"4857-4868","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":4,"title":["A comparative analysis of distributional term representations for author profiling in social media"],"prefix":"10.1177","volume":"36","author":[{"given":"Miguel A.","family":"\u00c1lvarez-Carmona","sequence":"first","affiliation":[{"name":"Computational Sciences Department, Instituto Nacional de Astrof\u00edsica, \u00d3ptica y Electr\u00f3nica, Luis Enrique Erro 1, Puebla M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Esa\u00fa","family":"Villatoro-Tello","sequence":"additional","affiliation":[{"name":"Information Technologies Department, Universidad Aut\u00f3noma Metropolitana, Unidad Cuajimalpa (UAM-C), Ciudad de M\u00e9xico, M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Manuel","family":"Montes-Y-G\u00f3mez","sequence":"additional","affiliation":[{"name":"Computational Sciences Department, Instituto Nacional de Astrof\u00edsica, \u00d3ptica y Electr\u00f3nica, Luis Enrique Erro 1, Puebla M\u00e9xico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luis","family":"Villase\u00f1or-Pineda","sequence":"additional","affiliation":[{"name":"Computational Sciences Department, Instituto Nacional de Astrof\u00edsica, \u00d3ptica y Electr\u00f3nica, Luis Enrique Erro 1, Puebla M\u00e9xico"},{"name":"Centre de Recherche GRAMMATICA (EA 4521), Universit\u00e9 d\u2019Artois, France"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2019,5,14]]},"reference":[{"key":"e_1_3_3_2_2","article-title":"Inaoe\u2019s participation at pan15: Author profiling task","volume":"1391","author":"\u00c1lvarez-Carmona M.A.","year":"2015","unstructured":"\u00c1lvarez-CarmonaM.A., L\u00f3pez-MonroyA.P., Montes-y G\u00f3mezM., Villase\u00f1or-PinedaL. and EscalanteH.J., Inaoe\u2019s participation at pan15: Author profiling task, Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, 1391, 2015.","journal-title":"Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum"},{"key":"e_1_3_3_3_2","first-page":"151","volume-title":"Ibero-American Conference on Artificial Intelligence","author":"\u00c1lvarez-Carmona M.A.","year":"2016","unstructured":"\u00c1lvarez-CarmonaM.A., L\u00f3pez-MonroyA.P., Montes-y G\u00f3mezM., Villase\u00f1or-PinedaL. and MezaI., Evaluating topicbased representations for author profiling in social media, In Ibero-American Conference on Artificial Intelligence, Springer, 2016, pp. 151\u2013162."},{"key":"e_1_3_3_4_2","volume-title":"Proceedings of the 2005 Joint Annual Meeting of the Interface and the Classification Society of North America","author":"Argamon S.","year":"2005","unstructured":"ArgamonS., DhawleS., KoppelM. and PennebakerJ.W., Lexical predictors of personality type, In Proceedings of the 2005 Joint Annual Meeting of the Interface and the Classification Society of North America, 2005."},{"key":"e_1_3_3_5_2","article-title":"N-gram: New groningen author-profiling model","author":"Basile A.","year":"2017","unstructured":"BasileA., DwyerG., MedvedevaM., RaweeJ., HaagsmaH. and NissimM., N-gram: New groningen author-profiling model, arXiv preprint arXiv:1707.03764 (2017).","journal-title":"arXiv preprint arXiv:1707.03764"},{"key":"e_1_3_3_6_2","first-page":"815","article-title":"Author profiling using svms and word embedding averages","author":"Bayot R.K.","year":"2016","unstructured":"BayotR.K. and Gon\u00e7alvesT., Author profiling using svms and word embedding averages, In CLEF (Working Notes), 2016, pp. 815\u2013823.","journal-title":"CLEF (Working Notes)"},{"key":"e_1_3_3_7_2","first-page":"327","volume-title":"Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Bergsma S.","year":"2012","unstructured":"BergsmaS., PostM. and YarowskyD., Stylometric analysis of scientific articles, In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2012, pp. 327\u2013337. Association for Computational Linguistics."},{"key":"e_1_3_3_8_2","first-page":"824","article-title":"Caps: A cross-genre author profiling system","author":"Bilan I.","year":"2016","unstructured":"BilanI. and ZhekovaD., Caps: A cross-genre author profiling system, In CLEF (Working Notes), 2016, pp. 824\u2013835.","journal-title":"CLEF (Working Notes)"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/2671188.2749381"},{"key":"e_1_3_3_10_2","article-title":"Subword-based deep averaging networks for author profiling in social media","author":"Franco-Salvador M.","year":"2017","unstructured":"Franco-SalvadorM., PlotnikovaN., PawarN. and BenajibaY., Subword-based deep averaging networks for author profiling in social media, Cappellato et al.[13] (2017).","journal-title":"Cappellato et al.[13]"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24027-5_3"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1155\/2016\/1638936"},{"key":"e_1_3_3_13_2","volume-title":"CLEF 2015 Evaluation Labs and Workshop \u2013 Working Notes Papers","author":"Kiprov Y.","year":"2015","unstructured":"KiprovY., HardalovM., NakovP. and KoychevI., SUPAN\u2019 2015: Experiments in Author Profiling\u2014Notebook for PAN at CLEF 2015. In CappellatoL., FerroN., JonesG. and San JuanE., editors, CLEF 2015 Evaluation Labs and Workshop \u2013 Working Notes Papers, Toulouse, France, CEUR-WS.org, 2015."},{"key":"e_1_3_3_14_2","article-title":"Author profiling with bidirectional rnns using attention with grus","author":"Kodiyan D.","year":"2017","unstructured":"KodiyanD., HardeggerF., NeuhausS. and CieliebakM., Author profiling with bidirectional rnns using attention with grus, Cappellato et al.[13] (2017).","journal-title":"Cappellato et al.[13]"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/17.4.401"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1093\/applin\/16.3.307"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/1031171.1031284"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/1031171.1031284"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2010.11.001"},{"key":"e_1_3_3_20_2","article-title":"Using intra-profile information for author profiling","author":"L\u00f3pez-Monroy A.P.","year":"2014","unstructured":"L\u00f3pez-MonroyA.P., Montes-y-G\u00f3mezM., EscalanteH.J. and Villase\u00f1or-PinedaL., Using intra-profile information for author profiling, In CLEF 2014 Working Notes, 2014.","journal-title":"CLEF 2014 Working Notes"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.06.024"},{"key":"e_1_3_3_22_2","volume-title":"Experimental IR Meets Multilinguality, Multimodality, and Interaction Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018)","author":"L\u00f3pez-Santill\u00e1n R.","year":"2018","unstructured":"L\u00f3pez-Santill\u00e1nR., Gonz\u00e1lez-GurrolaL.C. and Ram\u00edrez-AlonsoG., Custom document embeddings via the centroids method: Gender classification in an author profiling task, In Experimental IR Meets Multilinguality, Multimodality, and Interaction Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018), 2018."},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1540-4781.2011.01232_1.x"},{"key":"e_1_3_3_24_2","first-page":"1121","article-title":"A simple approach to author profiling in mapreduce","author":"Maharjan S.","year":"2014","unstructured":"MaharjanS., ShresthaP. and SolorioT., A simple approach to author profiling in mapreduce, In CLEF (Working Notes), 2014, pp. 1121\u20131128.","journal-title":"CLEF (Working Notes)"},{"key":"e_1_3_3_25_2","first-page":"947","article-title":"Adapting cross-genre author profiling to language and corpus","author":"Markov I.","year":"2016","unstructured":"MarkovI., G\u00f3mez-AdornoH., SidorovG. and GelbukhA.F., Adapting cross-genre author profiling to language and corpus, In CLEF (Working Notes), 2016, pp. 947\u2013955.","journal-title":"CLEF (Working Notes)"},{"key":"e_1_3_3_26_2","unstructured":"McCollisterC. Predicting Author Traits Through Topic Modeling of Multilingual Social Media Text PhD thesis University of Kansas 2016."},{"key":"e_1_3_3_27_2","article-title":"Ensemble-based classification for author profiling using various features","author":"Meina M.","year":"2013","unstructured":"MeinaM., BrodzinskaK., CelmerB., CzokowM., PateraM., PezackiJ. and WilkM., Ensemble-based classification for author profiling using various features, Notebook Papers of CLEF (2013).","journal-title":"Notebook Papers of CLEF"},{"key":"e_1_3_3_28_2","article-title":"Efficient estimation of word representations in vector space","author":"Mikolov T.","year":"2013","unstructured":"MikolovT., ChenK., CorradoG. and DeanJ., Efficient estimation of word representations in vector space, arXiv preprint arXiv:1301.3781 (2013).","journal-title":"arXiv preprint arXiv:1301.3781"},{"key":"e_1_3_3_29_2","first-page":"3111","article-title":"Distributed representations of words and phrases and their compositionality","author":"Mikolov T.","year":"2013","unstructured":"MikolovT., SutskeverI., ChenK., CorradoG.S. and DeanJ., Distributed representations of words and phrases and their compositionality, In Advances in Neural Information Processing Systems, 2013, pp. 3111\u20133119.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_3_30_2","article-title":"Author profiling with word+ character neural attention network","author":"Miura Y.","unstructured":"MiuraY., TaniguchiT., TaniguchiM. and OhkumaT., Author profiling with word+ character neural attention network, Cappellato et al.[13].","journal-title":"Cappellato et al.[13]"},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-74628-7_22"},{"key":"e_1_3_3_32_2","volume-title":"Proceedings of CLEF","author":"Poulston A.","year":"2015","unstructured":"PoulstonA., StevensonM. and BontchevaK., Topic models and n\u2013gram language models for author profiling, In Proceedings of CLEF, 2015."},{"key":"e_1_3_3_33_2","article-title":"Using tf-idf ngram and word embedding cluster ensembles for author profiling","author":"Poulston A.","year":"2017","unstructured":"PoulstonA., WaseemZ. and StevensonM., Using tf-idf ngram and word embedding cluster ensembles for author profiling, Cappellato et al.[13] (2017).","journal-title":"Cappellato et al.[13]"},{"key":"e_1_3_3_34_2","first-page":"898","article-title":"Overview of the author profiling task at PAN 2014","author":"Rangel F.","year":"2014","unstructured":"RangelF., RossoP., ChugurI., PotthastM., TrenkmannM., SteinB., VerhoevenB. and DaelemansW., Overview of the author profiling task at PAN 2014, In CLEF (Online Working Notes\/Labs\/Workshop), 2014, pp. 898\u2013927.","journal-title":"CLEF (Online Working Notes\/Labs\/Workshop)"},{"key":"e_1_3_3_35_2","article-title":"Overview of the 6th author profiling task at pan 2018: Multimodal gender identification in twitter","author":"Rangel F.","year":"2018","unstructured":"RangelF., RossoP., Montes-y G\u00f3mezM., PotthastM. and SteinB., Overview of the 6th author profiling task at pan 2018: Multimodal gender identification in twitter, Working Notes Papers of the CLEF (2018).","journal-title":"Working Notes Papers of the CLEF"},{"key":"e_1_3_3_36_2","first-page":"199","volume-title":"Proceedings of 2006 AAAI Spring Symposium on Computational Approaches for Analyzing Weblogs","author":"Schler J.","year":"2006","unstructured":"SchlerJ., KoppelM., ArgamonS. and PennebakerJ., Effects of age and gender on blogging, In Proceedings of 2006 AAAI Spring Symposium on Computational Approaches for Analyzing Weblogs, 2006, pp. 199\u2013205."},{"key":"e_1_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0073791"},{"key":"e_1_3_3_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"e_1_3_3_39_2","unstructured":"SierraS. and Gonz\u00e1lezF.A. Combining textual and visual representations for multimodal author profiling 2018."},{"key":"e_1_3_3_40_2","volume-title":"Experimental IR Meets Multilinguality, Multimodality, and Interaction Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018)","author":"Takahashi T.","year":"2018","unstructured":"TakahashiT., TaharaT., NagataniK., MiuraY., TaniguchiT. and OhkumaT., Text and image synergy with feature cross technique for gender identification, In Experimental IR Meets Multilinguality, Multimodality, and Interaction Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018), 2018."},{"key":"e_1_3_3_41_2","doi-asserted-by":"crossref","unstructured":"TellezF.P. PintoD. CardiffJ. and RossoP. Defining and evaluating blog characteristics In Artificial Intelligence 2009. MICAI 2009. Eighth Mexican International Conference on 2009 pp. 97\u2013102. IEEE.","DOI":"10.1109\/MICAI.2009.21"},{"key":"e_1_3_3_42_2","unstructured":"Villena Rom\u00e1nJ. and Gonz\u00e1lez Crist\u00f3balJ.C. Daedalus at pan: Guessing tweet author\u2019s gender and age (2014)."},{"issue":"3","key":"e_1_3_3_43_2","first-page":"266","article-title":"Examining multiple features for author profiling","volume":"5","author":"Weren E.R.","year":"2014","unstructured":"WerenE.R., KauerA.U., MizusakiL., MoreiraV.P., de OliveiraJ.P.M. and WivesL.K., Examining multiple features for author profiling, Journal of Information and Data Management5(3) (2014), 266.","journal-title":"Journal of Information and Data Management"},{"key":"e_1_3_3_44_2","first-page":"1164","article-title":"Exploring information retrieval features for author profiling","author":"Weren E.R.","year":"2014","unstructured":"WerenE.R., MoreiraV.P. and de OliveiraJ.P.M., Exploring information retrieval features for author profiling, In CLEF (Working Notes), 2014, pp. 1164\u20131171.","journal-title":"CLEF (Working Notes)"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1631\/FITEE.1601883"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179033","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179033","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179033","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:37:41Z","timestamp":1777455461000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179033"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,5,14]]},"references-count":44,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2019,5,14]]}},"alternative-id":["10.3233\/JIFS-179033"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179033","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,5,14]]}}}