{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T15:09:18Z","timestamp":1773414558650,"version":"3.50.1"},"reference-count":27,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2019,3,23]],"date-time":"2019-03-23T00:00:00Z","timestamp":1553299200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2019,5,14]]},"abstract":"<jats:p>The process of automatic identification of an author\u2019s demographic traits like gender, age, native language, geographical location, personality type and others from his\/her written text is termed as author profiling (AP). Currently, it has engaged the research community due to its promising uses in security, marketing, forensic, bogus account identification on public networks. A variety of benchmark corpora (English text) released by PAN shared task is used to perform our experiments. This study presents a Content-based approach for detection of author\u2019s traits (age group and gender) for same-genre author profiles. In our proposed method, we used a different set of features including syntactic n-grams of part-of-speech tags, traditional n-grams of part-of-speech tags, the combination of word n-grams and combination of character n-grams. We tried a range of classifier for several profile sizes. We used the word uni-grams and character tri-grams as our baseline approaches. We achieved best accuracy of 0.496 and 0.734 for both traits, i.e., age group and gender respectively, by applying the combination of word n-grams of various sizes. Experimental results signify that the combination of word n-grams can produce good results on benchmark corpora.<\/jats:p>","DOI":"10.3233\/jifs-179031","type":"journal-article","created":{"date-parts":[[2019,3,26]],"date-time":"2019-03-26T18:56:58Z","timestamp":1553626618000},"page":"4833-4843","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":14,"title":["Author profiling for age and gender using combinations of features of various types"],"prefix":"10.1177","volume":"36","author":[{"given":"Iqra","family":"Ameer","sequence":"first","affiliation":[{"name":"Center for Computing Research (CIC), Instituto Poit\u00e9cnico Nacional (IPN), Av. Juan de Dios B\u00e1tiz, Zacatenco, Mexico City, Mexico"}]},{"given":"Grigori","family":"Sidorov","sequence":"additional","affiliation":[{"name":"Center for Computing Research (CIC), Instituto Poit\u00e9cnico Nacional (IPN), Av. Juan de Dios B\u00e1tiz, Zacatenco, Mexico City, Mexico"}]},{"given":"Rao Muhammad Adeel","family":"Nawab","sequence":"additional","affiliation":[{"name":"Department of Computer Science, COMSATS University Islamabad Lahore Campus, Pakistan"}]}],"member":"179","published-online":{"date-parts":[[2019,3,23]]},"reference":[{"key":"e_1_3_3_2_2","article-title":"Inaoe\u2019s participation at PAN 13: Author profiling task","author":"Lopez-Monroy A.P.","year":"2013","unstructured":"A.P.Lopez-Monroy, M.Montes-y Gomez, H.J.Escalante, L.Villasenor-Pineda and E.Villatoro-Tello, Inaoe\u2019s participation at PAN 13: Author profiling task. In CLEF 2013 Evaluation Labs and Workshop, 2013.","journal-title":"In CLEF 2013 Evaluation Labs and Workshop"},{"issue":"3","key":"e_1_3_3_3_2","first-page":"757","article-title":"Enhancing deep learning gender identification with gated recurrent units architecture in social","volume":"22","author":"Bsir B.","year":"2018","unstructured":"B.Bsir and M.Zrigui, Enhancing deep learning gender identification with gated recurrent units architecture in social, Computac\u00f3on y Sistemas22(3) (2018), 757\u2013766.","journal-title":"Computac\u00f3on y Sistemas"},{"key":"e_1_3_3_4_2","article-title":"Predicting gender from blog posts","author":"Zhang C.","year":"2010","unstructured":"C.Zhang and P.Zhang, Predicting gender from blog posts. Technical report, University of Massachusetts Amherst, USA, 2010.","journal-title":"Technical report"},{"key":"e_1_3_3_5_2","first-page":"37","article-title":"Predicting age and gender in online social networks","author":"Peersman C.","year":"2011","unstructured":"C.Peersman, W.Daelemans and L.V.Vaeenbergh, Predicting age and gender in online social networks. In Proceedings of the 3rd international Workshop on Search and Mining User-Generated Contents, 2011, pp. 37\u201344. ACM.","journal-title":"In Proceedings of the 3rd international Workshop on Search and Mining User-Generated Contents"},{"key":"e_1_3_3_6_2","article-title":"How old do you think i am?","author":"Nguyen D.-P.","year":"2013","unstructured":"D.-P.Nguyen, R.Gravel, R.B.Trieschnigg and T.Meder, How old do you think i am?a study of language and age in Twitter. 2013.","journal-title":"a study of language and age in Twitter"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1002\/asi.21001"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"e_1_3_3_9_2","article-title":"Overview of the 3rd Author Profiling Task at PAN 2015","author":"Rangel F.","year":"2015","unstructured":"F.Rangel, F.Celli, P.Rosso, M.Potthast, B.Stein and W.Daelemans, Overview of the 3rd Author Profiling Task at PAN 2015. In Cappellato,L.Ferro,N.Jones,G. and Juan,E.S.editors, CLEF 2015 Evaluation Labs and Workshop \u2014 Working Notes Papers, Toulouse, France, CEUR-WS.org, 2015.","journal-title":"CLEF 2015 Evaluation Labs and Workshop \u2014 Working Notes Papers"},{"key":"e_1_3_3_10_2","article-title":"Overview of the 2nd Author Profiling Task at PAN 2014","author":"Rangel F.","year":"2014","unstructured":"F.Rangel, P.Rosso, I.Chugur, M.Potthast, M.Trenkmann, B.Stein, B.Verhoeven and W.Daelemans, Overview of the 2nd Author Profiling Task at PAN 2014. In Cappellato,L.Ferro,N.HalveyM.Kraaij,W.editors, CLEF 2014 Evaluation Labs and Workshop - Working Notes Papers, Sheffield, UK. CEUR-WS.org, 2014.","journal-title":"CLEF 2014 Evaluation Labs and Workshop - Working Notes Papers"},{"key":"e_1_3_3_11_2","first-page":"898","article-title":"Overview of the 2nd author profiling task at PAN 2014","author":"Rangel F.","year":"2014","unstructured":"F.Rangel, P.Rosso, I.Chugur, M.Potthast, M.Trenkmann, B.Stein, B.Verhoeven and W.Daelemans, Overview of the 2nd author profiling task at PAN 2014. In CLEF evaluation labs and workshop, 2014, pp. 898\u2013927.","journal-title":"CLEF evaluation labs and workshop"},{"key":"e_1_3_3_12_2","first-page":"23","article-title":"Overview of the author profiling task at PAN 2013","author":"Rangel F.","year":"2013","unstructured":"F.Rangel, P.Rosso, M.Koppel, E.Stamatatos and G.Inches, Overview of the author profiling task at PAN 2013, Notebook Papers of CLEF, 2013, pp. 23\u201326.","journal-title":"Notebook Papers of CLEF"},{"key":"e_1_3_3_13_2","article-title":"Overview of the 3rd author profiling task at PAN 2015","author":"Rangel F.","year":"2015","unstructured":"F.Rangel, P.Rosso, M.Potthast, B.Stein and W.Daelemans, Overview of the 3rd author profiling task at PAN 2015. In CLEF, 2015.","journal-title":"In CLEF"},{"issue":"3","key":"e_1_3_3_14_2","first-page":"757","article-title":"Houssem abdellaoui and mounir zrigui using tweets and emojis to build TEAD: An Arabic dataset for sentiment analysis","volume":"22","author":"Abdellaoui H.","year":"2018","unstructured":"H.Abdellaoui and M.Zrigui, Houssem abdellaoui and mounir zrigui using tweets and emojis to build TEAD: An Arabic dataset for sentiment analysis, Computation y Sistemas22(3) (2018), 757\u2013766.","journal-title":"Computation y Sistemas"},{"key":"e_1_3_3_15_2","article-title":"Data Mining: Practical machine learning tools and techniques","author":"Witten I.H.","year":"2016","unstructured":"I.H.Witten, E.Frank, M.A.Hall and C.J.Pal, Data Mining: Practical machine learning tools and techniques, Morgan Kaufmann, 2016.","journal-title":"Morgan Kaufmann"},{"key":"e_1_3_3_16_2","first-page":"947","article-title":"Adapting cross-genre author profiling to language and corpus","author":"Markov I.","year":"2016","unstructured":"I.Markov, H.G\u00f2mez-Adorno, G.Sidorov and A.F.Gel-bukh, Adapting cross-genre author profiling to language and corpus. In CLEF (Working Notes), 2016, pp. 947\u2013955.","journal-title":"In CLEF (Working Notes)"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.psych.54.101601.145041"},{"key":"e_1_3_3_18_2","unstructured":"J.Holmes and M.Meyerhoff. The handbook of language and gender volume 25. John Wiley & Sons 2008."},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/1121949.1121951"},{"key":"e_1_3_3_20_2","first-page":"1301","article-title":"Discriminating gender on twitter. In Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"Burger J.D.","year":"2011","unstructured":"J.D.Burger, J.Henderson, G.Kim and G.Zarrella, Discriminating gender on twitter. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2011, pp. 1301\u20131309.","journal-title":"Association for Computational Linguistics"},{"key":"e_1_3_3_21_2","first-page":"199","article-title":"Effects of age and gender on blogging","volume":"6","author":"Schler J.","year":"2006","unstructured":"J.Schler, M.Koppel, S.Argamon and J.W.Pennebaker, Effects of age and gender on blogging. In AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs, volume 6, 2006, pp. 199\u2013205.","journal-title":"In AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs"},{"key":"e_1_3_3_22_2","first-page":"173","article-title":"In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1","author":"Toutanova K.","year":"2003","unstructured":"K.Toutanova, D.Klein, C.D.Manning and Y.Singer, Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, Association for Computational Linguistics, 2003, pp. 173\u2013180.","journal-title":"Association for Computational Linguistics"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/17.4.401"},{"issue":"3","key":"e_1_3_3_24_2","first-page":"321","article-title":"Gender, genre, and writing style in formal written texts. Text & Talk An Interdisciplinary Journal of Language","volume":"23","author":"Argamon S.","year":"2003","unstructured":"S.Argamon, M.Koppel, J.Fine and A.R.Shimoni, Gender, genre, and writing style in formal written texts. Text & Talk An Interdisciplinary Journal of Language, Discourse & Communication Studies23(3) (2003), 321\u2013346.","journal-title":"Discourse & Communication Studies"},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/1461928.1461959"},{"key":"e_1_3_3_26_2","article-title":"Stylometric analysis of blogger\u2019s age and gender","author":"Goswami S.","year":"2009","unstructured":"S.Goswami, S.Sarkar and M.Rustagi, Stylometric analysis of blogger\u2019s age and gender, In Third International AAAI Conference on Weblogs and Social Media, 2009.","journal-title":"In Third International AAAI Conference on Weblogs and Social Media"},{"issue":"4","key":"e_1_3_3_27_2","first-page":"210","article-title":"Profiling the author of a written text in Russian","volume":"5","author":"Litvinova T.","year":"2014","unstructured":"T.Litvinova, Profiling the author of a written text in Russian. Journal of Language and Literature5(4) (2014), 210\u2013216.","journal-title":"Journal of Language and Literature"},{"key":"e_1_3_3_28_2","article-title":"Predicting author age from weibo microblog posts","author":"Zhang W.","year":"2016","unstructured":"W.Zhang, A.Caines, D.Alikaniotis and P.Buttery, Predicting author age from weibo microblog posts. In LREC2016.","journal-title":"LREC"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179031","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179031","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179031","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T19:29:15Z","timestamp":1770233355000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179031"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,3,23]]},"references-count":27,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2019,5,14]]}},"alternative-id":["10.3233\/JIFS-179031"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179031","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,3,23]]}}}