{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,5]],"date-time":"2026-04-05T09:46:01Z","timestamp":1775382361766,"version":"3.50.1"},"reference-count":94,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2018,5,18]],"date-time":"2018-05-18T00:00:00Z","timestamp":1526601600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>We are what we do, like, and say. Numerous research efforts have been pushed towards the automatic assessment of personality dimensions relying on a set of information gathered from social media platforms such as list of friends, interests of musics and movies, endorsements and likes an individual has ever performed. Turning this information into signals and giving them as inputs to supervised learning approaches has resulted in being particularly effective and accurate in computing personality traits and types. Despite the demonstrated accuracy of these approaches, the sheer amount of information needed to put in place such a methodology and access restrictions make them unfeasible to be used in a real usage scenario. In this paper, we propose a supervised learning approach to compute personality traits by only relying on what an individual tweets about publicly. The approach segments tweets in tokens, then it learns word vector representations as embeddings that are then used to feed a supervised learner classifier. We demonstrate the effectiveness of the approach by measuring the mean squared error of the learned model using an international benchmark of Facebook status updates. We also test the transfer learning predictive power of this model with an in-house built benchmark created by twenty four panelists who performed a state-of-the-art psychological survey and we observe a good conversion of the model while analyzing their Twitter posts towards the personality traits extracted from the survey.<\/jats:p>","DOI":"10.3390\/info9050127","type":"journal-article","created":{"date-parts":[[2018,5,21]],"date-time":"2018-05-21T04:07:30Z","timestamp":1526875650000},"page":"127","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":48,"title":["TwitPersonality: Computing Personality Traits from Tweets Using Word Embeddings and Supervised Learning"],"prefix":"10.3390","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6731-8609","authenticated-orcid":false,"given":"Giulio","family":"Carducci","sequence":"first","affiliation":[{"name":"Istituto Superiore Mario Boella (ISMB); Via Pier Carlo Boggio, 61, 10138 Turin, Italy"},{"name":"Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Turin, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0083-813X","authenticated-orcid":false,"given":"Giuseppe","family":"Rizzo","sequence":"additional","affiliation":[{"name":"Istituto Superiore Mario Boella (ISMB); Via Pier Carlo Boggio, 61, 10138 Turin, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3821-5379","authenticated-orcid":false,"given":"Diego","family":"Monti","sequence":"additional","affiliation":[{"name":"Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Turin, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3898-7480","authenticated-orcid":false,"given":"Enrico","family":"Palumbo","sequence":"additional","affiliation":[{"name":"Istituto Superiore Mario Boella (ISMB); Via Pier Carlo Boggio, 61, 10138 Turin, Italy"},{"name":"Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Turin, Italy"},{"name":"EURECOM, Sophia Antipolis, Campus SophiaTech, 450 Route des Chappes, 06410 Biot, France"}]},{"given":"Maurizio","family":"Morisio","sequence":"additional","affiliation":[{"name":"Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Turin, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2018,5,18]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5802","DOI":"10.1073\/pnas.1218772110","article-title":"Private traits and attributes are predictable from digital records of human behavior","volume":"110","author":"Kosinski","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Gottschalk, L.A., and Gleser, G.C. (1969). The Measurement of Psychological States through the Content Analysis of Verbal Behavior, University of California Press.","DOI":"10.1525\/9780520376762"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1097\/00006842-195811000-00002","article-title":"Experimental investigation of the specificity of attitude hypothesis in psychosomatic disease","volume":"20","author":"Graham","year":"1958","journal-title":"Psychosom. Med."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1306","DOI":"10.1037\/0022-006X.64.6.1306","article-title":"Emotion-abstraction patterns in verbatim protocols: A new way of describing psychotherapeutic processes","volume":"64","author":"Mergenthaler","year":"1996","journal-title":"J. Consult. Clin. Psychol."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1296","DOI":"10.1037\/0022-3514.77.6.1296","article-title":"Linguistic Styles: Language Use as an Individual Difference","volume":"77","author":"Pennebaker","year":"1999","journal-title":"Personal. Soc. Psychol."},{"key":"ref_6","unstructured":"Argamon, S., Dhawle, S., Koppel, M., and Pennebaker, J. (2005, January 24\u201328). Lexical predictors of personality type. Proceedings of the 2005 Joint Annual Meeting of the Interface and the Classification Society of North America, Cincinnati, OH, USA."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.1744-6570.1991.tb00688.x","article-title":"The Big Five personality dimensions and job performance: A meta-analysis","volume":"44","author":"Barrick","year":"1991","journal-title":"Pers. Psychol."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1055","DOI":"10.1016\/j.cpr.2002.09.001","article-title":"The five-factor model and personality disorder empirical literature: A meta-analytic review","volume":"23","author":"Saulsman","year":"2004","journal-title":"Clin. Psychol. Rev."},{"key":"ref_9","unstructured":"Huang, Y., Wei, L., and Chen, Y. (arXiv, 2017). Detection of the Prodromal Phase of Bipolar Disorder from Psychological and Phonological Aspects in Social Media, arXiv."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1177\/0146167292185003","article-title":"Attachment styles and the \u201cBig Five\u201d personality traits: Their connections with each other and with romantic relationship outcomes","volume":"18","author":"Shaver","year":"1992","journal-title":"Personal. Soc. Psychol. Bull."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1037\/0022-3514.84.6.1236","article-title":"The do re mi\u2019s of everyday life: The structure and personality correlates of music preferences","volume":"84","author":"Rentfrow","year":"2003","journal-title":"J. Personal. Soc. Psychol."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1177\/030573569302100105","article-title":"Research note: Personality and music preference: Extraversion and excitement seeking or openness to experience?","volume":"21","author":"Dollinger","year":"1993","journal-title":"Psychol. Music"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1080\/08838159109364129","article-title":"Constructing personality and social reality through music: Individual differences among fans of punk and heavy metal music","volume":"35","author":"Hansen","year":"1991","journal-title":"J. Broadcast. Electron. Media"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1177\/0305735697252003","article-title":"Music preference and the five-factor model of the NEO Personality Inventory","volume":"25","author":"Rawlings","year":"1997","journal-title":"Psychol. Music"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1017\/S1742058X09090109","article-title":"Personality and ideology as determinants of candidate preferences and Obama conversion in the 2008 US presidential election","volume":"6","author":"Jost","year":"2009","journal-title":"Du Bois Rev."},{"key":"ref_16","unstructured":"Cantador, I., Fernandez-Tobias, I., Bellog\u00edn, A., Kosinski, M., and Stillwell, D. (2013, January 10\u201314). Relating Personality Types with User Preferences Multiple Entertainment Domains. Proceedings of the 21st Conference on User Modeling, Adaptation, and Personalization (UMAP 2013), Rome, Italy."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Celli, F., Lepri, B., Biel, J., Gatica-Perez, D., and Riccardi, G. (2014, January 3). The workshop on computational personality recognition 2014. Proceedings of the 22nd ACM International Conference on Multimedia (MM \u201914), Orlando, FL, USA.","DOI":"10.1145\/2647868.2647870"},{"key":"ref_18","unstructured":"Tkal\u010di\u010d, M., de Carolis, B., de Gemmis, M., Odi\u0107, A., and Ko\u0161ir, A. (, January 7\u201311). Preface: EMPIRE 2014-2nd Workshop Emotions and Personality in Personalized Services. Proceedings of the 22st Conference on User Modeling, Adaptation, and Personalization (UMAP 2014), Aalborg, Denmark."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1016\/j.chb.2011.11.001","article-title":"A tale of two sites: Twitter vs. Facebook and the personality predictors of social media usage","volume":"28","author":"Hughes","year":"2011","journal-title":"Comput. Hum. Behav."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Bachrach, Y., Kosinski, M., Graepel, T., Kohli, P., and Stillwell, D. (2012, January 22\u201324). Personality and patterns of Facebook usage. Proceedings of the 4th Annual ACM Web Science Conference 2012 (WebSci\u201912), Evanston, IL, USA.","DOI":"10.1145\/2380718.2380722"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1089\/cyber.2010.0087","article-title":"Manifestations of Personality in Online Social Networks: Self-Reported Facebook-Related Behaviors and Observable Profile Information","volume":"14","author":"Gosling","year":"2011","journal-title":"Cyberpsychol. Behav. Soc. Netw."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Quercia, D., Kosinski, M., Stillwell, D., and Crowcroft, J. (2011, January 9\u201311). Our twitter profiles, our selves: Predicting personality with twitter. Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), Boston, MA, USA.","DOI":"10.1109\/PASSAT\/SocialCom.2011.26"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Jusupova, A., Batista, F., and Ribeiro, R. (2016, January 22\u201324). Characterizing the Personality of Twitter Users based on their Timeline Information. Proceedings of the Atas da 16 Confer\u00eancia da Associacao Portuguesa de Sistemas de Informa\u00e7\u00e3o, Porto, Portugal.","DOI":"10.18803\/capsi.v16.292-299"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Liu, F., Perez, J., and Nowson, S. (arXiv, 2016). A Language-independent and Compositional Model for Personality Trait Recognition from Short Texts, arXiv.","DOI":"10.18653\/v1\/E17-1071"},{"key":"ref_25","first-page":"419","article-title":"Personality perception based on LinkedIn profiles","volume":"32","author":"Bogaert","year":"2017","journal-title":"J. Manag. Psychol."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1036","DOI":"10.1073\/pnas.1418680112","article-title":"Computer-based personality judgments are more accurate than those made by humans","volume":"112","author":"YouYou","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_27","unstructured":"Nowson, S., and Oberlander, J. (2006, January 16\u201320). The Identity of Bloggers: Openness and gender in personal weblogs. Proceedings of the AAAI Spring Symposium, Computational Approaches to Analysing Weblogs, Boston, MA, USA."},{"key":"ref_28","first-page":"56","article-title":"A neural network approach to personality prediction based on the bigfive model","volume":"2","author":"Kalghatgi","year":"2015","journal-title":"Int. J. Innov. Res. Adv. Eng."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1109\/TASLP.2016.2531286","article-title":"Exploiting turn-taking temporal evolution for personality trait perception in dyadic conversations","volume":"24","author":"Su","year":"2016","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1109\/MIS.2017.23","article-title":"Deep Learning-Based Document Modeling for Personality Detection from Text","volume":"32","author":"Majumder","year":"2017","journal-title":"IEEE Intell. Syst."},{"key":"ref_31","unstructured":"Mikolov, T., Chen, K., Corrado, G.S., and Dean, J. (arXiv, 2013). Efficient Estimation of Word Representations in Vector Space, arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1613\/jair.2349","article-title":"Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text","volume":"30","author":"Mairesse","year":"2007","journal-title":"J. Artif. Intell. Res."},{"key":"ref_33","unstructured":"Turian, J., Ratinov, L., and Bengio, Y. (2010, January 11\u201316). Word representations: A simple and general method for semi-supervised learning. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL \u201910), Uppsala, Swede."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Pasca, M., Lin, D., Bigham, J., Lifchits, A., and Jain, A. (2006, January 17\u201318). Names and similarities on the web: Fact extraction in the fast lane. Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, (ACL-44), Sydney, Australia.","DOI":"10.3115\/1220175.1220277"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Manning, C., Raghavan, P., and Schtze, H. (2008). Introduction to Information Retrieval, Cambridge University Press.","DOI":"10.1017\/CBO9780511809071"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Shutze, H. (1995, January 27\u201331). Distributional part-of-speech tagging. Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics (EACL \u201995), Dublin, Ireland.","DOI":"10.3115\/976973.976994"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Ratinov, L., and Roth, D. (2009, January 4\u20135). Design challenges and misconceptions in named entity recognition. Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL \u201909), Boulder, CO, USA.","DOI":"10.3115\/1596374.1596399"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Kuang, S., and Davison, B. (2017). Learning Word Embeddings with Chi-Square Weights for Healthcare Tweet Classification. Appl. Sci., 7.","DOI":"10.3390\/app7080846"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C. (2014, January 25\u201329). Glove: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar.","DOI":"10.3115\/v1\/D14-1162"},{"key":"ref_40","unstructured":"Lebret, R., Legrand, J., and Collobert, R. (2013, January 5\u201310). Is deep learning really necessary for word embeddings?. Proceedings of the NIPS 2013 Deep Learning Workshop, Lake Tahoe, CA, USA."},{"key":"ref_41","unstructured":"Dhillon, P.S., Foster, D., and Ungar, L. (2011). Multi-view learning of word embeddings via cca. Advances in Neural Information Processing Systems 24 (NIPS 2011), MIT Press Ltd."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., and Qin, B. (2014, January 22-27). Learning sentiment-specific word embedding for twitter sentiment classification. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, MD, USA.","DOI":"10.3115\/v1\/P14-1146"},{"key":"ref_43","first-page":"2493","article-title":"Kavukcuoglu, K.; Kuksa, P. Natural language processing (almost) from scratch","volume":"12","author":"Collobert","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Golbeck, J., Robles, C., and Turner, K. (2011, January 7\u201312). Predicting personality with social media. Proceedings of the CHI \u201911 Extended Abstracts on Human Factors in Computing Systems (CHI EA \u201911), Vancouver, BC, Canada.","DOI":"10.1145\/1979742.1979614"},{"key":"ref_45","unstructured":"Jiang, L., Yu, M., Zhou, M., Liu, X., and Zhao, T. (2011, January 19\u201324). Target-dependent twitter sentiment classification. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (HLT \u201911), Portland, Oregon."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Hu, X., Tang, J., Gao, H., and Liu, H. (2013, January 13\u201317). Unsupervised sentiment analysis with emotional signals. Proceedings of the 22nd International Conference on World Wide Web (WWW \u201913), Rio de Janeiro, Brazil.","DOI":"10.1145\/2488388.2488442"},{"key":"ref_47","unstructured":"Mohammad, S.M., Kiritchenko, S., and Zhu, X. (2013, January 13\u201315). Nrc-Canada: Building the state-of-the-art in sentiment analysis of tweets. Proceedings of the Seventh International Workshop on Semantic Evaluation Exercises (SemEval-2013), Atlanta, GA, USA."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Kanavos, A., Nodarakis, N., Sioutas, S., Tsakalidis, A., Tsolis, D., and Tzimas, G. (2017). Large Scale Implementations for Twitter Sentiment Classification. Algorithms, 10.","DOI":"10.3390\/a10010033"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Dai, H., Touray, M., Jonnagaddala, J., and Shabbir, S.A. (2016). Feature Engineering for Recognizing Adverse Drug Reactions from Twitter Posts. Information, 7.","DOI":"10.3390\/info7020027"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Chamberlain, B.P., Humby, C., and Deisenroth, M.P. (2017, January 18\u201322). Probabilistic inference of twitter users\u2019 age based on what they follow. Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Skopje, Macedonia.","DOI":"10.1007\/978-3-319-71273-4_16"},{"key":"ref_51","unstructured":"Zhang, J., Hu, X., Zhang, Y., and Liu, H. (2016, January 17\u201320). Your Age Is No Secret: Inferring Microbloggers\u2019 Ages via Content and Interaction Analysis. Proceedings of the Tenth International Conference on Web and Social Media, Cologne, Germany."},{"key":"ref_52","unstructured":"Burger, J.D., Henderson, J., Kim, G., and Zarrella, G. (2011, January 27\u201331). Discriminating gender on Twitter. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP \u201911), Edinburgh, UK."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Conover, M.D., Gonalves, B., Ratkiewicz, J., Flammini, A., and Menczer, F. (2011, January 9\u201311). Predicting the political alignment of Twitter users. Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), Boston, MA, USA.","DOI":"10.1109\/PASSAT\/SocialCom.2011.34"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Cheng, Z., Caverlee, J., and Lee, K. (2010, January 26\u201330). You are where you tweet: A content-based approach to geo-locating Twitter users. Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM \u201910), Toronto, ON, Canada.","DOI":"10.1145\/1871437.1871535"},{"key":"ref_55","unstructured":"Pennacchiotti, M., and Popescu, A.M. (2011, January 17\u201321). A machine learning approach to twitter user classification. Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM-11), Catalonia, Spain."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1111\/j.1467-6494.1992.tb00970.x","article-title":"An Introduction to the Five-Factor Model and Its Applications","volume":"60","author":"McCrae","year":"1992","journal-title":"J. Personal."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1111\/j.1467-6494.1992.tb00973.x","article-title":"Recurrent Personality Factors Based on Trait Ratings","volume":"60","author":"Tupes","year":"1992","journal-title":"J. Personal."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1146\/annurev.ps.41.020190.002221","article-title":"Personality Structure: Emergence of the FiveFactor Model","volume":"41","author":"Digman","year":"1990","journal-title":"Ann. Rev. Psychol."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1037\/0003-066X.48.1.26","article-title":"The Structure of Phenotypic Personality Traits","volume":"48","author":"Goldberg","year":"1993","journal-title":"Am. Psychol."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"675","DOI":"10.1037\/0022-3514.55.4.675","article-title":"Number of factors in the personality sphere: Does increase in factors increase predictability of real-life criteria?","volume":"55","author":"Mershon","year":"1988","journal-title":"J. Personal. Soc. Psychol."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1037\/0022-3514.81.3.524","article-title":"Big Five factors and facets and the prediction of behavior","volume":"81","author":"Paunonen","year":"2001","journal-title":"J. Personal. Soc. Psychol."},{"key":"ref_62","first-page":"430","article-title":"Evaluating comprehensiveness in personality systems: The California Q-Set and the five-factor model","volume":"54","author":"McCrae","year":"1986","journal-title":"J. Psychol."},{"key":"ref_63","unstructured":"Costa, P.T., and McCrae, R.R. (1992). Revised NEO Personality Inventory (NEO Pl-R) and NEO Five-Factor Inventory (NEO-FFI) Professional Manual, Psychological Assessment Resources."},{"key":"ref_64","unstructured":"(2018, May 18). International Personality Item Pool. Available online: http:\/\/ipip.ori.org."},{"key":"ref_65","first-page":"7","article-title":"A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models","volume":"7","author":"Goldberg","year":"1999","journal-title":"Personal. Psychol. Eur."},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.jrp.2014.05.003","article-title":"Measuring thirty facets of the Five Factor Model with a 120-item public domain inventory: Development of the IPIP-NEO-120","volume":"51","author":"Johnson","year":"2014","journal-title":"J. Res. Personal."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1146\/annurev.psych.54.101601.145041","article-title":"Psychological Aspects of Natural Language Use: Our words, Our Selves","volume":"54","author":"Pennebaker","year":"2003","journal-title":"Ann. Rev. Psychol."},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1207\/s15326950dp4203_1","article-title":"Language with character: A stratified corpus comparison of individual differences in e-mail communication","volume":"42","author":"Oberlander","year":"2006","journal-title":"Discour. Process."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Chang, C., Saravia, E., and Chen, Y. (2016, January 18\u201321). Subconscious Crowdsourcing: A feasible data collection mechanism for mental disorder detection on social media. Proceedings of the 2016 IEEE\/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), San Francisco, CA, USA.","DOI":"10.1109\/ASONAM.2016.7752261"},{"key":"ref_70","unstructured":"Chin, D.N., and Wright, W.R. (2014, January 7\u201311). Social Media Sources for Personality Profiling. Proceedings of the 22nd Conference on User Modeling, Adaptation, and Personalization, Aalborg, Denmark."},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1037\/0022-3514.81.1.116","article-title":"Who attains social status? Effects of personality and physical attractiveness in social groups","volume":"81","author":"Anderson","year":"2001","journal-title":"J. Personal. Soc. Psychol."},{"key":"ref_72","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1006\/jrpe.1999.2271","article-title":"Affect and personality as predictors of conflict and closeness in young adults\u2019 friendships","volume":"34","author":"Berry","year":"2000","journal-title":"J. Res. Personal."},{"key":"ref_73","unstructured":"Rosen, P., and Kluemper, D. (2008, January 14\u201317). The impact of the Big Five personality traits on the acceptance of social networking website. Proceedings of the Americas Conference on Information Systems (AMCIS 2008), Toronto, ON, Canada."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Schrammel, J., K\u00f6ffel, C., and Tscheligi, M. (2009, January 1\u20135). Personality traits, usage patterns and information disclosure in online communities. Proceedings of the 23rd British HCI Group Annual Conference on People and Computers: Celebrating People and Technology (BCS-HCI), Cambridge, UK.","DOI":"10.14236\/ewic\/HCI2009.19"},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1146\/annurev.soc.27.1.415","article-title":"Birds of a Feather: Homophily in Social Networks","volume":"27","author":"McPherson","year":"2001","journal-title":"Ann. Rev. Sociol."},{"key":"ref_76","doi-asserted-by":"crossref","first-page":"1303","DOI":"10.1177\/0146167208320061","article-title":"Narcissism and social networking Web sites","volume":"34","author":"Buffardi","year":"2008","journal-title":"Personal. Soc. Psychol. Bull."},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1089\/cyber.2009.0257","article-title":"Self-presentation 2.0: Narcissism and self-esteem on Facebook","volume":"13","author":"Mehdizadeh","year":"2010","journal-title":"Cyberpsychol. Behav. Soc. Netw."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1089\/cpb.2008.0214","article-title":"The influence of shyness on the use of Facebook in an undergraduate sample","volume":"12","author":"Orr","year":"2009","journal-title":"Cyberpsychol. Behav."},{"key":"ref_79","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1016\/j.chb.2008.12.024","article-title":"Personality and motivations associated with Facebook use","volume":"25","author":"Ross","year":"2009","journal-title":"Comput. Hum. Behav."},{"key":"ref_80","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1027\/1864-1105.20.2.67","article-title":"The relationship between unwillingness-to-communicate and students\u2019 Facebook use","volume":"20","author":"Sheldon","year":"2008","journal-title":"J. Media Psychol."},{"key":"ref_81","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1016\/j.jrp.2005.08.007","article-title":"The international personality item pool and the future of public-domain personality measures","volume":"40","author":"Goldberg","year":"2006","journal-title":"J. Res. Personal."},{"key":"ref_82","unstructured":"Celli, F., Pianesi, F., Stillwell, D., and Kosinski, M. (2013). Workshop on Computational Personality Recognition: Shared Task, AAAI."},{"key":"ref_83","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1140\/epjb\/e2004-00111-4","article-title":"Betweenness centrality in large complex networks","volume":"38","author":"Barthelemy","year":"2004","journal-title":"Eur. Phys. J. B"},{"key":"ref_84","unstructured":"(2018, May 18). List of Stopwords Used by Scikit-Learn. Available online: https:\/\/github.com\/scikit-learn\/scikit-learn\/blob\/master\/sklearn\/feature_extraction\/stop_words.py."},{"key":"ref_85","unstructured":"John, O.P., Robins, R.W., and Pervin, L.A. (2008). Paradigm shift to the integrative Big Five trait taxonomy. Handbook of Personality: Theory and Research, Guilford Press. History, Measurement, and Conceptual Issues."},{"key":"ref_86","doi-asserted-by":"crossref","unstructured":"Li, Q., Shah, S., Fang, R., Liu, X., and Nourbakhsh, A. (arXiv, 2017). Data Sets: Word Embeddings Learned from Tweets and General Data, arXiv.","DOI":"10.1609\/icwsm.v11i1.14859"},{"key":"ref_87","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1037\/0033-295X.104.2.211","article-title":"A solution to Plato\u2019s problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge","volume":"104","author":"Landauer","year":"1997","journal-title":"Psychol. Rev."},{"key":"ref_88","unstructured":"Drucker, H., Burges, C.J.C., Kaufman, L., Smola, A.J., and Vapnik, V.N. (1996). Support vector regression machines. Advances in Neural Information Processing Systems 9, NIPS, MIT Press."},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Boser, B.E., Guyon, I.M., and Vapnik, V.N. (1992, January 27\u201329). A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory (COLT \u201992), Pittsburgh, PA, USA.","DOI":"10.1145\/130385.130401"},{"key":"ref_90","first-page":"252","article-title":"Linear regression and correlation","volume":"15","author":"Kenney","year":"1962","journal-title":"Math. Stat."},{"key":"ref_91","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1111\/j.1467-9868.2011.00771.x","article-title":"Regression shrinkage selection via the LASSO","volume":"73","author":"Tibshirani","year":"2011","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"ref_92","unstructured":"(2018, May 18). fastText English Word Vectors. Available online: https:\/\/fasttext.cc\/docs\/en\/english-vectors.html."},{"key":"ref_93","doi-asserted-by":"crossref","first-page":"372","DOI":"10.1177\/0956797609360756","article-title":"Facebook Profiles Reflect Actual Personality, Not Self-Idealization","volume":"21","author":"Back","year":"2010","journal-title":"Psychol. Sci."},{"key":"ref_94","unstructured":"(2018, May 18). TwitPersonality. Available online: https:\/\/github.com\/D2KLab\/twitpersonality."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/9\/5\/127\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:04:57Z","timestamp":1760195097000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/9\/5\/127"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,5,18]]},"references-count":94,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2018,5]]}},"alternative-id":["info9050127"],"URL":"https:\/\/doi.org\/10.3390\/info9050127","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,5,18]]}}}