{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,29]],"date-time":"2025-12-29T18:47:23Z","timestamp":1767034043842,"version":"3.41.2"},"reference-count":80,"publisher":"Emerald","issue":"2","license":[{"start":{"date-parts":[[2020,9,11]],"date-time":"2020-09-11T00:00:00Z","timestamp":1599782400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["ITP"],"published-print":{"date-parts":[[2022,3,28]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>This study aims to predict popular contributors through text representations of user-generated content in open crowds.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>Three text representation approaches \u2013 count vector, Tf-Idf vector, word embedding and supervised machine learning techniques \u2013 are used to generate popular contributor predictions.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The results of the experiments demonstrate that popular contributor predictions are considered successful. The <jats:italic>F<\/jats:italic>1 scores are all higher than the baseline model. Popular contributors in open crowds can be predicted through user-generated content.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Research limitations\/implications<\/jats:title><jats:p>This research presents brand new empirical evidence drawn from text representations of user-generated content that reveals why some contributors' ideas are more viral than others in open crowds.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Practical implications<\/jats:title><jats:p>This research suggests that companies can learn from popular contributors in ways that help them improve customer agility and better satisfy customers' needs. In addition to boosting customer engagement and triggering discussion, popular contributors' ideas provide insights into the latest trends and customer preferences. The results of this study will benefit marketing strategy, new product development, customer agility and management of information systems.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>The paper provides new empirical evidence for popular contributor prediction in an innovation crowd through text representation approaches.<\/jats:p><\/jats:sec>","DOI":"10.1108\/itp-04-2019-0171","type":"journal-article","created":{"date-parts":[[2020,9,10]],"date-time":"2020-09-10T05:23:56Z","timestamp":1599715436000},"page":"494-509","source":"Crossref","is-referenced-by-count":7,"title":["Predicting popular contributors in innovation crowds: the case of My Starbucks Ideas"],"prefix":"10.1108","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1979-8369","authenticated-orcid":false,"given":"Chien-Yi","family":"Hsiang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Julia Taylor","family":"Rayz","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","published-online":{"date-parts":[[2020,9,11]]},"reference":[{"issue":"3","key":"key2022032513083545300_ref001","doi-asserted-by":"crossref","first-page":"355","DOI":"10.5465\/amr.2010.0146","article-title":"Crowdsourcing as a solution to distant search","volume":"37","year":"2012","journal-title":"Academy of Management Review"},{"key":"key2022032513083545300_ref002","doi-asserted-by":"crossref","unstructured":"Afuah, A. (2018), \u201cCrowdsourcing: a primer and research framework\u201d, in Tucci, C.L., Afuah, A. and Viscusi, G. (Eds), Creating and Capturing Value through Crowdsourcing, pp. 39-57.","DOI":"10.1093\/oso\/9780198816225.003.0002"},{"issue":"2","key":"key2022032513083545300_ref200","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1057\/jit.2010.7","article-title":"Exploring the impact of socio-technical core-periphery structures in open source software development","volume":"25","year":"2010","journal-title":"Journal of Information Technology"},{"issue":"1","key":"key2022032513083545300_ref003","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1287\/mnsc.1120.1599","article-title":"Crowdsourcing new product ideas over time: an analysis of the Dell IdeaStorm community","volume":"59","year":"2013","journal-title":"Management Science"},{"issue":"4","key":"key2022032513083545300_ref004","first-page":"199","article-title":"Crowdsourcing: how to benefit from (too) many great ideas","volume":"12","year":"2013","journal-title":"MIS Quarterly Executive"},{"key":"key2022032513083545300_ref005","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1016\/j.jbusres.2016.08.006","article-title":"Resource management in big data initiatives: processes and dynamic capabilities","volume":"70","year":"2017","journal-title":"Journal of Business Research"},{"year":"2015","key":"key2022032513083545300_ref006","article-title":"A survey of predictive modelling under imbalanced distributions"},{"journal-title":"Machine Learning Mastery","article-title":"Imbalanced classification with Python: better metrics, balance skewed classes, cost-sensitive learning","year":"2020","key":"key2022032513083545300_ref007"},{"year":"2013","key":"key2022032513083545300_ref008","article-title":"API design for machine learning software: experiences from the scikit-learn project"},{"issue":"9-10","key":"key2022032513083545300_ref009","doi-asserted-by":"crossref","first-page":"1033","DOI":"10.1016\/j.jbusres.2008.08.009","article-title":"Understanding consumer-to-consumer interactions in virtual communities: the salience of reciprocity","volume":"63","year":"2010","journal-title":"Journal of Business Research"},{"issue":"1","key":"key2022032513083545300_ref010","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1016\/j.dss.2012.01.015","article-title":"What drives consumers to spread electronic word of mouth in online consumer-opinion platforms","volume":"53","year":"2012","journal-title":"Decision Support Systems"},{"key":"key2022032513083545300_ref011","first-page":"89","article-title":"Political polarization on twitter","volume":"133","year":"2011","journal-title":"Icwsm"},{"issue":"1","key":"key2022032513083545300_ref012","doi-asserted-by":"crossref","first-page":"32","DOI":"10.2307\/41166370","article-title":"Using social network analysis to improve communities of practice","volume":"49","year":"2006","journal-title":"California Management Review"},{"first-page":"118a","article-title":"Core and periphery in free\/libre and open source software team communications","year":"2006","key":"key2022032513083545300_ref201"},{"volume-title":"Getting Results from Crowds","year":"2012","key":"key2022032513083545300_ref013"},{"key":"key2022032513083545300_ref014","unstructured":"Duan, W., Cao, Q. and Gan, Q. (2010), \u201cInvestigating determinants of voting for the \u201chelpfulness\u201d of online consumer reviews: a text mining approach\u201d, paper presented at the, AMCIS."},{"issue":"3","key":"key2022032513083545300_ref202","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1016\/j.jesp.2004.05.009","article-title":"Cognitive and social comparison processes in brainstorming","volume":"41","year":"2005","journal-title":"Journal of Experimental Social Psychology"},{"issue":"2","key":"key2022032513083545300_ref015","doi-asserted-by":"publisher","first-page":"189","DOI":"10.1177\/0165551512437638","article-title":"Towards an integrated crowdsourcing definition","volume":"38","year":"2012","journal-title":"Journal of Information Science"},{"issue":"1","key":"key2022032513083545300_ref016","doi-asserted-by":"crossref","first-page":"63","DOI":"10.25300\/MISQ\/2018\/13211","article-title":"Top persuader prediction for social networks","volume":"42","year":"2018","journal-title":"MIS Quarterly"},{"issue":"6","key":"key2022032513083545300_ref017","doi-asserted-by":"crossref","first-page":"1464","DOI":"10.1287\/orsc.1100.0600","article-title":"Network exchange patterns in online communities","volume":"22","year":"2011","journal-title":"Organization Science"},{"key":"key2022032513083545300_ref018","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-3-319-98074-4_1","volume-title":"Learning from Imbalanced Data Sets","year":"2018"},{"issue":"8","key":"key2022032513083545300_ref203","doi-asserted-by":"crossref","first-page":"1765","DOI":"10.1002\/smj.2409","article-title":"The direct and indirect effects of core and peripheral social capital on organizational performance","volume":"37","year":"2016","journal-title":"Strategic Management Journal"},{"key":"key2022032513083545300_ref204","first-page":"1","article-title":"User roles and team structures in a crowdsourcing community for international development\u2013a social network perspective","year":"2017","journal-title":"Information Technology for Development"},{"issue":"1","key":"key2022032513083545300_ref205","doi-asserted-by":"crossref","first-page":"273","DOI":"10.2753\/MIS0742-1222310111","article-title":"User roles and contributions in innovation-contest communities","volume":"31","year":"2014","journal-title":"Journal of Management Information Systems"},{"issue":"1","key":"key2022032513083545300_ref206","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1016\/j.jbusres.2006.09.019","article-title":"Innovation creation by online basketball communities","volume":"60","year":"2007","journal-title":"Journal of Business Research"},{"issue":"4","key":"key2022032513083545300_ref019","first-page":"463","article-title":"A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches","volume":"42","year":"2011","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)"},{"year":"2011","key":"key2022032513083545300_ref020","article-title":"Crowdsourcing information systems: a systems theory perspective"},{"issue":"10","key":"key2022032513083545300_ref021","doi-asserted-by":"crossref","first-page":"1498","DOI":"10.1109\/TKDE.2010.188","article-title":"Estimating the helpfulness and economic impact of product reviews: mining text and reviewer characteristics","volume":"23","year":"2011","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"1","key":"key2022032513083545300_ref207","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/j.websem.2007.11.011","article-title":"Collective knowledge systems: where the social web meets the semantic web","volume":"6","year":"2008","journal-title":"Web Semantics: Science, Services and Agents on the World Wide Web"},{"issue":"2-3","key":"key2022032513083545300_ref022","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1080\/00437956.1954.11659520","article-title":"Distributional structure","volume":"10","year":"1954","journal-title":"Word"},{"volume-title":"Imbalanced Learning: Foundations, Algorithms, and Applications","year":"2013","key":"key2022032513083545300_ref023"},{"issue":"1","key":"key2022032513083545300_ref024","article-title":"Spacy 2: natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing","volume":"7","year":"2017","journal-title":"To appear"},{"issue":"5","key":"key2022032513083545300_ref025","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1111\/jpim.12396","article-title":"Identifying new product ideas: waiting for the wisdom of the crowd or screening ideas in real time","volume":"34","year":"2017","journal-title":"Journal of Product Innovation Management"},{"issue":"2","key":"key2022032513083545300_ref026","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1016\/j.aebj.2015.09.001","article-title":"Generating ideas on online platforms: a case study of \u2018My Starbucks Idea\u2019","volume":"10","year":"2015","journal-title":"Arab Economic and Business Journal"},{"issue":"9","key":"key2022032513083545300_ref027","doi-asserted-by":"crossref","first-page":"2138","DOI":"10.1287\/mnsc.2013.1879","article-title":"Crowdsourcing new product ideas under consumer learning","volume":"60","year":"2014","journal-title":"Management Science"},{"key":"key2022032513083545300_ref028","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1016\/j.chb.2015.01.010","article-title":"A study of factors that contribute to online review helpfulness","volume":"48","year":"2015","journal-title":"Computers in Human Behavior"},{"issue":"2","key":"key2022032513083545300_ref029","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1287\/mksc.1100.0566","article-title":"Opinion leadership and social contagion in new product diffusion","volume":"30","year":"2011","journal-title":"Marketing Science"},{"key":"key2022032513083545300_ref030","first-page":"187","article-title":"Assessment metrics for imbalanced learning","year":"2013","journal-title":"Imbalanced Learning: Foundations, Algorithms, and Applications"},{"issue":"998","key":"key2022032513083545300_ref031","first-page":"3482","article-title":"Natural language processing","volume":"212","year":"2012","journal-title":"Instructor"},{"volume-title":"Speech and Language Processing","year":"2014","key":"key2022032513083545300_ref032"},{"journal-title":"SSRN Electronic Journal","article-title":"Crowdfunding creative ideas: the dynamics of project backers in Kickstarter","year":"2015","key":"key2022032513083545300_ref208"},{"issue":"6","key":"key2022032513083545300_ref033","doi-asserted-by":"crossref","first-page":"987","DOI":"10.1037\/0021-9010.85.6.987","article-title":"A field experiment testing frontline opinion leaders as change agents","volume":"85","year":"2000","journal-title":"Journal of Applied Psychology"},{"year":"2014","key":"key2022032513083545300_ref034","article-title":"Distributed representations of sentences and documents"},{"volume-title":"Collective Intelligence","year":"1997","key":"key2022032513083545300_ref209"},{"issue":"1","key":"key2022032513083545300_ref035","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/j.dss.2010.12.007","article-title":"Who is talking? An ontology-based opinion leader identification framework for word-of-mouth marketing in online social blogs","volume":"51","year":"2011","journal-title":"Decision Support Systems"},{"issue":"4","key":"key2022032513083545300_ref036","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1525\/cmr.2014.56.4.103","article-title":"Managing crowds in innovation challenges","volume":"56","year":"2014","journal-title":"California Management Review"},{"issue":"3","key":"key2022032513083545300_ref210","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1287\/mksc.2014.0890","article-title":"Social dollars: the economic impact of customer participation in a firm-sponsored online customer community","volume":"34","year":"2015","journal-title":"Marketing Science"},{"key":"key2022032513083545300_ref037","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/j.indmarman.2016.03.012","article-title":"Value co-creation practices and capabilities: sustained purposeful engagement across B2B systems","volume":"56","year":"2016","journal-title":"Industrial Marketing Management"},{"year":"2010","key":"key2022032513083545300_ref038","article-title":"Recurrent neural network based language model"},{"year":"2013","key":"key2022032513083545300_ref039","article-title":"Efficient estimation of word representations in vector space"},{"year":"2013","key":"key2022032513083545300_ref040","article-title":"Distributed representations of words and phrases and their compositionality"},{"year":"2013","key":"key2022032513083545300_ref041","article-title":"Linguistic regularities in continuous space word representations"},{"key":"key2022032513083545300_ref042","first-page":"185","article-title":"Research note: what makes a helpful online review? A study of customer reviews on Amazon. com","year":"2010","journal-title":"MIS Quarterly"},{"issue":"3","key":"key2022032513083545300_ref043","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1177\/1075547008328797","article-title":"A two-step flow of influence? Opinion-leader campaigns on climate change","volume":"30","year":"2009","journal-title":"Science Communication"},{"year":"2005","key":"key2022032513083545300_ref044","article-title":"Social computing and weighting to identify member roles in online communities"},{"issue":"4","key":"key2022032513083545300_ref045","doi-asserted-by":"crossref","first-page":"1641","DOI":"10.1016\/j.chb.2013.01.044","article-title":"Does Twitter motivate involvement in politics? Tweeting, opinion leadership, and political engagement","volume":"29","year":"2013","journal-title":"Computers in Human Behavior"},{"issue":"Oct","key":"key2022032513083545300_ref046","first-page":"2825","article-title":"Scikit-learn: machine learning in Python","volume":"12","year":"2011","journal-title":"Journal of Machine Learning Research"},{"year":"2014","key":"key2022032513083545300_ref047","article-title":"Glove: global vectors for word representation"},{"year":"2000","key":"key2022032513083545300_ref048","article-title":"Machine learning from imbalanced data sets 101"},{"issue":"10","key":"key2022032513083545300_ref049","first-page":"100","article-title":"Building the co-creative enterprise","volume":"88","year":"2010","journal-title":"Harvard Business Review"},{"year":"2017","key":"key2022032513083545300_ref050","article-title":"A machine learning approach for classifying textual data in crowdsourcing"},{"issue":"4","key":"key2022032513083545300_ref051","doi-asserted-by":"crossref","first-page":"231","DOI":"10.2753\/MIS0742-1222280409","article-title":"Leveraging information technology infrastructure to facilitate a firm's customer agility and competitive activity: an empirical investigation","volume":"28","year":"2012","journal-title":"Journal of Management Information Systems"},{"year":"2014","key":"key2022032513083545300_ref211","article-title":"The semantic evolution of online communities"},{"issue":"4","key":"key2022032513083545300_ref212","doi-asserted-by":"crossref","first-page":"941","DOI":"10.1016\/j.respol.2012.10.008","article-title":"The periphery on stage: the intra-organizational dynamics in online communities of creation","volume":"42","year":"2013","journal-title":"Research Policy"},{"year":"2016","key":"key2022032513083545300_ref052","article-title":"Large scale needs-based open innovation via automated semantic textual similarity analysis"},{"issue":"6","key":"key2022032513083545300_ref053","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1016\/j.respol.2016.02.003","article-title":"Crowdsourcing ideas: involving ordinary users in the ideation phase of new product development","volume":"45","year":"2016","journal-title":"Research Policy"},{"key":"key2022032513083545300_ref054","doi-asserted-by":"crossref","unstructured":"Schneider, C. and von Briel, F. (2013), \u201cCrowdsourcing large-scale ecological monitoring: identifying design principles to motivate contributors\u201d, Building Sustainable Information Systems, Springer, pp. 509-518.","DOI":"10.1007\/978-1-4614-7540-8_39"},{"issue":"04","key":"key2022032513083545300_ref055","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1142\/S0218001409007326","article-title":"Classification of imbalanced data: a review","volume":"23","year":"2009","journal-title":"International Journal of Pattern Recognition and Artificial Intelligence"},{"issue":"1","key":"key2022032513083545300_ref056","first-page":"23","article-title":"Social networks and the diffusion of user-generated content: evidence from YouTube","volume":"23","year":"2012"},{"issue":"1","key":"key2022032513083545300_ref213","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1287\/isre.1100.0311","article-title":"How peripheral developers contribute to open-source software development","volume":"23","year":"2012","journal-title":"Information Systems Research"},{"issue":"1","key":"key2022032513083545300_ref057","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1109\/TSMCB.2008.2002909","article-title":"SVMs modeling for highly imbalanced classification","volume":"39","year":"2009","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)"},{"issue":"3","key":"key2022032513083545300_ref058","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.infsof.2009.10.007","article-title":"Analysis of virtual communities supporting OSS projects using social network analysis","volume":"52","year":"2010","journal-title":"Information and Software Technology"},{"key":"key2022032513083545300_ref059","doi-asserted-by":"crossref","unstructured":"Tucci, C.L., Afuah, A. and Viscusi, G. (2018), \u201cIntroduction to creating and capturing value through crowdsourcing\u201d, Creating and Capturing Value through Crowdsourcing, Oxford University Press.","DOI":"10.1093\/oso\/9780198816225.003.0001"},{"key":"key2022032513083545300_ref060","doi-asserted-by":"crossref","unstructured":"Viscusi, G. and Tucci, C.L. (2018), \u201cThree's a crowd?\u201d, Creating and Capturing Value through Crowdsourcing, Oxford University Press.","DOI":"10.1093\/oso\/9780198816225.003.0003"},{"year":"2019","key":"key2022032513083545300_ref061","article-title":"Towards computational assessment of idea novelty"},{"journal-title":"Pattern Recognition Letters","article-title":"Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost","year":"2020","key":"key2022032513083545300_ref062"},{"issue":"1","key":"key2022032513083545300_ref063","doi-asserted-by":"crossref","first-page":"35","DOI":"10.2307\/25148667","article-title":"Why should I share? Examining social capital and knowledge contribution in electronic networks of practice","volume":"29","year":"2005","journal-title":"MIS Quarterly"},{"issue":"6","key":"key2022032513083545300_ref064","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1509\/jm.15.0413","article-title":"Marketing analytics for data-rich environments","volume":"80","year":"2016","journal-title":"Journal of Marketing"},{"key":"key2022032513083545300_ref065","doi-asserted-by":"crossref","unstructured":"West, J. and Sims, J. (2018), \u201cHow firms leverage crowds and communities for open innovation\u201d, in Afuah, A., Tucci, C. and Viscusi, G. (Eds), Creating Capturing Value through Crowdsourcing, Oxford University Press.","DOI":"10.1093\/oso\/9780198816225.003.0004"},{"key":"key2022032513083545300_ref066","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/j.knosys.2014.06.004","article-title":"Fores texter: an efficient random forest algorithm for imbalanced text categorization","volume":"67","year":"2014","journal-title":"Knowledge-Based Systems"}],"container-title":["Information Technology &amp; People"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/ITP-04-2019-0171\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/ITP-04-2019-0171\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T21:54:37Z","timestamp":1753394077000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/itp\/article\/35\/2\/494-509\/183099"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,11]]},"references-count":80,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2020,9,11]]},"published-print":{"date-parts":[[2022,3,28]]}},"alternative-id":["10.1108\/ITP-04-2019-0171"],"URL":"https:\/\/doi.org\/10.1108\/itp-04-2019-0171","relation":{},"ISSN":["0959-3845"],"issn-type":[{"type":"print","value":"0959-3845"}],"subject":[],"published":{"date-parts":[[2020,9,11]]}}}