{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T04:59:49Z","timestamp":1774933189824,"version":"3.50.1"},"publisher-location":"New York, New York, USA","reference-count":64,"publisher":"ACM Press","license":[{"start":{"date-parts":[[2016,1,1]],"date-time":"2016-01-01T00:00:00Z","timestamp":1451606400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"JISC","award":["36"],"award-info":[{"award-number":["36"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016]]},"DOI":"10.1145\/2883851.2883950","type":"proceedings-article","created":{"date-parts":[[2016,4,22]],"date-time":"2016-04-22T13:54:08Z","timestamp":1461333248000},"page":"15-24","source":"Crossref","is-referenced-by-count":76,"title":["Towards automated content analysis of discussion transcripts"],"prefix":"10.1145","author":[{"given":"Vitomir","family":"Kovanovi\u0107","sequence":"first","affiliation":[{"name":"The University of Edinburgh, Edinburgh, UK"}]},{"given":"Sre\u0107ko","family":"Joksimovi\u0107","sequence":"additional","affiliation":[{"name":"The University of Edinburgh, Edinburgh, UK"}]},{"given":"Zak","family":"Waters","sequence":"additional","affiliation":[{"name":"Queensland University of Technology, Brisbane, Australia"}]},{"given":"Dragan","family":"Ga\u0161evi\u0107","sequence":"additional","affiliation":[{"name":"The University of Edinburgh, Edinburgh, UK"}]},{"given":"Kirsty","family":"Kitto","sequence":"additional","affiliation":[{"name":"Queensland University of Technology, Brisbane, Australia"}]},{"given":"Marek","family":"Hatala","sequence":"additional","affiliation":[{"name":"Simon Fraser University, Burnaby, Canada"}]},{"given":"George","family":"Siemens","sequence":"additional","affiliation":[{"name":"University of Texas at Arlington, Arlington"}]}],"member":"320","reference":[{"key":"key-10.1145\/2883851.2883950-1","unstructured":"Z. Akyol, J. B. Arbaugh, M. Cleveland-Innes, D. R. Garrison, P. Ice, J. C. Richardson, and K. Swan. A response to the review of the community of inquiry framework.Journal of distance education, 23(2), 2009. URL: http:\/\/www.ijede.ca\/index.php\/jde\/article\/view\/630\/884."},{"key":"key-10.1145\/2883851.2883950-2","doi-asserted-by":"crossref","unstructured":"T. Anderson and J. Dron. Three generations of distance education pedagogy.The international review of research in open and distance learning, 12(3):80--97, 2010. URL: http:\/\/www.irrodl.org\/index.php\/irrodl\/article\/view\/890\/.","DOI":"10.19173\/irrodl.v12i3.890"},{"key":"key-10.1145\/2883851.2883950-3","unstructured":"T. Anderson, L. Rourke, D. R. Garrison, and W. Archer. Assessing teaching presence in a computer conferencing context.Journal of asynchronous learning networks, 5:1--17, 2001. URL: http:\/\/auspace.athabascau.ca\/handle\/2149\/725."},{"key":"key-10.1145\/2883851.2883950-4","unstructured":"J. B. Arbaugh, A. Bangert, and M. Cleveland-Innes. Subject matter effects and the community of inquiry (coi) framework: an exploratory study.The internet and higher education, 13(1):37--44, 2010."},{"key":"key-10.1145\/2883851.2883950-5","unstructured":"J. Arbaugh, M. Cleveland-Innes, S. R. Diaz, D. R. Garrison, P. Ice, J. C. Richardson, and K. P. Swan. Developing a community of inquiry instrument: testing a measure of the community of inquiry framework using a multi-institutional sample.The internet and higher education, 11(3--4):133--136, 2008."},{"key":"key-10.1145\/2883851.2883950-6","doi-asserted-by":"crossref","unstructured":"L. Breiman. Random Forests.Machine learning, 45(1):5--32, 2001.","DOI":"10.1023\/A:1010933404324"},{"key":"key-10.1145\/2883851.2883950-7","unstructured":"D. L. Butler and P. H. Winne. Feedback and self-regulated learning: a theoretical synthesis.Review of educational research, 65(3):245--281, 1995."},{"key":"key-10.1145\/2883851.2883950-8","unstructured":"N. V. Chawla, N. Japkowicz, and A. Kotcz. Editorial: special issue on learning from imbalanced data sets.ACM SIGKDD explorations newsletter, 6(1):1--6, 2004."},{"key":"key-10.1145\/2883851.2883950-9","doi-asserted-by":"crossref","unstructured":"N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. Smote: synthetic minority over-sampling technique.Journal of artificial intelligence research:321--357, 2002. URL: https:\/\/www.jair.org\/media\/953\/live-953-2037-jair.pdf.","DOI":"10.1613\/jair.953"},{"key":"key-10.1145\/2883851.2883950-10","unstructured":"Coh-Metrix 3.0 indicies. http:\/\/cohmetrix.com\/documentation_indices.html."},{"key":"key-10.1145\/2883851.2883950-11","unstructured":"S. Corich, K. Hunt, and L. Hunt. Computerised content analysis for measuring critical thinking within discussion forums.Journal of e-learning and knowledge society, 2(1), 2012. URL: http:\/\/www.jelks.org\/ojs\/index.php\/Je-LKS_EN\/article\/view\/700."},{"key":"key-10.1145\/2883851.2883950-12","unstructured":"B. De Wever, T. Schellens, M. Valcke, and H. Van Keer. Content analysis schemes to analyze transcripts of online asynchronous discussion groups: a review.Computers &#38; education, 46(1):6--28, 2006."},{"key":"key-10.1145\/2883851.2883950-13","doi-asserted-by":"crossref","unstructured":"S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman. Indexing by latent semantic analysis.Journal of the american society for information science, 41(6):391--407, 1990.","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"key-10.1145\/2883851.2883950-14","unstructured":"J. Dewey. My pedagogical creed.School journal, 54(3):77--80, 1897."},{"key":"key-10.1145\/2883851.2883950-15","doi-asserted-by":"crossref","unstructured":"P. D&#246;nmez, C. Ros&#233;, K. Stegmann, A. Weinberger, and F. Fischer. Supporting CSCL with automatic corpus analysis technology. InProceedings of th 2005 conference on computer support for collaborative learning: learning 2005: the next 10 years!, 2005, 125--134. URL: https:\/\/telearn.archives-ouvertes.fr\/hal-00190638.","DOI":"10.3115\/1149293.1149310"},{"key":"key-10.1145\/2883851.2883950-16","doi-asserted-by":"crossref","unstructured":"R. Donnelly and J. Gardner. Content analysis of computer conferencing transcripts.Interactive learning environments, 19(4):303--315, 2011. URL: http:\/\/eprints.teachingandlearning.ie\/3930\/.","DOI":"10.1080\/10494820903075722"},{"key":"key-10.1145\/2883851.2883950-17","unstructured":"N. Dowell, O. Skrypnyk, S. Joksimovi&#263;, A. C. Graesser, S. Dawson, D. Ga&#353;evi&#263;, P. d. Vries, T. Hennis, and V. Kovanovi&#263;. Modeling Learners' Social Centrality and Performance through Language and Discourse. InProceedings of the 8th International Conference on Educational Data Mining (EDM 2015), 2015. URL: http:\/\/www.educationaldatamining.org\/EDM2015\/proceedings\/full250-257.pdf."},{"key":"key-10.1145\/2883851.2883950-18","unstructured":"M. Fern&#225;ndez-Delgado, E. Cernadas, S. Barro, and D. Amorim. Do we need hundreds of classifiers to solve real world classification problems?The journal of machine learning research, 15(1):3133--3181, 2014. URL: http:\/\/jmlr.org\/papers\/v15\/delgado14a.html."},{"key":"key-10.1145\/2883851.2883950-19","doi-asserted-by":"crossref","unstructured":"P. Ferragina and U. Scaiella. Fast and accurate annotation of short texts with wikipedia pages.Software, ieee, 29(1):70--75, 2012.","DOI":"10.1109\/MS.2011.122"},{"key":"key-10.1145\/2883851.2883950-20","doi-asserted-by":"crossref","unstructured":"P. W. Foltz, W. Kintsch, and T. K. Landauer. The measurement of textual coherence with latent semantic analysis.Discourse processes, 25:285--307, 1998. URL: http:\/\/eric.ed.gov\/?id=EJ589329.","DOI":"10.1080\/01638539809545029"},{"key":"key-10.1145\/2883851.2883950-21","unstructured":"E. Gabrilovich and S. Markovitch. Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis. InProceedings of the 20th International Joint Conference on Artifical Intelligence. Morgan Kaufmann Publishers Inc., 2007, pp. 1606--1611. URL: http:\/\/dl.acm.org\/citation.cfm?id=1625275.1625535."},{"key":"key-10.1145\/2883851.2883950-22","doi-asserted-by":"crossref","unstructured":"D. Ga&#353;evi&#263;, O. Adesope, S. Joksimovi&#263;, and V. Kovanovi&#263;. Externally-facilitated regulation scaffolding and role assignment to develop cognitive presence in asynchronous online discussions.The internet and higher education, 24:53--65, 2015.","DOI":"10.1016\/j.iheduc.2014.09.006"},{"key":"key-10.1145\/2883851.2883950-23","unstructured":"D. R. Garrison, T. Anderson, and W. Archer. Critical inquiry in a text-based environment: computer conferencing in higher education.The internet and higher education, 2(2-3):87--105, 1999."},{"key":"key-10.1145\/2883851.2883950-24","doi-asserted-by":"crossref","unstructured":"D. R. Garrison, T. Anderson, and W. Archer. Critical thinking, cognitive presence, and computer conferencing in distance education.American journal of distance education, 15(1):7--23, 2001.","DOI":"10.1080\/08923640109527071"},{"key":"key-10.1145\/2883851.2883950-25","unstructured":"D. R. Garrison, T. Anderson, and W. Archer. The first decade of the community of inquiry framework: a retrospective.The internet and higher education, 13(1--2):5--9, 2010."},{"key":"key-10.1145\/2883851.2883950-26","unstructured":"R. Garrison, M. Cleveland-Innes, and T. S. Fung. Exploring causal relationships among teaching, cognitive and social presence: student perceptions of the community of inquiry framework.The internet and higher education, 13(1--2):31--36, 2010."},{"key":"key-10.1145\/2883851.2883950-27","doi-asserted-by":"crossref","unstructured":"L. Getoor.Introduction to Statistical Relational Learning. MIT Press, 2007. ISBN: 978-0-262-07288-5.","DOI":"10.7551\/mitpress\/7432.001.0001"},{"key":"key-10.1145\/2883851.2883950-28","doi-asserted-by":"crossref","unstructured":"P. Gorsky, A. Caspi, I. Blau, Y. Vine, and A. Billet. Toward a coi population parameter: the impact of unit (sentence vs. message) on the results of quantitative content analysis.The international review of research in open and distributed learning, 13(1):17--37, 2011. URL: http:\/\/www.irrodl.org\/index.php\/irrodl\/article\/view\/1073.","DOI":"10.19173\/irrodl.v13i1.1073"},{"key":"key-10.1145\/2883851.2883950-29","doi-asserted-by":"crossref","unstructured":"A. C. Graesser, D. S. McNamara, and J. M. Kulikowich. Coh-Metrix Providing Multilevel Analyses of Text Characteristics.Educational researcher, 40(5):223--234, 2011.","DOI":"10.3102\/0013189X11413260"},{"key":"key-10.1145\/2883851.2883950-30","unstructured":"O. R. Holsti.Content analysis for the social sciences and humanities. Addison-Wesley Reading, MA, 1969."},{"key":"key-10.1145\/2883851.2883950-31","unstructured":"M. K. C. f. Jed Wing, S. Weston, A. Williams, C. Keefer, A. Engelhardt, T. Cooper, Z. Mayer, B. Kenkel, t. R Core Team, M. Benesty, R. Lescarbeau, A. Ziem, L. Scrucca, Y. Tang, and C. Candan.Caret: classification and regression training. R package version 6.0-58, 2015. URL: http:\/\/CRAN.R-project.org\/package=caret."},{"key":"key-10.1145\/2883851.2883950-32","unstructured":"S. Joksimovi&#263;, N. Dowell, O. Skrypnyk, V. Kovanovi&#263;, D. Ga&#353;evi&#263;, S. Dawson, and A. C. Graesser. Exploring the Accumulation of Social Capital in cMOOC Through Language and Discourse.Submitted, 2015."},{"key":"key-10.1145\/2883851.2883950-33","unstructured":"S. Joksimovi&#263;, D. Ga&#353;evi&#263;, V. Kovanovi&#263;, O. Adesope, and M. Hatala. Psychological characteristics in cognitive presence of communities of inquiry: A linguistic analysis of online discussions.The internet and higher education, 22:1--10, 2014."},{"key":"key-10.1145\/2883851.2883950-34","doi-asserted-by":"crossref","unstructured":"S. Joksimovi&#263;, V. Kovanovi&#263;, J. Jovanovi&#263;, A. Zouaq, D. Ga&#353;evi&#263;, and M. Hatala. What Do cMOOC Participants Talk About in Social Media?: A Topic Analysis of Discourse in a cMOOC. InProceedings of the Fifth International Conference on Learning Analytics And Knowledge, 2015, pp. 156--165.","DOI":"10.1145\/2723576.2723609"},{"key":"key-10.1145\/2883851.2883950-35","unstructured":"V. Kovanovi&#263;, S. Joksimovi&#263;, D. Ga&#353;evi&#263;, and M. Hatala. Automated Content Analysis of Online Discussion Transcripts. InProceedings of the Workshops at the LAK 2014 Conference co-located with 4th International Conference on Learning Analytics and Knowledge (LAK 2014), 2014. URL: http:\/\/ceur-ws.org\/Vol-1137\/."},{"key":"key-10.1145\/2883851.2883950-36","unstructured":"V. Kovanovi&#263;, S. Joksimovi&#263;, D. Ga&#353;evi&#263;, M. Hatala, and G. Siemens. Content Analytics: the definition, scope, and an overview of published research. In,Handbook of Learning Analyitcs, 2015."},{"key":"key-10.1145\/2883851.2883950-37","unstructured":"K. H. Krippendorff.Content analysis: an introduction to its methodology. Sage Publications, 2003."},{"key":"key-10.1145\/2883851.2883950-38","unstructured":"J. Lafferty, A. McCallum, and F. C. Pereira. Conditional random fields: probabilistic models for segmenting and labeling sequence data. InProceedings of the eighteenth international conference on machine learning (ICML '01), 2001. URL: http:\/\/dl.acm.org\/citation.cfm?id=655813."},{"key":"key-10.1145\/2883851.2883950-39","doi-asserted-by":"crossref","unstructured":"J. R. Landis and G. G. Koch. The measurement of observer agreement for categorical data.Biometrics, 33(1):159--174, 1977.","DOI":"10.2307\/2529310"},{"key":"key-10.1145\/2883851.2883950-40","unstructured":"A. Liaw and M. Wiener. Classification and regression by random-forest.R news, 2(3):18--22, 2002. URL: http:\/\/CRAN.R-project.org\/doc\/Rnews\/."},{"key":"key-10.1145\/2883851.2883950-41","unstructured":"G. Louppe, L. Wehenkel, A. Sutera, and P. Geurts. Understanding variable importances in forests of randomized trees. InAdvances in neural information processing systems 26, 2013, pp. 431--439. URL: http:\/\/media.nips.cc\/nipsbooks\/nipspapers\/paper_files\/nips26\/281.pdf."},{"key":"key-10.1145\/2883851.2883950-42","doi-asserted-by":"crossref","unstructured":"R. Luppicini. Review of computer mediated communication research for education.Instructional science, 35(2):141--185, 2007.","DOI":"10.1007\/s11251-006-9001-6"},{"key":"key-10.1145\/2883851.2883950-43","doi-asserted-by":"crossref","unstructured":"E. Mayfield and C. Penstein-Ros&#233;. Using feature construction to avoid large feature spaces in text classification. InProceedings of the 12th annual conference on genetic and evolutionary computation, 2010, 1299--1306.","DOI":"10.1145\/1830483.1830714"},{"key":"key-10.1145\/2883851.2883950-44","unstructured":"T. McKlin. Analyzing Cognitive Presence in Online Courses Using an Artificial Neural Network. PhD thesis. Georgia State University, College of Education, 2004."},{"key":"key-10.1145\/2883851.2883950-45","doi-asserted-by":"crossref","unstructured":"D. S. McNamara, A. C. Graesser, P. M. McCarthy, and Z. Cai.Automated Evaluation of Text and Discourse with Coh-Metrix. Cambridge University Press, 2014.","DOI":"10.1017\/CBO9780511894664"},{"key":"key-10.1145\/2883851.2883950-46","doi-asserted-by":"crossref","unstructured":"P. N. Mendes, M. Jakob, A. Garc&#237;a-Silva, and C. Bizer. DBpedia spotlight: shedding light on the web of documents. InProceedings of the 7th international conference on semantic systems, 2011, 1--8.","DOI":"10.1145\/2063518.2063519"},{"key":"key-10.1145\/2883851.2883950-47","unstructured":"J. Mu, K. Stegmann, E. Mayfield, C. Ros&#233;, and F. Fischer. The ACODEA framework: developing segmentation and classification schemes for fully automatic analysis of online discussions.International journal of computer-supported collaborative learning, 7(2):285--305, 2012."},{"key":"key-10.1145\/2883851.2883950-48","unstructured":"E. B. Page and N. S. Petersen. The computer moves into essay grading: Updating the ancient test.Phi delta kappan, 76(7):561, 1995. URL: http:\/\/search.proquest.com\/docview\/218533317\/abstract."},{"key":"key-10.1145\/2883851.2883950-49","unstructured":"C. L. Park. Replicating the Use of a Cognitive Presence Measurement Tool.Journal of interactive online learning, 8:140--155, 2, 2009. URL: http:\/\/www.ncolr.org\/issues\/jiol\/v8\/n2\/replicating-the-use-of-a-cognitive-presence-measurement-tool#.VrVSebKUFhE."},{"key":"key-10.1145\/2883851.2883950-50","unstructured":"L. Rourke, T. Anderson, D. R. Garrison, and W. Archer. Assessing social presence in asynchronous text-based computer conferencing.The journal of distance education\/ revue de l'&#233;ducation &#224; distance, 14(2):50--71, 2007. URL: http:\/\/eric.ed.gov\/?id=EJ616753."},{"key":"key-10.1145\/2883851.2883950-51","unstructured":"L. Rourke, T. Anderson, D. R. Garrison, and W. Archer. Methodological issues in the content analysis of computer conference transcripts.International journal of artificial intelligence in education (IJAIED), 12:8--22, 2001."},{"key":"key-10.1145\/2883851.2883950-52","unstructured":"P. J. Stone, D. C. Dunphy, and M. S. Smith.The general inquirer: a computer approach to content analysis. MIT press, 1966."},{"key":"key-10.1145\/2883851.2883950-53","doi-asserted-by":"crossref","unstructured":"J.-W. Strijbos. Assessment of (computer-supported) collaborative learning.IEEE transactions on learning technologies, 4(1):59--73, 2011.","DOI":"10.1109\/TLT.2010.37"},{"key":"key-10.1145\/2883851.2883950-54","doi-asserted-by":"crossref","unstructured":"J.-W. Strijbos, R. L. Martens, F. J. Prins, and W. M. G. Jochems. Content analysis: what are they talking about?Computers &#38; education, 46(1):29--48, 2006.","DOI":"10.1016\/j.compedu.2005.04.002"},{"key":"key-10.1145\/2883851.2883950-55","unstructured":"M. Strube and S. P. Ponzetto. WikiRelate! Computing Semantic Relatedness Using Wikipedia. InProceedings of the 21st National Conference on Artificial Intelligence - Volume 2. AAAI Press, 2006, pp. 1419--1424. ISBN: 978-1-57735-281-5. URL: http:\/\/dl.acm.org\/citation.cfm?id=1597348.1597414."},{"key":"key-10.1145\/2883851.2883950-56","unstructured":"P.-N. Tan, V. Kumar, and M. Steinbach.Introduction to Data Mining. Addison-Wesley Longman Publishing Co., Inc., 2005. ISBN: 0-321-32136-7."},{"key":"key-10.1145\/2883851.2883950-57","unstructured":"Y. R. Tausczik and J. W. Pennebaker. The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods.Journal of language and social psychology, 29(1):24--54, 2010."},{"key":"key-10.1145\/2883851.2883950-58","unstructured":"Y. R. Tausczik and J. W. Pennebaker. The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods.Journal of language and social psychology, 29(1):24--54, 2010."},{"key":"key-10.1145\/2883851.2883950-59","unstructured":"V. N. Vapnik.Statistical learning theory. Wiley-Interscience, 1998."},{"key":"key-10.1145\/2883851.2883950-60","doi-asserted-by":"crossref","unstructured":"J. Vassileva. Toward social learning environments.IEEE transactions on learning technologies, 1(4):199--214, 2008.","DOI":"10.1109\/TLT.2009.4"},{"key":"key-10.1145\/2883851.2883950-61","doi-asserted-by":"crossref","unstructured":"N. Vaughan and D. R. Garrison. Creating cognitive presence in a blended faculty development community.The internet and higher education, 8(1):1--12, 2005.","DOI":"10.1016\/j.iheduc.2004.11.001"},{"key":"key-10.1145\/2883851.2883950-62","unstructured":"Z. Waters, V. Kovanovi&#263;, K. Kitto, and D. Ga&#353;evi&#263;. Structure matters: Adoption of structured classification approach in the context of cognitive presence classification. InProceedings of the 11th Asia Information Retrieval Societies Conference, AIRS 2015, 2015."},{"key":"key-10.1145\/2883851.2883950-63","doi-asserted-by":"crossref","unstructured":"I. H. Witten, E. Frank, and M. A. Hall.Data mining: practical machine learning tools and techniques. Morgan Kaufmann, 3rd ed., 2011.","DOI":"10.1016\/B978-0-12-374856-0.00001-8"},{"key":"key-10.1145\/2883851.2883950-64","doi-asserted-by":"crossref","unstructured":"A. Zouaq and R. Nkambou. Building domain ontologies from text for educational purposes.IEEE transactions on learning technologies, 1(1):49--62, 2008.","DOI":"10.1109\/TLT.2008.12"}],"event":{"name":"the Sixth International Conference","location":"Edinburgh, United Kingdom","acronym":"LAK '16","number":"6","start":{"date-parts":[[2016,4,25]]},"end":{"date-parts":[[2016,4,29]]}},"container-title":["Proceedings of the Sixth International Conference on Learning Analytics &amp; Knowledge - LAK '16"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2883851.2883950","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/dl.acm.org\/ft_gateway.cfm?id=2883950&amp;ftid=1708821&amp;dwn=1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:54:08Z","timestamp":1750222448000},"score":1,"resource":{"primary":{"URL":"http:\/\/dl.acm.org\/citation.cfm?doid=2883851.2883950"}},"subtitle":["a cognitive presence case"],"proceedings-subject":"Learning Analytics & Knowledge","short-title":[],"issued":{"date-parts":[[2016]]},"references-count":64,"URL":"https:\/\/doi.org\/10.1145\/2883851.2883950","relation":{},"subject":[],"published":{"date-parts":[[2016]]}}}