{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:54:33Z","timestamp":1754157273376,"version":"3.41.2"},"reference-count":27,"publisher":"Emerald","issue":"4","license":[{"start":{"date-parts":[[2009,8,7]],"date-time":"2009-08-07T00:00:00Z","timestamp":1249603200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,8,7]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-heading\">Purpose<\/jats:title><jats:p>With the significant growth in electronic education materials such as syllabus documents and lecture notes, available on the internet and intranets, there is a need for robust central repositories of such materials to allow both educators and learners to conveniently share, search and access them. The purpose of this paper is to report on the work to develop a national repository for course syllabi in Ireland.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title><jats:p>The paper describes a prototype syllabus repository system for higher education in Ireland, which has been developed by utilising a number of information extraction and document classification techniques, including a new fully unsupervised document classification method that uses a web search engine for automatic collection of training set for the classification algorithm.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Findings<\/jats:title><jats:p>Preliminary experimental results for evaluating the performance of the system and its various units, particularly the information extractor and the classifier, are presented and discussed.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title><jats:p>In this paper, three major obstacles associated with creating a large\u2010scale syllabus repository are identified, and a comprehensive review of published research work related to addressing these problems is provided. Two different types of syllabus documents are identified and describe a rule\u2010based information extraction system capable of extracting structured information from unstructured syllabus documents is described. Finally, the importance of classifying resources in a syllabus digital library is highlighted, a number of standard education classification schemes are introduced, and the unsupervised automated document classification system, which classifies syllabus documents based on an extended version of the <jats:italic>International Standard Classification of Education<\/jats:italic>, is described.<\/jats:p><\/jats:sec>","DOI":"10.1108\/02640470910979598","type":"journal-article","created":{"date-parts":[[2009,10,5]],"date-time":"2009-10-05T10:27:21Z","timestamp":1254738441000},"page":"640-658","source":"Crossref","is-referenced-by-count":2,"title":["An automated syllabus digital library system for higher education in Ireland"],"prefix":"10.1108","volume":"27","author":[{"given":"Arash","family":"Joorabchi","sequence":"first","affiliation":[]},{"given":"Abdulhussain E.","family":"Mahdi","sequence":"additional","affiliation":[]}],"member":"140","reference":[{"key":"key2022032120043307800_b1","unstructured":"Appelt, D.E. and Israel, D. (1999), \u201cIntroduction to information extraction technology\u201d, Proceeding of the 16th International Joint Conference on Artificial Intelligence (IJCAI\u201099), Stockholm, Sweden, available at: www.ai.sri.com\/\u223cappelt\/ie\u2010tutorial\/IJCAI99.pdf (accessed July 2008)."},{"key":"key2022032120043307800_b2","doi-asserted-by":"crossref","unstructured":"Assis, G.d., Laender, A., Gon\u00e7alves, M. and Silva, A.d. (2007), \u201cExploiting genre in focused crawling\u201d, String Processing and Information Retrieval, Springer, Berlin, pp. 62\u201073.","DOI":"10.1007\/978-3-540-75530-2_6"},{"key":"key2022032120043307800_b3","doi-asserted-by":"crossref","unstructured":"Cebeci, Z., Budak, F. and Tekdal, M. (2006), \u201cWorking with XML for flexible management of online course syllabi\u201d, Information Technology Journal, Vol. 5 No. 2, pp. 322\u20108.","DOI":"10.3923\/itj.2006.322.328"},{"key":"key2022032120043307800_b4","doi-asserted-by":"crossref","unstructured":"Cohen, D.J. (2006), \u201cFrom babel to knowledge: data mining large digital collections\u201d, D\u2010Lib Magazine, Vol. 12 No. 3.","DOI":"10.1045\/march2006-cohen"},{"key":"key2022032120043307800_b5","unstructured":"Cunningham, H., Maynard, D., Bontcheva, K. and Tablan, V. (2002), \u201cGATE: a framework and graphical development environment for robust NLP tools and applications\u201d, Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL'02), Philadelphia, available at: http:\/\/gate.ac.uk\/sale\/acl02\/acl\u2010main.pdf (accessed July 2008)."},{"key":"key2022032120043307800_b6","unstructured":"DCMI education community (1999), \u201cDC\u2010Ed., Dublin Core Metadata Initiative\u201d, available at: http:\/\/dublincore.org\/groups\/education\/ (accessed July 2008)."},{"key":"key2022032120043307800_b7","doi-asserted-by":"crossref","unstructured":"Embley, D.W., Hurst, M., Lopresti, D. and Nagy, G. (2006), \u201cTable\u2010processing paradigms: a research survey\u201d, International Journal on Document Analysis and Recognition, Vol. 8 Nos 2\/3, pp. 66\u201086.","DOI":"10.1007\/s10032-006-0017-x"},{"key":"key2022032120043307800_b8","doi-asserted-by":"crossref","unstructured":"Ida, M., Nozawa, T., Yoshikane, F., Miyazaki, K. and Kita, H. (2005), \u201cSyllabus database and web service on higher education\u201d, Proceedings of the 7th International Conference on Advanced Communication Technology (IEEE\u2010ICACT 2005), Republic of Korea, Vol. 1, pp. 415\u20108.","DOI":"10.1109\/ICACT.2005.245891"},{"key":"key2022032120043307800_b9","unstructured":"IDEAS (2007), \u201cIndividualised digitised educational advisory system\u201d, Enterprise Research Center, University of Limerick, Limerick, available at: www.ideas.ie\/ (accessed July 2008)."},{"key":"key2022032120043307800_b10","unstructured":"IEEE\u2010LTSC\u2010WG12 (2002), \u201cThe learning object metadata standard\u201d, IEEE Learning Technology Standards Committee, available at: www.ieeeltsc.org\/working\u2010groups\/wg12LOM\/ (accessed July 2008)."},{"key":"key2022032120043307800_b11","unstructured":"ISCED (1997), International Standard Classification of Education \u2013 1997 Version (ISCED97), UNESCO, available at: www.uis.unesco.org\/ev.php?ID=3813_201&ID2=DO_TOPIC (accessed July 2008)."},{"key":"key2022032120043307800_b12","unstructured":"JACS (2007), Joint Academic Coding System v 1.7, HESA \u2013 Higher Education Statistics Agency, available at: www.hesa.ac.uk\/index.php?option=com_content&task=view&id=158&Itemid=233 (July 2008)."},{"key":"key2022032120043307800_b13","unstructured":"Joachims, T. (1997), Fourteenth International Conference on Machine Learning, Morgan Kaufmann Publishers, Nashville, TN, pp. 143\u201051."},{"key":"key2022032120043307800_b16","doi-asserted-by":"crossref","unstructured":"McCallum, A. (2005), \u201cInformation extraction: distilling structured data from unstructured text\u201d, Queue, Vol. 3 No. 9, pp. 48\u201057.","DOI":"10.1145\/1105664.1105679"},{"key":"key2022032120043307800_b14","doi-asserted-by":"crossref","unstructured":"Marcis, J.G. and Carr, D. (2003), \u201cA note on student views regarding the course syllabus\u201d, Atlantic Economic Journal, Vol. 31 No. 1, p. 115.","DOI":"10.1007\/BF02298467"},{"key":"key2022032120043307800_b15","unstructured":"Matsunaga, Y., Yamada, S., Ito, E. and Hirokawa, S. (2003), \u201cA web syllabus crawler and its efficiency evaluation\u201d, International Symposium on Information Science and Electrical Engineering 2003 (ISEE 2003), Fukuoka, pp. 565\u20108."},{"key":"key2022032120043307800_b17","unstructured":"Mitchell, T. (1997), Machine Learning, McGraw\u2010Hill, New York, NY."},{"key":"key2022032120043307800_b18","unstructured":"OpenOffice.org 2.0 (2007), \u201cSponsored by Sun Microsystems, released under the open source LGPL licence\u201d, available at: www.openoffice.org\/ (accessed July 2008)."},{"key":"key2022032120043307800_b19","doi-asserted-by":"crossref","unstructured":"Sebastiani, F. (2002), \u201cMachine learning in automated text categorization\u201d, ACM Computing Surveys (CSUR), Vol. 34 No. 1, pp. 1\u201047.","DOI":"10.1145\/505282.505283"},{"key":"key2022032120043307800_b20","unstructured":"Steward, S. (2004), \u201cPdftk 1.12 \u2013 the PDF Toolkit\u201d, sponsored by AccessPDF, Released under the open source GPL licence, available at: www.accesspdf.com\/pdftk\/index.html (accessed July 2008)."},{"key":"key2022032120043307800_b21","unstructured":"Thompson, C.A., Smarr, J., Nguyen, H. and Manning, C. (2003), \u201cFinding educational resources on the web: exploiting automatic extraction of metadata\u201d, paper presented at ECML Workshop on Adaptive Text Extraction and Mining, available at: http:\/\/nlp.stanford.edu\/pubs\/edutellaTR.pdf (accessed July 2008)."},{"key":"key2022032120043307800_b22","unstructured":"Trewin, D. (2001), Australian Standard Classification of Education (ASCED), Australian Bureau of Statistics, available at: www.abs.gov.au\/AUSSTATS\/abs@.nsf\/DetailsPage\/1272.02001?OpenDocument."},{"key":"key2022032120043307800_b23","unstructured":"Xiaoyan, Y., Manas, T., Weiguo, F., Manuel, P.\u2010Q., Edward, A.F., William, C., GuoFang, T. and Lillian, C. (2007), \u201cAutomatic syllabus classification\u201d, paper presented to the ACM IEEE Joint Conference on Digital Libraries, Vancouver."},{"key":"key2022032120043307800_b24","unstructured":"Xpdf 3.02 (2007), \u201cGlyph & Cog, LLC., Released under the open source GPL licence\u201d, available at: www.foolabs.com\/xpdf\/ (accessed July 2008)."},{"key":"key2022032120043307800_b25","unstructured":"Yahoo!\u2010API (2007), Yahoo Search Web Services Software Development Kit, Yahoo!, available at: http:\/\/developer.yahoo.com\/search\/ (accessed July 2008)."},{"key":"key2022032120043307800_b26","unstructured":"Yu, X, Tungare, M., Fan, W., Yuan, Y., P\u00e9rez\u2010Qui\u00f1ones, M., Fox, E.A., Cameron, W. and Cassel, L. (2007), \u201cUsing automatic metadata extraction to build a structured syllabus repository\u201d, Proceedings of the 10th International Conference on Asian Digital Libraries (ICADL 2007) Ha Noi, Vietnam, available at: http:\/\/manas.tungare.name\/publications\/yu_2007_using (accessed July 2008)."},{"key":"key2022032120043307800_b27","unstructured":"Zhu, X. (2005), \u201cSemi\u2010supervised learning literature survey\u201d, Report Number 1530, Department of Computer Sciences, University of Wisconsin, Madison, available at: http:\/\/pages.cs.wisc.edu\/\u223cjerryzhu\/pub\/ssl_survey.pdf (accessed July 2008)."}],"container-title":["The Electronic Library"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/02640470910979598","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/02640470910979598\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/02640470910979598\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:42:28Z","timestamp":1753400548000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/el\/article\/27\/4\/640-658\/91310"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,8,7]]},"references-count":27,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2009,8,7]]}},"alternative-id":["10.1108\/02640470910979598"],"URL":"https:\/\/doi.org\/10.1108\/02640470910979598","relation":{},"ISSN":["0264-0473"],"issn-type":[{"type":"print","value":"0264-0473"}],"subject":[],"published":{"date-parts":[[2009,8,7]]}}}