{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T07:42:26Z","timestamp":1772523746654,"version":"3.50.1"},"reference-count":64,"publisher":"Emerald","issue":"4","license":[{"start":{"date-parts":[[2011,9,27]],"date-time":"2011-09-27T00:00:00Z","timestamp":1317081600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,9,27]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-heading\">Purpose<\/jats:title><jats:p>The aim of this paper is to develop a system for automatic extraction of metadata from scientific papers in PDF format for the information system for monitoring the scientific research activity of the University of Novi Sad (CRIS UNS).<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Design\/methodology\/approach<\/jats:title><jats:p>The system is based on machine learning and performs automatic extraction and classification of metadata in eight pre\u2010defined categories. The extraction task is realised as a classification process. For the purpose of classification each row of text is represented with a vector that comprises different features: formatting, position, characteristics related to the words, etc. Experiments were performed with standard classification models. Both a single classifier with all eight categories and eight individual classifiers were tested. Classifiers were evaluated using the five\u2010fold cross validation, on a manually annotated corpus comprising 100 scientific papers in PDF format, collected from various conferences, journals and authors' personal web pages.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Findings<\/jats:title><jats:p>Based on the performances obtained on classification experiments, eight separate support vector machines (SVM) models (each of which recognises its corresponding category) were chosen. All eight models were established to have a good performance. The<jats:italic>F<\/jats:italic>\u2010measure was over 85 per cent for almost all of the classifiers and over 90 per cent for most of them.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Research limitations\/implications<\/jats:title><jats:p>Automatically extracted metadata cannot be directly entered into CRIS UNS but requires control of the curators.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Practical implications<\/jats:title><jats:p>The proposed system for automatic metadata extraction using support vector machines model was integrated into the software system, CRIS UNS. Metadata extraction has been tested on the publications of researchers from the Department of Mathematics and Informatics of the Faculty of Sciences in Novi Sad. Analysis of extracted metadata from these publications showed that the performance of the system for the previously unseen data is in accordance with that obtained by the cross\u2010validation from eight separate SVM classifiers. This system will help in the process of synchronising metadata from CRIS UNS with other institutional repositories.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-heading\">Originality\/value<\/jats:title><jats:p>The paper documents a fully automated system for metadata extraction from scientific papers that was developed. The system is based on the SVM classifier and open source tools, and is capable of extracting eight types of metadata from scientific articles of any format that can be converted to PDF. Although developed as part of CRIS UNS, the proposed system can be integrated into other CRIS systems, as well as institutional repositories and library management systems.<\/jats:p><\/jats:sec>","DOI":"10.1108\/00330331111182094","type":"journal-article","created":{"date-parts":[[2011,10,1]],"date-time":"2011-10-01T07:25:22Z","timestamp":1317453922000},"page":"376-396","source":"Crossref","is-referenced-by-count":20,"title":["Automatic extraction of metadata from scientific publications for CRIS systems"],"prefix":"10.1108","volume":"45","author":[{"given":"Aleksandar","family":"Kova\u010devi\u0107","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dragan","family":"Ivanovi\u0107","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Branko","family":"Milosavljevi\u0107","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zora","family":"Konjovi\u0107","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Du\u0161an","family":"Surla","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","reference":[{"key":"key2022022020372799300_b1","doi-asserted-by":"crossref","unstructured":"Abugessaisa, I. (2010), \u201cGeospatial metadata extraction from product description document applying methods from ontology engineering\u201d, International Journal of Metadata, Semantics and Ontologies, Vol. 5 No. 4, pp. 321\u201032.","DOI":"10.1504\/IJMSO.2010.035554"},{"key":"key2022022020372799300_b2","doi-asserted-by":"crossref","unstructured":"Adams, J. (2009), \u201cThe use of bibliometrics to measure research quality in UK higher education institutions\u201d, Archivum Immunologiae et Therapiae Experimentalis, Vol. 57 No. 1, pp. 19\u201032.","DOI":"10.1007\/s00005-009-0003-3"},{"key":"key2022022020372799300_b3","doi-asserted-by":"crossref","unstructured":"Anderson, J.D. and P\u00e9rez\u2010Carballo, J. (2001), \u201cThe nature of indexing: how humans and machines analyze messages and texts for retrieval. Part II: machine indexing, and the allocation of human versus machine effort\u201d, Information Processing & Management, Vol. 37 No. 2, pp. 255\u201077.","DOI":"10.1016\/S0306-4573(00)00046-7"},{"key":"key2022022020372799300_b4","unstructured":"Asserson, A., Jeffery, K. and Lopatenko, A. (2002), \u201cCERIF: past, present and future: an overview\u201d, Proceedings of the 6th International Conference on Current Research Information Systems, University of Kassel, 29\u201031 August, pp. 30\u201040."},{"key":"key2022022020372799300_b5","doi-asserted-by":"crossref","unstructured":"Beli\u0107, K. and Surla, D. (2008a), \u201cUser\u2010friendly web application for bibliographic material processing\u201d, The Electronic Library, Vol. 26 No. 3, pp. 400\u201010.","DOI":"10.1108\/02640470810879536"},{"key":"key2022022020372799300_b6","doi-asserted-by":"crossref","unstructured":"Beli\u0107, K. and Surla, D. (2008b), \u201cModel of a user friendly system for library cataloguing\u201d, Computer Science and Information Systems, Vol. 5 No. 1, pp. 61\u201085.","DOI":"10.2298\/CSIS0801061B"},{"key":"key2022022020372799300_b7","unstructured":"Bergmark, D. (2000), Automatic Extraction of Reference Linking Information from Online Documents, Cornell University, Ithaca, NY."},{"key":"key2022022020372799300_b8","doi-asserted-by":"crossref","unstructured":"Boberi\u0107, D. and Surla, D. (2009), \u201cXML editor for search and retrieval of bibliographic records in the Z39.50 standard\u201d, The Electronic Library, Vol. 27 No. 3, pp. 474\u201095.","DOI":"10.1108\/02640470910966916"},{"key":"key2022022020372799300_b9","unstructured":"Cardie, C. (1993), \u201cA case\u2010based approach to knowledge acquisition for domain\u2010specific sentence analysis\u201d, Proceedings of the 11th National Conference on Artificial Intelligence. AAAI'93, July 11\u201015, AAAI Press, Menlo Park, CA, pp. 798\u2010803."},{"key":"key2022022020372799300_b10","doi-asserted-by":"crossref","unstructured":"Chen, H.\u2010Y. (2009), \u201cMethod of web information extraction based on decision tree\u201d, Proceedings of the 2009 International Forum on Information Technology and Applications, Chengdu, May 15\u201017, Vol. 1, IEEE Computer Society, Piscataway, NJ, pp. 664\u20106.","DOI":"10.1109\/IFITA.2009.394"},{"key":"key2022022020372799300_b11","unstructured":"Chieu, H.L. and Ng, H.T. (2002), \u201cA maximum entropy approach to information extraction from semi\u2010structured and free text\u201d, Proceedings of the 18th National Conference on Artificial Intelligence, Menlo Park, CA, USA, July 28\u2010August 1, American Association for Artificial Intelligence, Menlo Park, CA, pp. 786\u201091."},{"key":"key2022022020372799300_b12","doi-asserted-by":"crossref","unstructured":"Cristianini, N. (2000), An Introduction to Support Vector Machines: And Other Kernel\u2010based Learning Methods, Cambridge University Press, Cambridge.","DOI":"10.1017\/CBO9780511801389"},{"key":"key2022022020372799300_b13","unstructured":"Crystal, A. and Land, P. (2003), \u201cMetadata and Search: Global Corporate Circle DCMI 2003 Workshop\u201d, available at: www.dublincore.org\/groups\/corporate\/Seattle\/ (accessed 14 January 2011)."},{"key":"key2022022020372799300_b14","doi-asserted-by":"crossref","unstructured":"Cui, B. (2009), \u201cScientific literature metadata extraction based on HMM\u201d, Proceedings of the 6th International Conference on Cooperative Design, Visualization, and Engineering. CDVE'09, Luxembourg, Luxembourg, September 20\u201023, Springer, New York, NY, pp. 64\u20108.","DOI":"10.1007\/978-3-642-04265-2_9"},{"key":"key2022022020372799300_b15","doi-asserted-by":"crossref","unstructured":"Cui, B. and Chen, X. (2010), \u201cAn improved hidden Markov model for literature metadata extraction\u201d, Proceedings of the 6th International Conference on Advanced Intelligent Computing Theories and Applications: Intelligent Computing. ICIC'10, Changsha, China, August 18\u201021, Springer, New York, NY, pp. 205\u201012.","DOI":"10.1007\/978-3-642-14922-1_26"},{"key":"key2022022020372799300_b16","doi-asserted-by":"crossref","unstructured":"Devignes, M., Franiatte, P., Messai, N., Bresso, E., Napoli, A. and Sma\u00efl\u2010Tabonne, M. (2010), \u201cBioRegistry: automatic extraction of metadata for biological database retrieval and discovery\u201d, International Journal of Metadata, Semantics and Ontologies, Vol. 5 No. 3, pp. 184\u201093.","DOI":"10.1504\/IJMSO.2010.034043"},{"key":"key2022022020372799300_b18","doi-asserted-by":"crossref","unstructured":"Dimi\u0107, B. and Surla, D. (2009), \u201cXML editor for UNIMARC and MARC 21 cataloguing\u201d, The Electronic Library, Vol. 27 No. 3, pp. 509\u201028.","DOI":"10.1108\/02640470910966934"},{"key":"key2022022020372799300_b17","doi-asserted-by":"crossref","unstructured":"Dimi\u0107, B., Milosavljevi\u0107, B. and Surla, D. (2010), \u201cXML schema for UNIMARC and MARC 21\u201d, The Electronic Library, Vol. 28 No. 2, pp. 245\u201062.","DOI":"10.1108\/02640471011033611"},{"key":"key2022022020372799300_b19","doi-asserted-by":"crossref","unstructured":"Domingos, P. and Pazzani, M. (1997), \u201cOn the optimality of the simple Bayesian classifier under zero\u2010one loss\u201d, Machine Learning, Vol. 29, pp. 103\u201030.","DOI":"10.1023\/A:1007413511361"},{"key":"key2022022020372799300_b20","doi-asserted-by":"crossref","unstructured":"Finkel, J.R., Grenager, T. and Manning, C. (2005), \u201cIncorporating non\u2010local information into information extraction systems by Gibbs sampling\u201d, Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, University of Michigan, 25\u201030 June, pp. 363\u201070.","DOI":"10.3115\/1219840.1219885"},{"key":"key2022022020372799300_b21","doi-asserted-by":"crossref","unstructured":"Flynn, P., Zhou, L., Maly, K., Zeil, S. and Zubair, M. (2007), \u201ctemplate\u2010based metadata extraction architecture\u201d, Proceedings of the 10th International Conference on Asian Digital Libraries: Looking Back 10 Years and Forging New Frontiers. ICADL'07, Hanoi, Vietnam, December 10\u201013, Springer, New York, NY, pp. 327\u201036.","DOI":"10.1007\/978-3-540-77094-7_42"},{"key":"key2022022020372799300_b22","doi-asserted-by":"crossref","unstructured":"Giuffrida, G., Shek, E.C. and Yang, J. (2000), \u201cKnowledge\u2010based metadata extraction from PostScript files\u201d, Proceedings of the Fifth ACM Conference on Digital libraries. DL '00, San Antonio, TX, USA, June 02\u201007, ACM, New York, NY, pp. 77\u201084.","DOI":"10.1145\/336597.336639"},{"key":"key2022022020372799300_b23","doi-asserted-by":"crossref","unstructured":"Greenberg, J., Spurgin, K. and Crystal, A. (2006), \u201cFunctionalities for automatic metadata generation applications: a survey of metadata experts' opinions\u201d, International Journal of Metadata, Semantics and Ontologies, Vol. 1 No. 1, pp. 3\u201020.","DOI":"10.1504\/IJMSO.2006.008766"},{"key":"key2022022020372799300_b24","unstructured":"Groza, T., Handschuh, S. and Hulpus, I. (2009), A document engineering approach to automatic extraction of shallow metadata from scientific publications, DERI Galway, National University of Ireland, Galway, available at: http:\/\/cogprints.org\/5859\/1\/Thesis\u2010David\u2010Nadeau.pdf (accessed 4 February 2011)."},{"key":"key2022022020372799300_b25","unstructured":"Han, H., Giles, C.L., Manavoglu, E., Zha, H., Zhang, Z. and Fox, E.A. (2003), \u201cAutomatic document metadata extraction using support vector machines\u201d, Proceedings of the 3rd ACM\/IEEE\u2010CS Joint Conference on Digital Libraries. JCDL '03, Houston, TX, USA, May 27\u201031, IEEE Computer Society, Piscataway, NJ, pp. 37\u201048."},{"key":"key2022022020372799300_b27","unstructured":"Heidorn, P.B. and Wei, Q. (2008), \u201cAutomatic metadata extraction from museum specimen labels\u201d, Proceedings of the 2008 International Conference on Dublin Core and Metadata Applications, Berlin, September 22\u201026, Dublin Core Metadata Initiative, Singapore, pp. 57\u201068."},{"key":"key2022022020372799300_b28","doi-asserted-by":"crossref","unstructured":"Hu, Y., Li, H., Cao, Y., Teng, L., Meyeron, D. and Zheng, Q. (2006), \u201cAutomatic extraction of titles from general documents using machine learning\u201d, Information Processing & Management, Vol. 42 No. 5, pp. 1276\u201093.","DOI":"10.1016\/j.ipm.2005.12.001"},{"key":"key2022022020372799300_b29","doi-asserted-by":"crossref","unstructured":"Ivanovi\u0107, D., Surla, D. and Rackovi\u0107, M. (2011a), \u201cA CERIF data model extension for evaluation and quantitative expression of scientific research results\u201d, Scientometrics, Vol. 86 No. 1, pp. 155\u201072.","DOI":"10.1007\/s11192-010-0228-2"},{"key":"key2022022020372799300_b30","doi-asserted-by":"crossref","unstructured":"Ivanovi\u0107, D., Surla, D. and Konjovi\u0107, Z. (2011b), \u201cCERIF compatible data model based on MARC21 format\u201d, The Electronic Library, Vol. 29 No. 1, pp. 52\u201070.","DOI":"10.1108\/02640471111111433"},{"key":"key2022022020372799300_b31","doi-asserted-by":"crossref","unstructured":"Ivanovi\u0107, D., Milosavljevi\u0107, G., Milosavljevi\u0107, B. and Surla, D. (2010), \u201cA CERIF\u2010compatible research management system based on the MARC21 format\u201d, Program: electronic library and information systems, Vol. 44 No. 3, pp. 229\u201051.","DOI":"10.1108\/00330331011064249"},{"key":"key2022022020372799300_b32","unstructured":"J\u00f6rg, B., Krast, O., Jeffery, K. and Grootel, G. (2009a), \u201cCERIF 2008 \u2013 1.0 Full Data Model (FDM) introduction and specification\u201d, available at: www.eurocris.org\/fileadmin\/cerif\u20102008\/CERIF2008_1.0_FDM.pdf (accessed 4 September 2010)."},{"key":"key2022022020372799300_b33","unstructured":"J\u00f6rg, B., Krast, O., Jeffery, K. and Grootel, G. (2009b), \u201cCERIF 2008 \u2013 1.0 XML data exchange format specification\u201d, available at: www.eurocris.org\/fileadmin\/cerif\u20102008\/CERIF2008_1.0_XML.pdf (accessed 4 September 2010)."},{"key":"key2022022020372799300_b34","unstructured":"Klink, S., Dengel, A. and Kieninger, T. (2000), \u201cDocument structure analysis based on layout and textual features\u201d, Proceedings of the International Workshop on Document Analysis Systems, DAS2000, Boston, MA, 9\u201011 June, pp. 99\u2010111."},{"key":"key2022022020372799300_b35","doi-asserted-by":"crossref","unstructured":"Larkey, L.S. and Croft, W.B. (1996), \u201cCombining classifiers in text categorization\u201d, Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR '96, Zurich, Switzerland, August 18\u201022, ACM, New York, NY, pp. 289\u201097.","DOI":"10.1145\/243199.243276"},{"key":"key2022022020372799300_b36","unstructured":"Lattner, A.D. and Herzog, O. (2003), \u201cInstance\u2010based learning and information extraction for the generation of metadata\u201d, Proceedings of the 3rd International Conference on Knowledge Management. I\u2010KNOW 03, Gaithersburg, MD, 29 November\u20102 December, pp. 472\u20109."},{"key":"key2022022020372799300_b37","doi-asserted-by":"crossref","unstructured":"Lawrence, S., Giles, L. and Bollacker, K. (2002), \u201cDigital libraries and autonomous citation indexing\u201d, Computer, Vol. 32 No. 6, pp. 67\u201071.","DOI":"10.1109\/2.769447"},{"key":"key2022022020372799300_b38","doi-asserted-by":"crossref","unstructured":"Liddy, E.D., Allen, E., Harwell, S., Corieri, S., Yilmazel, O., Ozgencil, N.E., Diekema, A.R., McCracken, N. and Silverstein, J. (2002), \u201cAutomatic metadata generation & evaluation\u201d, Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, 11\u201015 August, pp. 401\u20102.","DOI":"10.1145\/564376.564464"},{"key":"key2022022020372799300_b39","unstructured":"Lin, S., Ng, J.\u2010P., Pradhan, S., Shah, J., Pietrobon, R. and Kan, M.\u2010Y. (2010), \u201cExtracting formulaic and free text clinical research articles metadata using conditional random fields\u201d, Proceedings of the The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics. NAACL HLT 2010, Second Louhi Workshop on Text and Data Mining of Health Documents, Los Angeles, CA, 1\u20106 June, pp. 90\u20105."},{"key":"key2022022020372799300_b42","doi-asserted-by":"crossref","unstructured":"McCallum, A., Nigam, K., Rennie, J. and Seymore, K. (2000), \u201cAutomating the construction of internet portals with machine learning\u201d, Information Retrieval, Vol. 3, pp. 127\u201063.","DOI":"10.1023\/A:1009953814988"},{"key":"key2022022020372799300_b40","unstructured":"Mao, S., Kim, J.W. and Thoma, G.R. (2004), \u201cA dynamic feature generation system for automated metadata extraction in preservation of digital materials\u201d, Proceedings of the First International Workshop on Document Image Analysis for Libraries. DIAL'04, Palo Alto, California, USA, January 23\u201024, IEEE Computer Society, New York, NY, pp. 225\u201032."},{"key":"key2022022020372799300_b41","doi-asserted-by":"crossref","unstructured":"Marinai, S. (2009), \u201cMetadata extraction from PDF papers for digital library ingest\u201d, Proceedings of the 10th International Conference on Document Analysis and Recognition, Barcelona, 26\u201029 July, pp. 251\u20105.","DOI":"10.1109\/ICDAR.2009.232"},{"key":"key2022022020372799300_b44","doi-asserted-by":"crossref","unstructured":"Milosavljevi\u0107, B. and Te\u0161endi\u0107, D. (2010), \u201cSoftware architecture of distributed client\/server library circulation system\u201d, The Electronic Library, Vol. 28 No. 2, pp. 286\u201099.","DOI":"10.1108\/02640471011033648"},{"key":"key2022022020372799300_b45","doi-asserted-by":"crossref","unstructured":"Milosavljevi\u0107, B., Boberi\u0107, D. and Surla, D. (2010), \u201cRetrieval of bibliographic records using Apache Lucene\u201d, The Electronic Library, Vol. 28 No. 4, pp. 525\u201039.","DOI":"10.1108\/02640471011065355"},{"key":"key2022022020372799300_b46","doi-asserted-by":"crossref","unstructured":"Milosavljevi\u0107, G., Ivanovi\u0107, D., Surla, D. and Milosavljevi\u0107, B. (2011), \u201cAutomated construction of the user interface for a CERIF\u2010compliant research management system\u201d, The Electronic Library, Vol. 29 No. 5.","DOI":"10.1108\/02640471111177035"},{"key":"key2022022020372799300_b47","doi-asserted-by":"crossref","unstructured":"Nadeau, D. and Sekine, S. (2007), \u201cA survey of named entity recognition and classification\u201d, Lingvisticae Investigationes, Vol. 30 No. 1, pp. 3\u201026.","DOI":"10.1075\/li.30.1.03nad"},{"key":"key2022022020372799300_b49","doi-asserted-by":"crossref","unstructured":"Ojokoh, A.B., Adewale, S.O. and Falaki, O.S. (2009), \u201cAutomated document metadata extraction\u201d, Journal of Information Science, Vol. 35 No. 5, pp. 563\u201070.","DOI":"10.1177\/0165551509105195"},{"key":"key2022022020372799300_b48","unstructured":"Peng, F. and McCallum, A. (2004), \u201cAccurate information extraction from research papers using conditional random fields\u201d, Proceedings of Human Language Technology Conference and North American Chapter of the Association for Computational Linguistics. HLTNAACL, Boston, MA, 2\u20107 May, pp. 329\u201036."},{"key":"key2022022020372799300_b50","doi-asserted-by":"crossref","unstructured":"Quinlan, J.R. (1986), \u201cInduction of decision trees\u201d, Machine Learning, Vol. 1, pp. 81\u2010106.","DOI":"10.1007\/BF00116251"},{"key":"key2022022020372799300_b51","doi-asserted-by":"crossref","unstructured":"Quinlan, J.R. (1987), \u201cSimplifying decision trees\u201d, International Journal of Man\u2010Machine Studies, Vol. 27, pp. 221\u201034.","DOI":"10.1016\/S0020-7373(87)80053-6"},{"key":"key2022022020372799300_b52","doi-asserted-by":"crossref","unstructured":"Ra\u0111enovi\u0107, J., Milosavljevi\u0107, B. and Surla, D. (2009), \u201cModelling and implementation of catalogue cards using FreeMarker\u201d, Program: electronic library and information systems, Vol. 43 No. 1, pp. 62\u201076.","DOI":"10.1108\/00330330910934110"},{"key":"key2022022020372799300_b53","unstructured":"Ramshaw, L.A. and Marcus, M.P. (1995), \u201cText chunking using transformation\u2010based learning\u201d, Proceedings of the Third ACL Workshop on Very Large Corpora, Boston, MA, 30 June, pp. 82\u201094."},{"key":"key2022022020372799300_b54","unstructured":"Schwartz, C. (2001), Sorting out the Web: Approaches to Subject Access, Ablex, Westport, CT."},{"key":"key2022022020372799300_b55","unstructured":"Sekine, S. and Grishman, R. (1998), \u201cA decision tree method for finding and classifying names in Japanese texts\u201d, Proceedings of the Sixth Workshop on Very Large Corpora, Montreal, Quebec, 15\u201016 August."},{"key":"key2022022020372799300_b56","unstructured":"Takasu, A. (2003), \u201cBibliographic attribute extraction from erroneous references based on a statistical model\u201d, Proceedings of the 3rd ACM\/IEEE\u2010CS Joint Conference on DigitalLlibraries. JCDL '03, Houston, TX, USA, May 27\u201031, IEEE Computer Society, New York, NY, pp. 49\u201060."},{"key":"key2022022020372799300_b57","doi-asserted-by":"crossref","unstructured":"Takeuchi, K. and Collier, N. (2002), \u201cUse of support vector machines in extended named entity recognition\u201d, Proceedings of the 6th Conference on Natural Language Learning, COLING\u201002, Taipei, 27 August, pp. 1\u20107.","DOI":"10.3115\/1118853.1118882"},{"key":"key2022022020372799300_b58","doi-asserted-by":"crossref","unstructured":"Te\u0161endi\u0107, D., Milosavljevi\u0107, B. and Surla, D. (2009), \u201cA library circulation system for city and special libraries\u201d, The Electronic Library, Vol. 27 No. 1, pp. 162\u201086.","DOI":"10.1108\/02640470910934669"},{"key":"key2022022020372799300_b59","unstructured":"Vapnik, V. (1998), Statistical Learning Theory, Wiley, New York, NY."},{"key":"key2022022020372799300_b60","doi-asserted-by":"crossref","unstructured":"Vidakovi\u0107, M., Milosavljevi\u0107, B., Konjovi\u0107, Z. and Sladi\u0107, G. (2009), \u201cExtensible Java EE\u2010based agent framework and its application on distributed library catalogues\u201d, Computer Science and Information Systems, Vol. 6 No. 2, pp. 1\u201028.","DOI":"10.2298\/CSIS0902001V"},{"key":"key2022022020372799300_b61","doi-asserted-by":"crossref","unstructured":"Yang, Y. (2001), \u201cA study of thresholding strategies for text categorization\u201d, Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '01,New Orleans, LA, USA, September 9\u201013, ACM, New York, NY, pp. 137\u201045.","DOI":"10.1145\/383952.383975"},{"key":"key2022022020372799300_b62","doi-asserted-by":"crossref","unstructured":"Yilmazel, O., Finneran, C.M. and Liddy, E.D. (2004), \u201cMetaextract: an NLP system to automatically assign metadata\u201d, Proceedings of the 4th ACM\/IEEE\u2010CS Joint Conference on Digital Libraries, JCDL '04, Tucson, AZ, USA, June 7\u201011, ACM, New York, NY, pp. 241\u20102.","DOI":"10.1145\/996350.996405"},{"key":"key2022022020372799300_b63","unstructured":"Yin, P., Zhang, M., Deng, Z.H. and Yang, D.Q. (2005), \u201cMetadata extraction from bibliographies using bigram HMM\u201d, in Chen, Z., Chen, H., Miao, Q., Fu, Y., Fox, E. and Lim, E.\u2010P. (Eds), Digital Libraries: International Collaboration and Cross\u2010Fertilization, Vol. 3334, Lecture Notes in Computer Science, pp. 1\u201014."},{"key":"key2022022020372799300_b64","unstructured":"Zimmerman, E. (2002), \u201cCris\u2010cross: current research information systems at a crossroads\u201d, in Adamczak, W. and Nase, A. (Eds), Gaining Insight from Research Information. Proceedings of the 6th International Conference on Current Research Information Systems, University of Kassel, Kassel, 29\u201031 August, pp. 29\u201031."},{"key":"key2022022020372799300_frg26","doi-asserted-by":"crossref","unstructured":"Han, H., Manavoglu, E., Zha, H., Tsioutsiouliklis, K., Giles, C.L. and Zhang, X. (2005), \u201cRule\u2010based word clustering for document metadata extraction\u201d, Proceedings of the 2005 ACM Symposium on Applied Computing. SAC '05, Santa Fe, New Mexico, March 13\u201017, ACM, New York, NY, pp. 1049\u201053.","DOI":"10.1145\/1066677.1066917"},{"key":"key2022022020372799300_frg43","unstructured":"McCallum, A., Freitag, D. and Pereira, F. (2000), \u201cMaximum entropy Markov models for information extraction and segmentation\u201d, Proceedings of the 17th International Conference on Machine Learning, Stanford University, Stanford, CA, 29 June\u20102 July, pp. 591\u20108."}],"container-title":["Program"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.emeraldinsight.com\/doi\/full-xml\/10.1108\/00330331111182094","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/00330331111182094\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/00330331111182094\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:55:54Z","timestamp":1753401354000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/dta\/article\/45\/4\/376-396\/330447"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,9,27]]},"references-count":64,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2011,9,27]]}},"alternative-id":["10.1108\/00330331111182094"],"URL":"https:\/\/doi.org\/10.1108\/00330331111182094","relation":{},"ISSN":["0033-0337"],"issn-type":[{"value":"0033-0337","type":"print"}],"subject":[],"published":{"date-parts":[[2011,9,27]]}}}