{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T18:15:25Z","timestamp":1754158525336,"version":"3.41.2"},"reference-count":53,"publisher":"Emerald","issue":"4","license":[{"start":{"date-parts":[[2019,9,2]],"date-time":"2019-09-02T00:00:00Z","timestamp":1567382400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["EL"],"published-print":{"date-parts":[[2019,9,6]]},"abstract":"<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title>\n<jats:p>This paper aims to describe the structure of an aligned Serbian-German literary corpus (SrpNemKor) contained in a digital library Bibli\u0161a. The goal of the research was to create a benchmark Serbian-German annotated corpus searchable with various query expansions.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title>\n<jats:p>The presented research is particularly focused on the enhancement of bilingual search queries in a full-text search of aligned SrpNemKor collection. The enhancement is based on using existing lexical resources such as Serbian morphological electronic dictionaries and the bilingual lexical database Termi.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Findings<\/jats:title>\n<jats:p>For the purpose of this research, the lexical database Termi is enriched with a bilingual list of German-Serbian translated pairs of lexical units. The list of correct translation pairs was extracted from SrpNemKor, evaluated and integrated into Termi. Also, Serbian morphological e-dictionaries are updated with new entries extracted from the Serbian part of the corpus.<\/jats:p>\n<\/jats:sec>\n<jats:sec>\n<jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title>\n<jats:p>A bilingual search of SrpNemKor in Bibli\u0161a is available within the user-friendly platform. The enriched database Termi enables semantic enhancement and refinement of user\u2019s search query based on synonyms both in Serbian and German at a very high level. Serbian morphological e-dictionaries facilitate the morphological expansion of search queries in Serbian, thereby enabling the analysis of concepts and concept structures by identifying terms assigned to the concept, and by establishing relations between terms in Serbian and German which makes Bibli\u0161a a valuable Web tool that can support research and analysis of SrpNemKor.<\/jats:p>\n<\/jats:sec>","DOI":"10.1108\/el-03-2019-0056","type":"journal-article","created":{"date-parts":[[2019,9,2]],"date-time":"2019-09-02T06:22:21Z","timestamp":1567405341000},"page":"722-739","source":"Crossref","is-referenced-by-count":3,"title":["Bilingual lexical extraction based on word alignment for improving corpus search"],"prefix":"10.1108","volume":"37","author":[{"given":"Jelena","family":"Andonovski","sequence":"first","affiliation":[]},{"given":"Branislava","family":"\u0160andrih","sequence":"additional","affiliation":[]},{"given":"Olivera","family":"Kitanovi\u0107","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2019,9,2]]},"reference":[{"issue":"5","key":"key2021102914342677300_ref001","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1017\/S1351324917000195","article-title":"Leveraging bilingual terminology to improve machine translation in a computer aided translation environment","volume":"23","year":"2017","journal-title":"Natural Language Engineering"},{"issue":"1","key":"key2021102914342677300_ref002","first-page":"279","article-title":"Impact of new technologies in the digital libraries","volume":"6","year":"2019","journal-title":"Journal of Advancements in Library Sciences"},{"issue":"2","key":"key2021102914342677300_ref003","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1108\/JD-06-2018-0087","article-title":"Metadata categorization for identifying search patterns in a digital library","volume":"75","year":"2019","journal-title":"Journal of Documentation"},{"key":"key2021102914342677300_ref004","first-page":"674","article-title":"Identifying bilingual multi-word expressions for statistical machine translation","volume-title":"Proceedings of LREC\u201912 Conference","year":"2012"},{"issue":"2","key":"key2021102914342677300_ref005","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1108\/02640471211221313","article-title":"Multilinguality in the digital library: a review","volume":"30","year":"2012","journal-title":"The Electronic Library"},{"volume-title":"WordNet: An Electronic Lexical Database","year":"1998","key":"key2021102914342677300_ref006"},{"key":"key2021102914342677300_ref007","first-page":"327","article-title":"Extraction and presentation of bilingual correspondences from Slovak-Bulgarian parallel corpus: cognitive studies","volume":"15","year":"2015","journal-title":"\u00c9tudes Cognitives"},{"key":"key2021102914342677300_ref008","unstructured":"Gravano, L. and Henzinger, M.H. (2014), \u201cSystems and methods for using anchor text as parallel corpora for cross-language information retrieval\u201d, US Patent 8,631,010."},{"issue":"1","key":"key2021102914342677300_ref009","first-page":"53a","article-title":"Personal names in information extraction","volume":"11","year":"2010","journal-title":"Infotheca"},{"key":"key2021102914342677300_ref010","first-page":"4210","article-title":"Axolotl: a web accessible parallel corpus for Spanish-Nahuatl","volume-title":"Proceedings of LREC\u201916 Conference, European Language Resources Association","year":"2016"},{"issue":"1","key":"key2021102914342677300_ref011","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1017\/S1351324915000431","article-title":"A classification approach for detecting crosslingual biomedical term translations","volume":"23","year":"2017","journal-title":"Natural Language Engineering"},{"key":"key2021102914342677300_ref012","first-page":"3401","article-title":"Efficient data selection for bilingual terminology extraction from comparable corpora","volume-title":"Proceedings of COLING\u201916","year":"2016"},{"issue":"4","key":"key2021102914342677300_ref013","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1017\/S1351324916000127","article-title":"End-to-end statistical machine translation with zero or small parallel texts","volume":"22","year":"2016","journal-title":"Natural Language Engineering"},{"key":"key2021102914342677300_ref014","first-page":"19","article-title":"Transkribus: a service platform for transcription, recognition and retrieval of historical documents","volume-title":"14th IAPR International Conference on Document Analysis and Recognition (ICDAR\u201917)","year":"2017"},{"key":"key2021102914342677300_ref015","first-page":"48","article-title":"Statistical phrase-based translation","volume-title":"Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology","year":"2003"},{"volume-title":"Bibliotekarski Terminolo\u0161ki Re\u010dnik: Englesko-Srpski, Srpsko-Engleski (Library Terminological Dictionary: English-Serbian, Serbian-English)","year":"2004","key":"key2021102914342677300_ref016"},{"volume-title":"Processing of Serbian \u2013 Automata, Texts and Electronic Dictionaries, Faculty of Philology","year":"2008","key":"key2021102914342677300_ref017"},{"issue":"1\/2","key":"key2021102914342677300_ref018","first-page":"147","article-title":"Using textual and lexical resources in developing Serbian WordNet","volume":"7","year":"2004","journal-title":"Romanian Journal of Information Science and Technology"},{"key":"key2021102914342677300_ref019","first-page":"2487","article-title":"Using English baits to catch Serbian multi-word terminology","volume-title":"Proceedings of LREC\u201918 Conference","year":"2018"},{"volume-title":"Understanding Digital Libraries","year":"2005","key":"key2021102914342677300_ref020"},{"issue":"4","key":"key2021102914342677300_ref021","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/ijl\/3.4.235","article-title":"Introduction to WordNet: an on-line lexical database","volume":"3","year":"1990","journal-title":"International Journal of Lexicography"},{"issue":"5","key":"key2021102914342677300_ref022","first-page":"317","article-title":"Bilingual lexicon extraction from Arabic-English parallel corpora with a view to machine translation","volume":"7","year":"2016","journal-title":"Arab World English Journal"},{"key":"key2021102914342677300_ref023","first-page":"563","article-title":"An integrated environment for development of parallel corpora (in Serbian)","volume-title":"Die Unterschiede Zwischen Dem Bosnischen\/Bosniakischen, Kroatischen Und Serbischen","year":"2008"},{"issue":"1","key":"key2021102914342677300_ref024","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1162\/089120103321337421","article-title":"A systematic comparison of various statistical alignment models","volume":"29","year":"2003","journal-title":"Computational Linguistics"},{"issue":"3","key":"key2021102914342677300_ref025","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1007\/s10590-017-9201-7","article-title":"A system for terminology extraction and translation equivalent detection in real time: efficient use of statistical machine translation phrase tables","volume":"31","year":"2017","journal-title":"Machine Translation"},{"key":"key2021102914342677300_ref026","unstructured":"Open DE WordNet Initiative (2019), available at: https:\/\/ikum.mediencampus.h-da.de\/projekt\/open-de-wordnet-initiative\/ (accessed 28 February 2019)."},{"key":"key2021102914342677300_ref027","unstructured":"Paumier, S. (2011), \u201cUnitex 3.0 user manual\u201d, available at: www.cis.uni-muenchen.de\/people\/lg3\/ManuelUnitex.pdf (accessed 15 December 2018)."},{"key":"key2021102914342677300_ref028","first-page":"20","article-title":"Term extraction, tagging, and mapping tools for under-resourced languages","volume-title":"Proceedings of TKE\u201912","year":"2012"},{"key":"key2021102914342677300_ref029","unstructured":"Savourel, Y. (2004), \u201cTMX 1.4b specification\u201d, available at: www.gala-global.org\/tmx-14b (accessed 25 December 2018)."},{"article-title":"A hybrid approach for automatic extraction of bilingual multiword expressions from parallel corpora","volume-title":"Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC\u201918)","year":"2018","key":"key2021102914342677300_ref030"},{"key":"key2021102914342677300_ref031","first-page":"21","article-title":"Balkanet: a multilingual semantic network for the Balkan languages","volume-title":"Proceedings of the International WordNet Conference","year":"2002"},{"key":"key2021102914342677300_ref036","first-page":"12","article-title":"Developing termbases for expert terminology under the TBX standard","volume-title":"Natural Language Processing for Serbian: Resources and Applications","year":"2014"},{"key":"key2021102914342677300_ref032","first-page":"1764","article-title":"A bilingual digital library for academic and entrepreneurial knowledge management","volume-title":"Proceeding of IFKAD\u201915: Culture, Innovation and Entrepreneurship: Connecting the Knowledge Dots","year":"2015"},{"key":"key2021102914342677300_ref033","first-page":"48","article-title":"Electronic dictionaries \u2013 from file system to lemon based lexical database","volume-title":"Proceedings of LREC\u201918 \u2013 W23 6th Workshop on Linked Data in Linguistics: Towards Linguistic Data Science (LDL\u201918)","year":"2018"},{"key":"key2021102914342677300_ref034","first-page":"507","article-title":"Rule-based automatic multi-word term extraction and lemmatization","volume-title":"Proceedings of LREC\u201916","year":"2016"},{"key":"key2021102914342677300_ref037","first-page":"77","article-title":"Production of morphological dictionaries of multi-word units using a multipurpose tool","volume-title":"Proceedings of the Computational Linguistics-Applications Conference, 17-19 October","year":"2011"},{"key":"key2021102914342677300_ref035","first-page":"112","article-title":"Keyword-based search on bilingual digital libraries","volume-title":"Semantic Keyword-Based Search on Structured Data Sources \u2013 Second COST Action IC1302 International KEYSTONE Conference, IKC\u201916","year":"2017"},{"issue":"1","key":"key2021102914342677300_ref038","first-page":"49a","article-title":"The development of the GeolISSTerm terminological dictionary","volume":"12","year":"2011","journal-title":"INFOtheca"},{"key":"key2021102914342677300_ref039","first-page":"1256","article-title":"Extraction of multi-word expressions from small parallel corpora","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING\u201910","year":"2010"},{"key":"key2021102914342677300_ref040","unstructured":"Utvi\u0107, M. (2013), \u201cThe construction of reference corpus of contemporary Serbian\u201d, PhD thesis, Filolo\u0161ki fakultet, Univerzitet u Beogradu."},{"key":"key2021102914342677300_ref041","first-page":"1091","article-title":"Harvesting multi-word expressions from parallel corpora","volume-title":"Proceedings of LREC\u201908 Conference","year":"2008"},{"key":"key2021102914342677300_ref042","first-page":"148","article-title":"Literature and aligned texts","volume-title":"Readings in Multilinguality","year":"2006"},{"key":"key2021102914342677300_ref043","first-page":"279","article-title":"Processing of corpora of serbian using electronic dictionaries","volume":"63","year":"2012","journal-title":"Prace Filologiczne"},{"key":"key2021102914342677300_ref044","first-page":"3172","article-title":"Innovations in parallel corpus search tools","volume-title":"Proceedings of LREC\u201914","year":"2014"},{"article-title":"Sustaining multilinguality: Case studies of two American multilingual digital libraries","volume-title":"iConference 2019 Proceedings","year":"2019","key":"key2021102914342677300_ref045"},{"issue":"1","key":"key2021102914342677300_ref046","article-title":"Bilingual term alignment from comparable corpora in English discharge summary and chinese discharge summary","volume":"16","year":"2015","journal-title":"BMC Bioinformatics"},{"issue":"2","key":"key2021102914342677300_ref047","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1108\/02640471211221395","article-title":"Bilingual terminology extraction using multi-level termhood","volume":"30","year":"2012","journal-title":"The Electronic Library"},{"key":"key2021102914342677300_ref048","unstructured":"Collins English Dictionary (2019), available at: www.collinsdictionary.com\/dictionary\/english\/token (accessed 5 February 2019)."},{"key":"key2021102914342677300_ref049","first-page":"25","article-title":"The MULTEXT-East morphosyntactic specifications for slavic languages","volume-title":"Proceedings of the Workshop on Morphological Processing of Slavic Languages: 10th Conference of the European Chapter (EACL\u201903)","year":"2003"},{"key":"key2021102914342677300_ref050","first-page":"561","article-title":"Visualising a text with a tree cloud","volume-title":"IFCS\u201909","year":"2010"},{"key":"key2021102914342677300_ref051","first-page":"495","article-title":"An aligned English-Serbian corpus","volume-title":"ELLSIIR Proceedings Volume I, Faculty of Philology","year":"2011"},{"key":"key2021102914342677300_ref052","unstructured":"Sketch Engine (2019), \u201cToken\u201d, available at: www.sketchengine.eu\/my_keywords\/token\/ (accessed 5 February 2019)."},{"key":"key2021102914342677300_ref053","first-page":"207","article-title":"Construction and exploitation of X-Serbian bitexts","volume-title":"Multilingual Processing in Eastern and Southern EU Languages: Low-Resourced Technologies and Translation","year":"2012"}],"container-title":["The Electronic Library"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/EL-03-2019-0056\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/EL-03-2019-0056\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T01:06:56Z","timestamp":1753405616000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/el\/article\/37\/4\/722-739\/91050"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,2]]},"references-count":53,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2019,9,2]]},"published-print":{"date-parts":[[2019,9,6]]}},"alternative-id":["10.1108\/EL-03-2019-0056"],"URL":"https:\/\/doi.org\/10.1108\/el-03-2019-0056","relation":{},"ISSN":["0264-0473","0264-0473"],"issn-type":[{"type":"print","value":"0264-0473"},{"type":"print","value":"0264-0473"}],"subject":[],"published":{"date-parts":[[2019,9,2]]}}}