{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T01:11:34Z","timestamp":1654132294696},"reference-count":19,"publisher":"IGI Global","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,4,1]]},"abstract":"<p>Many documents in the World Wide Web present structured information that consists of multiple pieces of data with certain relationships among them. Although it is usually not difficult to identify the individual data values in the document text, their relationships are often not explicitly described in the document content. They are expressed by visual presentation of the document content that is expected to be interpreted by a human reader. In this paper, the authors propose a formal generic model of logical relationships in a document based on an interpretation of visual presentation patterns in the documents. The model describes the visually expressed relationships between individual parts of the contents independently of the document format and the particular way of presentation. Therefore, it can be used as an appropriate document model in many information retrieval or extraction applications. The authors formally define the model, the authors introduce a method of extracting the relationships between the content parts based on the visual presentation analysis and the authors discuss the expected applications. The authors also present a new dataset consisting of programmes of conferences and other scientific events and the authors discuss its suitability for the task in hand. Finally, the authors use the dataset to evaluate results of the implemented system.<\/p>","DOI":"10.4018\/ijcini.2013040102","type":"journal-article","created":{"date-parts":[[2014,2,20]],"date-time":"2014-02-20T14:31:28Z","timestamp":1392906688000},"page":"13-29","source":"Crossref","is-referenced-by-count":0,"title":["Extracting Visually Presented Element Relationships from Web Documents"],"prefix":"10.4018","volume":"7","author":[{"given":"Radek","family":"Burget","sequence":"first","affiliation":[{"name":"Faculty of Information Technology, IT4Innovations Centre of Excellence, Brno University of Technology, Brno, Czech Republic"}]},{"given":"Pavel","family":"Smrz","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, IT4Innovations Centre of Excellence, Brno University of Technology, Brno, Czech Republic"}]}],"member":"2432","reference":[{"key":"ijcini.2013040102-0","doi-asserted-by":"crossref","unstructured":"Burget, R. (2004). Hierarchies in HTML documents: Linking text to concepts. In Proceedings of the 15th International Workshop on Database and Expert Systems Applications (pp. 186\u2013190). IEEE Computer Society.","DOI":"10.1109\/DEXA.2004.1333471"},{"key":"ijcini.2013040102-1","doi-asserted-by":"publisher","DOI":"10.1504\/IJIIDS.2011.041322"},{"key":"ijcini.2013040102-2","author":"D.Cai","year":"2003","journal-title":"VIPS: A vision-based page segmentation algorithm"},{"key":"ijcini.2013040102-3","doi-asserted-by":"crossref","unstructured":"Finkel, J. R., Grenager, T., & Manning, C. (2005). Incorporating non-local information into information extraction systems by Gibbs sampling. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (ACL \u201905) (pp. 363\u2013370).","DOI":"10.3115\/1219840.1219885"},{"key":"ijcini.2013040102-4","unstructured":"Klink, S., Dengel, A., & Kieninger, T. (2000). Document structure analysis based on layout and textual features. In Proc. of International Workshop on Document Analysis Systems, Brazil (pp. 99\u2013111). IAPR."},{"key":"ijcini.2013040102-5","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.1460"},{"issue":"3","key":"ijcini.2013040102-6","first-page":"482","article-title":"Building association link network for semantic link on web resources. Automation Science and Engineering","volume":"8","author":"X.Luo","year":"2011","journal-title":"IEEE Transactions on"},{"key":"ijcini.2013040102-7","doi-asserted-by":"publisher","DOI":"10.4018\/jdls.2010100101"},{"key":"ijcini.2013040102-8","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1007\/978-1-84628-726-8_2","article-title":"Document structure and layout analysis","author":"A.Namboodiri","year":"2007","journal-title":"Digital document processing, Advances in pattern recognition"},{"key":"ijcini.2013040102-9","doi-asserted-by":"crossref","unstructured":"Nojoumian, M., & Lethbridge, T. C. (2007). Extracting document structure to facilitate a knowledge base creation for the uml superstructure specification. In Proceedings of the International Conference on Information Technology (pp. 393\u2013400).","DOI":"10.1109\/ITNG.2007.93"},{"key":"ijcini.2013040102-10","unstructured":"Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The pagerank citation ranking: Bringing order to the web. Technical Report 1999-66, Stanford InfoLab."},{"key":"ijcini.2013040102-11","doi-asserted-by":"crossref","unstructured":"Rauf, R., Antkiewicz, M., & Czarnecki, K. (2011). Logical structure extraction from software requirements documents. In Proceedings of the Requirements Engineering Conference (RE), 2011 19th IEEE International (pp. 101\u2013110).","DOI":"10.1109\/RE.2011.6051638"},{"key":"ijcini.2013040102-12","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1075\/ata.xiii.22shr","article-title":"Corpus enhancement and computer-assisted localization and translation","author":"G. M.Shreve","year":"2006","journal-title":"Perspectives on localization"},{"key":"ijcini.2013040102-13","doi-asserted-by":"crossref","unstructured":"Stoffel, A., Spretke, D., Kinnemann, H., & Keim, D. A. (2010). Enhancing document structure analysis using visual analytics. In Proceedings of the 2010 ACM Symposium on Applied Computing (SAC \u201910) (pp. 8\u201312). New York, NY: ACM.","DOI":"10.1145\/1774088.1774091"},{"key":"ijcini.2013040102-14","unstructured":"Summers, K. (1995). Toward a taxonomy of logical document structures. In Electronic Publishing and the Information Superhighway: Proceedings of the Dartmouth Institute for Advanced Graduate Studies (pp. 124\u2013133)."},{"key":"ijcini.2013040102-15","doi-asserted-by":"crossref","unstructured":"Tang, J., Hong, M., Li, J., & Liang, B. (2006). Tree-structured conditional random fields for semantic annotation. In Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., \u2026 Aroyo, L. (Eds.), The semantic web - ISWC 2006 (Vol. 4273 of Lecture Notes in Computer Science, pp. 640\u2013653). Springer Berlin Heidelberg.","DOI":"10.1007\/11926078_46"},{"key":"ijcini.2013040102-16","doi-asserted-by":"crossref","unstructured":"Yashiro, H., Murakami, T., Shima, Y., Nakano, Y., & Fujisawa, H. (1989). A new method of document structure extraction using generic layout knowledge. In Proceedings of the International Workshop on Industrial Applications of Machine Intelligence and Vision, Tokyo, Japan (pp. 282\u2013287).","DOI":"10.1109\/MIV.1989.40564"},{"key":"ijcini.2013040102-17","doi-asserted-by":"crossref","unstructured":"You, Y., Xu, G., Cao, J., Zhang, Y., & Huang, G. (2013). Leveraging visual features and hierarchical dependencies for conference information extraction. In Ishikawa, Y., Li, J., Wang, W., Zhang, R., & Zhang, W. (Eds.), Web technologies and applications (Vol. 7808 of Lecture Notes in Computer Science, pp. 404\u2013416). Springer Berlin Heidelberg.","DOI":"10.1007\/978-3-642-37401-2_41"},{"key":"ijcini.2013040102-18","author":"S.Yu","year":"2002","journal-title":"Improving pseudo-relevance feedback in web information retrieval using web page segmentation"}],"container-title":["International Journal of Cognitive Informatics and Natural Intelligence"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=101815","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T00:28:30Z","timestamp":1654129710000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/ijcini.2013040102"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2013,4,1]]},"references-count":19,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2013,4]]}},"URL":"https:\/\/doi.org\/10.4018\/ijcini.2013040102","relation":{},"ISSN":["1557-3958","1557-3966"],"issn-type":[{"value":"1557-3958","type":"print"},{"value":"1557-3966","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,4,1]]}}}