{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T22:44:43Z","timestamp":1759963483490,"version":"3.38.0"},"publisher-location":"Dordrecht","reference-count":97,"publisher":"Springer Netherlands","isbn-type":[{"type":"print","value":"9789048191772"},{"type":"electronic","value":"9789048191789"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010]]},"DOI":"10.1007\/978-90-481-9178-9_1","type":"book-chapter","created":{"date-parts":[[2010,9,30]],"date-time":"2010-09-30T18:07:57Z","timestamp":1285870077000},"page":"3-30","source":"Crossref","is-referenced-by-count":8,"title":["Riding the Rough Waves of Genre on the Web"],"prefix":"10.1007","author":[{"given":"Marina","family":"Santini","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Mehler","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Serge","family":"Sharoff","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2010,8,16]]},"reference":[{"key":"1_CR1","doi-asserted-by":"crossref","unstructured":"Amitay, E., D. Carmel, A. Darlow, R. Lempel, and A. Soffer. 2003. The connectivity sonar: Detecting site functionality by structural patterns. In Proceedings of the 14th ACM Conference on Hypertext and Hypermedia, 38\u201347. University of Nottingham, UK.","DOI":"10.1145\/900051.900060"},{"key":"1_CR2","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1002\/aris.2008.1440420115","volume":"42","author":"J. Andersen","year":"2008","unstructured":"Andersen, J. 2008. The concept of genre in information studies. Annual Review of Information Science & Technology 42:339, 2007.","journal-title":"Annual Review of Information Science & Technology"},{"issue":"5","key":"1_CR3","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1002\/bult.2008.1720340511","volume":"34","author":"J. Andersen","year":"2008","unstructured":"Andersen, J. 2008. Bringing genre into focus: Lis and genre between people, texts, activity and situation. Bulletin of the American Society for Information Science and Technology 34(5):31\u201334.","journal-title":"Bulletin of the American Society for Information Science and Technology"},{"issue":"2","key":"1_CR4","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1108\/09593840510601504","volume":"18","author":"I. Askehave","year":"2005","unstructured":"Askehave, I., and A.E. Nielsen. 2005. Digital genres: A challenge to traditional genre theory. Information Technology & People 18(2):120\u2013141.","journal-title":"Information Technology & People"},{"key":"1_CR5","volume-title":"The BNC handbook: Exploring the British National Corpus with SARA","author":"G. Aston","year":"1998","unstructured":"Aston, G., and L. Burnard. 1998. The BNC handbook: Exploring the British National Corpus with SARA. Edinburgh: Edinburgh University Press."},{"key":"1_CR6","doi-asserted-by":"crossref","unstructured":"Barnard, D.T., L. Burnard, S.J. DeRose, D.G. Durand, and C.M. Sperberg-McQueen. 1995. Lessons for the World Wide Web from the text encoding initiative. In Proceedings of the 4th international World Wide Web conference \u201cThe Web Revolution\u201d. Boston, MA.","DOI":"10.1145\/3592626.3592654"},{"key":"1_CR7","doi-asserted-by":"crossref","unstructured":"Baroni, M., and A. Kilgarriff. 2006. Large linguistically-processed Web corpora for multiple languages. In Companion Volume to Proceedings of the European Association of Computational Linguistics, 87\u201390. Trento.","DOI":"10.3115\/1608974.1608976"},{"key":"1_CR8","unstructured":"Baroni, M., F. Chantree, A. Kilgarriff, and S. Sharoff. 2008. Cleaneval: A competition for cleaning web pages. In Proceedings of the 6th Language Resources and Evaluation Conference (LREC 2008). Marrakech."},{"key":"1_CR9","doi-asserted-by":"publisher","DOI":"10.1057\/9780230582323","volume-title":"Multimodality and genre: A foundation for the systematic analysis of multimodal documents","author":"J.A. Bateman","year":"2008","unstructured":"Bateman, J.A. 2008. Multimodality and genre: A foundation for the systematic analysis of multimodal documents. London: Palgrave Macmillan."},{"issue":"3","key":"1_CR10","doi-asserted-by":"publisher","first-page":"409","DOI":"10.1162\/089120101317066131","volume":"27","author":"J.A. Bateman","year":"2001","unstructured":"Bateman, J.A., T. Kamps, J. Kleinz, and K. Reichenberger. 2001. Towards constructive text, diagram, and layout generation for information presentation. Computational Linguistics 27(3):409\u2013449.","journal-title":"Computational Linguistics"},{"key":"1_CR11","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511621024","volume-title":"Variation across speech and writing","author":"D. Biber","year":"1988","unstructured":"Biber, D. 1988. Variation across speech and writing. Cambridge, MA: Cambridge University Press."},{"issue":"3","key":"1_CR12","first-page":"43","volume":"27","author":"D. Biber","year":"1989","unstructured":"Biber, D. 1989. A typology of English texts. Linguistics 27(3):43\u201358.","journal-title":"Linguistics"},{"key":"1_CR13","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511519871","volume-title":"Dimensions of register variation: A cross-linguistic comparison","author":"D. Biber","year":"1995","unstructured":"Biber, D. 1995. Dimensions of register variation: A cross-linguistic comparison. Cambridge, MA: Cambridge University Press."},{"key":"1_CR14","doi-asserted-by":"crossref","DOI":"10.1075\/scl.28","volume-title":"Discourse on the move: Using corpus analysis to describe discourse structure","author":"D. Biber","year":"2007","unstructured":"Biber, D., U. Connor, and T.A. Upton. 2007. Discourse on the move: Using corpus analysis to describe discourse structure. Amsterdam: Benjamins."},{"key":"1_CR15","unstructured":"Bj\u00f6rneborn, L. 2004. Small-world link structures across an academic web space: A library and information science approach. PhD thesis, Royal School of Library and Information Science, Department of Information Studies, Denmark."},{"key":"1_CR16","volume-title":"Genres on the web: Computational models and empirical studies","author":"L. Bj\u00f6rneborn","year":"2010","unstructured":"Bj\u00f6rneborn, L. 2010. Genre connectivity and genre drift in a web of genres. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR17","volume-title":"Genres on the web: Computational models and empirical studies","author":"P. Braslavski","year":"2010","unstructured":"Braslavski, P. 2010. Marrying relevance and genre rankings: An exploratory study. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR18","volume-title":"Academic writing and genre: A systematic analysis","author":"I. Bruce","year":"2008","unstructured":"Bruce, I. 2008. Academic writing and genre: A systematic analysis. London: Continuum."},{"key":"1_CR19","volume-title":"Genres on the web: Computational models and empirical studies","author":"I. Bruce","year":"2010","unstructured":"Bruce, I. 2010. Evolving genres in online domains: The hybrid genre of the participatory news article. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR20","doi-asserted-by":"crossref","unstructured":"Chakrabarti, S. 2001. Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction. In Proceedings of the 10th International World Wide Web Conference, May 1\u20135, 211\u2013220. Hong Kong.","DOI":"10.1145\/371920.372054"},{"key":"1_CR21","doi-asserted-by":"crossref","unstructured":"Chakrabarti, S., M. van den Berg, and B. Dom. 1999. Focused crawling: A new approach to topic-specific web resource discovery. In Proceedings of the 8th International World Wide Web Conference. Toronto, ON.","DOI":"10.1016\/S1389-1286(99)00052-3"},{"key":"1_CR22","doi-asserted-by":"crossref","unstructured":"Chakrabarti, S., M. Joshi, K. Punera, and D.M. Pennock. 2002. The structure of broad topics on the web. In Proceedings of the 11th International World Wide Web Conference, 251\u2013262. New York, NY: ACM Press.","DOI":"10.1145\/511446.511480"},{"key":"1_CR23","first-page":"430","volume-title":"Advances in Neural Information Processing Systems 13, Papers from Neural Information Processing Systems (NIPS)","author":"D.A. Cohn","year":"2000","unstructured":"Cohn, D.A., and T. Hofmann. 2000. The missing link \u2013 a probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems 13, Papers from Neural Information Processing Systems (NIPS), eds. T.K. Leen, T.G. Dietterich, and V. Tresp, 430\u2013436. Denver, CO: MIT Press,"},{"issue":"2","key":"1_CR24","doi-asserted-by":"publisher","first-page":"115","DOI":"10.3366\/E1749503208000129","volume":"3","author":"A. Condamines","year":"2008","unstructured":"Condamines, A. 2008. Taking genre into account when analysing conceptual relation patterns. Corpora 3(2):115\u2013140.","journal-title":"Corpora"},{"issue":"1\u20132","key":"1_CR25","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1016\/S0004-3702(00)00004-7","volume":"118","author":"M. Craven","year":"2000","unstructured":"Craven, M., D. DiPasquo, D. Freitag, A.K. McCallum, T.M. Mitchell, K. Nigam, and S. Slattery. 2000. Learning to construct knowledge bases from the World Wide Web. Artificial Intelligence 118(1\u20132):69\u2013113.","journal-title":"Artificial Intelligence"},{"key":"1_CR26","volume-title":"Genres on the web: Computational models and empirical studies","author":"M. Dehmer","year":"2010","unstructured":"Dehmer, M., and F. Emmert-Streib. 2010. Mining graph patterns in web-based systems: A conceptual view. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"issue":"3","key":"1_CR27","doi-asserted-by":"publisher","first-page":"35","DOI":"10.3166\/dn.8.3.35-54","volume":"8","author":"L. Denoyer","year":"2004","unstructured":"Denoyer, L., and P. Gallinari. 2004. Un mod\u00e8le de mixture de mod\u00e8les g\u00f9n\u00f9ratifs pour les documents structur\u00f9s multim\u00f9dias. Document num\u00f9rique 8(3):35\u201354.","journal-title":"Document num\u00f9rique"},{"key":"1_CR28","doi-asserted-by":"crossref","unstructured":"Diligenti, M., M. Gori, M. Maggini, and F. Scarselli. 2001. Classification of HTML documents by hidden tree-markov models. In Proceedings of the International Conference on Document Analysis and Recognition (ICDAR), 849\u2013853. Seattle, WA.","DOI":"10.1109\/ICDAR.2001.953907"},{"issue":"5","key":"1_CR29","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1002\/bult.2008.1720340507","volume":"34","author":"A. Dillon","year":"2008","unstructured":"Dillon, A. 2008. Bringing genre into focus: Why information has shape. Bulletin of the American Society for Information Science and Technology 34(5):17\u201319.","journal-title":"Bulletin of the American Society for Information Science and Technology"},{"issue":"1","key":"1_CR30","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1145\/1189740.1189744","volume":"7","author":"D. Donato","year":"2007","unstructured":"Donato, D., L. Laura, S. Leonardi, and S. Millozzi. 2007. The web as a graph: How far we are. ACM Transactions on Internet Technology 7(1):4.","journal-title":"ACM Transactions on Internet Technology"},{"key":"1_CR31","doi-asserted-by":"crossref","unstructured":"Eiron, N., and K.S. McCurley. 2003. Untangling compound documents on the web. In Proceedings of the 14th ACM Conference on Hypertext and Hypermedia, 85\u201394. Nottingham.","DOI":"10.1145\/900051.900070"},{"key":"1_CR32","doi-asserted-by":"crossref","unstructured":"Ester, M., H.-P. Kriegel, and M. Schubert. 2002. Web site mining: A new way to spot competitors, customers and suppliers in the world wide web. In KDD \u201902: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 249\u2013258. New York, NY: ACM Press.","DOI":"10.1145\/775047.775084"},{"key":"1_CR33","unstructured":"Ferraresi, A., E. Zanchetta, S. Bernardini, and M. Baroni. 2008. Introducing and evaluating ukWaC, a very large web-derived corpus of English. In The 4th Web as Corpus Workshop: Can We Beat Google? (At LREC 2008). Marrakech."},{"key":"1_CR34","volume-title":"Corpus linguistics in North America 2002: Selections from the 4th North American Symposium of the American Association for applied corpus linguistics","author":"W.H. Fletcher","year":"2004","unstructured":"Fletcher, W.H. 2004. Making the web more useful as a source for linguistic corpora. In Corpus linguistics in North America 2002: Selections from the 4th North American Symposium of the American Association for applied corpus linguistics, eds. U. Connor, and T. Upton. Editions Rodopi: Amsterdam\/New York."},{"issue":"2\u20133","key":"1_CR35","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1023\/A:1013681528748","volume":"18","author":"P. Frasconi","year":"2002","unstructured":"Frasconi, P., G. Soda, and A. Vullo. 2002. Hidden Markov models for text categorization in multi-page documents. Journal of Intelligent Information Systems 18(2\u20133):195\u2013217.","journal-title":"Journal of Intelligent Information Systems"},{"key":"1_CR36","doi-asserted-by":"crossref","unstructured":"Freund, L. 2008. Exploiting task-document relationships to support information retrieval in the workplace. PhD thesis, University of Toronto.","DOI":"10.1145\/1480506.1480529"},{"key":"1_CR37","unstructured":"Freund, L., and C. Nilsen. 2008. Assessing a genre-based approach to online government information. In Proceedings of the 36th Annual Conference of the Canadian Association for Information Science (CAIS). University of British Columbia, Vancouver."},{"key":"1_CR38","volume-title":"Genres on the web: Computational models and empirical studies","author":"J. Grieve","year":"2010","unstructured":"Grieve, J., D. Biber, E. Friginal, and T. Nekrasova. 2010. Variation among blogs: A multi-dimensional analysis. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR39","unstructured":"Gunnarsson, M. 2010. Classification along genre dimensions. PhD, Inst. f. Biblioteks- och Informationsvetenskap, G\u00f6teborgs Universitet."},{"key":"1_CR40","doi-asserted-by":"crossref","unstructured":"Gupta, S., H. Becker, G. Kaiser, and S. Stolfo. 2006. Verifying genre-based clustering approach to content extraction. In Proceedings of the 15th International Conference on World Wide Web, 875\u2013876. New York, NY: ACM Press.","DOI":"10.1145\/1135777.1135922"},{"issue":"2","key":"1_CR41","doi-asserted-by":"publisher","first-page":"94","DOI":"10.1145\/1230819.1241670","volume":"50","author":"B. He","year":"2007","unstructured":"He, B., M. Patel, Z. Zhang, and K. Chen-Chuan Chang. 2007. Accessing the deep web: A survey. Communications of the ACM 50(2):94\u2013101.","journal-title":"Communications of the ACM"},{"key":"1_CR42","doi-asserted-by":"crossref","unstructured":"Herring, S.C., I. Kouper, J.C. Paolillo, L.A. Scheidt, M. Tyworth, P. Welsch, E. Wright, and N. Yu. 2005. Conversations in the blogosphere: An analysis \u201cfrom the bottom up\u201d. In Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS\u201905). Big Island, Hawaii.","DOI":"10.1109\/HICSS.2005.167"},{"key":"1_CR43","doi-asserted-by":"crossref","DOI":"10.1075\/pbns.174","volume-title":"Email hoaxes: Form, function, genre ecology","author":"T. Heyd","year":"2008","unstructured":"Heyd, T. 2008. Email hoaxes: Form, function, genre ecology. Amsterdam: Benjamins."},{"key":"1_CR44","unstructured":"Ide, N., R. Reppen, and K. Suderman. 2002. The American National Corpus: More than the Web can provide. In Proceedings of the 3rd Language Resources and Evaluation Conference, 839\u2013844. Las Palmas."},{"key":"1_CR45","unstructured":"Joachims, T., N. Cristianini, and J. Shawe-Taylor. 2001. Composite kernels for hypertext categorisation. In Proceedings of the 11th International Conference on Machine Learning, 250\u2013257. San Fransisco, CA: Morgan Kaufmann."},{"key":"1_CR46","doi-asserted-by":"crossref","unstructured":"Kanaris, I., and E. Stamatatos. 2007. Webpage genre identification using variable-length character n-grams. In Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI\u201907), Washington, DC: IEEE Computer Society.","DOI":"10.1109\/ICTAI.2007.107"},{"key":"1_CR47","doi-asserted-by":"crossref","unstructured":"Karlgren, J., and D. Cutting. 1994. Recognizing text genres with simple metrics using discriminant analysis. In Proceedings of the 15th Conference on Computational Linguistics, vol. 2, 1071\u20131075. Kyoto.","DOI":"10.3115\/991250.991324"},{"key":"1_CR48","doi-asserted-by":"crossref","unstructured":"Kessler, B., G. Nunberg, and H. Sch\u00fctze. 1997. Automatic detection of text genre. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics. 32\u201338. Madrid, Spain.","DOI":"10.3115\/979617.979622"},{"key":"1_CR49","volume-title":"Genres on the web: Computational models and empirical studies","author":"Y. Kim","year":"2010","unstructured":"Kim, Y., and S. Ross. 2010. Formulating representative features with respect to genre classification. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR50","first-page":"127","volume-title":"Databases and applications","author":"H.-P. Kriegel","year":"2004","unstructured":"Kriegel, H.-P., and M. Schubert. 2004. Classification of websites as sets of feature vectors. In Databases and applications, ed. M.H. Hamza, 127\u2013132. Anaheim, CA: IASTED\/ACTA Press."},{"key":"1_CR51","volume-title":"Computational analysis of presentday American English","author":"H. Kucera","year":"1967","unstructured":"Kucera, H., and W.N. Francis. 1967. Computational analysis of presentday American English. Providence, RI: Brown University Press."},{"issue":"12","key":"1_CR52","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1145\/1035134.1035162","volume":"47","author":"R. Kumar","year":"2004","unstructured":"Kumar, R., J. Novak, P. Raghavan, and A. Tomkins. 2004. Structure and evolution of blogspace. Communications of the ACM 47(12):35\u201339.","journal-title":"Communications of the ACM"},{"issue":"3","key":"1_CR53","first-page":"37","volume":"5","author":"D. Lee","year":"2001","unstructured":"Lee, D. 2001. Genres, registers, text types, domains, and styles: clarifying the concepts and navigating a path through the BNC jungle. Language Learning and Technology 5(3): 37\u201372.","journal-title":"Language Learning and Technology"},{"key":"1_CR54","doi-asserted-by":"crossref","unstructured":"Li, W.-S., O. Kolak, Q. Vu, and H. Takano. 2000. Defining logical domains in a web site. In Proceedings of the 11th ACM on Hypertext and Hypermedia, 123\u2013132. San Antonio, TX.","DOI":"10.1145\/336296.336345"},{"issue":"4","key":"1_CR55","doi-asserted-by":"publisher","first-page":"768","DOI":"10.1109\/TKDE.2002.1019208","volume":"14","author":"W.-S. Li","year":"2002","unstructured":"Li, W.-S., K.S. Candan, Q. Vu, and D. Agrawal. 2002. Query relaxation by structure and semantics for retrieval of logical web documents. IEEE Transactions on Knowledge and Data Engineering 14(4):768\u2013791.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"5","key":"1_CR56","doi-asserted-by":"publisher","first-page":"1263","DOI":"10.1016\/j.ipm.2004.06.004","volume":"41","author":"C.S. Lim","year":"2005","unstructured":"Lim, C.S., K.J. Lee, and G.C. Kim. 2005. Multiple sets of features for automatic genre classification of web documents. Information Processing & Management 41(5):1263\u20131276.","journal-title":"Information Processing & Management"},{"key":"1_CR57","volume-title":"Genres on the web: Computational models and empirical studies","author":"C. Lindemann","year":"2010","unstructured":"Lindemann, C., and L. Littig. 2010. Classification of web sites at super-genre level. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"issue":"2","key":"1_CR58","doi-asserted-by":"publisher","first-page":"141","DOI":"10.3366\/E1749503208000130","volume":"3","author":"E. Marshman","year":"2008","unstructured":"Marshman, E., M.-C. L\u2019Homme, and V. Surtees. 2008. Portability of cause-effect relation markers across specialised domains and text genres: a comparative evaluation. Corpora 3(2):141\u2013172.","journal-title":"Corpora"},{"key":"1_CR59","first-page":"29","volume":"21","author":"J.R. Martin","year":"1994","unstructured":"Martin, J.R. 1994. Macro-genres: The ecology of the page. Network 21: 29\u201352.","journal-title":"Network"},{"key":"1_CR60","volume-title":"Genre relations: Mapping culture","author":"J.R. Martin","year":"2008","unstructured":"Martin, J.R., and D. Rose. 2008. Genre relations: Mapping culture. London & Oakland: Equinox Pub."},{"issue":"7&8","key":"1_CR61","doi-asserted-by":"publisher","first-page":"619","DOI":"10.1080\/08839510802164085","volume":"22","author":"A. Mehler","year":"2008","unstructured":"Mehler, A. 2008. Structural similarities of complex networks: A computational model by example of wiki graphs. Applied Artificial Intelligence 22(7&8):619\u2013683.","journal-title":"Applied Artificial Intelligence"},{"key":"1_CR62","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1007\/978-90-481-3331-4_12","volume-title":"Linguistic modeling of information and markup languages. Contributions to language technology","author":"A. Mehler","year":"2010","unstructured":"Mehler, A. 2010. Structure formation in the web. A graph-theoretical model of hypertext types. In Linguistic modeling of information and markup languages. Contributions to language technology, eds. A. Witt and D. Metzing, Text, Speech and Language Technology, 225\u2013247. Dordrecht: Springer."},{"key":"1_CR63","volume-title":"Analysis of complex networks: From biology to linguistics","author":"A. Mehler","year":"2009","unstructured":"Mehler, A. 2009b. Generalised shortest paths trees: A novel graph class applied to semiotic networks. In Analysis of complex networks: From biology to linguistics, eds. M. Dehmer and F. Emmert-Streib. Weinheim: Wiley-VCH."},{"key":"1_CR64","volume-title":"Towards an information theory of complex networks: Statistical methods and applications","author":"A. Mehler","year":"2010","unstructured":"Mehler, A. 2010. A quantitative graph model of social ontologies by example of Wikipedia. In Towards an information theory of complex networks: Statistical methods and applications, eds. M. Dehmer, F. Emmert-Streib, and A. Mehler. Boston, MA\/Basel: Birkh\u00e4user."},{"key":"1_CR65","doi-asserted-by":"publisher","first-page":"136","DOI":"10.1007\/11553762_14","volume-title":"Proceedings of the 4th International Workshop on Innovative Internet Computing Systems (I2CS \u201904)","author":"A. Mehler","year":"2006","unstructured":"Mehler, A., M. Dehmer, and R. Gleim. 2006. Towards logical hypertext structure: A graph-theoretic perspective. In Proceedings of the 4th International Workshop on Innovative Internet Computing Systems (I2CS \u201904), eds. T. B\u00f6hme and G. Heyer, Lecture Notes in Computer Science, vol. 3473, 136\u2013150. Berlin\/New York, NY: Springer."},{"key":"1_CR66","unstructured":"Mehler, A., R. Gleim, and A. Wegner. 2007. Structural uncertainty of hypertext types. An empirical study. In Proceedings of the Workshop \u201cTowards Genre-Enabled Search Engines: The Impact of NLP\u201d, September, 30, 2007, in Conjunction with RANLP 2007, 13\u201319. Borovets, Bulgaria."},{"issue":"14","key":"1_CR67","doi-asserted-by":"publisher","first-page":"1261","DOI":"10.1002\/asi.20081","volume":"55","author":"F. Menczer","year":"2004","unstructured":"Menczer, F. 2004. Lexical and semantic clustering by web links. Journal of the American Society for Information Science and Technology 55(14):1261\u20131269.","journal-title":"Journal of the American Society for Information Science and Technology"},{"key":"1_CR68","doi-asserted-by":"publisher","first-page":"1410","DOI":"10.1016\/j.ipm.2008.02.001","volume":"44","author":"M. Montesi","year":"2008","unstructured":"Montesi, M., and T. Navarrete. 2008. Classifying web genres in context: A case study documenting the web genres used by a software engineer. Information Processing and Management 44:1410\u20131430.","journal-title":"Information Processing and Management"},{"key":"1_CR69","doi-asserted-by":"crossref","unstructured":"Ounis, I., M. de Rijke, C. Macdonald, G. Mishne, and I. Soboroff. 2006. Overview of the trec 2006 blog track. In Proceedings of the Text Retrieval Conference (TREC). NIST.","DOI":"10.6028\/NIST.SP.500-272.blog-overview"},{"key":"1_CR70","unstructured":"P\u00e4iv\u00e4rinta, T., M. Shepherd, L. Svensson, and M. Rossi. 2008. A special issue editorial. Scandinavian Journal of Information Systems 20(1)."},{"key":"1_CR71","doi-asserted-by":"crossref","unstructured":"Pirolli, P., J. Pitkow, and R. Rao. 1996. Silk from a sow\u2019s ear: Extracting usable structures from the web. In Proceedings of the ACM SIGCHI Conference on Human Factors in Computing, 118\u2013125. New York, NY: ACM Press.","DOI":"10.1145\/238386.238450"},{"issue":"2","key":"1_CR72","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1162\/089120103322145315","volume":"29","author":"R. Power","year":"2003","unstructured":"Power, R., D. Scott, and N. Bouayad-Agha. 2003. Document structure. Computational Linguistics 29(2):211\u2013260.","journal-title":"Computational Linguistics"},{"key":"1_CR73","unstructured":"Raiko, T., K. Kersting, J. Karhunen, and L. de Raedt. 2002. Bayesian learning of logical hidden Markov models. In Proceedings of the Finnish AI Conference (STeP-2002), 64\u201371. Finland."},{"key":"1_CR74","doi-asserted-by":"crossref","unstructured":"Rehm, G. 2002. Towards automatic web genre identification \u2013 A corpus-based approach in the domain of academia by example of the academic\u2019s personal homepage. In Proceedings of the Hawaii International Conference on System Sciences. Big Island, Hawaii.","DOI":"10.1109\/HICSS.2002.994036"},{"key":"1_CR75","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/978-90-481-3331-4_8","volume-title":"Linguistic Modeling of Information and Markup Languages. Contributions to Language Technology","author":"G. Rehm","year":"2010","unstructured":"Rehm, G. 2010. Hypertext types and markup languages. The relationship between HTML and web genres. In Linguistic Modeling of Information and Markup Languages. Contributions to Language Technology, eds. A. Witt and D. Metzing, Text, Speech and Language Technology, 143\u2013164. Dordrecht: Springer."},{"issue":"5","key":"1_CR76","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1002\/bult.2008.1720340508","volume":"34","author":"M.A. Rosso","year":"2008","unstructured":"Rosso, M.A. 2008. Bringing genre into focus: Stalking the wild web genre (with apologies to euell gibbons). Bulletin of the American Society for Information Science and Technology 34(5):20\u201322.","journal-title":"Bulletin of the American Society for Information Science and Technology"},{"key":"1_CR77","volume-title":"Genres on the web: Computational models and empirical studies","author":"M.A. Rosso","year":"2010","unstructured":"Rosso, M.A., and S.W. Haas. 2010. Identification of web genres by user warrant. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR78","doi-asserted-by":"crossref","unstructured":"Santini, M. 2007a. Characterizing genres of web pages: Genre hybridism and individualization. In Proceedings of the 40th Annual Hawaii International Conference on System Sciences (HICSS\u201907). Big Island, Hawaii.","DOI":"10.1109\/HICSS.2007.124"},{"key":"1_CR79","unstructured":"Santini, M. 2007b. Automatic identification of genre in Web pages. PhD thesis, University of Brighton, Brighton."},{"key":"1_CR80","volume-title":"Genres on the web: Computational models and empirical studies","author":"M. Santini","year":"2010","unstructured":"Santini, M. 2010. Cross-testing a genre classification model for the web. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR81","first-page":"63","volume-title":"WaCky! Working Papers on the Web as Corpus","author":"S. Sharoff","year":"2006","unstructured":"Sharoff, S. 2006. Creating general-purpose corpora using automated search engine queries. In WaCky! Working Papers on the Web as Corpus, eds. M. Baroni and S. Bernardini, 63\u201368. Bologna: Gedit."},{"key":"1_CR82","unstructured":"Sharoff, S. 2007. Classifying web corpora into domain and genre using automatic feature identification. In Proceedings of Web as Corpus Workshop. Louvain-la-Neuve."},{"key":"1_CR83","volume-title":"Genres on the web: Computational models and empirical studies","author":"S. Sharoff","year":"2010","unstructured":"Sharoff, S. 2010. In the garden and in the jungle. Comparing genres in the bnc and internet. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"volume-title":"Looking up: An account of the COBUILD project in lexical computing","year":"1987","key":"1_CR84","unstructured":"Sinclair, J. ed. 1987. Looking up: An account of the COBUILD project in lexical computing. London and Glasgow: Collins."},{"key":"1_CR85","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1075\/tlrp.6.21sin","volume-title":"A practical guide to lexicography","author":"J. Sinclair","year":"2003","unstructured":"Sinclair, J. 2003. Corpora for lexicography. In ed. P. van Sterkenberg, A practical guide to lexicography, 167\u2013178. Amsterdam: Benjamins."},{"key":"1_CR86","volume-title":"Genres on the web: Computational models and empirical studies","author":"B. Stein","year":"2010","unstructured":"Stein, B., S. Meyer zu Eissen, and N. Lipka. 2010. Web genre analysis: Use cases, retrieval models, and implementation issues. In Genres on the web: Computational models and empirical studies, eds. A. Mehler, S. Sharoff, and M. Santini, Text, Speech and Language Technology. Dordrecht: Springer."},{"key":"1_CR87","unstructured":"Stewart, J.G. 2008. Genre oriented summarization. PhD thesis, Carnegie Mellon University."},{"key":"1_CR88","doi-asserted-by":"crossref","unstructured":"Sun, A., and E.-P. Lim. 2003. Web unit mining: Finding and classifying subgraphs of web pages. In CIKM \u201903: Proceedings of the 12th International Conference on Information and Knowledge Management, 108\u2013115, New York, NY: ACM Press.","DOI":"10.1145\/956863.956885"},{"key":"1_CR89","volume-title":"Genre analysis: English in academic and research settings","author":"J.M. Swales","year":"1990","unstructured":"Swales, J.M. 1990. Genre analysis: English in academic and research settings. Cambridge, MA: Cambridge University Press."},{"key":"1_CR90","doi-asserted-by":"crossref","unstructured":"Tajima, K., Y. Mizuuchi, M. Kitagawa, and K. Tanaka. 1998. Cut as a querying unit for WWW , netnews, e-mail. In Proceedings of the 9th ACM Conference on Hypertext and Hypermedia, 235\u2013244. New York, NY: ACM Press.","DOI":"10.1145\/276627.276653"},{"key":"1_CR91","unstructured":"Tajima, K., and K. Tanaka. 1999. New techniques for the discovery of logical documents in web. In International Symposium on Database Applications in Non-traditional Environments. IEEE, 125\u2013132."},{"issue":"8","key":"1_CR92","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1002\/aris.1440390110","volume":"6","author":"M. Thelwall","year":"2006","unstructured":"Thelwall,M., L. Vaughan, and L. Bj\u00f6rneborn. 2006. Webometrics. Annual Review of Information Science Technology 6(8):81\u2013135.","journal-title":"Annual Review of Information Science Technology"},{"key":"1_CR93","doi-asserted-by":"crossref","unstructured":"Tian, Y.H., T.J. Huang, W. Gao, J. Cheng, and P. Bo Kang. 2003. Two-phase web site classification based on hidden Markov tree models. In WI \u201903: Proceedings of the 2003 IEEE\/WIC International Conference on Web Intelligence. IEEE Computer Society, 227, Washington, DC.","DOI":"10.1109\/WI.2003.1241198"},{"key":"1_CR94","unstructured":"Waltinger, U., A. Mehler, and A. Wegner. 2009. A two-level approach to web genre classification. In Proceedings of the 5th International Conference on Web Information Systems and Technologies (WEBIST \u201909), March 23\u201326, 2007. Lisboa."},{"issue":"1","key":"1_CR95","first-page":"151","volume":"10","author":"G. Wisniewski","year":"2007","unstructured":"Wisniewski, G., F. Maes, L. Denoyer, and P. Gallinari. 2007. Mod\u00e8le probabiliste pour l\u2019extraction de structures dans les documents web. Document num\u00f9rique, 10(1):151\u2013170.","journal-title":"Document num\u00f9rique"},{"key":"1_CR96","doi-asserted-by":"crossref","unstructured":"Wodak, R. 2008. Introduction: Discourse studies \u2013 important concepts and terms. In Qualitative Discourse Analysis in the Social Sciences, eds. Wodak, R. and Krzyzanowski, M., 1\u201329. Palgrave.","DOI":"10.1007\/978-1-137-04798-4_1"},{"key":"1_CR97","doi-asserted-by":"crossref","unstructured":"Yates, S.J., and T.R. Sumner. 1997. Digital genres and the new burden of fixity. In Proceedings of the 30th Hawaii International Conference on System Sciences, vol. 6. Maui, HI.","DOI":"10.1109\/HICSS.1997.665479"}],"container-title":["Text, Speech and Language Technology","Genres on the Web"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-90-481-9178-9_1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T04:25:33Z","timestamp":1740543933000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-90-481-9178-9_1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010]]},"ISBN":["9789048191772","9789048191789"],"references-count":97,"URL":"https:\/\/doi.org\/10.1007\/978-90-481-9178-9_1","relation":{},"ISSN":["1386-291X"],"issn-type":[{"type":"print","value":"1386-291X"}],"subject":[],"published":{"date-parts":[[2010]]}}}