{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T16:36:29Z","timestamp":1732034189898},"reference-count":0,"publisher":"National Library of Serbia","issue":"2","license":[{"start":{"date-parts":[[2012,1,1]],"date-time":"2012-01-01T00:00:00Z","timestamp":1325376000000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["ComSIS","COMPUT SCI INF SYST","COMPUT SCI INFORM SY","COMPUTER SCI INFORM","COMSIS J"],"published-print":{"date-parts":[[2012]]},"abstract":"<jats:p>The main goal of this study is to present a scale that classifies crawling\n   systems according to their effectiveness in traversing the ?clientside?\n   Hidden Web. First, we perform a thorough analysis of the different\n   client-side technologies and the main features of the web pages in order to\n   determine the basic steps of the aforementioned scale. Then, we define the\n   scale by grouping basic scenarios in terms of several common features, and we\n   propose some methods to evaluate the effectiveness of the crawlers according\n   to the levels of the scale. Finally, we present a testing web site and we\n   show the results of applying the aforementioned methods to the results\n   obtained by some open-source and commercial crawlers that tried to traverse\n   the pages. Only a few crawlers achieve good results in treating client-side\n   technologies. Regarding standalone crawlers, we highlight the open-source\n   crawlers Heritrix and Nutch and the commercial crawler WebCopierPro, which is\n   able to process very complex scenarios. With regard to the crawlers of the\n   main search engines, only Google processes most of the scenarios we have\n   proposed, while Yahoo! and Bing just deal with the basic ones. There are not\n   many studies that assess the capacity of the crawlers to deal with\n   client-side technologies. Also, these studies consider fewer technologies,\n   fewer crawlers and fewer combinations. Furthermore, to the best of our\n   knowledge, our article provides the first scale for classifying crawlers from\n   the point of view of the most important client-side technologies.<\/jats:p>","DOI":"10.2298\/csis111215015p","type":"journal-article","created":{"date-parts":[[2012,6,7]],"date-time":"2012-06-07T09:56:21Z","timestamp":1339062981000},"page":"561-583","source":"Crossref","is-referenced-by-count":3,"title":["A scale for crawler effectiveness on the client-side hidden web"],"prefix":"10.2298","volume":"9","author":[{"suffix":"M.","given":"V\u00edctor","family":"Prieto","sequence":"first","affiliation":[{"name":"Comunication and Information Technologies Department, University of A Coru\u00f1a Campus de Elvi, A Coru\u00f1a, Spain"}]},{"given":"Manuel","family":"\u00c1lvarez","sequence":"additional","affiliation":[{"name":"Comunication and Information Technologies Department, University of A Coru\u00f1a Campus de Elvi, A Coru\u00f1a, Spain"}]},{"given":"Rafael","family":"L\u00f3pez-Garc\u00eda","sequence":"additional","affiliation":[{"name":"Comunication and Information Technologies Department, University of A Coru\u00f1a Campus de Elvi, A Coru\u00f1a, Spain"}]},{"given":"Fidel","family":"Cacheda","sequence":"additional","affiliation":[{"name":"Comunication and Information Technologies Department, University of A Coru\u00f1a Campus de Elvi, A Coru\u00f1a, Spain"}]}],"member":"1078","container-title":["Computer Science and Information Systems"],"original-title":[],"language":"en","deposited":{"date-parts":[[2023,5,29]],"date-time":"2023-05-29T08:30:37Z","timestamp":1685349037000},"score":1,"resource":{"primary":{"URL":"https:\/\/doiserbia.nb.rs\/Article.aspx?ID=1820-02141200015P"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012]]},"references-count":0,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2012]]}},"URL":"https:\/\/doi.org\/10.2298\/csis111215015p","relation":{},"ISSN":["1820-0214","2406-1018"],"issn-type":[{"value":"1820-0214","type":"print"},{"value":"2406-1018","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012]]}}}