{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:33:20Z","timestamp":1750307600105,"version":"3.41.0"},"reference-count":35,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2010,4,1]],"date-time":"2010-04-01T00:00:00Z","timestamp":1270080000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000144","name":"Division of Computer and Network Systems","doi-asserted-by":"publisher","award":["CNS-0627166CNS-0925472"],"award-info":[{"award-number":["CNS-0627166CNS-0925472"]}],"id":[{"id":"10.13039\/100000144","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Web"],"published-print":{"date-parts":[[2010,4]]},"abstract":"<jats:p>\n            An ads-portal domain refers to a Web domain that shows only advertisements, served by a third-party advertisement syndication service, in the form of ads listing. We develop a machine-learning-based classifier to identify ads-portal domains, which has 96% accuracy. We use this classifier to measure the prevalence of ads-portal domains on the Internet. Surprisingly, 28.3\/25% of the (two-level) *.\n            <jats:italic>com<\/jats:italic>\n            \/*.\n            <jats:italic>net<\/jats:italic>\n            web domains are ads-portal domains. Also, 41\/39.8% of *.\n            <jats:italic>com<\/jats:italic>\n            \/*.\n            <jats:italic>net<\/jats:italic>\n            ads-portal domains are typos of well-known domains, also known as typo-squatting domains. In addition, we use the classifier along with DNS trace files to estimate how often Internet users visit ads-portal domains. It turns out that \u223c5% of the two-level *.\n            <jats:italic>com<\/jats:italic>\n            , *.\n            <jats:italic>net<\/jats:italic>\n            , *.\n            <jats:italic>org<\/jats:italic>\n            , *.\n            <jats:italic>biz<\/jats:italic>\n            and *.\n            <jats:italic>info<\/jats:italic>\n            web domains on the traces are ads-portal domains and \u223c50% of these accessed ads-portal domains are typos. These numbers show that ads-portal domains and typo-squatting ads-portal domains are prevalent on the Internet and successful in attracting many visits. Our classifier represents a step towards better categorizing the web documents. It can also be helpful to search engines ranking algorithms, helpful in identifying web spams that redirects to ads-portal domains, and used to discourage access to typo-squatting ads-portal domains.\n          <\/jats:p>","DOI":"10.1145\/1734200.1734201","type":"journal-article","created":{"date-parts":[[2010,4,27]],"date-time":"2010-04-27T12:45:25Z","timestamp":1272372325000},"page":"1-34","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Ads-portal domains"],"prefix":"10.1145","volume":"4","author":[{"given":"Mishari","family":"Almishari","sequence":"first","affiliation":[{"name":"University of California, Irvine, CA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaowei","family":"Yang","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2010,4,29]]},"reference":[{"volume-title":"Proceedings of the Infocom Mini-Conference.","author":"Banerjee A.","key":"e_1_2_1_1_1"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1018054314350"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/B978-1-55860-377-6.50023-2"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Cristianini N. and Shawe-Taylor J. 2000. An Introduction to Support Vector Machines: and Other Kernel-Based Learning Methods. Cambridge University Press.   Cristianini N. and Shawe-Taylor J. 2000. An Introduction to Support Vector Machines: and Other Kernel-Based Learning Methods. Cambridge University Press.","DOI":"10.1017\/CBO9780511801389"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/72.788645"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1062745.1062796"},{"key":"e_1_2_1_8_1","unstructured":"F-Secure. 2005. Googkle.com installed malware by exploiting browser vulnerabilities. http:\/\/www.f-secure.com\/v-descs\/googkle.shtml.  F-Secure. 2005. Googkle.com installed malware by exploiting browser vulnerabilities. http:\/\/www.f-secure.com\/v-descs\/googkle.shtml."},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Fielding R. Gettys J. Mogul J. Frystyk H. Masinter L. Leach P. and Berners-Lee T. 1999. Hypertext transfer protocol -- HTTP\/1.1. RFC 2616.  Fielding R. Gettys J. Mogul J. Frystyk H. Masinter L. Leach P. and Berners-Lee T. 1999. Hypertext transfer protocol -- HTTP\/1.1. RFC 2616.","DOI":"10.17487\/rfc2616"},{"volume-title":"Proceedings of the European Conference on Computational Learning Theory.","author":"Freund Y.","key":"e_1_2_1_10_1"},{"key":"e_1_2_1_11_1","unstructured":"Google. 2006. Google SOAP Search API. http:\/\/code.google.com\/apis\/soapsearch\/.  Google. 2006. Google SOAP Search API. http:\/\/code.google.com\/apis\/soapsearch\/."},{"key":"e_1_2_1_12_1","unstructured":"Google. 2008. Google adsense. http:\/\/www.google.com\/adsense.  Google. 2008. Google adsense. http:\/\/www.google.com\/adsense."},{"key":"e_1_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Gusfield D. 1998. Algorithms on Strings Trees and Sequences. Cambridge University Press.   Gusfield D. 1998. Algorithms on Strings Trees and Sequences. Cambridge University Press.","DOI":"10.1017\/CBO9780511574931"},{"volume-title":"Proceedings of the 9th International Conference on Machine Learning.","author":"Iba W.","key":"e_1_2_1_14_1"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/383952.383974"},{"key":"e_1_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Kawakita M. Minami M. Eguchi S. and Lennert-Cody C. E. 2005. An introduction to the predictive technique AdaBoost with a comparison to generalized additive models. In Fisheries research.  Kawakita M. Minami M. Eguchi S. and Lennert-Cody C. E. 2005. An introduction to the predictive technique AdaBoost with a comparison to generalized additive models. In Fisheries research.","DOI":"10.1016\/j.fishres.2005.07.011"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/645324.649649"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/301136.301186"},{"key":"e_1_2_1_19_1","unstructured":"McAfee. 2007. McAfee's study of typosquatting. www.mcafee.com\/typosquatters.  McAfee. 2007. McAfee's study of typosquatting. www.mcafee.com\/typosquatters."},{"key":"e_1_2_1_20_1","unstructured":"McAfee. 2008. McAfee siteadvisor. http:\/\/www.siteadvisor.com\/.  McAfee. 2008. McAfee siteadvisor. http:\/\/www.siteadvisor.com\/."},{"key":"e_1_2_1_21_1","unstructured":"Mitchell T. 1997. Machine Learning. McGraw Hill.   Mitchell T. 1997. Machine Learning. McGraw Hill."},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Mockapetris P. 1987. Domain names\u2014Implementation and specification. RFC 1035.  Mockapetris P. 1987. Domain names\u2014Implementation and specification. RFC 1035.","DOI":"10.17487\/rfc1035"},{"key":"e_1_2_1_23_1","unstructured":"Mozdev. 2008. AdBlock. http:\/\/adblock.mozdev.org\/.  Mozdev. 2008. AdBlock. http:\/\/adblock.mozdev.org\/."},{"key":"e_1_2_1_24_1","unstructured":"Mozilla. 2009. JavaScript. https:\/\/developer.mozilla.org\/en\/JavaScript.  Mozilla. 2009. JavaScript. https:\/\/developer.mozilla.org\/en\/JavaScript."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1135777.1135794"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1108\/eb046814"},{"key":"e_1_2_1_27_1","unstructured":"Quinlan J. 1993. c4.5: Programs for Machine Learning. Morgan Kaufmann.   Quinlan J. 1993. c4.5: Programs for Machine Learning. Morgan Kaufmann."},{"volume-title":"Proceedings of the 13th National Conference on Artificial Intelligence and 8th Innovative Applications of Artificial Intelligence Conference.","year":"1996","author":"Quinlan J. R.","key":"e_1_2_1_28_1"},{"key":"e_1_2_1_29_1","unstructured":"Raggett D. Hors A. L. and Jacobs I. 1998. HTML 4.0 specification. http:\/\/www.w3.org\/TR\/1998\/REC-html40-19980424.  Raggett D. Hors A. L. and Jacobs I. 1998. HTML 4.0 specification. http:\/\/www.w3.org\/TR\/1998\/REC-html40-19980424."},{"volume-title":"Proceedings of the Usenix SRUTI Workshop.","author":"Wang Y.-M.","key":"e_1_2_1_30_1"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242612"},{"key":"e_1_2_1_32_1","unstructured":"Wikipedia. 2008. Type-in traffic. http:\/\/en.wikipedia.org\/wiki\/Type-in_traffic.  Wikipedia. 2008. Type-in traffic. http:\/\/en.wikipedia.org\/wiki\/Type-in_traffic."},{"volume-title":"Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann.","year":"2005","author":"Witten I. H.","key":"e_1_2_1_33_1"},{"key":"e_1_2_1_34_1","unstructured":"Yahoo. 2007. Yahoo&excl; directory. http:\/\/dir.yahoo.com\/.  Yahoo. 2007. Yahoo&excl; directory. http:\/\/dir.yahoo.com\/."},{"key":"e_1_2_1_35_1","unstructured":"Yahoo. 2008. Yahoo search Web services. http:\/\/developer.yahoo.com\/search\/web\/V1\/spellingSuggestion.html.  Yahoo. 2008. Yahoo search Web services. http:\/\/developer.yahoo.com\/search\/web\/V1\/spellingSuggestion.html."}],"container-title":["ACM Transactions on the Web"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1734200.1734201","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1734200.1734201","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T12:45:28Z","timestamp":1750250728000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1734200.1734201"}},"subtitle":["Identification and measurements"],"short-title":[],"issued":{"date-parts":[[2010,4]]},"references-count":35,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2010,4]]}},"alternative-id":["10.1145\/1734200.1734201"],"URL":"https:\/\/doi.org\/10.1145\/1734200.1734201","relation":{},"ISSN":["1559-1131","1559-114X"],"issn-type":[{"type":"print","value":"1559-1131"},{"type":"electronic","value":"1559-114X"}],"subject":[],"published":{"date-parts":[[2010,4]]},"assertion":[{"value":"2008-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-04-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}