{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T18:06:43Z","timestamp":1772042803126,"version":"3.50.1"},"reference-count":20,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2024,4,30]],"date-time":"2024-04-30T00:00:00Z","timestamp":1714435200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGCOMM Comput. Commun. Rev."],"published-print":{"date-parts":[[2024,4,30]]},"abstract":"<jats:p>Domain lists are a key ingredient for representative censuses of the Web. Unfortunately, such censuses typically lack a view on domains under country-code top-level domains (ccTLDs). This introduces unwanted bias: many countries have a rich local Web that remains hidden if their ccTLDs are not considered. The reason ccTLDs are rarely considered is that gaining access - if possible at all - is often laborious. To tackle this, we ask: what can we learn about ccTLDs from public sources? We extract domain names under ccTLDs from 6 years of public data from Certificate Transparency logs and Common Crawl. We compare this against ground truth for 19 ccTLDs for which we have the full DNS zone. We find that public data covers 43%-80% of these ccTLDs, and that coverage grows over time. By also comparing port scan data we then show that these public sources reveal a significant part of the Web presence under a ccTLD. We conclude that in the absence of full access to ccTLDs, domain names learned from public sources can be a good proxy when performing Web censuses.<\/jats:p>","DOI":"10.1145\/3687234.3687236","type":"journal-article","created":{"date-parts":[[2024,8,6]],"date-time":"2024-08-06T18:29:00Z","timestamp":1722968940000},"page":"2-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["This Is a Local Domain: On Amassing Country-Code Top-Level Domains from Public Data"],"prefix":"10.1145","volume":"54","author":[{"given":"Raffaele","family":"Sommese","sequence":"first","affiliation":[{"name":"University of Twente"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Roland","family":"van Rijswijk-Deij","sequence":"additional","affiliation":[{"name":"University of Twente"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mattijs","family":"Jonker","sequence":"additional","affiliation":[{"name":"University of Twente"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,8,6]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 6th edition of the Network Traffic Measurement and Analysis Conference (TMA Conference","author":"Affinito Antonia","year":"2022","unstructured":"Antonia Affinito, Raffaele Sommese, Gautam Akiwate, Stefan Savage, Kimberley Claffy, Geoffrey M. Voelker, Alessio Botta, and Mattijs Jonker. 2022. Domain Name Lifetimes: Baseline and Threats. In Proceedings of the 6th edition of the Network Traffic Measurement and Analysis Conference (TMA Conference 2022). IFIP."},{"key":"e_1_2_1_2_1","unstructured":"Apple Inc. 2021. Apple's Certificate Transparency policy. https:\/\/support.apple.com\/en-ca\/HT205280."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1239971.1239973"},{"key":"e_1_2_1_4_1","doi-asserted-by":"crossref","unstructured":"N. Br\u00fcgger and D. Laursen (Eds.). 2019. The Historical Web and Digital Humanities: The Case of National Web Domains (1 ed.). Routledge.","DOI":"10.4324\/9781315231662-1"},{"key":"e_1_2_1_5_1","volume-title":"Leading country code top level domains (ccTLD) as of","author":"Stat Domain Name","year":"2023","unstructured":"Domain Name Stat. 2023. Leading country code top level domains (ccTLD) as of July 2023, by number of registered domains. https:\/\/www.statista.com\/statistics\/266721\/sales-of-cc-top-level-domains\/."},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"B. Laurie et al. 2021. Certificate Transparency Version 2.0. RFC 9162. RFC Editor. https:\/\/tools.ietf.org\/html\/rfc9162","DOI":"10.17487\/RFC9162"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Oliver Gasser Benjamin Hof Max Helm Maciej Korczynski Ralph Holz and Georg Carle. 2018. In Log We Trust: Revealing Poor Security Practices with Certificate Transparency Logs and Internet Measurements. In Passive and Active Measurement. Springer International Publishing.","DOI":"10.1007\/978-3-319-76481-8_13"},{"key":"e_1_2_1_8_1","unstructured":"Google. 2024. HTTPS encryption on the web. https:\/\/commoncrawl.org."},{"key":"e_1_2_1_9_1","unstructured":"Google Inc. 2022. Chrome Certificate Transparency Policy. https:\/\/googlechrome.github.io\/CertificateTransparency\/ct_policy.html."},{"key":"e_1_2_1_10_1","volume-title":"Ren\u00e9 Rydhof Hansen, and Jens Myrup Pedersen","author":"Hageman Kaspar","year":"2021","unstructured":"Kaspar Hageman, Ren\u00e9 Rydhof Hansen, and Jens Myrup Pedersen. 2021. Gollector: Measuring Domain Name Dark Matter from Different Vantage Points. In Secure IT Systems. Springer International Publishing, 133--152."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411740.3411742"},{"key":"e_1_2_1_12_1","unstructured":"Internet Archive. 2023. The HTTP Archive. https:\/\/httparchive.org."},{"key":"e_1_2_1_13_1","volume-title":"Domains of Control: Governance of and by the Domain Name System","author":"Merrill Kenneth","unstructured":"Kenneth Merrill. 2016. Domains of Control: Governance of and by the Domain Name System. Palgrave Macmillan US, New York, 89--106."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14722\/ndss.2019.23386"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3338498.3358655"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the Internet Measurement Conference","author":"Scheitle Quirin","year":"2018","unstructured":"Quirin Scheitle, Oliver Gasser, Theodor Nolte, Johanna Amann, Lexi Brent, Georg Carle, Ralph Holz, Thomas C. Schmidt, and Matthias W\u00e4hlisch. 2018. The Rise of Certificate Transparency and Its Implications on the Internet Ecosystem. In Proceedings of the Internet Measurement Conference 2018. Association for Computing Machinery."},{"key":"e_1_2_1_17_1","unstructured":"The CommonCrawl Foundation. 2007. Common Crawl. https:\/\/commoncrawl.org."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSAC.2016.2558918"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the 2016 Internet Measurement Conference. Association for Computing Machinery.","author":"VanderSloot Benjamin","unstructured":"Benjamin VanderSloot, Johanna Amann, Matthew Bernhard, Zakir Durumeric, Michael Bailey, and J. Alex Halderman. 2016. Towards a Complete View of the Certificate Ecosystem. In Proceedings of the 2016 Internet Measurement Conference. Association for Computing Machinery."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/NOMS54207.2022.9789881"}],"container-title":["ACM SIGCOMM Computer Communication Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3687234.3687236","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3687234.3687236","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:58:01Z","timestamp":1750294681000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3687234.3687236"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,30]]},"references-count":20,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,4,30]]}},"alternative-id":["10.1145\/3687234.3687236"],"URL":"https:\/\/doi.org\/10.1145\/3687234.3687236","relation":{},"ISSN":["0146-4833"],"issn-type":[{"value":"0146-4833","type":"print"}],"subject":[],"published":{"date-parts":[[2024,4,30]]},"assertion":[{"value":"2024-08-06","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}