{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T22:36:15Z","timestamp":1777674975803,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,19]],"date-time":"2021-04-19T00:00:00Z","timestamp":1618790400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,19]]},"DOI":"10.1145\/3442381.3450050","type":"proceedings-article","created":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T19:00:27Z","timestamp":1622746827000},"page":"80-91","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":23,"title":["Towards Realistic and ReproducibleWeb Crawl Measurements"],"prefix":"10.1145","author":[{"given":"Jordan","family":"Jueckstock","sequence":"first","affiliation":[{"name":"North Carolina State University, USA"}]},{"given":"Shaown","family":"Sarker","sequence":"additional","affiliation":[{"name":"North Carolina State University, USA"}]},{"given":"Peter","family":"Snyder","sequence":"additional","affiliation":[{"name":"Brave Software, USA"}]},{"given":"Aidan","family":"Beggs","sequence":"additional","affiliation":[{"name":"North Carolina State University, USA"}]},{"given":"Panagiotis","family":"Papadopoulos","sequence":"additional","affiliation":[{"name":"Telefonica Research, United Kingdom"}]},{"given":"Matteo","family":"Varvello","sequence":"additional","affiliation":[{"name":"Nokia Bell Labs, USA"}]},{"given":"Benjamin","family":"Livshits","sequence":"additional","affiliation":[{"name":"Brave Software and Imperial College London, United Kingdom"}]},{"given":"Alexandros","family":"Kapravelos","sequence":"additional","affiliation":[{"name":"North Carolina State University, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,6,3]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"[n.d.]. catapult - Git at Google. https:\/\/chromium.googlesource.com\/catapult\/. Accessed: 2019-5-12.  [n.d.]. catapult - Git at Google. https:\/\/chromium.googlesource.com\/catapult\/. Accessed: 2019-5-12."},{"key":"e_1_3_2_1_2_1","unstructured":"[n.d.]. Historical trends in the usage statistics of dns server providers. https:\/\/w3techs.com\/technologies\/history_overview\/dns_server. Accessed: 2020-5-29.  [n.d.]. Historical trends in the usage statistics of dns server providers. https:\/\/w3techs.com\/technologies\/history_overview\/dns_server. Accessed: 2020-5-29."},{"key":"e_1_3_2_1_3_1","unstructured":"[n.d.]. New Industry Benchmarks for Mobile Page Speed - Think With Google. https:\/\/www.thinkwithgoogle.com\/marketing-resources\/data-measurement\/mobile-page-speed-new-industry-benchmarks\/. Accessed: 2020-5-6.  [n.d.]. New Industry Benchmarks for Mobile Page Speed - Think With Google. https:\/\/www.thinkwithgoogle.com\/marketing-resources\/data-measurement\/mobile-page-speed-new-industry-benchmarks\/. Accessed: 2020-5-6."},{"key":"e_1_3_2_1_4_1","unstructured":"[n.d.]. Puppeteer. https:\/\/pptr.dev\/. Accessed: 2019-5-12.  [n.d.]. Puppeteer. https:\/\/pptr.dev\/. Accessed: 2019-5-12."},{"key":"e_1_3_2_1_5_1","unstructured":"2015. GO Simple Tunnel - a simple tunnel written in golang. https:\/\/github.com\/ginuerzh\/gost. Accessed: 2020-06-02.  2015. GO Simple Tunnel - a simple tunnel written in golang. https:\/\/github.com\/ginuerzh\/gost. Accessed: 2020-06-02."},{"key":"e_1_3_2_1_6_1","unstructured":"2018. . https:\/\/antoinevastel.com\/bot%20detection\/2018\/01\/17\/detect-chrome-headless-v2.html. Accessed: 2020-10-16.  2018. . https:\/\/antoinevastel.com\/bot%20detection\/2018\/01\/17\/detect-chrome-headless-v2.html. Accessed: 2020-10-16."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1498759.1498837"},{"key":"e_1_3_2_1_8_1","unstructured":"Sadia Afroz Michael\u00a0Carl Tschantz Shaarif Sajid Shoaib\u00a0Asif Qazi Mobin Javed and Vern Paxson. 2018. Exploring server-side blocking of regions. arXiv preprint arXiv:1805.11606(2018).  Sadia Afroz Michael\u00a0Carl Tschantz Shaarif Sajid Shoaib\u00a0Asif Qazi Mobin Javed and Vern Paxson. 2018. Exploring server-side blocking of regions. arXiv preprint arXiv:1805.11606(2018)."},{"key":"e_1_3_2_1_9_1","volume-title":"How Crawlers Impact Our Understanding of the Web. In The Web Conference.","author":"Ahmad Syed\u00a0Suleman","year":"2020","unstructured":"Syed\u00a0Suleman Ahmad , Muhammad\u00a0Daniyal Dar , Muhammad\u00a0Fareed Zaffar , Narseo Vallina-Rodriguez , and Rishab Nithyanand . 2020 . Apophanies or Epiphanies? How Crawlers Impact Our Understanding of the Web. In The Web Conference. Syed\u00a0Suleman Ahmad, Muhammad\u00a0Daniyal Dar, Muhammad\u00a0Fareed Zaffar, Narseo Vallina-Rodriguez, and Rishab Nithyanand. 2020. Apophanies or Epiphanies? How Crawlers Impact Our Understanding of the Web. In The Web Conference."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMST.2015.2418435"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-010-0180-z"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2976749.2978313"},{"key":"e_1_3_2_1_13_1","unstructured":"Nathaniel Fruchter Hsin Miao Scott Stevenson and Rebecca Balebako. 2015. Variations in tracking in relation to geographic location. arXiv preprint arXiv:1506.04103(2015).  Nathaniel Fruchter Hsin Miao Scott Stevenson and Rebecca Balebako. 2015. Variations in tracking in relation to geographic location. arXiv preprint arXiv:1506.04103(2015)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2016.50"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3098822.3098850"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3355369.3355599"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.14722\/ndss.2016.23342"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14722\/ndss.2019.23386"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3131365.3131396"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2012.47"},{"key":"e_1_3_2_1_21_1","volume-title":"A study on tolerable waiting time: how long are web users willing to wait?Behaviour & Information Technology 23, 3","author":"Fui-Hoon Nah Fiona","year":"2004","unstructured":"Fiona Fui-Hoon Nah . 2004. A study on tolerable waiting time: how long are web users willing to wait?Behaviour & Information Technology 23, 3 ( 2004 ), 153\u2013163. Fiona Fui-Hoon Nah. 2004. A study on tolerable waiting time: how long are web users willing to wait?Behaviour & Information Technology 23, 3 (2004), 153\u2013163."},{"key":"e_1_3_2_1_22_1","volume-title":"Proceedings of the IEEE Symposium on Security and Privacy. 1344\u20131361","author":"Oest A.","unstructured":"A. Oest , Y. Safaei , A. Doup\u00e9 , G. Ahn , B. Wardman , and K. Tyers . 2019. PhishFarm: A Scalable Framework for Measuring the Effectiveness of Evasion Techniques against Browser Phishing Blacklists . In Proceedings of the IEEE Symposium on Security and Privacy. 1344\u20131361 . A. Oest, Y. Safaei, A. Doup\u00e9, G. Ahn, B. Wardman, and K. Tyers. 2019. PhishFarm: A Scalable Framework for Measuring the Effectiveness of Evasion Techniques against Browser Phishing Blacklists. In Proceedings of the IEEE Symposium on Security and Privacy. 1344\u20131361."},{"key":"e_1_3_2_1_23_1","volume-title":"Proceedings of the USENIX symposium on Networked Systems Design and Implementation (NSDI). USENIX Association.","author":"Roesner Franziska","year":"2012","unstructured":"Franziska Roesner , Tadayoshi Kohno , and David Wetherall . 2012 . Detecting and defending against third-party tracking on the web . In Proceedings of the USENIX symposium on Networked Systems Design and Implementation (NSDI). USENIX Association. Franziska Roesner, Tadayoshi Kohno, and David Wetherall. 2012. Detecting and defending against third-party tracking on the web. In Proceedings of the USENIX symposium on Networked Systems Design and Implementation (NSDI). USENIX Association."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2987443.2987466"},{"key":"e_1_3_2_1_25_1","volume-title":"8th {USENIX} Workshop on Free and Open Communications on the Internet ({FOCI} 18).","author":"Tschantz Michael\u00a0Carl","unstructured":"Michael\u00a0Carl Tschantz , Sadia Afroz , Shaarif Sajid , Shoaib\u00a0Asif Qazi , Mobin Javed , and Vern Paxson . 2018. A bestiary of blocking: The motivations and modes behind website unavailability . In 8th {USENIX} Workshop on Free and Open Communications on the Internet ({FOCI} 18). Michael\u00a0Carl Tschantz, Sadia Afroz, Shaarif Sajid, Shoaib\u00a0Asif Qazi, Mobin Javed, and Vern Paxson. 2018. A bestiary of blocking: The motivations and modes behind website unavailability. In 8th {USENIX} Workshop on Free and Open Communications on the Internet ({FOCI} 18)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3355369.3355600"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3321705.3329855"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3366423.3380104"}],"event":{"name":"WWW '21: The Web Conference 2021","location":"Ljubljana Slovenia","acronym":"WWW '21","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of the Web Conference 2021"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442381.3450050","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3442381.3450050","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:45Z","timestamp":1750195485000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442381.3450050"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,19]]},"references-count":28,"alternative-id":["10.1145\/3442381.3450050","10.1145\/3442381"],"URL":"https:\/\/doi.org\/10.1145\/3442381.3450050","relation":{},"subject":[],"published":{"date-parts":[[2021,4,19]]},"assertion":[{"value":"2021-06-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}