{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T16:42:03Z","timestamp":1777653723231,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":68,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,7,30]],"date-time":"2020-07-30T00:00:00Z","timestamp":1596067200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,7,30]]},"DOI":"10.1145\/3387514.3405892","type":"proceedings-article","created":{"date-parts":[[2020,7,30]],"date-time":"2020-07-30T22:35:31Z","timestamp":1596148531000},"page":"632-647","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":234,"title":["Classic Meets Modern"],"prefix":"10.1145","author":[{"given":"Soheil","family":"Abbasloo","sequence":"first","affiliation":[{"name":"High Speed Networking Lab, NYU"}]},{"given":"Chen-Yu","family":"Yen","sequence":"additional","affiliation":[{"name":"High Speed Networking Lab, NYU"}]},{"given":"H. Jonathan","family":"Chao","sequence":"additional","affiliation":[{"name":"High Speed Networking Lab, NYU"}]}],"member":"320","published-online":{"date-parts":[[2020,7,30]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"Martin Abadi et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https:\/\/www.tensorflow.org\/ Software available from tensorflow.org.  Martin Abadi et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https:\/\/www.tensorflow.org\/ Software available from tensorflow.org."},{"key":"e_1_3_2_2_2_1","volume-title":"Bounding Queue Delay in Cellular Networks to Support Ultra-Low Latency Applications. arXiv preprint arXiv:1908.00953","author":"Abbasloo Soheil","year":"2019","unstructured":"Soheil Abbasloo and H Jonathan Chao . 2019. Bounding Queue Delay in Cellular Networks to Support Ultra-Low Latency Applications. arXiv preprint arXiv:1908.00953 ( 2019 ). Soheil Abbasloo and H Jonathan Chao. 2019. Bounding Queue Delay in Cellular Networks to Support Ultra-Low Latency Applications. arXiv preprint arXiv:1908.00953 (2019)."},{"key":"e_1_3_2_2_3_1","volume-title":"SharpEdge: An Asynchronous and Core-Agnostic Solution to Guarantee Bounded-Delays. To appear in CCF Transactions of Networking","author":"Abbasloo Soheil","year":"2020","unstructured":"Soheil Abbasloo and H Jonathan Chao . 2020. SharpEdge: An Asynchronous and Core-Agnostic Solution to Guarantee Bounded-Delays. To appear in CCF Transactions of Networking ( 2020 ). https:\/\/arxiv.org\/abs\/2001.00112 Soheil Abbasloo and H Jonathan Chao. 2020. SharpEdge: An Asynchronous and Core-Agnostic Solution to Guarantee Bounded-Delays. To appear in CCF Transactions of Networking (2020). https:\/\/arxiv.org\/abs\/2001.00112"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.23919\/IFIPNetworking.2018.8696844"},{"key":"e_1_3_2_2_5_1","volume-title":"IFIP Networking Conference (IFIP Networking)","author":"Abbasloo Soheil","year":"2018","unstructured":"Soheil Abbasloo , Yang Xu , and H. Jonathan Chao . 2018. HyLine: a Simple and Practical Flow Scheduling for Commodity Datacenters . In IFIP Networking Conference (IFIP Networking) , 2018 . Soheil Abbasloo, Yang Xu, and H. Jonathan Chao. 2018. HyLine: a Simple and Practical Flow Scheduling for Commodity Datacenters. In IFIP Networking Conference (IFIP Networking), 2018."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSAC.2019.2898758"},{"key":"e_1_3_2_2_7_1","volume-title":"2nd USENIX Workshop on Hot Topics in Edge Computing (HotEdge 19)","author":"Abbasloo Soheil","year":"2019","unstructured":"Soheil Abbasloo , Yang Xu , H Jonathon Chao , Hang Shi , Ulas C Kozat , and Yinghua Ye . 2019 . Toward Optimal Performance with Network Assisted TCP at Mobile Edge . 2nd USENIX Workshop on Hot Topics in Edge Computing (HotEdge 19) (2019). Soheil Abbasloo, Yang Xu, H Jonathon Chao, Hang Shi, Ulas C Kozat, and Yinghua Ye. 2019. Toward Optimal Performance with Network Assisted TCP at Mobile Edge. 2nd USENIX Workshop on Hot Topics in Edge Computing (HotEdge 19) (2019)."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1851182.1851192"},{"key":"e_1_3_2_2_9_1","volume-title":"Going Beyond Two Layers. CoRR abs\/1811.04918","author":"Allen-Zhu Zeyuan","year":"2018","unstructured":"Zeyuan Allen-Zhu , Yuanzhi Li , and Yingyu Liang . 2018. Learning and Generalization in Overparameterized Neural Networks , Going Beyond Two Layers. CoRR abs\/1811.04918 ( 2018 ). arXiv:1811.04918 http:\/\/arxiv.org\/abs\/1811.04918 Zeyuan Allen-Zhu, Yuanzhi Li, and Yingyu Liang. 2018. Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers. CoRR abs\/1811.04918 (2018). arXiv:1811.04918 http:\/\/arxiv.org\/abs\/1811.04918"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3232755.3232783"},{"key":"e_1_3_2_2_11_1","unstructured":"Andre Barreto et al. 2017. Successor Features for Transfer in Reinforcement Learning. In Advances in Neural Information Processing Systems 30. Curran Associates Inc. 4055--4065. http:\/\/papers.nips.cc\/paper\/6994-successor-features-for-transfer-in-reinforcement-learning.pdf  Andre Barreto et al. 2017. Successor Features for Transfer in Reinforcement Learning. In Advances in Neural Information Processing Systems 30. Curran Associates Inc. 4055--4065. http:\/\/papers.nips.cc\/paper\/6994-successor-features-for-transfer-in-reinforcement-learning.pdf"},{"key":"e_1_3_2_2_12_1","volume-title":"Dynamic Programming (1 ed.)","author":"Bellman Richard","unstructured":"Richard Bellman . 1957. Dynamic Programming (1 ed.) . Princeton University Press, Princeton, NJ , USA. Richard Bellman. 1957. Dynamic Programming (1 ed.). Princeton University Press, Princeton, NJ, USA."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.bjp.2013.12.037"},{"key":"e_1_3_2_2_14_1","volume-title":"TCP Vegas: New techniques for congestion detection and avoidance","author":"Brakmo Lawrence S","unstructured":"Lawrence S Brakmo , Sean W O'Malley , and Larry L Peterson . 1994. TCP Vegas: New techniques for congestion detection and avoidance . Vol. 24 . ACM. Lawrence S Brakmo, Sean W O'Malley, and Larry L Peterson. 1994. TCP Vegas: New techniques for congestion detection and avoidance. Vol. 24. ACM."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/1251203.1251205"},{"key":"e_1_3_2_2_16_1","first-page":"50","article-title":"BBR","volume":"14","author":"Cardwell Neal","year":"2016","unstructured":"Neal Cardwell , Yuchung Cheng , C Stephen Gunn , Soheil Hassas Yeganeh , and Van Jacobson . 2016 . BBR : Congestion-Based Congestion Control. Queue 14 , 5 (2016), 50 . Neal Cardwell, Yuchung Cheng, C Stephen Gunn, Soheil Hassas Yeganeh, and Van Jacobson. 2016. BBR: Congestion-Based Congestion Control. Queue 14, 5 (2016), 50.","journal-title":"Congestion-Based Congestion Control. Queue"},{"key":"e_1_3_2_2_17_1","volume-title":"Soheil Hassas Yeganeh, and Van Jacobson","author":"Cardwell Neal","year":"2019","unstructured":"Neal Cardwell , Yuchung Cheng , C Stephen Gunn , Soheil Hassas Yeganeh, and Van Jacobson . 2019 . TCPBBRv2 Alpha\/Preview Release . https:\/\/github.com\/google\/bbr\/blob\/v2alpha\/README.md. (2019). Neal Cardwell, Yuchung Cheng, C Stephen Gunn, Soheil Hassas Yeganeh, and Van Jacobson. 2019. TCPBBRv2 Alpha\/Preview Release. https:\/\/github.com\/google\/bbr\/blob\/v2alpha\/README.md. (2019)."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/0169-7552(89)90019-6"},{"key":"e_1_3_2_2_19_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.)","volume":"97","author":"Cobbe Karl","year":"2019","unstructured":"Karl Cobbe , Oleg Klimov , Chris Hesse , Taehoon Kim , and John Schulman . 2019 . Quantifying Generalization in Reinforcement Learning . In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.) , Vol. 97 . PMLR, Long Beach, California, USA, 1282--1289. http:\/\/proceedings.mlr.press\/v97\/cobbe19a.html Karl Cobbe, Oleg Klimov, Chris Hesse, Taehoon Kim, and John Schulman. 2019. Quantifying Generalization in Reinforcement Learning. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.), Vol. 97. PMLR, Long Beach, California, USA, 1282--1289. http:\/\/proceedings.mlr.press\/v97\/cobbe19a.html"},{"key":"e_1_3_2_2_20_1","volume-title":"15th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 18). 343--356.","author":"Dong Mo","unstructured":"Mo Dong , Tong Meng , Doron Zarchy , Engin Arslan , Yossi Gilad , Brighten Godfrey , and Michael Schapira . 2018. {PCC} Vivace : Online-Learning Congestion Control . In 15th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 18). 343--356. Mo Dong, Tong Meng, Doron Zarchy, Engin Arslan, Yossi Gilad, Brighten Godfrey, and Michael Schapira. 2018. {PCC} Vivace: Online-Learning Congestion Control. In 15th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 18). 343--356."},{"key":"e_1_3_2_2_21_1","volume-title":"Generalizing skills with semi-supervised reinforcement learning. arXiv preprint arXiv:1612.00429","author":"Finn Chelsea","year":"2016","unstructured":"Chelsea Finn , Tianhe Yu , Justin Fu , Pieter Abbeel , and Sergey Levine . 2016. Generalizing skills with semi-supervised reinforcement learning. arXiv preprint arXiv:1612.00429 ( 2016 ). Chelsea Finn, Tianhe Yu, Justin Fu, Pieter Abbeel, and Sergey Levine. 2016. Generalizing skills with semi-supervised reinforcement learning. arXiv preprint arXiv:1612.00429 (2016)."},{"key":"e_1_3_2_2_22_1","volume-title":"Catastrophic forgetting in connectionist networks. Trends in cognitive sciences 3, 4","author":"French Robert M","year":"1999","unstructured":"Robert M French . 1999. Catastrophic forgetting in connectionist networks. Trends in cognitive sciences 3, 4 ( 1999 ), 128--135. Robert M French. 1999. Catastrophic forgetting in connectionist networks. Trends in cognitive sciences 3, 4 (1999), 128--135."},{"key":"e_1_3_2_2_23_1","volume-title":"Addressing function approximation error in actor-critic methods. arXiv preprint arXiv:1802.09477","author":"Fujimoto Scott","year":"2018","unstructured":"Scott Fujimoto , Herke van Hoof , and David Meger . 2018. Addressing function approximation error in actor-critic methods. arXiv preprint arXiv:1802.09477 ( 2018 ). Scott Fujimoto, Herke van Hoof, and David Meger. 2018. Addressing function approximation error in actor-critic methods. arXiv preprint arXiv:1802.09477 (2018)."},{"issue":"1","key":"e_1_3_2_2_24_1","first-page":"1","article-title":"An Invariant Property of Computer Network Power. In Proceedings of the International Conference on Communications. Denver","volume":"63","author":"Gail R.","year":"1981","unstructured":"R. Gail and L. Kleinrock . 1981 . An Invariant Property of Computer Network Power. In Proceedings of the International Conference on Communications. Denver , Colorado , 63 . 1 . 1 - 63 .1.5. R. Gail and L. Kleinrock. 1981. An Invariant Property of Computer Network Power. In Proceedings of the International Conference on Communications. Denver, Colorado, 63.1.1-63.1.5.","journal-title":"Colorado"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2390231.2390242"},{"key":"e_1_3_2_2_26_1","volume-title":"Free buffer allocation---An investigation by simulation. Computer Networks (1976) 2, 3","author":"Giessler Alfred","year":"1978","unstructured":"Alfred Giessler , J Haenle , Andreas K\u00f6nig , and E Pade . 1978. Free buffer allocation---An investigation by simulation. Computer Networks (1976) 2, 3 ( 1978 ), 191--208. Alfred Giessler, J Haenle, Andreas K\u00f6nig, and E Pade. 1978. Free buffer allocation---An investigation by simulation. Computer Networks (1976) 2, 3 (1978), 191--208."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1400097.1400105"},{"key":"e_1_3_2_2_29_1","volume-title":"Proceedings of the 34th International Conference on Machine Learning-Volume 70","author":"Irina","unstructured":"Irina Higgins et al. 2017. Darla: Improving zero-shot transfer in reinforcement learning . In Proceedings of the 34th International Conference on Machine Learning-Volume 70 . JMLR. org, 1480--1490. Irina Higgins et al. 2017. Darla: Improving zero-shot transfer in reinforcement learning. In Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 1480--1490."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_2_31_1","volume-title":"Hado Van Hasselt, and David Silver","author":"Horgan Dan","year":"2018","unstructured":"Dan Horgan , John Quan , David Budden , Gabriel Barth-Maron , Matteo Hessel , Hado Van Hasselt, and David Silver . 2018 . Distributed prioritized experience replay. arXiv preprint arXiv:1803.00933 (2018). Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado Van Hasselt, and David Silver. 2018. Distributed prioritized experience replay. arXiv preprint arXiv:1803.00933 (2018)."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/52324.52356"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCOM.1981.1095152"},{"key":"e_1_3_2_2_34_1","volume-title":"International Conference on Machine Learning. 3050--3059","author":"Jay Nathan","year":"2019","unstructured":"Nathan Jay , Noga Rotman , Brighten Godfrey , Michael Schapira , and Aviv Tamar . 2019 . A Deep Reinforcement Learning Perspective on Internet Congestion Control . In International Conference on Machine Learning. 3050--3059 . Nathan Jay, Noga Rotman, Brighten Godfrey, Michael Schapira, and Aviv Tamar. 2019. A Deep Reinforcement Learning Perspective on Internet Congestion Control. In International Conference on Machine Learning. 3050--3059."},{"key":"e_1_3_2_2_35_1","volume-title":"Congestion control for high bandwidth-delay product networks. ACM SIGCOMM computer communication review 32, 4","author":"Katabi Dina","year":"2002","unstructured":"Dina Katabi , Mark Handley , and Charlie Rohrs . 2002. Congestion control for high bandwidth-delay product networks. ACM SIGCOMM computer communication review 32, 4 ( 2002 ), 89--102. Dina Katabi, Mark Handley, and Charlie Rohrs. 2002. Congestion control for high bandwidth-delay product networks. ACM SIGCOMM computer communication review 32, 4 (2002), 89--102."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1611835114"},{"key":"e_1_3_2_2_37_1","volume-title":"Proceedings of the International Conference on Communications","volume":"2","author":"Kleinrock Leonard","year":"1978","unstructured":"Leonard Kleinrock . 1978 . On flow control in computer networks . In Proceedings of the International Conference on Communications , Vol. 2 . 27--2. Leonard Kleinrock. 1978. On flow control in computer networks. In Proceedings of the International Conference on Communications, Vol. 2. 27--2."},{"key":"e_1_3_2_2_38_1","unstructured":"Yan Lecun. 2017. My take on Ali Rahimi's \"Test of Time\" award talk at NIPS. (2017). https:\/\/www.facebook.com\/yann.lecun\/posts\/10154938130592143  Yan Lecun. 2017. My take on Ali Rahimi's \"Test of Time\" award talk at NIPS. (2017). https:\/\/www.facebook.com\/yann.lecun\/posts\/10154938130592143"},{"key":"e_1_3_2_2_39_1","volume-title":"Deep reinforcement learning: An overview. arXiv preprint arXiv:1701.07274","author":"Yuxi Li.","year":"2017","unstructured":"Yuxi Li. 2017. Deep reinforcement learning: An overview. arXiv preprint arXiv:1701.07274 ( 2017 ). Yuxi Li. 2017. Deep reinforcement learning: An overview. arXiv preprint arXiv:1701.07274 (2017)."},{"key":"e_1_3_2_2_40_1","volume-title":"Proceedings of the ACM Special Interest Group on Data Communication. 44--58","author":"Yuliang","unstructured":"Yuliang Li et al. 2019. HPCC: high precision congestion control . In Proceedings of the ACM Special Interest Group on Data Communication. 44--58 . Yuliang Li et al. 2019. HPCC: high precision congestion control. In Proceedings of the ACM Special Interest Group on Data Communication. 44--58."},{"key":"e_1_3_2_2_41_1","volume-title":"Continuous control with deep reinforcement learning. CoRR abs\/1509.02971","author":"Lillicrap Timothy P.","year":"2015","unstructured":"Timothy P. Lillicrap , Jonathan J. Hunt , Alexander Pritzel , Nicolas Heess , Tom Erez , Yuval Tassa , David Silver , and Daan Wierstra . 2015. Continuous control with deep reinforcement learning. CoRR abs\/1509.02971 ( 2015 ). arXiv:1509.02971 Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR abs\/1509.02971 (2015). arXiv:1509.02971"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.peva.2007.12.007"},{"key":"e_1_3_2_2_43_1","volume-title":"Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychological review 102, 3","author":"McClelland James L","year":"1995","unstructured":"James L McClelland , Bruce L McNaughton , and Randall C O'Reilly . 1995. Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychological review 102, 3 ( 1995 ), 419. James L McClelland, Bruce L McNaughton, and Randall C O'Reilly. 1995. Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychological review 102, 3 (1995), 419."},{"key":"e_1_3_2_2_44_1","volume-title":"Psychology of learning and motivation.","author":"McCloskey Michael","unstructured":"Michael McCloskey and Neal J Cohen . 1989. Catastrophic interference in connectionist networks: The sequential learning problem . In Psychology of learning and motivation. Vol. 24 . Elsevier , 109--165. Michael McCloskey and Neal J Cohen. 1989. Catastrophic interference in connectionist networks: The sequential learning problem. In Psychology of learning and motivation. Vol. 24. Elsevier, 109--165."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2829988.2787510"},{"key":"e_1_3_2_2_46_1","volume-title":"Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602","author":"Mnih Volodymyr","year":"2013","unstructured":"Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Alex Graves , Ioannis Antonoglou , Daan Wierstra , and Martin Riedmiller . 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 ( 2013 ). Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013)."},{"key":"e_1_3_2_2_47_1","doi-asserted-by":"crossref","unstructured":"Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemare Alex Graves Martin Riedmiller Andreas K Fidjeland Georg Ostrovski etal 2015. Human-level control through deep reinforcement learning. Nature 518 7540 (2015) 529.  Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemare Alex Graves Martin Riedmiller Andreas K Fidjeland Georg Ostrovski et al. 2015. Human-level control through deep reinforcement learning. Nature 518 7540 (2015) 529.","DOI":"10.1038\/nature14236"},{"key":"e_1_3_2_2_48_1","volume-title":"Learning to adapt in dynamic, real-world environments through meta-reinforcement learning. arXiv preprint arXiv:1803.11347","author":"Nagabandi Anusha","year":"2018","unstructured":"Anusha Nagabandi , Ignasi Clavera , Simin Liu , Ronald S Fearing , Pieter Abbeel , Sergey Levine , and Chelsea Finn . 2018. Learning to adapt in dynamic, real-world environments through meta-reinforcement learning. arXiv preprint arXiv:1803.11347 ( 2018 ). Anusha Nagabandi, Ignasi Clavera, Simin Liu, Ronald S Fearing, Pieter Abbeel, Sergey Levine, and Chelsea Finn. 2018. Learning to adapt in dynamic, real-world environments through meta-reinforcement learning. arXiv preprint arXiv:1803.11347 (2018)."},{"key":"e_1_3_2_2_49_1","volume-title":"Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, et al.","author":"Nair Arun","year":"2015","unstructured":"Arun Nair , Praveen Srinivasan , Sam Blackwell , Cagdas Alcicek , Rory Fearon , Alessandro De Maria , Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, et al. 2015 . Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv:1507.04296 (2015). Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, et al. 2015. Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv:1507.04296 (2015)."},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2619239.2631455"},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2209249.2209264"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPSR.2013.6602305"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3281411.3281430"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/72.410363"},{"key":"e_1_3_2_2_55_1","unstructured":"Ali Rahimi and Ben Recht. 2017. Back when we were kids. (2017). https:\/\/www.youtube.com\/watch?v=Qi1Yry33TQE \"Test of Time\" award talk at NIPS.  Ali Rahimi and Ben Recht. 2017. Back when we were kids. (2017). https:\/\/www.youtube.com\/watch?v=Qi1Yry33TQE \"Test of Time\" award talk at NIPS."},{"key":"e_1_3_2_2_56_1","volume-title":"Connectionist models of recognition memory: constraints imposed by learning and forgetting functions. Psychological review 97, 2","author":"Ratcliff Roger","year":"1990","unstructured":"Roger Ratcliff . 1990. Connectionist models of recognition memory: constraints imposed by learning and forgetting functions. Psychological review 97, 2 ( 1990 ), 285. Roger Ratcliff. 1990. Connectionist models of recognition memory: constraints imposed by learning and forgetting functions. Psychological review 97, 2 (1990), 285."},{"key":"e_1_3_2_2_57_1","volume-title":"LEDBAT: The New BitTorrent Congestion Control Protocol.. In ICCCN. 1--6.","author":"Rossi Dario","year":"2010","unstructured":"Dario Rossi , Claudio Testa , Silvio Valenti , and Luca Muscariello . 2010 . LEDBAT: The New BitTorrent Congestion Control Protocol.. In ICCCN. 1--6. Dario Rossi, Claudio Testa, Silvio Valenti, and Luca Muscariello. 2010. LEDBAT: The New BitTorrent Congestion Control Protocol.. In ICCCN. 1--6."},{"key":"e_1_3_2_2_58_1","volume-title":"Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al.","author":"Silver David","year":"2016","unstructured":"David Silver , Aja Huang , Chris J Maddison , Arthur Guez , Laurent Sifre , George Van Den Driessche , Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016 . Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484. David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484."},{"key":"e_1_3_2_2_59_1","unstructured":"David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai Arthur Guez Marc Lanctot Laurent Sifre Dharshan Kumaran Thore Graepel etal 2017. Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv preprint arXiv:1712.01815 (2017).  David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai Arthur Guez Marc Lanctot Laurent Sifre Dharshan Kumaran Thore Graepel et al. 2017. Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv preprint arXiv:1712.01815 (2017)."},{"key":"e_1_3_2_2_60_1","unstructured":"David Silver Guy Lever Nicolas Heess Thomas Degris Daan Wierstra and Martin Riedmiller. 2014. Deterministic policy gradient algorithms.  David Silver Guy Lever Nicolas Heess Thomas Degris Daan Wierstra and Martin Riedmiller. 2014. Deterministic policy gradient algorithms."},{"key":"e_1_3_2_2_61_1","volume-title":"Reinforcement Learning: An Introduction","author":"Sutton R.S.","year":"2018","unstructured":"R.S. Sutton and A.G. Barto . 2018 . Reinforcement Learning: An Introduction . MIT Press . https:\/\/books.google.com\/books?id=uWV0DwAAQBAJ R.S. Sutton and A.G. Barto. 2018. Reinforcement Learning: An Introduction. MIT Press. https:\/\/books.google.com\/books?id=uWV0DwAAQBAJ"},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2006.188"},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2007.893885"},{"key":"e_1_3_2_2_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2009.2034963"},{"key":"e_1_3_2_2_65_1","volume-title":"On the Generalization Gap in Reparameterizable Reinforcement Learning. CoRR abs\/1905.12654","author":"Wang Huan","year":"2019","unstructured":"Huan Wang , Stephan Zheng , Caiming Xiong , and Richard Socher . 2019. On the Generalization Gap in Reparameterizable Reinforcement Learning. CoRR abs\/1905.12654 ( 2019 ). arXiv:1905.12654 http:\/\/arxiv.org\/abs\/1905.12654 Huan Wang, Stephan Zheng, Caiming Xiong, and Richard Socher. 2019. On the Generalization Gap in Reparameterizable Reinforcement Learning. CoRR abs\/1905.12654 (2019). arXiv:1905.12654 http:\/\/arxiv.org\/abs\/1905.12654"},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/2486001.2486020"},{"key":"e_1_3_2_2_67_1","unstructured":"Keith Winstein Anirudh Sivaraman Hari Balakrishnan etal 2013. Stochastic Forecasts Achieve High Throughput and Low Delay over Cellular Networks.. In NSDI. 459--471.  Keith Winstein Anirudh Sivaraman Hari Balakrishnan et al. 2013. Stochastic Forecasts Achieve High Throughput and Low Delay over Cellular Networks.. In NSDI. 459--471."},{"key":"e_1_3_2_2_68_1","volume-title":"Proceedings-IEEE INFOCOM","volume":"4","author":"Xu Lisong","year":"2004","unstructured":"Lisong Xu , Khaled Harfoush , and Injong Rhee . 2004 . Binary increase congestion control (BIC) for fast long-distance networks . In Proceedings-IEEE INFOCOM , Vol. 4 . IEEE, 2514--2524. Lisong Xu, Khaled Harfoush, and Injong Rhee. 2004. Binary increase congestion control (BIC) for fast long-distance networks. In Proceedings-IEEE INFOCOM, Vol. 4. IEEE, 2514--2524."},{"key":"e_1_3_2_2_69_1","unstructured":"Francis Y Yan Jestin Ma Greg D Hill Deepti Raghavan Riad S Wahby Philip Levis and Keith Winstein. 2018. Pantheon: the training ground for Internet congestion-control research. In 2018 {USENIX} Annual Technical Conference ({USENIX}{ATC} 18). 731--743.  Francis Y Yan Jestin Ma Greg D Hill Deepti Raghavan Riad S Wahby Philip Levis and Keith Winstein. 2018. Pantheon: the training ground for Internet congestion-control research. In 2018 {USENIX} Annual Technical Conference ({USENIX}{ATC} 18). 731--743."}],"event":{"name":"SIGCOMM '20: Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication","location":"Virtual Event USA","acronym":"SIGCOMM '20","sponsor":["SIGCOMM ACM Special Interest Group on Data Communication"]},"container-title":["Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3387514.3405892","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3387514.3405892","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:41:36Z","timestamp":1750200096000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3387514.3405892"}},"subtitle":["a Pragmatic Learning-Based Congestion Control for the Internet"],"short-title":[],"issued":{"date-parts":[[2020,7,30]]},"references-count":68,"alternative-id":["10.1145\/3387514.3405892","10.1145\/3387514"],"URL":"https:\/\/doi.org\/10.1145\/3387514.3405892","relation":{},"subject":[],"published":{"date-parts":[[2020,7,30]]},"assertion":[{"value":"2020-07-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}