{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T02:16:32Z","timestamp":1775873792305,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":41,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,4,4]],"date-time":"2017-04-04T00:00:00Z","timestamp":1491264000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1464216 and 1618912"],"award-info":[{"award-number":["1464216 and 1618912"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,4,4]]},"DOI":"10.1145\/3037697.3037742","type":"proceedings-article","created":{"date-parts":[[2017,4,5]],"date-time":"2017-04-05T08:47:40Z","timestamp":1491382060000},"page":"483-496","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":40,"title":["FLEP"],"prefix":"10.1145","author":[{"given":"Bo","family":"Wu","sequence":"first","affiliation":[{"name":"Colorado School of Mines, Golden, CO, USA"}]},{"given":"Xu","family":"Liu","sequence":"additional","affiliation":[{"name":"College of William and Mary, Williamsburg, VA, USA"}]},{"given":"Xiaobo","family":"Zhou","sequence":"additional","affiliation":[{"name":"University of Colorado, Colorado Springs, Colorado Springs, CO, USA"}]},{"given":"Changjun","family":"Jiang","sequence":"additional","affiliation":[{"name":"Tongji University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2017,4,4]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"clang: a C language family frontend for LLVM. http:\/\/clang.llvm.org\/; accessed 23-02-2016.  clang: a C language family frontend for LLVM. http:\/\/clang.llvm.org\/; accessed 23-02-2016."},{"key":"e_1_3_2_1_2_1","unstructured":"NVLink Communication Protocol. https:\/\/en.wikipedia.org\/wiki\/NVLink.  NVLink Communication Protocol. https:\/\/en.wikipedia.org\/wiki\/NVLink."},{"key":"e_1_3_2_1_3_1","unstructured":"OpenCL. http:\/\/www.khronos.org\/opencl\/.  OpenCL. http:\/\/www.khronos.org\/opencl\/."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2012.6168946"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ECRTS.2012.15"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"e_1_3_2_1_7_1","volume-title":"A software framework for efficient preemptive scheduling on gpu. Technical report","author":"Chen G.","year":"2016","unstructured":"G. Chen , X. Shen , and H. Zhou . A software framework for efficient preemptive scheduling on gpu. Technical report , North Carolina State University , 2016 . G. Chen, X. Shen, and H. Zhou. A software framework for efficient preemptive scheduling on gpu. Technical report, North Carolina State University, 2016."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3018743.3018748"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2872362.2872368"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735702"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2408776.2408794"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/2813767.2813806"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/RTCSA.2011.46"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2008.44"},{"key":"e_1_3_2_1_15_1","volume-title":"Presented as part of the 4th USENIX Workshop on Hot Topics in Parallelism","author":"Gregg C.","year":"2012","unstructured":"C. Gregg , J. Dorn , K. Hazelwood , and K. Skadron . Fine-grained resource sharing for concurrent gpgpu kernels . In Presented as part of the 4th USENIX Workshop on Hot Topics in Parallelism , Berkeley, CA , 2012 . USENIX. C. Gregg, J. Dorn, K. Hazelwood, and K. Skadron. Fine-grained resource sharing for concurrent gpgpu kernels. In Presented as part of the 4th USENIX Workshop on Hot Topics in Parallelism, Berkeley, CA, 2012. USENIX."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/InPar.2012.6339596"},{"key":"e_1_3_2_1_17_1","volume-title":"The Datacenter As a Computer: An Introduction to the Design of Warehouse-Scale Machines","author":"Hoelzle U.","year":"2009","unstructured":"U. Hoelzle and L. A. Barroso . The Datacenter As a Computer: An Introduction to the Design of Warehouse-Scale Machines . Morgan and Claypool Publishers , 1 st edition, 2009 . U. Hoelzle and L. A. Barroso. The Datacenter As a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan and Claypool Publishers, 1st edition, 2009.","edition":"1"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818950.2818979"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/2120959.2121120"},{"key":"e_1_3_2_1_20_1","first-page":"2","volume-title":"Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, USENIXATC'11","author":"Kato S.","year":"2011","unstructured":"S. Kato , K. Lakshmanan , R. Rajkumar , and Y. Ishikawa . Timegraph: Gpu scheduling for real-time multi-tasking environments . In Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, USENIXATC'11 , pages 2 -- 2 , Berkeley, CA, USA , 2011 . USENIX Association. S. Kato, K. Lakshmanan, R. Rajkumar, and Y. Ishikawa. Timegraph: Gpu scheduling for real-time multi-tasking environments. In Proceedings of the 2011 USENIX Conference on USENIX Annual Technical Conference, USENIXATC'11, pages 2--2, Berkeley, CA, USA, 2011. USENIX Association."},{"key":"e_1_3_2_1_21_1","first-page":"401","volume-title":"Presented as part of the 2012 USENIX Annual Technical Conference (USENIX ATC 12)","author":"Kato S.","year":"2012","unstructured":"S. Kato , M. McThrow , C. Maltzahn , and S. Brandt . Gdev: First-class gpu resource management in the operating system . In Presented as part of the 2012 USENIX Annual Technical Conference (USENIX ATC 12) , pages 401 -- 412 , Boston, MA , 2012 . USENIX. S. Kato, M. McThrow, C. Maltzahn, and S. Brandt. Gdev: First-class gpu resource management in the operating system. In Presented as part of the 2012 USENIX Annual Technical Conference (USENIX ATC 12), pages 401--412, Boston, MA, 2012. USENIX."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2731186.2731192"},{"key":"e_1_3_2_1_23_1","volume-title":"Reordering GPU kernel launches to enable efficient concurrent execution. CoRR, abs\/1511.07983","author":"Li T.","year":"2015","unstructured":"T. Li , V. K. Narayana , and T. A. El-Ghazawi . Reordering GPU kernel launches to enable efficient concurrent execution. CoRR, abs\/1511.07983 , 2015 . T. Li, V. K. Narayana, and T. A. El-Ghazawi. Reordering GPU kernel launches to enable efficient concurrent execution. CoRR, abs\/1511.07983, 2015."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2014.2313342"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2854038.2854040"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/SFFCS.1999.814615"},{"key":"e_1_3_2_1_27_1","unstructured":"NVIDIA. Cuda software development toolkit v7.0\\\\.texttt https:\/\/developer.nvidia.com\/cuda-toolkit-70.  NVIDIA. Cuda software development toolkit v7.0\\\\.texttt https:\/\/developer.nvidia.com\/cuda-toolkit-70."},{"key":"e_1_3_2_1_28_1","unstructured":"NVIDIA. Nvidia's next generation cuda computer architecture: Fermi. Technical report.  NVIDIA. Nvidia's next generation cuda computer architecture: Fermi. Technical report."},{"key":"e_1_3_2_1_29_1","volume-title":"Next generation cuda computer architecture kepler gk110. Technical report","author":"NVIDIA.","year":"2012","unstructured":"NVIDIA. Next generation cuda computer architecture kepler gk110. Technical report , 2012 . NVIDIA. Next generation cuda computer architecture kepler gk110. Technical report, 2012."},{"key":"e_1_3_2_1_30_1","volume-title":"Sharing a gpu between mpi processes: multi-process service (mps) overview. Technical report","author":"NVIDIA.","year":"2013","unstructured":"NVIDIA. Sharing a gpu between mpi processes: multi-process service (mps) overview. Technical report , 2013 . NVIDIA. Sharing a gpu between mpi processes: multi-process service (mps) overview. Technical report, 2013."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2451116.2451160"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2694344.2694346"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2043556.2043579"},{"key":"e_1_3_2_1_34_1","first-page":"109","volume-title":"2014 USENIX Annual Technical Conference (USENIX ATC 14)","author":"Suzuki Y.","year":"2014","unstructured":"Y. Suzuki , S. Kato , H. Yamada , and K. Kono . Gpuvm: Why not virtualizing gpus at the hypervisor? In 2014 USENIX Annual Technical Conference (USENIX ATC 14) , pages 109 -- 120 , Philadelphia, PA , June 2014 . USENIX Association. Y. Suzuki, S. Kato, H. Yamada, and K. Kono. Gpuvm: Why not virtualizing gpus at the hypervisor? In 2014 USENIX Annual Technical Conference (USENIX ATC 14), pages 109--120, Philadelphia, PA, June 2014. USENIX Association."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2014.6853208"},{"key":"e_1_3_2_1_36_1","first-page":"121","volume-title":"2014 USENIX Annual Technical Conference (USENIX ATC 14)","author":"Tian K.","year":"2014","unstructured":"K. Tian , Y. Dong , and D. Cowperthwaite . A full gpu virtualization solution with mediated pass-through . In 2014 USENIX Annual Technical Conference (USENIX ATC 14) , pages 121 -- 132 , Philadelphia, PA , June 2014 . USENIX Association. K. Tian, Y. Dong, and D. Cowperthwaite. A full gpu virtualization solution with mediated pass-through. In 2014 USENIX Annual Technical Conference (USENIX ATC 14), pages 121--132, Philadelphia, PA, June 2014. USENIX Association."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2591971.2592002"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732967.2732976"},{"key":"e_1_3_2_1_39_1","volume-title":"Simultaneous multikernel: Fine-grained sharing of gpgpus","author":"Wang Z.","year":"2015","unstructured":"Z. Wang , J. Yang , R. Melhem , B. Childers , Y. Zhang , and M. Guo . Simultaneous multikernel: Fine-grained sharing of gpgpus . IEEE COMPUTER ARCHITECTURE LETTERS , PP(99):748--760, 2015 . Z. Wang, J. Yang, R. Melhem, B. Childers, Y. Zhang, and M. Guo. Simultaneous multikernel: Fine-grained sharing of gpgpus. IEEE COMPUTER ARCHITECTURE LETTERS, PP(99):748--760, 2015."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2751205.2751213"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/RTAS.2015.7108420"}],"event":{"name":"ASPLOS '17: Architectural Support for Programming Languages and Operating Systems","location":"Xi'an China","acronym":"ASPLOS '17","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages","SIGOPS ACM Special Interest Group on Operating Systems","SIGARCH ACM Special Interest Group on Computer Architecture","SIGBED ACM Special Interest Group on Embedded Systems"]},"container-title":["Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3037697.3037742","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3037697.3037742","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3037697.3037742","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:03:11Z","timestamp":1750201391000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3037697.3037742"}},"subtitle":["Enabling Flexible and Efficient Preemption on GPUs"],"short-title":[],"issued":{"date-parts":[[2017,4,4]]},"references-count":41,"alternative-id":["10.1145\/3037697.3037742","10.1145\/3037697"],"URL":"https:\/\/doi.org\/10.1145\/3037697.3037742","relation":{"is-identical-to":[{"id-type":"doi","id":"10.1145\/3093336.3037742","asserted-by":"object"},{"id-type":"doi","id":"10.1145\/3093337.3037742","asserted-by":"object"}]},"subject":[],"published":{"date-parts":[[2017,4,4]]},"assertion":[{"value":"2017-04-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}