{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,1]],"date-time":"2022-04-01T14:03:24Z","timestamp":1648821804960},"reference-count":45,"publisher":"Institute of Electronics, Information and Communications Engineers (IEICE)","issue":"10","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IEICE Trans. Inf. &amp; Syst."],"published-print":{"date-parts":[[2018,10,1]]},"DOI":"10.1587\/transinf.2018edp7038","type":"journal-article","created":{"date-parts":[[2018,9,30]],"date-time":"2018-09-30T18:09:53Z","timestamp":1538330993000},"page":"2423-2435","source":"Crossref","is-referenced-by-count":0,"title":["Improving Per-Node Computing Efficiency by an Adaptive Lock-Free Scheduling Model"],"prefix":"10.1587","volume":"E101.D","author":[{"given":"Zhishuo","family":"ZHENG","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, South China University of Technology"}]},{"given":"Deyu","family":"QI","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, South China University of Technology"}]},{"given":"Naqin","family":"ZHOU","sequence":"additional","affiliation":[{"name":"Guangzhou University"}]},{"given":"Xinyang","family":"WANG","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, South China University of Technology"}]},{"given":"Mincong","family":"YU","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, South China University of Technology"}]}],"member":"532","reference":[{"key":"1","doi-asserted-by":"crossref","unstructured":"[1] J. Emeras, S. Varrette, M. Guzek, and P. Bouvry, \u201cEvalix: Classification and prediction of job resource consumption on hpc platforms,\u201d Job Scheduling Strategies for Parallel Processing, Lecture Notes in Computer Science, vol.10353, pp.102-122, 2017. 10.1007\/978-3-319-61756-5_6","DOI":"10.1007\/978-3-319-61756-5_6"},{"key":"2","doi-asserted-by":"crossref","unstructured":"[2] H. Li, D. Groep, and L. Wolters, \u201cWorkload characteristics of amulti-cluster supercomputer,\u201d International Conference on Job Scheduling Strategies for Parallel Processing, vol.3277, pp.176-193, 2004. 10.1007\/11407522_10","DOI":"10.1007\/11407522_10"},{"key":"3","doi-asserted-by":"crossref","unstructured":"[3] A. Batat and D.G. Feitelson, \u201cGang scheduling with memory considerations,\u201d Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000, pp.109-114, 2000. 10.1109\/ipdps.2000.845971","DOI":"10.1109\/IPDPS.2000.845971"},{"key":"4","doi-asserted-by":"crossref","unstructured":"[4] A.D. Breslow, A. Tiwari, M. Schulz, L. Carrington, L. Tang, and J. Mars, \u201cEnabling fair pricing on HPC systems with node sharing,\u201d SC &apos;13-International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-12, 2013. 10.1145\/2503210.2503256","DOI":"10.1145\/2503210.2503256"},{"key":"5","doi-asserted-by":"crossref","unstructured":"[5] M. Stillwell, F. Vivien, and H. Casanova, \u201cDynamic fractional resource scheduling for HPC workloads,\u201d IEEE International Symposium on Parallel and Distributed Processing, pp.1-12, 2010. 10.1109\/ipdps.2010.5470356","DOI":"10.1109\/IPDPS.2010.5470356"},{"key":"6","doi-asserted-by":"publisher","unstructured":"[6] Y. Zhang, H. Franke, J. Moreira, and A. Sivasubramaniam, \u201cAn integrated approach to parallel scheduling using gang-scheduling, backfilling, and migration,\u201d IEEE Transactions on Parallel and Distributed Systems, vol.14, no.3, pp.236-247, 2003. 10.1109\/tpds.2003.1189582","DOI":"10.1109\/TPDS.2003.1189582"},{"key":"7","doi-asserted-by":"crossref","unstructured":"[7] Y. Zhang, M.S. Squillante, A. Sivasubramaniam, and R.K. Sahoo, \u201cPerformance implications of failures in large-scale cluster scheduling,\u201d Lecture Notes in Computer Science, vol.3277, pp.233-252, 2004. 10.1007\/11407522_13","DOI":"10.1007\/11407522_13"},{"key":"8","doi-asserted-by":"crossref","unstructured":"[8] N. Fallenbeck, H.-J. Picht, M. Smith, and B. Freisleben, \u201cXen and the art of cluster scheduling,\u201d International Workshop on Virtualization Technology in Distributed Computing, p.4, 2006. 10.1109\/vtdc.2006.18","DOI":"10.1109\/VTDC.2006.18"},{"key":"9","doi-asserted-by":"crossref","unstructured":"[9] G. Fox, J. Qiu, S. Jha, S. Ekanayake, and S. Kamburugamuve, \u201cBig data, simulations and HPC convergence,\u201d Big Data Benchmarking, Lecture Notes in Computer Science, vol.10044, pp.3-17, 2015. 10.1007\/978-3-319-49748-8_1","DOI":"10.1007\/978-3-319-49748-8_1"},{"key":"10","doi-asserted-by":"publisher","unstructured":"[10] D.A. Reed and J. Dongarra, \u201cExascale computing and big data,\u201d Communications of the Acm, vol.58, no.7, pp.56-68, 2015. 10.1145\/2699414","DOI":"10.1145\/2699414"},{"key":"11","unstructured":"[11] G.P. Rodriguez, \u2018HPC scheduling in a brave new world\u2019, Ph.D. dissertation, Ume\u00e5 universitet, Ume\u00e5, 2017."},{"key":"12","doi-asserted-by":"publisher","unstructured":"[12] L. Adhianto and B. Chapman, \u201cPerformance modeling of communication and computation in hybrid mpi and openmp applications,\u201d Simulation Modelling Practice and Theory, vol.15, no.4, pp.481-491, 2007. 10.1016\/j.simpat.2006.11.014","DOI":"10.1016\/j.simpat.2006.11.014"},{"key":"13","doi-asserted-by":"crossref","unstructured":"[13] J. Ekanayake, S. Pallickara, and G. Fox, \u201cMapreduce for data intensive scientific analyses,\u201d IEEE Fourth International Conference on Escience, 2008. Escience, pp.277-284, 2009. 10.1109\/escience.2008.59","DOI":"10.1109\/eScience.2008.59"},{"key":"14","doi-asserted-by":"crossref","unstructured":"[14] D.G. Feitelson, \u201cWorkload modeling for performance evaluation,\u201d Performance Evaluation of Complex Systems:Techniques and Tools, vol.2459, pp.114-141, 2002. 10.1007\/3-540-45798-4_6","DOI":"10.1007\/3-540-45798-4_6"},{"key":"15","unstructured":"[15] J. Gray, \u201cSort benchmark home page,\u201d http:\/\/sortbenchmark.org\/"},{"key":"16","doi-asserted-by":"crossref","unstructured":"[16] D.G. Feitelson and L. Rudolph, \u201cToward convergence in job schedulers for parallel supercomputers,\u201d The Workshop on Job Scheduling Strategies for Parallel Processing, vol.1162, pp.1-26, 1996. 10.1007\/bfb0022284","DOI":"10.1007\/BFb0022284"},{"key":"17","doi-asserted-by":"crossref","unstructured":"[17] J.-P. Lozi, B. Lepers, J. Funston, F. Gaud, V. Qu\u00e9ma, and A. Fedorova, \u201cThe linux scheduler: a decade of wasted cores,\u201d Eleventh European Conference on Computer Systems, p.1, 2016. 10.1145\/2901318.2901326","DOI":"10.1145\/2901318.2901326"},{"key":"18","unstructured":"[18] S. Boyd-Wickizer, A.T. Clements, Y. Mao, A. Pesterev, M.F. Kaashoek, R. Morris, and N. Zeldovich, \u201cAn analysis of linux scalability to many cores,\u201d Proc. Usenix Symposium on Operating Systems Design and Implementation, OSDI 2010, October 4-6, 2010, Vancouver, Canada, pp.1-16, 2010."},{"key":"19","unstructured":"[19] J.A. Colmenares, S. Bird, H. Cook, P. Pearce, D. Zhu, J. Shalf, S. Hofmeyr, K. Asanovi, and J. Kubiatowicz, \u201cResource management in the tessellation manycore os,\u201d Usenix Workshop on Hot Topics in Parallelism, 2010."},{"key":"20","doi-asserted-by":"crossref","unstructured":"[20] B. Rhoden, K. Klues, D. Zhu, and E. Brewer, \u201cImproving per-node efficiency in the datacenter with new os abstractions,\u201d ACM Symposium on Cloud Computing, SOCC, pp.1-8, 2011. 10.1145\/2038916.2038941","DOI":"10.1145\/2038916.2038941"},{"key":"21","doi-asserted-by":"crossref","unstructured":"[21] C. Reiss, A. Tumanov, G.R. Ganger, R.H. Katz, and M.A. Kozuch, \u201cHeterogeneity and dynamicity of clouds at scale: Google trace analysis,\u201d ACM Symposium on Cloud Computing, pp.1-13, 2012. 10.1145\/2391229.2391236","DOI":"10.1145\/2391229.2391236"},{"key":"22","doi-asserted-by":"crossref","unstructured":"[22] S. Di, D. Kondo, and W. Cirne, \u201cCharacterization and comparison of google cloud load versus grids,\u201d 2012 IEEE International Conference on Cluster Computing, pp.230-238, 2012. 10.1109\/cluster.2012.35","DOI":"10.1109\/CLUSTER.2012.35"},{"key":"23","doi-asserted-by":"crossref","unstructured":"[23] H. You and H. Zhang, \u201cComprehensive workload analysis and modeling of a petascale supercomputer,\u201d Job Scheduling Strategies for Parallel Processing, Lecture Notes in Computer Science, vol.7698, pp.253-271, 2012. 10.1007\/978-3-642-35867-8_14","DOI":"10.1007\/978-3-642-35867-8_14"},{"key":"24","doi-asserted-by":"crossref","unstructured":"[24] A. Baumann, P. Barham, P.-E. Dagand, T. Harris, R. Isaacs, S. Peter, T. Roscoe, A. Sch\u00fcpbach, and A. Singhania, \u201cThe multikernel: a new os architecture for scalable multicore systems,\u201d ACM Symposium on Operating Systems Principles 2009, SOSP 2009, Big Sky, Montana, Usa, pp.29-44, Oct. 2009. 10.1145\/1629575.1629579","DOI":"10.1145\/1629575.1629579"},{"key":"25","unstructured":"[25] S. Boyd-Wickizer, H. Chen, R. Chen, Y. Mao, F. Kaashoek, R. Morris, A. Pesterev, L. Stein, M. Wu, and Y. Dai, \u201cCorey: An operating system for many cores,\u201d Usenix Conference on Operating Systems Design and Implementation, pp.43-57, 2008."},{"key":"26","doi-asserted-by":"publisher","unstructured":"[26] D. Wentzlaff and A. Agarwal, \u201cFactored operating systems (fos): the case for a scalable operating system for multicores,\u201d Acm Sigops Operating Systems Review, vol.43, no.2, pp.76-85, 2009. 10.1145\/1531793.1531805","DOI":"10.1145\/1531793.1531805"},{"key":"27","doi-asserted-by":"publisher","unstructured":"[27] N. Shavit and D. Touitou, \u201cSoftware transactional memory,\u201d Distributed Computing, vol.10, no.2, pp.99-116, 1997. 10.1007\/s004460050028","DOI":"10.1007\/s004460050028"},{"key":"28","doi-asserted-by":"publisher","unstructured":"[28] D.R. Engler, M.F. Kaashoek, and J. O&apos;Toole, \u201cExokernel: an operating system architecture for application-level resource management,\u201d Proc. ACM Symposium on Operating System Principles, vol.29, no.5, pp.251-266, 1995. 10.1145\/224057.224076","DOI":"10.1145\/224057.224076"},{"key":"29","unstructured":"[29] A. Baumann, S. Peter, A. Singhania, T. Roscoe, P. Barham, and R. Isaacs, \u201cYour computer is already a distributed system. Why isn&apos;t your os?,\u201d Conference on Hot Topics in Operating Systems, pp.12-12, 2009."},{"key":"30","doi-asserted-by":"crossref","unstructured":"[30] G.P. Rodrigo, P.O. \u00d6stberg, E. Elmroth, K. Antypas, R. Gerber, and L. Ramakrishnan, \u201cTowards understanding HPC users and systems: A nersc case study,\u201d Journal of Parallel and Distributed Computing, vol.111, 2017.","DOI":"10.1016\/j.jpdc.2017.09.002"},{"key":"31","doi-asserted-by":"publisher","unstructured":"[31] S. Peter, J. Li, I. Zhang, R.K.P. Dan, D. Woos, A. Krishnamurthy, T. Anderson, and T. Roscoe, \u201cArrakis: the operating system is the control plane,\u201d ACM Transactions on Computer Systems, vol.33, no.4, pp.1-30, 2015. 10.1145\/2812806","DOI":"10.1145\/2812806"},{"key":"32","doi-asserted-by":"publisher","unstructured":"[32] R. Sakellariou and R. Sakellariou, \u201cMapping Virtual Machines onto Physical Machines in Cloud Computing: A Survey,\u201d ACM Computing Surveys, vol.49, no.3, pp.1-30, 2016. 10.1145\/2983575","DOI":"10.1145\/2983575"},{"key":"33","doi-asserted-by":"publisher","unstructured":"[33] R.J. Creasy, \u201cThe origin of the vm\/370 time-sharing system,\u201d IBM J. Research and Development, vol.25, no.5, pp.483-490, 1981. 10.1147\/rd.255.0483","DOI":"10.1147\/rd.255.0483"},{"key":"34","doi-asserted-by":"publisher","unstructured":"[34] H. Alam, T. Zhang, M. Erez, and Y. Etsion, \u201cDo-it-yourself virtual memory translation,\u201d ACM\/IEEE International Symposium on Computer Architecture, pp.457-468, 2017. 10.1145\/3140659.3080209","DOI":"10.1145\/3140659.3080209"},{"key":"35","unstructured":"[35] Chen and Benjie, \u201cMultiprocessing with the exokernel operating system,\u201d Massachusetts Institute of Technology, vol.13, no.1, pp.5, 7, 18-24, 2000."},{"key":"36","doi-asserted-by":"publisher","unstructured":"[36] V. Uhlig, \u201cThe mechanics of in kernel synchronization for a scalable microkernel,\u201d ACM Sigops Operating Systems Review, vol.41, no.4, pp.49-58, 2007. 10.1145\/1278901.1278909","DOI":"10.1145\/1278901.1278909"},{"key":"37","doi-asserted-by":"crossref","unstructured":"[37] I. Watson, C. Kirkham, and M. Lujan, \u201cA study of a transactional parallel routing algorithm,\u201d International Conference on Parallel Architecture and Compilation Techniques, pp.388-398, 2007. 10.1109\/pact.2007.4336228","DOI":"10.1109\/PACT.2007.4336228"},{"key":"38","unstructured":"[38] J.T. Havill, W. Mao, and V. Dimitrov, \u201cImproved parallel job scheduling with overhead,\u201d Joint Conference on Information Sciences, 2008."},{"key":"39","doi-asserted-by":"crossref","unstructured":"[39] D. Akhmetova, G. Kestor, R. Gioiosa, S. Markidis, and E. Laure, \u201cOn the application task granularity and the interplay with the scheduling overhead in many-core shared memory systems,\u201d IEEE International Conference on CLUSTER Computing, pp.428-437, 2015. 10.1109\/cluster.2015.65","DOI":"10.1109\/CLUSTER.2015.65"},{"key":"40","doi-asserted-by":"crossref","unstructured":"[40] M. Schwarzkopf, A. Konwinski, M. Abd-El-Malek, and J. Wilkes, \u201cOmega: flexible, scalable schedulers for large compute clusters,\u201d ACM European Conference on Computer Systems, pp.351-364, 2013. 10.1145\/2465351.2465386","DOI":"10.1145\/2465351.2465386"},{"key":"41","unstructured":"[41] Parallel workloads archive, http:\/\/www.cs.huji.ac.il\/labs\/parallel\/workload\/"},{"key":"42","doi-asserted-by":"publisher","unstructured":"[42] S. Blagodurov, S. Zhuravlev, and A. Fedorova, \u201cContention-aware scheduling on multicore systems,\u201d ACM Transactions on Computer Systems, vol.28, no.4, pp.1-45, 2010. 10.1145\/1880018.1880019","DOI":"10.1145\/1880018.1880019"},{"key":"43","doi-asserted-by":"publisher","unstructured":"[43] J. Feliu, S. Petit, J. Sahuquillo, and J. Duato, \u201cCache-hierarchycontention-aware scheduling in cmps,\u201d IEEE Transactions on Parallel and Distributed Systems, vol.25, no.3, pp.581-590, 2014. 10.1109\/tpds.2013.61","DOI":"10.1109\/TPDS.2013.61"},{"key":"44","doi-asserted-by":"crossref","unstructured":"[44] X. Zhang, S. Dwarkadas, and K. Shen, \u201cTowards practical page coloring-based multicore cache management,\u201d Proc. 4th ACM european conference on Computer systems-EuroSys &apos;09, Nuremberg, Germany, pp.89-102, April 2009. 10.1145\/1519065.1519076","DOI":"10.1145\/1519065.1519076"},{"key":"45","doi-asserted-by":"crossref","unstructured":"[45] M. Frigo, C.E. Leiserson, and K.H. Randall, \u201cThe implementation of the cilk-5 multithreaded language,\u201d ACM Sigplan 1998 Conference on Programming Language Design and Implementation, pp.212-223, 1998. 10.1145\/277650.277725","DOI":"10.1145\/277652.277725"}],"container-title":["IEICE Transactions on Information and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E101.D\/10\/E101.D_2018EDP7038\/_pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,10,24]],"date-time":"2019-10-24T23:54:30Z","timestamp":1571961270000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E101.D\/10\/E101.D_2018EDP7038\/_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,1]]},"references-count":45,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2018]]}},"URL":"https:\/\/doi.org\/10.1587\/transinf.2018edp7038","relation":{},"ISSN":["0916-8532","1745-1361"],"issn-type":[{"value":"0916-8532","type":"print"},{"value":"1745-1361","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,10,1]]}}}