{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T14:13:25Z","timestamp":1760710405322,"version":"3.37.3"},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T00:00:00Z","timestamp":1604620800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T00:00:00Z","timestamp":1604620800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"funder":[{"DOI":"10.13039\/100000181","name":"Air Force Office of Scientific Research","doi-asserted-by":"publisher","award":["AFOSR Grant FA9550-17-1-0205"],"award-info":[{"award-number":["AFOSR Grant FA9550-17-1-0205"]}],"id":[{"id":"10.13039\/100000181","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Sign Process Syst"],"published-print":{"date-parts":[[2021,4]]},"DOI":"10.1007\/s11265-020-01604-4","type":"journal-article","created":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T02:02:59Z","timestamp":1604628179000},"page":"391-403","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["IterML: Iterative Machine Learning for Intelligent Parameter Pruning and Tuning in Graphics Processing Units"],"prefix":"10.1007","volume":"93","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0173-5215","authenticated-orcid":false,"given":"Xuewen","family":"Cui","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wu-chun","family":"Feng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,11,6]]},"reference":[{"issue":"3","key":"1604_CR1","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1080\/00031305.1992.10475879","volume":"46","author":"NS Altman","year":"1992","unstructured":"Altman, N. S. (1992). An introduction to kernel and nearest-neighbor nonparametric regression. The American Statistician, 46(3), 175\u2013185.","journal-title":"The American Statistician"},{"key":"1604_CR2","doi-asserted-by":"crossref","unstructured":"Breiman, L. (2017). Classification and regression trees. Routledge.","DOI":"10.1201\/9781315139470"},{"key":"1604_CR3","unstructured":"Choi, J. W., Singh, A., & Vuduc, R. W. (2010). Model-driven autotuning of sparse matrix-vector multiply on gpus. In ACM Sigplan notices, (Vol. 45 pp. 115\u2013126): ACM."},{"key":"1604_CR4","unstructured":"Cui, X., Scogland, T. R., de Supinski, B. R., & Feng, W. C. (2017). Directive-based partitioning and pipelining for graphics processing units. In Parallel and distributed processing symposium (IPDPS), 2017 IEEE international (pp. 575\u2013584): IEEE."},{"key":"1604_CR5","unstructured":"Dongarra, J. J., Meuer, H. W., & Strohmaier, E. (1994). Top500 Supercomputer sites."},{"key":"1604_CR6","unstructured":"Drucker, H., Burges, C. J., Kaufman, L., Smola, A. J., & Vapnik, V. (1997). Support vector regression machines. In Advances in neural information processing systems (pp. 155\u2013161)."},{"key":"1604_CR7","unstructured":"Hong, S., & Kim, H. (2009). An analytical model for a gpu architecture with memory-level and thread-level parallelism awareness. In ACM SIGARCH Computer architecture news, (Vol. 37 pp. 152\u2013163): ACM."},{"key":"1604_CR8","unstructured":"Hou, K., Feng, W. C., & Che, S. (2017). Auto-tuning strategies for parallelizing sparse matrix-vector (spmv) multiplication on multi-and many-core processors. In Parallel and distributed processing symposium workshops (IPDPSW), 2017 IEEE international (pp. 713\u2013722): IEEE."},{"key":"1604_CR9","unstructured":"Hou, K., Wang, H., & Feng, W. C. (2017). Gpu-unicache: Automatic code generation of spatial blocking for stencils on gpus. In Proceedings of the computing frontiers conference (pp. 107\u2013116): ACM."},{"key":"1604_CR10","unstructured":"Hou, K., Wang, H., Feng, W. C., Vetter, J. S., & Lee, S. (2018). Highly efficient compensation-based parallelism for wavefront loops on gpus. In 2018 IEEE International parallel and distributed processing symposium (IPDPS) (pp. 276\u2013285): IEEE."},{"key":"1604_CR11","unstructured":"Johnson, N. (2013). Epcc openacc benchmark suite."},{"key":"1604_CR12","unstructured":"Joseph, P., Vaswani, K., & Thazhuthaveetil, M. J. (2006). Construction and use of linear regression models for processor performance analysis. In The twelfth international symposium on high-performance computer architecture, 2006 (pp. 99\u2013108): IEEE."},{"key":"1604_CR13","unstructured":"Lee, R., Wang, H., & Zhang, X. (2018). Software-defined software: a perspective of machine learning-based software production. In 2018 IEEE 38th international conference on distributed computing systems (ICDCS) (pp. 1270\u20131275): IEEE."},{"key":"1604_CR14","unstructured":"Li, W., Jin, G., Cui, X., & See, S. (2015). An evaluation of unified memory technology on nvidia gpus. In 2015 15th IEEE\/ACM international symposium on cluster, cloud and grid computing (pp. 1092\u20131098): IEEE."},{"key":"1604_CR15","unstructured":"Li, Y., Chang, K., Bel, O., Miller, E. L., & Long, D. D. (2017). Capes: unsupervised storage performance tuning using neural network-based deep reinforcement learning. In Proceedings of the international conference for high performance computing, networking, storage and analysis (p. 42): ACM."},{"key":"1604_CR16","unstructured":"Li, Y., Dongarra, J., & Tomov, S. (2009). A note on auto-tuning gemm for gpus. In International conference on computational science (pp. 884\u2013892): Springer."},{"issue":"3","key":"1604_CR17","first-page":"18","volume":"2","author":"A Liaw","year":"2002","unstructured":"Liaw, A., Wiener, M., & et al. (2002). Classification and regression by randomforest. R News, 2(3), 18\u201322.","journal-title":"R News"},{"issue":"2","key":"1604_CR18","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1145\/2636342","volume":"47","author":"S Mittal","year":"2015","unstructured":"Mittal, S., & Vetter, J. S. (2015). A survey of methods for analyzing and improving gpu energy efficiency. ACM Computing Surveys (CSUR), 47(2), 19.","journal-title":"ACM Computing Surveys (CSUR)"},{"key":"1604_CR19","doi-asserted-by":"crossref","unstructured":"Pal, S. K., & Mitra, S. (1992). Multilayer perceptron, fuzzy sets, classification.","DOI":"10.1109\/72.159058"},{"key":"1604_CR20","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., & et al. (2011). Scikit-learn: machine learning in python. Journal of Machine Learning Research, 12, 2825\u20132830.","journal-title":"Journal of Machine Learning Research"},{"key":"1604_CR21","unstructured":"Pouchet, L.N. (2012). Polybench: the polyhedral benchmark suite. URL: http:\/\/www.cs.ucla.edu\/pouchet\/software\/polybench."},{"key":"1604_CR22","unstructured":"Ryoo, S., Rodrigues, C. I., Stone, S. S., Baghsorkhi, S. S., Ueng, S. Z., Stratton, J. A., & Hwu, W. M. W. (2008). Program optimization space pruning for a multithreaded gpu. In Proceedings of the 6th annual IEEE\/ACM international symposium on Code generation and optimization (pp. 195\u2013204): ACM."},{"issue":"3","key":"1604_CR23","doi-asserted-by":"publisher","first-page":"2133","DOI":"10.1007\/s10586-017-1003-4","volume":"20","author":"NP Tran","year":"2017","unstructured":"Tran, N. P., Lee, M., & Choi, J. (2017). Parameter based tuning model for optimizing performance on gpu. Cluster Computing, 20(3), 2133\u20132142.","journal-title":"Cluster Computing"}],"container-title":["Journal of Signal Processing Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11265-020-01604-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11265-020-01604-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11265-020-01604-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,17]],"date-time":"2024-08-17T00:28:12Z","timestamp":1723854492000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11265-020-01604-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,6]]},"references-count":23,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,4]]}},"alternative-id":["1604"],"URL":"https:\/\/doi.org\/10.1007\/s11265-020-01604-4","relation":{},"ISSN":["1939-8018","1939-8115"],"issn-type":[{"type":"print","value":"1939-8018"},{"type":"electronic","value":"1939-8115"}],"subject":[],"published":{"date-parts":[[2020,11,6]]},"assertion":[{"value":"5 November 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 July 2020","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 October 2020","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 November 2020","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}