{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T09:09:34Z","timestamp":1769936974535,"version":"3.49.0"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2021,2,13]],"date-time":"2021-02-13T00:00:00Z","timestamp":1613174400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,2,13]],"date-time":"2021-02-13T00:00:00Z","timestamp":1613174400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Sign Process Syst"],"published-print":{"date-parts":[[2021,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Whilst FPGAs have been used in cloud ecosystems, it is still extremely challenging to achieve high compute density when mapping heterogeneous multi-tasks on shared resources at runtime. This work addresses this by treating the FPGA resource as a service and employing multi-task processing at the high level, design space exploration and static off-line partitioning in order to allow more efficient mapping of heterogeneous tasks onto the FPGA. In addition, a new, comprehensive runtime functional simulator is used to evaluate the effect of various spatial and temporal constraints on both the existing and new approaches when varying system design parameters. A comprehensive suite of real high performance computing tasks was implemented on a Nallatech 385 FPGA card and show that our approach can provide on average 2.9 \u00d7 and 2.3 \u00d7 higher system throughput for compute and mixed intensity tasks, while 0.2 \u00d7 lower for memory intensive tasks due to external memory access latency and bandwidth limitations. The work has been extended by introducing a novel scheduling scheme to enhance temporal utilization of resources when using the proposed approach. Additional results for large queues of mixed intensity tasks (compute and memory) show that the proposed partitioning and scheduling approach can provide higher than 3 \u00d7 system speedup over previous schemes.<\/jats:p>","DOI":"10.1007\/s11265-020-01633-z","type":"journal-article","created":{"date-parts":[[2021,2,14]],"date-time":"2021-02-14T08:30:19Z","timestamp":1613291419000},"page":"587-602","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Evaluation of Static Mapping for Dynamic Space-Shared Multi-task Processing on FPGAs"],"prefix":"10.1007","volume":"93","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9702-3070","authenticated-orcid":false,"given":"Umar Ibrahim","family":"Minhas","sequence":"first","affiliation":[]},{"given":"Roger","family":"Woods","sequence":"additional","affiliation":[]},{"given":"Georgios","family":"Karakonstantis","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,2,13]]},"reference":[{"key":"1633_CR1","unstructured":"Developer Zone. Intel FPGA SDK for OpenCL. https:\/\/www.intel.co.uk\/content\/www\/uk\/en\/programmable\/products\/design-software\/embedded-software-developers\/opencl\/support.html (2018)."},{"key":"1633_CR2","doi-asserted-by":"crossref","unstructured":"Abdul-Rahman, O.A., & Aida, K. (2014). Towards understanding the usage behavior of Google cloud users: the mice and elephants phenomenon. In International conference on cloud computing technology and science: IEEE.","DOI":"10.1109\/CloudCom.2014.75"},{"key":"1633_CR3","unstructured":"Asanovic, K., Bodik, R., Catanzaro, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., et al. (2006). The landscape of parallel computing research: a view from berkeley. Technical Report UCB\/EECS-2006-183, EECS Department, University of California, Berkeley."},{"issue":"11","key":"1633_CR4","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.1109\/TVLSI.2006.886411","volume":"14","author":"S Banerjee","year":"2006","unstructured":"Banerjee, S., Bozorgzadeh, E., Dutt, N.D. (2006). Integrating physical constraints in HW-SW partitioning for architectures with partial dynamic reconfiguration. IEEE Transactions on VLSI Systems, 14(11), 1189\u20131202.","journal-title":"IEEE Transactions on VLSI Systems"},{"key":"1633_CR5","doi-asserted-by":"crossref","unstructured":"Cattaneo, R., Bellini, R., Durelli, G., Pilato, C., Santambrogio, M.D., Sciuto, D. (2014). Para-sched: a reconfiguration-aware scheduler for reconfigurable architectures. In IEEE International parallel & distributed processing symposium workshops (pp. 243\u2013250).","DOI":"10.1109\/IPDPSW.2014.32"},{"issue":"4","key":"1633_CR6","doi-asserted-by":"publisher","first-page":"991","DOI":"10.1109\/TCC.2016.2560158","volume":"6","author":"X Chang","year":"2016","unstructured":"Chang, X., Xia, R., Muppala, J.K., Trivedi, K.S., Liu, J. (2016). Effective modeling approach for IAAS data center performance analysis under heterogeneous workload. IEEE Transactions on Cloud Computing, 6(4), 991\u20131003.","journal-title":"IEEE Transactions on Cloud Computing"},{"key":"1633_CR7","doi-asserted-by":"crossref","unstructured":"Charitopoulos, G., Koidis, I., Papadimitriou, K., Pnevmatikatos, D. (2017). Run-time management of systems with partially reconfigurable FPGAs. Integration, the VLSI Journal, 57.","DOI":"10.1016\/j.vlsi.2016.11.008"},{"key":"1633_CR8","doi-asserted-by":"crossref","unstructured":"Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., Lee, S.H., Skadron, K. (2009). Rodinia: a benchmark suite for heterogeneous computing. In IEEE International symposium on workload characterization (pp. 44\u201354).","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"1633_CR9","doi-asserted-by":"crossref","unstructured":"Chen, F., Shan, Y., Zhang, Y., Wang, Y., Franke, H., Chang, X., Wang, K. (2014). Enabling FPGAs in the cloud. In Proceedings of ACM conference on computing frontiers.","DOI":"10.1145\/2597917.2597929"},{"key":"1633_CR10","doi-asserted-by":"crossref","unstructured":"Eyerman, S., & Eeckhout, L. (2008). System-level performance metrics for multiprogram workloads. IEEE Micro, 28(3).","DOI":"10.1109\/MM.2008.44"},{"key":"1633_CR11","doi-asserted-by":"crossref","unstructured":"Gautier, Q., Althoff, A., Meng, P., Kastner, R. (2016). Spector: an openCL FPGA benchmark suite. In IEEE International conference on field-programmable technology (pp. 141\u2013148).","DOI":"10.1109\/FPT.2016.7929519"},{"issue":"5-6","key":"1633_CR12","doi-asserted-by":"publisher","first-page":"374","DOI":"10.1016\/j.micpro.2009.05.001","volume":"33","author":"Z Gu","year":"2009","unstructured":"Gu, Z., Liu, W., Xu, J., Cui, J., He, X., Deng, Q. (2009). Efficient algorithms for 2D area management and online task placement on runtime reconfigurable FPGAs. Microprocessors and Microsystems, 33 (5-6), 374\u2013387.","journal-title":"Microprocessors and Microsystems"},{"key":"1633_CR13","volume-title":"Clustering algorithms","author":"JA Hartigan","year":"1975","unstructured":"Hartigan, J.A. (1975). Clustering algorithms. New York: Wiley."},{"issue":"1","key":"1633_CR14","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1080\/00401706.1970.10488634","volume":"12","author":"AE Hoerl","year":"1970","unstructured":"Hoerl, A.E., & Kennard, R.W. (1970). Ridge regression: biased estimation for nonorthogonal problems. Technometrics, 12(1), 55\u201367.","journal-title":"Technometrics"},{"key":"1633_CR15","doi-asserted-by":"crossref","unstructured":"Huang, M., Wu, D., Yu, C.H., Fang, Z., Interlandi, M., Condie, T., Cong, J. (2016). Programming and runtime support to blaze FPGA accelerator deployment at datacenter scale. In Proceedings of ACM symposium on cloud computing (pp. 456\u2013469).","DOI":"10.1145\/2987550.2987569"},{"key":"1633_CR16","doi-asserted-by":"crossref","unstructured":"Huang, S., Huang, J., Dai, J., Xie, T., Huang, B. (2010). The HiBench benchmark suite: characterization of the Mapreduce-based data analysis. In IEEE International conference on data engineering workshops (pp. 41\u201351).","DOI":"10.1109\/ICDEW.2010.5452747"},{"issue":"2","key":"1633_CR17","doi-asserted-by":"publisher","first-page":"350","DOI":"10.1109\/TCAD.2017.2697952","volume":"37","author":"H Liang","year":"2018","unstructured":"Liang, H., Sinha, S., Zhang, W. (2018). Parallelizing hardware tasks on multicontext FPGA with efficient placement and scheduling algorithms. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 37(2), 350\u2013 363.","journal-title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems"},{"key":"1633_CR18","doi-asserted-by":"publisher","first-page":"396","DOI":"10.1109\/TMSCS.2017.2764087","volume":"4","author":"U Minhas","year":"2018","unstructured":"Minhas, U., Russell, M., Kaloutsakis, S., Barber, P., Woods, R., Georgakoudis, G., Gillan, C., Nikolopoulos, D., Bilas, A. (2018). Nanostreams: a microserver architecture for real-time analytics on fast data streams. IEEE Transactions on Multi-Scale Computing Systems, 4, 396\u2013409.","journal-title":"IEEE Transactions on Multi-Scale Computing Systems"},{"key":"1633_CR19","doi-asserted-by":"crossref","unstructured":"Minhas, U.I., Woods, R., Karakonstantis, G. (2019). Evaluation of FPGA partitioning schemes for time and space sharing of heterogeneous tasks. In International symposium on applied reconfigurable computing (pp. 334\u2013349).","DOI":"10.1007\/978-3-030-17227-5_24"},{"key":"1633_CR20","doi-asserted-by":"crossref","unstructured":"Minhas, U.I., Woods, R.F., Karakonstantis, G. (2018). Exploring functional acceleration of opencl on FPGAs and GPUs through platform-independent optimizations. In International symposium on applied reconfigurable computing (pp. 551\u2013 563).","DOI":"10.1007\/978-3-319-78890-6_44"},{"key":"1633_CR21","unstructured":"Page, L., Brin, S., Motwani, R., Winograd, T., et al. (1998). The Pagerank citation ranking: bringing order to the web."},{"key":"1633_CR22","unstructured":"Pham, K.D., Horta, E., Koch, D. (2017). Bitman: a tool and API for FPGA bitstream manipulations. In IEEE design, automation & test in europe conference & exhibition (pp. 894\u2013897)."},{"key":"1633_CR23","doi-asserted-by":"crossref","unstructured":"Redaelli, F., Santambrogio, M.D., Memik, S.O. (2009). An ILP formulation for the task graph scheduling problem tailored to bi-dimensional reconfigurable architectures. International Journal of Reconfigurable Computing.","DOI":"10.1155\/2009\/541067"},{"key":"1633_CR24","doi-asserted-by":"crossref","unstructured":"Reiss, C., Tumanov, A., Ganger, G.R., Katz, R.H., Kozuch, M.A. (2012). Heterogeneity and dynamicity of clouds at scale: google trace analysis. In Proceedings of ACM symposium on cloud computing.","DOI":"10.1145\/2391229.2391236"},{"key":"1633_CR25","doi-asserted-by":"crossref","unstructured":"Sengupta, D., Goswami, A., Schwan, K., Pallavi, K. (2014). Scheduling multi-tenant cloud workloads on accelerator-based systems. In IEEE Supercomputing conference.","DOI":"10.1109\/SC.2014.47"},{"key":"1633_CR26","doi-asserted-by":"crossref","unstructured":"Vaishnav, A., Pham, K.D., Koch, D. (2018). A survey on FPGA virtualization. In IEEE International conference on field programmable logic and applications (pp. 131\u20131317).","DOI":"10.1109\/FPL.2018.00031"},{"key":"1633_CR27","doi-asserted-by":"crossref","unstructured":"Vaishnav, A., Pham, K.D., Koch, D., Garside, J. (2018). Resource elastic virtualization for FPGAs using openCL. International Conference onField Programmable Logic and Applications (FPL).","DOI":"10.1109\/FPL.2018.00028"},{"key":"1633_CR28","doi-asserted-by":"crossref","unstructured":"Vipin, K., & Fahmy, S.A. (2012). Architecture-aware reconfiguration-centric floorplanning for partial reconfiguration. In International symposium on applied reconfigurable computing: Springer.","DOI":"10.1007\/978-3-642-28365-9_2"},{"key":"1633_CR29","doi-asserted-by":"crossref","unstructured":"Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R. (2008). Large-scale parallel collaborative filtering for the Netflix prize. In International conference on algorithmic applications in management (pp. 337\u2013348): Springer.","DOI":"10.1007\/978-3-540-68880-8_32"}],"container-title":["Journal of Signal Processing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11265-020-01633-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11265-020-01633-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11265-020-01633-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,5,19]],"date-time":"2021-05-19T05:05:51Z","timestamp":1621400751000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11265-020-01633-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,13]]},"references-count":29,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,5]]}},"alternative-id":["1633"],"URL":"https:\/\/doi.org\/10.1007\/s11265-020-01633-z","relation":{},"ISSN":["1939-8018","1939-8115"],"issn-type":[{"value":"1939-8018","type":"print"},{"value":"1939-8115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2,13]]},"assertion":[{"value":"30 April 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 October 2020","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 December 2020","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 February 2021","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}