{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T15:59:07Z","timestamp":1780675147677,"version":"3.54.1"},"reference-count":28,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2021,10,12]],"date-time":"2021-10-12T00:00:00Z","timestamp":1633996800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100009877","name":"Regione Calabria","doi-asserted-by":"publisher","award":["POR Calabria FSE\/FESR 2014-2020 International  Mobility of PhD students and research grants\/type A Researchers"],"award-info":[{"award-number":["POR Calabria FSE\/FESR 2014-2020 International  Mobility of PhD students and research grants\/type A Researchers"]}],"id":[{"id":"10.13039\/501100009877","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>Nowadays, computer vision relies heavily on convolutional neural networks (CNNs) to perform complex and accurate tasks. Among them, super-resolution CNNs represent a meaningful example, due to the presence of both convolutional (CONV) and transposed convolutional (TCONV) layers. While the former exploit multiply-and-accumulate (MAC) operations to extract features of interest from incoming feature maps (fmaps), the latter perform MACs to tune the spatial resolution of the received fmaps properly. The ever-growing real-time and low-power requirements of modern computer vision applications represent a stimulus for the research community to investigate the deployment of CNNs on well-suited hardware platforms, such as field programmable gate arrays (FPGAs). FPGAs are widely recognized as valid candidates for trading off computational speed and power consumption, thanks to their flexibility and their capability to also deal with computationally intensive models. In order to reduce the number of operations to be performed, this paper presents a novel hardware-oriented algorithm able to efficiently accelerate both CONVs and TCONVs. The proposed strategy was validated by employing it within a reconfigurable hardware accelerator purposely designed to adapt itself to different operating modes set at run-time. When characterized using the Xilinx XC7K410T FPGA device, the proposed accelerator achieved a throughput of up to 2022.2 GOPS and, in comparison to state-of-the-art competitors, it reached an energy efficiency up to 2.3 times higher, without compromising the overall accuracy.<\/jats:p>","DOI":"10.3390\/jimaging7100210","type":"journal-article","created":{"date-parts":[[2021,10,13]],"date-time":"2021-10-13T06:38:41Z","timestamp":1634107121000},"page":"210","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Design of Flexible Hardware Accelerators for Image Convolutions and Transposed Convolutions"],"prefix":"10.3390","volume":"7","author":[{"given":"Cristian","family":"Sestito","sequence":"first","affiliation":[{"name":"Department of Informatics, Modeling, Electronics and System Engineering, University of Calabria, 87036 Rende, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2197-4563","authenticated-orcid":false,"given":"Fanny","family":"Spagnolo","sequence":"additional","affiliation":[{"name":"Department of Informatics, Modeling, Electronics and System Engineering, University of Calabria, 87036 Rende, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1363-9201","authenticated-orcid":false,"given":"Stefania","family":"Perri","sequence":"additional","affiliation":[{"name":"Department of Mechanical, Energy and Management Engineering, University of Calabria, 87036 Rende, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2021,10,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1901","DOI":"10.1109\/TPAMI.2015.2491929","article-title":"HCP: A Flexible CNN Framework for Multi-Label Image Classification","volume":"38","author":"Wei","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1116","DOI":"10.1109\/TMI.2018.2878669","article-title":"HyperDense-Net: A Hyper-Densely Connected CNN for Multi-Modal Image Segmentation","volume":"38","author":"Dolz","year":"2019","journal-title":"IEEE Trans. Medic. Imaging"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Wu, R., Guo, X., Du, J., and Li, J. (2021). Accelerating Neural Network Inference on FPGA-Based Platforms\u2014A Survey. Electronics, 10.","DOI":"10.3390\/electronics10091025"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1109\/TPAMI.2015.2439281","article-title":"Image Super-Resolution Using Deep Convolutional Networks","volume":"38","author":"Dong","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_8","unstructured":"Chang, H., Yeung, D.Y., and Xiong, Y. (July, January 27). Super-resolution through neighbor embedding. Proceedings of the 2004 IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, USA."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2861","DOI":"10.1109\/TIP.2010.2050625","article-title":"\u201cImage super-resolution via sparse representation","volume":"19","author":"Yang","year":"2010","journal-title":"IEEE Trans. Image Process."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Dong, C., Loy, C.C., and Tang, X. (2016, January 8\u201316). Accelerating the super-resolution convolutional neural network. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46475-6_25"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1109\/TCSVT.2018.2888898","article-title":"An Energy-Efficient FPGA-Based Deconvolutional Neural Networks Accelerator for Single Image Super-Resolution","volume":"30","author":"Chang","year":"2020","journal-title":"IEEE Trans. Circ. Syst. Video Tech."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Perri, S., Sestito, C., Spagnolo, F., and Corsonello, P. (2020). Efficient Deconvolution Architecture for Heterogeneous Systems-on-Chip. J. Imaging, 6.","DOI":"10.3390\/jimaging6090085"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1867","DOI":"10.1109\/TVLSI.2020.3000519","article-title":"F-DNA: Fast Convolution Architecture for Deconvolutional Neural Network Acceleration","volume":"28","author":"Mao","year":"2020","journal-title":"IEEE Trans. VLSI"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Tang, Z., Luo, G., and Jiang, M. (2019, January 24\u201326). FTConv: FPGA Acceleration for Transposed Convolution Layers in Deep Neural Networks. Proceedings of the 2019 ACM\/SIGDA International Symposium on Field-Programmable Gate Arrays, Seaside, CA, USA.","DOI":"10.1145\/3289602.3293976"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Shi, B., Tang, Z., Luo, G., and Jiang, M. (2019, January 9\u201313). Winograd-based Real-Time Super-Resolution System on FPGA. Proceedings of the 2019 International Conference on Field-Programmable Technology, Tianjin, China.","DOI":"10.1109\/ICFPT47387.2019.00083"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2521","DOI":"10.1109\/TCSVT.2018.2864321","article-title":"A Real-Time Convolutional Neural Network for Super-Resolution on FPGA with Applications to 4k UHD 60 fps Video Services","volume":"29","author":"Kim","year":"2019","journal-title":"IEEE Trans. Circ. Syst. Video Tech."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"187754","DOI":"10.1109\/ACCESS.2020.3031055","article-title":"CNN Acceleration with Hardware-Efficient Dataflow for Super-Resolution","volume":"8","author":"Lee","year":"2020","journal-title":"IEEE Access"},{"key":"ref_18","unstructured":"Dumoulin, V., and Visin, F. (2021). A Guide to convolution arithmetic for deep learning. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1545","DOI":"10.1109\/TVLSI.2020.2995741","article-title":"Uni-OPU: An FPGA-Based Uniform Accelerator for Convolutional and Transposed Convolutional Networks","volume":"28","author":"Yu","year":"2020","journal-title":"IEEE Trans. VLSI"},{"key":"ref_20","unstructured":"Sestito, C., Spagnolo, F., Corsonello, P., and Perri, S. (2021, January 19\u201322). Run-Time Adaptive Hardware Accelerator for Convolutional Neural Networks. Proceedings of the 16th Conference on PhD Research in Microelectronics and Electronics, online."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Yazdanbakhsh, A., Brzozowki, M., Khaleghu, B., Ghodrati, S., Samadi, K., Kim, N.S., and Esmeilzadeh, H. (May, January 29). FlexiGAN: An End-to-End Solution for FPGA Acceleration of Generative Adversarial Networks. Proceedings of the 2018 IEEE 26th Annual Symposium on Field-Programmable Custom Computing Machines, Boulder, CO, USA.","DOI":"10.1109\/FCCM.2018.00019"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, D., Shen, J., Wen, M., and Zhang, C. (2019). Efficient Implementation of 2D and 3D Sparse Deconvolutional Neural Networks with a Uniform Architecture on FPGAs. Electronics, 8.","DOI":"10.3390\/electronics8070803"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Di, X., Yang, H.-G., Jia, Y., Huang, Z., and Mao, N. (2020). Exploring Efficient Acceleration Architecture for Winograd-Transformed Transposed Convolution of GAN on FPGAs. Electronics, 9.","DOI":"10.3390\/electronics9020286"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 11\u201318). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. Proceedings of the 2015 IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_25","unstructured":"(2021, September 24). ZCU102Evaluation Board User Guide UG1182 (v1.6). Available online: https:\/\/www.xilinx.com\/content\/dam\/xilinx\/support\/documentation\/boards_and_kits\/zcu102\/ug1182-zcu102-eval-bd.pdf."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Shi, W., Caballero, J., Huszar, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., and Wang, Z. (2016, January 27\u201330). Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. Proceedings of the the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.207"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1109\/TIP.2003.819861","article-title":"Image quality assessment: From error visibility to structural similarity","volume":"13","author":"Wang","year":"2004","journal-title":"IEEE Trans. Image Processing"},{"key":"ref_28","unstructured":"Bevilacqua, M., Roumy, A., Guillemot, C., and Morel, M.L.A. (2020, January 7\u201310). Low-complexity single-image super-resolution based on nonnegative neighbor embedding. Proceedings of the British Machine Vision Conference, Surrey, UK."}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/7\/10\/210\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:12:10Z","timestamp":1760166730000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/7\/10\/210"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,12]]},"references-count":28,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2021,10]]}},"alternative-id":["jimaging7100210"],"URL":"https:\/\/doi.org\/10.3390\/jimaging7100210","relation":{},"ISSN":["2313-433X"],"issn-type":[{"value":"2313-433X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,12]]}}}