{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T03:18:45Z","timestamp":1760239125981,"version":"build-2065373602"},"reference-count":35,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T00:00:00Z","timestamp":1602547200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>We propose a novel end-to-end image colorization framework which integrates attention mechanism and a learnable adaptive normalization function. In contrast to previous colorization methods that directly generate the whole image, we believe that the color of the significant area determines the quality of the colorized image. The attention mechanism uses the attention map which is obtained by the auxiliary classifier to guide our framework to produce more subtle content and visually pleasing color in salient visual regions. Furthermore, we apply Adaptive Group Instance Normalization (AGIN) function to promote our framework to generate vivid colorized images flexibly, under the circumstance that we consider colorization as a particular style transfer task. Experiments show that our model is superior to previous the state-of-the-art models in coloring foreground objects.<\/jats:p>","DOI":"10.3390\/info11100479","type":"journal-article","created":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T21:48:38Z","timestamp":1602625718000},"page":"479","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Attentional Colorization Networks with Adaptive Group-Instance Normalization"],"prefix":"10.3390","volume":"11","author":[{"given":"Yuzhen","family":"Gao","sequence":"first","affiliation":[{"name":"Shanghai Film Academy, Shanghai University, Shanghai 200444, China"},{"name":"Shanghai Engineering Research Center for Motion Picture Special Effects, Shanghai 200444, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Youdong","family":"Ding","sequence":"additional","affiliation":[{"name":"Shanghai Film Academy, Shanghai University, Shanghai 200444, China"},{"name":"Shanghai Engineering Research Center for Motion Picture Special Effects, Shanghai 200444, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fei","family":"Wang","sequence":"additional","affiliation":[{"name":"Shanghai Film Academy, Shanghai University, Shanghai 200444, China"},{"name":"Shanghai Engineering Research Center for Motion Picture Special Effects, Shanghai 200444, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Huan","family":"Liang","sequence":"additional","affiliation":[{"name":"Shanghai Film Academy, Shanghai University, Shanghai 200444, China"},{"name":"Shanghai Engineering Research Center for Motion Picture Special Effects, Shanghai 200444, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,10,13]]},"reference":[{"key":"ref_1","unstructured":"Kim, J., Kim, M., Kang, H., and Lee, K. (2019). U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation. arXiv."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wu, Y., and He, K. (2018, January 8\u201314). Group Normalization. Proceedings of the Computer Vision\u2014ECCV 2018\u20145th European Conference, Part XIII, Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_1"},{"key":"ref_3","unstructured":"Ulyanov, D., Vedaldi, A., and Lempitsky, V.S. (2016). Instance Normalization: The Missing Ingredient for Fast Stylization. arXiv."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"240","DOI":"10.4304\/jmm.4.4.240-247","article-title":"Film Colorization Using Texture Feature Coding and Artificial Neural Networks","volume":"4","author":"Koleini","year":"2009","journal-title":"J. Multim."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Cheng, Z., Yang, Q., and Sheng, B. (2015, January 7\u201313). Deep Colorization. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.55"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Putri, V.K., and Fanany, M.I. (2017, January 19\u201321). Sketch plus colorization deep convolutional neural networks for photos generation from sketches. Proceedings of the 2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI), Yogyakarta, Indonesia.","DOI":"10.1109\/EECSI.2017.8239116"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Vitynskyi, P., Tkachenko, R., Izonin, I., and Kutucu, H. (2018, January 21\u201325). Hybridization of the SGTM neural-like structure through inputs polynomial extension. Proceedings of the 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP), Lviv, Ukraine.","DOI":"10.1109\/DSMP.2018.8478456"},{"key":"ref_8","unstructured":"Radford, A., Metz, L., and Chintala, S. (2016, January 2\u20134). Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. Proceedings of the 4th International Conference on Learning Representations (ICLR 2016), San Juan, Puerto Rico."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Isola, P., Zhu, J., Zhou, T., and Efros, A.A. (2017, January 21\u201326). Image-to-Image Translation with Conditional Adversarial Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.632"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Karras, T., Laine, S., and Aila, T. (2019, January 16\u201320). A Style-Based Generator Architecture for Generative Adversarial Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00453"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhu, J., Park, T., Isola, P., and Efros, A.A. (2017, January 22\u201329). Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy.","DOI":"10.1109\/ICCV.2017.244"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1145\/1015706.1015780","article-title":"Colorization using optimization","volume":"23","author":"Levin","year":"2004","journal-title":"ACM Trans. Graph."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zhang, R., Zhu, J., Isola, P., Geng, X., Lin, A.S., Yu, T., and Efros, A.A. (2017). Real-time user-guided image colorization with learned deep priors. arXiv.","DOI":"10.1145\/3072959.3073703"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Sangkloy, P., Lu, J., Fang, C., Yu, F., and Hays, J. (2017, January 21\u201326). Scribbler: Controlling Deep Image Synthesis with Sketch and Color. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.723"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1145\/566654.566576","article-title":"Transferring color to greyscale images","volume":"21","author":"Welsh","year":"2002","journal-title":"ACM Trans. Graph."},{"key":"ref_16","unstructured":"Tai, Y., Jia, J., and Tang, C. (2005, January 20\u201326). Local Color Transfer via Probabilistic Segmentation by Expectation-Maximization. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), San Diego, CA, USA."},{"key":"ref_17","first-page":"1","article-title":"Deep exemplar-based colorization","volume":"37","author":"He","year":"2018","journal-title":"ACM Trans. Graph."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhang, B., He, M., Liao, J., Sander, P.V., Yuan, L., Bermak, A., and Chen, D. (2019, January 16\u201320). Deep Exemplar-Based Video Colorization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00824"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Yoo, S., Bahng, H., Chung, S., Lee, J., Chang, J., and Choo, J. (2019, January 16\u201320). Coloring With Limited Data: Few-Shot Colorization via Memory Augmented Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01154"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Larsson, G., Maire, M., and Shakhnarovich, G. (2016, January 11\u201314). Learning Representations for Automatic Colorization. Proceedings of the Computer Vision\u2014ECCV 2016\u201414th European Conference, Part IV, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46493-0_35"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2897824.2925974","article-title":"Let there be color!: Joint end-to-end learning of global and local image priors for automatic image colorization with simultaneous classification","volume":"35","author":"Iizuka","year":"2016","journal-title":"ACM Trans. Graph."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhang, R., Isola, P., and Efros, A.A. (2016, January 11\u201314). Colorful Image Colorization. Proceedings of the Computer Vision\u2014ECCV 2016\u201414th European Conference, Part III, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46487-9_40"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Messaoud, S., Forsyth, D.A., and Schwing, A.G. (2018, January 8\u201314). Structural Consistency and Controllability for Diverse Colorization. Proceedings of the Computer Vision\u2014ECCV 2018\u201415th European Conference, Part VI, Munich, Germany.","DOI":"10.1007\/978-3-030-01231-1_37"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Cao, Y., Zhou, Z., Zhang, W., and Yu, Y. (2017, January 18\u201322). Unsupervised Diverse Colorization via Generative Adversarial Networks. Proceedings of the Machine Learning and Knowledge Discovery in Databases\u2014European Conference (ECML PKDD 2017), Part I, Skopje, Macedonia.","DOI":"10.1007\/978-3-319-71249-9_10"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Zhao, J., Han, J., Shao, L., and Snoek, C.G.M. (2019). Pixelated Semantic Colorization. arXiv.","DOI":"10.1007\/s11263-019-01271-4"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Zhou, B., Khosla, A., Lapedriza, \u00c0., Oliva, A., and Torralba, A. (2016, January 27\u201330). Learning Deep Features for Discriminative Localization. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.319"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017, January 22\u201329). Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy.","DOI":"10.1109\/ICCV.2017.74"},{"key":"ref_28","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the 32nd International Conference on Machine Learning (ICML 2015), Lille, France."},{"key":"ref_29","unstructured":"Ba, L.J., Kiros, J.R., and Hinton, G.E. (2016). Layer Normalization. arXiv."},{"key":"ref_30","unstructured":"Nam, H., and Kim, H. (2018, January 3\u20138). Batch-Instance Normalization for Adaptively Style-Invariant Neural Networks. Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018 (NeurIPS 2018), Montr\u00e9al, QC, Canada."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Huang, X., and Belongie, S.J. (2017, January 22\u201329). Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization. Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy.","DOI":"10.1109\/ICCV.2017.167"},{"key":"ref_32","unstructured":"Dumoulin, V., Shlens, J., and Kudlur, M. (2017, January 24\u201326). A Learned Representation For Artistic Style. Proceedings of the 5th International Conference on Learning Representations (ICLR 2017), Toulon, France."},{"key":"ref_33","unstructured":"Kingma, D.P., and Ba, J. (2015, January 7\u20139). Adam: A Method for Stochastic Optimization. Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015), San Diego, CA, USA."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1007\/978-3-319-10602-1_48","article-title":"Microsoft COCO: Common Objects in Context. ECCV (5)","volume":"8693","author":"Lin","year":"2014","journal-title":"Lect. Notes Comput. Sci."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1007\/s11263-016-0981-7","article-title":"Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations","volume":"123","author":"Krishna","year":"2017","journal-title":"Int. J. Comput. Vis."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/11\/10\/479\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:20:32Z","timestamp":1760178032000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/11\/10\/479"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,13]]},"references-count":35,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2020,10]]}},"alternative-id":["info11100479"],"URL":"https:\/\/doi.org\/10.3390\/info11100479","relation":{},"ISSN":["2078-2489"],"issn-type":[{"type":"electronic","value":"2078-2489"}],"subject":[],"published":{"date-parts":[[2020,10,13]]}}}