{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,29]],"date-time":"2026-01-29T21:08:24Z","timestamp":1769720904270,"version":"3.49.0"},"reference-count":34,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2017,10,23]],"date-time":"2017-10-23T00:00:00Z","timestamp":1508716800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61773361, 61473273, 91546122, 61573335, 61602438"],"award-info":[{"award-number":["61773361, 61473273, 91546122, 61573335, 61602438"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Guangdong provincial science and technology plan projects","award":["2015 B010109005"],"award-info":[{"award-number":["2015 B010109005"]}]},{"name":"the Youth Innovation Promotion Association CAS","award":["2017146"],"award-info":[{"award-number":["2017146"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2018,3,31]]},"abstract":"<jats:p>Transfer learning has gained a lot of attention and interest in the past decade. One crucial research issue in transfer learning is how to find a good representation for instances of different domains such that the divergence between domains can be reduced with the new representation. Recently, deep learning has been proposed to learn more robust or higher-level features for transfer learning. In this article, we adapt the autoencoder technique to transfer learning and propose a supervised representation learning method based on double encoding-layer autoencoder. The proposed framework consists of two encoding layers: one for embedding and the other one for label encoding. In the embedding layer, the distribution distance of the embedded instances between the source and target domains is minimized in terms of KL-Divergence. In the label encoding layer, label information of the source domain is encoded using a softmax regression model. Moreover, to empirically explore why the proposed framework can work well for transfer learning, we propose a new effective measure based on autoencoder to compute the distribution distance between different domains. Experimental results show that the proposed new measure can better reflect the degree of transfer difficulty and has stronger correlation with the performance from supervised learning algorithms (e.g., Logistic Regression), compared with previous ones, such as KL-Divergence and Maximum Mean Discrepancy. Therefore, in our model, we have incorporated two distribution distance measures to minimize the difference between source and target domains in the embedding representations. Extensive experiments conducted on three real-world image datasets and one text data demonstrate the effectiveness of our proposed method compared with several state-of-the-art baseline methods.<\/jats:p>","DOI":"10.1145\/3108257","type":"journal-article","created":{"date-parts":[[2017,10,23]],"date-time":"2017-10-23T19:19:16Z","timestamp":1508786356000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":35,"title":["Supervised Representation Learning with Double Encoding-Layer Autoencoder for Transfer Learning"],"prefix":"10.1145","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0520-2619","authenticated-orcid":false,"given":"Fuzhen","family":"Zhuang","sequence":"first","affiliation":[{"name":"Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, and University of Chinese Academy of Sciences, Haidian District, Beijing"}]},{"given":"Xiaohu","family":"Cheng","sequence":"additional","affiliation":[{"name":"Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, and University of Chinese Academy of Sciences, Haidian District, Beijing"}]},{"given":"Ping","family":"Luo","sequence":"additional","affiliation":[{"name":"Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, and University of Chinese Academy of Sciences, Haidian District, Beijing"}]},{"given":"Sinno Jialin","family":"Pan","sequence":"additional","affiliation":[{"name":"Nanyang Technological University, Singapore"}]},{"given":"Qing","family":"He","sequence":"additional","affiliation":[{"name":"Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, and University of Chinese Academy of Sciences, Haidian District, Beijing"}]}],"member":"320","published-online":{"date-parts":[[2017,10,23]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/WIIAT.2008.291"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1561\/2200000006"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/1610075.1610094"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 29th International Conference on Machine Learning.","author":"Chen Minmin","year":"2012","unstructured":"Minmin Chen , Zhixiang Eddie Xu , Kilian Q. Weinberger , and Fei Sha . 2012 . Marginalized denoising autoencoders for domain adaptation . In Proceedings of the 29th International Conference on Machine Learning. Minmin Chen, Zhixiang Eddie Xu, Kilian Q. Weinberger, and Fei Sha. 2012. Marginalized denoising autoencoders for domain adaptation. In Proceedings of the 29th International Conference on Machine Learning."},{"key":"e_1_2_1_5_1","first-page":"1891","article-title":"Confidence-weighted linear classification for text categorization","volume":"13","author":"Crammer Koby","year":"2012","unstructured":"Koby Crammer , Mark Dredze , and Fernando Pereira . 2012 . Confidence-weighted linear classification for text categorization . J. Mach. Learn. Res. 13 , 1 (2012), 1891 -- 1926 . Koby Crammer, Mark Dredze, and Fernando Pereira. 2012. Confidence-weighted linear classification for text categorization. J. Mach. Learn. Res. 13, 1 (2012), 1891--1926.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281218"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273521"},{"key":"e_1_2_1_8_1","first-page":"1","article-title":"Regularization paths for generalized linear models via coordinate descent","volume":"33","author":"Hastie Trevor","year":"2010","unstructured":"Trevor Hastie , Friedman Jerome , and Rob Tibshirani . 2010 . Regularization paths for generalized linear models via coordinate descent . J. Stat. Softw. 33 , 1 (2010), 1 . Trevor Hastie, Friedman Jerome, and Rob Tibshirani. 2010. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1 (2010), 1.","journal-title":"J. Stat. Softw."},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 32nd International Conference on Machine Learning (ICML\u201915)","author":"Ganin Yaroslav","year":"2015","unstructured":"Yaroslav Ganin and Victor Lempitsky . 2015 . Unsupervised domain adaptation by backpropagation . In Proceedings of the 32nd International Conference on Machine Learning (ICML\u201915) . 1180--1189. Yaroslav Ganin and Victor Lempitsky. 2015. Unsupervised domain adaptation by backpropagation. In Proceedings of the 32nd International Conference on Machine Learning (ICML\u201915). 1180--1189."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401928"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the 33rd International Conference on Machine Learning. 2839--2848","author":"Gong Mingming","year":"2016","unstructured":"Mingming Gong , Kun Zhang , Tongliang Liu , Dacheng Tao , Clark Glymour , and Bernhard Sch\u00f6lkopf . 2016 . Domain adaptation with conditional transferable components . In Proceedings of the 33rd International Conference on Machine Learning. 2839--2848 . Mingming Gong, Kun Zhang, Tongliang Liu, Dacheng Tao, Clark Glymour, and Bernhard Sch\u00f6lkopf. 2016. Domain adaptation with conditional transferable components. In Proceedings of the 33rd International Conference on Machine Learning. 2839--2848."},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the 2007 Conference of the Association for Computational Linguistics. 264--271","author":"Jiang Jing","year":"2007","unstructured":"Jing Jiang and Chengxiang Zhai . 2007 . Instance weighting for domain adaptation in NLP . In Proceedings of the 2007 Conference of the Association for Computational Linguistics. 264--271 . Jing Jiang and Chengxiang Zhai. 2007. Instance weighting for domain adaptation in NLP. In Proceedings of the 2007 Conference of the Association for Computational Linguistics. 264--271."},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 28th AAAI Conference on Artificial Intelligence. 2213--2220","author":"Tsang Ivor","year":"2014","unstructured":"Ivor Tsang , Joey Tianyi Zhou , Sinno Jialin Pan , and Yan Yan . 2014 . Hybrid heterogeneous transfer learning through deep learning . In Proceedings of the 28th AAAI Conference on Artificial Intelligence. 2213--2220 . Ivor Tsang, Joey Tianyi Zhou, Sinno Jialin Pan, and Yan Yan. 2014. Hybrid heterogeneous transfer learning through deep learning. In Proceedings of the 28th AAAI Conference on Artificial Intelligence. 2213--2220."},{"key":"e_1_2_1_14_1","volume-title":"Letter to the editor","author":"Kullback Solomon","year":"1987","unstructured":"Solomon Kullback . 1987. Letter to the editor : The Kullback-Leibler distance ( 1987 ). Solomon Kullback. 1987. Letter to the editor: The Kullback-Leibler distance (1987)."},{"key":"e_1_2_1_15_1","first-page":"79","article-title":"Model selection and multi-model inference","volume":"1","author":"Liddle Andrew R.","year":"2010","unstructured":"Andrew R. Liddle , Pia Mukherjee , and David Parkinson . 2010 . Model selection and multi-model inference . Bayes. Meth. Cosmol. 1 (2010), 79 . Andrew R. Liddle, Pia Mukherjee, and David Parkinson. 2010. Model selection and multi-model inference. Bayes. Meth. Cosmol. 1 (2010), 79.","journal-title":"Bayes. Meth. Cosmol."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2544314"},{"key":"e_1_2_1_17_1","volume-title":"Jordan","author":"Long Mingsheng","year":"2015","unstructured":"Mingsheng Long , Yue Cao , Jianmin Wang , and Michael I . Jordan . 2015 . Learning transferable features with deep adaptation networks. In Proceedings of the International Machine Learning Society (ICML\u2019 15). 97--105. Mingsheng Long, Yue Cao, Jianmin Wang, and Michael I. Jordan. 2015. Learning transferable features with deep adaptation networks. In Proceedings of the International Machine Learning Society (ICML\u201915). 97--105."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2016.2554549"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2332398"},{"key":"e_1_2_1_20_1","volume-title":"Plant leaf classification using probabilistic integration of shape, texture and margin features. Signal Processing, Pattern Recognition and Applications","author":"Mallah Cope","year":"2013","unstructured":"Cope Mallah and Orwell. 2013. Plant leaf classification using probabilistic integration of shape, texture and margin features. Signal Processing, Pattern Recognition and Applications ( 2013 ). Cope Mallah and Orwell. 2013. Plant leaf classification using probabilistic integration of shape, texture and margin features. Signal Processing, Pattern Recognition and Applications (2013)."},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the 23rd AAAI Conference on Artificial Intelligence.","author":"Pan S. J.","unstructured":"S. J. Pan , J. T. Kwok , and Q. Yang . 2008. Transfer learning via dimensionality reduction . In Proceedings of the 23rd AAAI Conference on Artificial Intelligence. S. J. Pan, J. T. Kwok, and Q. Yang. 2008. Transfer learning via dimensionality reduction. In Proceedings of the 23rd AAAI Conference on Artificial Intelligence."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2010.2091281"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.191"},{"key":"e_1_2_1_24_1","doi-asserted-by":"crossref","unstructured":"Christopher Poultney Sumit Chopra Yann L. Cun and others. 2006. Efficient learning of sparse representations with an energy-based model. In Advances in Neural Information Processing Systems. 1137--1144. Christopher Poultney Sumit Chopra Yann L. Cun and others. 2006. Efficient learning of sparse representations with an energy-based model. In Advances in Neural Information Processing Systems. 1137--1144.","DOI":"10.7551\/mitpress\/7503.003.0147"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.126"},{"key":"e_1_2_1_26_1","volume-title":"Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-based Algorithms","author":"Snyman Jan","unstructured":"Jan Snyman . 2005. Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-based Algorithms . Vol. 97 . Springer Science 8 Business Media. Jan Snyman. 2005. Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-based Algorithms. Vol. 97. Springer Science 8 Business Media."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.463"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390294"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/1756006.1953039"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the 28th International Conference on Machine Learning. 513--520","author":"Xavier Antoine","year":"2011","unstructured":"Antoine Xavier and Bengio. 2011 . Domain adaptation for large-scale sentiment classification: A deep learning approach . In Proceedings of the 28th International Conference on Machine Learning. 513--520 . Antoine Xavier and Bengio. 2011. Domain adaptation for large-scale sentiment classification: A deep learning approach. In Proceedings of the 28th International Conference on Machine Learning. 513--520."},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the 10th Pacific Asia Knowledge Discovery and Data Mining.","author":"Xing D. K.","unstructured":"D. K. Xing , W. Y. Dai , G. R. Xue , and Y. Yu . 2007. Bridged refinement for transfer learning . In Proceedings of the 10th Pacific Asia Knowledge Discovery and Data Mining. D. K. Xing, W. Y. Dai, G. R. Xue, and Y. Yu. 2007. Bridged refinement for transfer learning. In Proceedings of the 10th Pacific Asia Knowledge Discovery and Data Mining."},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of 24th International Joint Conference on Artificial Intelligence. 4119--4125","author":"Zhuang Fuzhen","year":"2015","unstructured":"Fuzhen Zhuang , Xiaohu Cheng , Ping Luo , Sinno Jialin Pan , and Qing He . 2015 . Supervised representation learning: Transfer learning with deep autoencoders . In Proceedings of 24th International Joint Conference on Artificial Intelligence. 4119--4125 . Fuzhen Zhuang, Xiaohu Cheng, Ping Luo, Sinno Jialin Pan, and Qing He. 2015. Supervised representation learning: Transfer learning with deep autoencoders. In Proceedings of 24th International Joint Conference on Artificial Intelligence. 4119--4125."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-44845-8_27"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.205"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3108257","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3108257","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:13:43Z","timestamp":1750212823000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3108257"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,23]]},"references-count":34,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2018,3,31]]}},"alternative-id":["10.1145\/3108257"],"URL":"https:\/\/doi.org\/10.1145\/3108257","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,10,23]]},"assertion":[{"value":"2017-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-10-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}