{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,5]],"date-time":"2026-01-05T07:24:42Z","timestamp":1767597882081,"version":"build-2065373602"},"reference-count":22,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2020,10,3]],"date-time":"2020-10-03T00:00:00Z","timestamp":1601683200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>In this paper, we study how to extract visual concepts to understand landscape scenicness. Using visual feature representations from a Convolutional Neural Network (CNN), we learn a number of Concept Activation Vectors (CAV) aligned with semantic concepts from ancillary datasets. These concepts represent objects, attributes or scene categories that describe outdoor images. We then use these CAVs to study their impact on the (crowdsourced) perception of beauty of landscapes in the United Kingdom. Finally, we deploy a technique to explore new concepts beyond those initially available in the ancillary dataset: Using a semi-supervised manifold alignment technique, we align the CNN image representation to a large set of word embeddings, therefore giving access to entire dictionaries of concepts. This allows us to obtain a list of new concept candidates to improve our understanding of the elements that contribute the most to the perception of scenicness. We do this without the need for any additional data by leveraging the commonalities in the visual and word vector spaces. Our results suggest that new and potentially useful concepts can be discovered by leveraging neighbourhood structures in the word vector spaces.<\/jats:p>","DOI":"10.3390\/make2040022","type":"journal-article","created":{"date-parts":[[2020,10,3]],"date-time":"2020-10-03T07:22:16Z","timestamp":1601709736000},"page":"397-413","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Concept Discovery for The Interpretation of Landscape Scenicness"],"prefix":"10.3390","volume":"2","author":[{"given":"Pim","family":"Arendsen","sequence":"first","affiliation":[{"name":"Laboratory of Geo-information Science and Remote Sensing, Wageningen University, 6708 PB Wageningen, The Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Diego","family":"Marcos","sequence":"additional","affiliation":[{"name":"Laboratory of Geo-information Science and Remote Sensing, Wageningen University, 6708 PB Wageningen, The Netherlands"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Devis","family":"Tuia","sequence":"additional","affiliation":[{"name":"Laboratory of Geo-information Science and Remote Sensing, Wageningen University, 6708 PB Wageningen, The Netherlands"},{"name":"Environmental Computational Science and Earth Observation Laboratory, Ecole Polytechnique F\u00e9d\u00e9rale de Lausanne, 1950 Sion, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,10,3]]},"reference":[{"key":"ref_1","unstructured":"(2020, September 14). ScenicOrNot. Available online: http:\/\/scenicornot.datasciencelab.co.uk\/."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Workman, S., Souvenir, R., and Jacobs, N. (2017, January 22\u201329). Understanding and Mapping Natural Beauty. Proceedings of the IEEE International Conference on Computer, Venice, Italy.","DOI":"10.1109\/ICCV.2017.596"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"170170","DOI":"10.1098\/rsos.170170","article-title":"Using deep learning to quantify the beauty of outdoor places","volume":"4","author":"Seresinhe","year":"2017","journal-title":"R. Soc. Open Sci."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Marcos, D., Lobry, S., and Tuia, D. (2019, January 27\u201328). Semantically Interpretable Activation Maps: What-where-how explanations within CNNs. Proceedings of the International Conference on Computer Vision Workshops (ICCVw), Seoul, Korea.","DOI":"10.1109\/ICCVW.2019.00518"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Collins, E., Achanta, R., and Susstrunk, S. (2018, January 8\u201314). Deep feature factorization for concept discovery. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_21"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Wigness, M., Draper, B.A., and Beveridge, J.R. (2014, January 24\u201326). Selectively guiding visual concept discovery. Proceedings of the Winter Conference on Applications of Computer Vision, Steamboat Springs, CO, USA.","DOI":"10.1109\/WACV.2014.6836093"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Sun, C., Gan, C., and Nevatia, R. (2015, January 7\u201313). Automatic concept discovery from parallel text and visual corpora. Proceedings of the International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.298"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Ma, Y., and Fu, Y. (2011). Manifold alignment. Manifold Learning: Theory and Applications, CRC Press.","DOI":"10.1201\/b11431"},{"key":"ref_9","unstructured":"Kim, B., Wattenberg, M., Gilmer, J., Cai, C., Wexler, J., Viegas, F., and Sayres, R. (2018, January 10\u201315). Interpretability beyond feature attribution: Quantitative Testing with Concept Activation Vectors. Proceedings of the International Conference on Machine Learning, Stockholm, Sweden."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1093\/biomet\/30.1-2.81","article-title":"A new measure of rank correlation","volume":"30","author":"Kendall","year":"1938","journal-title":"Biometrika"},{"key":"ref_12","unstructured":"Wang, C., and Mahadevan, S. (2011, January 16\u201322). Heterogeneous domain adaptation using manifold alignment. Proceedings of the International Joint Conference on Artificial Intelligence, Barcelona, Spain."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Tuia, D., and Camps-Valls, G. (2016). Kernel Manifold Alignment for Domain Adaptation. PLoS ONE, 11.","DOI":"10.1371\/journal.pone.0148655"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"7708","DOI":"10.1109\/TGRS.2014.2317499","article-title":"Semisupervised Manifold Alignment of Multimodal Remote Sensing Images","volume":"52","author":"Tuia","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","unstructured":"(2020, September 14). Geograph Britain and Ireland. Available online: http:\/\/www.geograph.org.uk."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Bau, D., Zhou, B., Khosla, A., Oliva, A., and Torralba, A. (2017, January 21\u201326). Network dissection: Quantifying interpretability of deep visual representations. Proceedings of the Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.354"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhou, B., Zhao, H., Puig, X., Fidler, S., Barriuso, A., and Torralba, A. (2017, January 21\u201326). Scene Parsing through ADE20K Dataset. Proceedings of the Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.544"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2601097.2601206","article-title":"Intrinsic Images in the Wild","volume":"33","author":"Bell","year":"2014","journal-title":"ACM Trans. Graph."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Mottaghi, R., Chen, X., Liu, X., Cho, N.G., Lee, S.W., Fidler, S., Urtasun, R., and Yuille, A. (2014, January 23\u201328). The role of context for object detection and semantic segmentation in the wild. Proceedings of the Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.119"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Chen, X., Mottaghi, R., Liu, X., Fidler, S., Urtasun, R., and Yuille, A. (2014, January 23\u201328). Detect what you can: Detecting and representing objects using holistic models and body parts. Proceedings of the Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.254"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., and Vedaldi, A. (2014, January 23\u201328). Describing Textures in the Wild. Proceedings of the Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.461"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C.D. (2014, January 25\u201329). GloVe: Global Vectors for Word Representation. Proceedings of the Empirical Methods in Natural Language Processing, Doha, Qatar.","DOI":"10.3115\/v1\/D14-1162"}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/2\/4\/22\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:16:14Z","timestamp":1760177774000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/2\/4\/22"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,3]]},"references-count":22,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2020,12]]}},"alternative-id":["make2040022"],"URL":"https:\/\/doi.org\/10.3390\/make2040022","relation":{},"ISSN":["2504-4990"],"issn-type":[{"type":"electronic","value":"2504-4990"}],"subject":[],"published":{"date-parts":[[2020,10,3]]}}}