{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T20:20:11Z","timestamp":1780431611086,"version":"3.54.1"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"1s","license":[{"start":{"date-parts":[[2023,1,23]],"date-time":"2023-01-23T00:00:00Z","timestamp":1674432000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Italian MIUR within PRIN 2017","award":["20172BH297"],"award-info":[{"award-number":["20172BH297"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2023,2,28]]},"abstract":"<jats:p>Online stores have become fundamental for the fashion industry, revolving around recommendation systems to suggest appropriate items to customers. Such recommendations often suffer from a lack of diversity and propose items that are similar to previous purchases of a user. Recently, a novel kind of approach based on Memory Augmented Neural Networks (MANNs) has been proposed, aimed at recommending a variety of garments to create an outfit by complementing a given fashion item. In this article we address the task of compatible garment recommendation developing a MANN architecture by taking into account the co-occurrence of clothing attributes, such as shape and color, to compose an outfit. To this end we obtain disentangled representations of fashion items and store them in external memory modules, used to guide recommendations at inference time. We show that our disentangled representations are able to achieve significantly better performance compared to the state of the art and also provide interpretable latent spaces, giving a qualitative explanation of the recommendations.<\/jats:p>","DOI":"10.1145\/3531017","type":"journal-article","created":{"date-parts":[[2022,4,20]],"date-time":"2022-04-20T12:11:28Z","timestamp":1650456688000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":48,"title":["Disentangling Features for Fashion Recommendation"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2485-8218","authenticated-orcid":false,"given":"Lavinia","family":"De Divitiis","sequence":"first","affiliation":[{"name":"University of Florence, Florence, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2537-2700","authenticated-orcid":false,"given":"Federico","family":"Becattini","sequence":"additional","affiliation":[{"name":"University of Florence, Florence, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8294-4539","authenticated-orcid":false,"given":"Claudio","family":"Baecchi","sequence":"additional","affiliation":[{"name":"University of Florence, Florence, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1052-8322","authenticated-orcid":false,"given":"Alberto","family":"Del Bimbo","sequence":"additional","affiliation":[{"name":"University of Florence, Florence, Italy"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,1,23]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"crossref","unstructured":"Vassileios Balntas Edgar Riba Daniel Ponsa and Krystian Mikolajczyk. 2016. Learning local feature descriptors with triplets and shallow convolutional neural networks. In Proceedings of the British Machine Vision Conference 2016 (BMVC\u201916) Vol. 1 3.","DOI":"10.5244\/C.30.119"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/3469877.3490621"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413530"},{"key":"e_1_3_2_5_2","article-title":"Fashion meets computer vision: A survey","author":"Cheng Wen-Huang","year":"2020","unstructured":"Wen-Huang Cheng, Sijie Song, Chieh-Yun Chen, Shintami Chusnul Hidayati, and Jiaying Liu. 2020. Fashion meets computer vision: A survey. arXiv preprint arXiv:2003.13988 (2020).","journal-title":"arXiv preprint arXiv:2003.13988"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01290"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/CBMI50038.2021.9461912"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-68790-8_23"},{"key":"e_1_3_2_9_2","article-title":"Interpretable partitioned embedding for customized fashion outfit composition","author":"Feng Zunlei","year":"2018","unstructured":"Zunlei Feng, Zhenyun Yu, Yezhou Yang, Yongcheng Jing, Junxiao Jiang, and Mingli Song. 2018. Interpretable partitioned embedding for customized fashion outfit composition. arXiv preprint arXiv:1806.04845 (2018).","journal-title":"arXiv preprint arXiv:1806.04845"},{"key":"e_1_3_2_10_2","unstructured":"Guangyu Gao Liling Liu Li Wang and Yihang Zhang. 2019. Fashion clothes matching scheme based on siamese network and autoencoder. In Proceedings of Multimedia Systems (MMSys\u201919) ."},{"key":"e_1_3_2_11_2","article-title":"Neural Turing machines","author":"Graves Alex","year":"2014","unstructured":"Alex Graves, Greg Wayne, and Ivo Danihelka. 2014. Neural Turing machines. arXiv preprint arXiv:1410.5401 (2014).","journal-title":"arXiv preprint arXiv:1410.5401"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123394"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v30i1.9973"},{"key":"e_1_3_2_14_2","article-title":"An LSTM-based dynamic customer model for fashion recommendation","author":"Heinz Sebastian","year":"2017","unstructured":"Sebastian Heinz, Christian Bracher, and Roland Vollgraf. 2017. An LSTM-based dynamic customer model for fashion recommendation. arXiv preprint arXiv:1708.07347 (2017).","journal-title":"arXiv preprint arXiv:1708.07347"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01193"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806239"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/2623330.2623332"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2019.05.081"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01081"},{"key":"e_1_3_2_20_2","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Maaten Laurens van der","year":"2008","unstructured":"Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, (November2008), 2579\u20132605.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00717"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2020.3008558"},{"key":"e_1_3_2_23_2","article-title":"SMEMO: Social memory for trajectory forecasting","author":"Marchetti Francesco","year":"2022","unstructured":"Francesco Marchetti, Federico Becattini, Lorenzo Seidenari, and Alberto Del Bimbo. 2022. SMEMO: Social memory for trajectory forecasting. arXiv preprint arXiv:2203.12446 (2022).","journal-title":"arXiv preprint arXiv:2203.12446"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/2766462.2767755"},{"key":"e_1_3_2_25_2","first-page":"197","article-title":"Self-supervised on-line cumulative learning from video streams","author":"Pernici Federico","year":"2020","unstructured":"Federico Pernici, Matteo Bruni, and Alberto Del Bimbo. 2020. Self-supervised on-line cumulative learning from video streams. Computer Vision and Image Understanding (2020), 197\u2013198.","journal-title":"Computer Vision and Image Understanding"},{"key":"e_1_3_2_26_2","article-title":"BPR: Bayesian personalized ranking from implicit feedback","author":"Rendle Steffen","year":"2012","unstructured":"Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2012. BPR: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618 (2012).","journal-title":"arXiv preprint arXiv:1205.2618"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM50055.2020.00039"},{"key":"e_1_3_2_28_2","doi-asserted-by":"crossref","unstructured":"Xuemeng Song Fuli Feng Xianjing Han Xin Yang Wei Liu and Liqiang Nie. 2018. Neural compatibility modeling with attentive knowledge distillation. In Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR\u201918) . 5\u201314.","DOI":"10.1145\/3209978.3209996"},{"key":"e_1_3_2_29_2","doi-asserted-by":"crossref","unstructured":"Xuemeng Song Fuli Feng Jinhuan Liu Zekun Li Liqiang Nie and Jun Ma. 2017. Neurostylist: Neural compatibility modeling for clothing matching. In Proceedings of the 25th ACM international conference on Multimedia (ACMMM\u201917) . 753\u2013761.","DOI":"10.1145\/3123266.3123314"},{"key":"e_1_3_2_30_2","doi-asserted-by":"crossref","unstructured":"Xuemeng Song Xianjing Han Yunkai Li Jingyuan Chen Xin-Shun Xu and Liqiang Nie. 2019. GP-BPR: Personalized compatibility modeling for clothing matching. In Proceedings of the 27th ACM International Conference on Multimedia (ACM MM\u201919) . 320\u2013328.","DOI":"10.1145\/3343031.3350956"},{"key":"e_1_3_2_31_2","unstructured":"Sainbayar Sukhbaatar Jason Weston Rob Fergus et\u00a0al. 2015. End-to-end memory networks. In Proceedings of Advances in neural information processing systems (NIPS\u201915) . 2440\u20132448."},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01270-0_24"},{"key":"e_1_3_2_33_2","doi-asserted-by":"crossref","unstructured":"Andreas Veit Balazs Kovacs Sean Bell Julian McAuley Kavita Bala and Serge Belongie. 2015. Learning visual clothing style with heterogeneous dyadic co-occurrences. In Proceedings of the IEEE International Conference on Computer Vision (CVPR\u201915) . 4642\u20134650.","DOI":"10.1109\/ICCV.2015.527"},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","unstructured":"Xin Wang Bo Wu and Yueqi Zhong. 2019. Outfit compatibility prediction and diagnosis with multi-layered comparison network. In Proceedings of the 27th ACM International Conference on Multimedia (ACMMM\u201919) . 329\u2013337.","DOI":"10.1145\/3343031.3350909"},{"key":"e_1_3_2_35_2","article-title":"Visually-aware fashion recommendation and design with generative image models","volume":"1711","author":"Kang Chen Fang Wang-Cheng","year":"2017","unstructured":"Chen Fang Wang-Cheng Kang, Zhaowen Wang, and Julian J. McAuley. 2017. Visually-aware fashion recommendation and design with generative image models. CoRR abs\/1711.02231 (2017). arXiv:1711.02231http:\/\/arxiv.org\/abs\/1711.02231.","journal-title":"CoRR"},{"key":"e_1_3_2_36_2","article-title":"Memory networks","author":"Weston Jason","year":"2014","unstructured":"Jason Weston, Sumit Chopra, and Antoine Bordes. 2014. Memory networks. arXiv preprint arXiv:1410.3916 (2014).","journal-title":"arXiv preprint arXiv:1410.3916"},{"key":"e_1_3_2_37_2","doi-asserted-by":"crossref","unstructured":"Wenhui Yu Huidi Zhang Xiangnan He Xu Chen Li Xiong and Zheng Qin. 2018. Aesthetic-based clothing recommendation. In Proceedings of the 2018 World Wide Web Conference (WebConf\u201918) . 649\u2013658.","DOI":"10.1145\/3178876.3186146"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1080\/00405000.2019.1694351"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3531017","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3531017","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:26Z","timestamp":1750186826000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3531017"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,23]]},"references-count":37,"journal-issue":{"issue":"1s","published-print":{"date-parts":[[2023,2,28]]}},"alternative-id":["10.1145\/3531017"],"URL":"https:\/\/doi.org\/10.1145\/3531017","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,23]]},"assertion":[{"value":"2022-01-30","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-04-10","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}