{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:21:32Z","timestamp":1750220492129,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":24,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,24]],"date-time":"2021-08-24T00:00:00Z","timestamp":1629763200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,24]]},"DOI":"10.1145\/3460426.3463590","type":"proceedings-article","created":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T22:50:29Z","timestamp":1630536629000},"page":"442-446","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Semi-supervised Many-to-many Music Timbre Transfer"],"prefix":"10.1145","author":[{"given":"Yu-Chen","family":"Chang","sequence":"first","affiliation":[{"name":"National Cheng Kung University, Tainan, Taiwan Roc"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wen-Cheng","family":"Chen","sequence":"additional","affiliation":[{"name":"National Cheng Kung University, Tainan, Taiwan Roc"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Min-Chun","family":"Hu","sequence":"additional","affiliation":[{"name":"National Tsing Hua University, Hsinchu, Taiwan Roc"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,9]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Feross Aboukhadijeh. [n.d.]. Bitmidi.com. https:\/\/github.com\/feross\/bitmidi.com ( [n. d.]).  Feross Aboukhadijeh. [n.d.]. Bitmidi.com. https:\/\/github.com\/feross\/bitmidi.com ( [n. d.])."},{"key":"e_1_3_2_1_2_1","volume-title":"Modulated Variational auto-Encoders for many-to-many musical timbre transfer. arXiv preprint arXiv:1810.00222","author":"Bitton Adrien","year":"2018","unstructured":"Adrien Bitton , Philippe Esling , and Axel Chemla-Romeu-Santos . 2018. Modulated Variational auto-Encoders for many-to-many musical timbre transfer. arXiv preprint arXiv:1810.00222 ( 2018 ). Adrien Bitton, Philippe Esling, and Axel Chemla-Romeu-Santos. 2018. Modulated Variational auto-Encoders for many-to-many musical timbre transfer. arXiv preprint arXiv:1810.00222 (2018)."},{"volume-title":"One-shot voice conversion by separating speaker and content representations with instance normalization. arXiv preprint arXiv:1904.05742","year":"2019","key":"e_1_3_2_1_3_1","unstructured":"Ju-chieh Chou, Cheng-chieh Yeh, and Hung-yi Lee. 2019. One-shot voice conversion by separating speaker and content representations with instance normalization. arXiv preprint arXiv:1904.05742 ( 2019 ). Ju-chieh Chou, Cheng-chieh Yeh, and Hung-yi Lee. 2019. One-shot voice conversion by separating speaker and content representations with instance normalization. arXiv preprint arXiv:1904.05742 (2019)."},{"volume-title":"Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations. arXiv preprint arXiv:1804.02812","year":"2018","key":"e_1_3_2_1_4_1","unstructured":"Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee, and Lin-shan Lee. 2018. Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations. arXiv preprint arXiv:1804.02812 ( 2018 ). Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee, and Lin-shan Lee. 2018. Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations. arXiv preprint arXiv:1804.02812 (2018)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.265"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1984.1164317"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24261-3_7"},{"key":"e_1_3_2_1_8_1","volume-title":"Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer. arXiv preprint arXiv:1811.09620","author":"Huang Sicong","year":"2018","unstructured":"Sicong Huang , Qiyang Li , Cem Anil , Xuchan Bao , Sageev Oore , and Roger B Grosse . 2018 a. Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer. arXiv preprint arXiv:1811.09620 (2018). Sicong Huang, Qiyang Li, Cem Anil, Xuchan Bao, Sageev Oore, and Roger B Grosse. 2018a. Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer. arXiv preprint arXiv:1811.09620 (2018)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.167"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01219-9_11"},{"key":"e_1_3_2_1_11_1","volume-title":"Frame-level instrument recognition by timbre and pitch. arXiv preprint arXiv:1806.09587","author":"Hung Yun-Ning","year":"2018","unstructured":"Yun-Ning Hung and Yi-Hsuan Yang . 2018. Frame-level instrument recognition by timbre and pitch. arXiv preprint arXiv:1806.09587 ( 2018 ). Yun-Ning Hung and Yi-Hsuan Yang. 2018. Frame-level instrument recognition by timbre and pitch. arXiv preprint arXiv:1806.09587 (2018)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.632"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.wocn.2018.07.001"},{"key":"e_1_3_2_1_14_1","volume-title":"Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114","author":"Kingma Diederik P","year":"2013","unstructured":"Diederik P Kingma and Max Welling . 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 ( 2013 ). Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)."},{"key":"e_1_3_2_1_15_1","unstructured":"Hyungui Lim and Jeongsoo Park. [n.d.]. Rare sound event detection using 1D convolutional recurrent neural networks.  Hyungui Lim and Jeongsoo Park. [n.d.]. Rare sound event detection using 1D convolutional recurrent neural networks."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33011061"},{"key":"e_1_3_2_1_17_1","volume-title":"URL https:\/\/github.com\/hmartelb\/Pix2Pix-Timbre-Transfer","author":"Martel H\u00e9ctor","year":"2019","unstructured":"H\u00e9ctor Martel . 2019. Pix2Pix-Timbre-Transfer. URL https:\/\/github.com\/hmartelb\/Pix2Pix-Timbre-Transfer ( 2019 ). H\u00e9ctor Martel. 2019. Pix2Pix-Timbre-Transfer. URL https:\/\/github.com\/hmartelb\/Pix2Pix-Timbre-Transfer (2019)."},{"key":"e_1_3_2_1_18_1","volume-title":"A universal music translation network. arXiv preprint arXiv:1805.07848","author":"Mor Noam","year":"2018","unstructured":"Noam Mor , Lior Wolf , Adam Polyak , and Yaniv Taigman . 2018. A universal music translation network. arXiv preprint arXiv:1805.07848 ( 2018 ). Noam Mor, Lior Wolf, Adam Polyak, and Yaniv Taigman. 2018. A universal music translation network. arXiv preprint arXiv:1805.07848 (2018)."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11671"},{"key":"e_1_3_2_1_20_1","volume-title":"Autovc: Zero-shot voice style transfer with only autoencoder loss. arXiv preprint arXiv:1905.05879","author":"Qian Kaizhi","year":"2019","unstructured":"Kaizhi Qian , Yang Zhang , Shiyu Chang , Xuesong Yang , and Mark Hasegawa-Johnson . 2019 . Autovc: Zero-shot voice style transfer with only autoencoder loss. arXiv preprint arXiv:1905.05879 (2019). Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, and Mark Hasegawa-Johnson. 2019. Autovc: Zero-shot voice style transfer with only autoencoder loss. arXiv preprint arXiv:1905.05879 (2019)."},{"volume-title":"Theory and applications of digital speech processing","author":"Rabiner Lawrence","key":"e_1_3_2_1_21_1","unstructured":"Lawrence Rabiner and Ronald Schafer . 2010. Theory and applications of digital speech processing . Prentice Hall Press . Lawrence Rabiner and Ronald Schafer. 2010. Theory and applications of digital speech processing .Prentice Hall Press."},{"key":"e_1_3_2_1_22_1","volume-title":"In Proceedings of the 15th International Society for Music Information Retrieval Conference, ISMIR . Citeseer.","author":"Raffel Colin","year":"2014","unstructured":"Colin Raffel , Brian McFee , Eric J Humphrey , Justin Salamon , Oriol Nieto , Dawen Liang , Daniel PW Ellis , and C Colin Raffel . 2014 . mir_eval: A transparent implementation of common MIR metrics . In In Proceedings of the 15th International Society for Music Information Retrieval Conference, ISMIR . Citeseer. Colin Raffel, Brian McFee, Eric J Humphrey, Justin Salamon, Oriol Nieto, Dawen Liang, Daniel PW Ellis, and C Colin Raffel. 2014. mir_eval: A transparent implementation of common MIR metrics. In In Proceedings of the 15th International Society for Music Information Retrieval Conference, ISMIR . Citeseer."},{"key":"e_1_3_2_1_23_1","volume-title":"Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022","author":"Ulyanov Dmitry","year":"2016","unstructured":"Dmitry Ulyanov , Andrea Vedaldi , and Victor Lempitsky . 2016. Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022 ( 2016 ). Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky. 2016. Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022 (2016)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"}],"event":{"name":"ICMR '21: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Taipei Taiwan","acronym":"ICMR '21"},"container-title":["Proceedings of the 2021 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460426.3463590","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3460426.3463590","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:49:22Z","timestamp":1750193362000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3460426.3463590"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,24]]},"references-count":24,"alternative-id":["10.1145\/3460426.3463590","10.1145\/3460426"],"URL":"https:\/\/doi.org\/10.1145\/3460426.3463590","relation":{},"subject":[],"published":{"date-parts":[[2021,8,24]]},"assertion":[{"value":"2021-09-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}