{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,29]],"date-time":"2025-11-29T16:19:24Z","timestamp":1764433164417,"version":"3.37.3"},"reference-count":8,"publisher":"Wiley","license":[{"start":{"date-parts":[[2018,7,17]],"date-time":"2018-07-17T00:00:00Z","timestamp":1531785600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61631016","3132017XNG1750","2018XNG1857"],"award-info":[{"award-number":["61631016","3132017XNG1750","2018XNG1857"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Cross Project \u201cResearch on 3D Audio Space and Panoramic Interaction Based on VR\u201d","award":["61631016","3132017XNG1750","2018XNG1857"],"award-info":[{"award-number":["61631016","3132017XNG1750","2018XNG1857"]}]},{"name":"School Project Funding","award":["61631016","3132017XNG1750","2018XNG1857"],"award-info":[{"award-number":["61631016","3132017XNG1750","2018XNG1857"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Advances in Multimedia"],"published-print":{"date-parts":[[2018,7,17]]},"abstract":"<jats:p>Speech synthesis is an important research content in the field of human-computer interaction and has a wide range of applications. As one of its branches, singing synthesis plays an important role. Beijing Opera is a famous traditional Chinese opera, and it is called Chinese quintessence. The singing of Beijing Opera carries some features of speech but it has its own unique pronunciation rules and rhythms which differ from ordinary speech and singing. In this paper, we propose three models for the synthesis of Beijing Opera. Firstly, the speech signals of the source speaker and the target speaker are extracted by using the straight algorithm. And then through the training of GMM, we complete the voice control model to input the voice to be converted and output the voice after the voice conversion. Finally, by modeling the fundamental frequency, duration, and frequency separately, a melodic control model is constructed using GAN to realize the synthesis of the Beijing Opera fragment. We connect the fragments and superimpose the background music to achieve the synthesis of Beijing Opera. The experimental results show that the synthesized Beijing Opera has some audibility and can basically complete the composition of Beijing Opera. We also extend our models to human-AI cooperative music generation: given a target voice of human, we can generate a Beijing Opera which is sung by a new target voice.<\/jats:p>","DOI":"10.1155\/2018\/5158164","type":"journal-article","created":{"date-parts":[[2018,7,18]],"date-time":"2018-07-18T11:43:51Z","timestamp":1531914231000},"page":"1-14","source":"Crossref","is-referenced-by-count":3,"title":["Beijing Opera Synthesis Based on Straight Algorithm and Deep Learning"],"prefix":"10.1155","volume":"2018","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7731-1120","authenticated-orcid":true,"given":"XueTing","family":"Wang","sequence":"first","affiliation":[{"name":"College of Science and Technology, Communication University of China, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0464-9862","authenticated-orcid":true,"given":"Cong","family":"Jin","sequence":"additional","affiliation":[{"name":"Key Laboratory of Media Audio & Video, Communication University of China, Beijing, China"}]},{"given":"Wei","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Science and Technology, Communication University of China, Beijing, China"}]}],"member":"311","reference":[{"key":"1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2007.323274"},{"issue":"4","key":"2","first-page":"63","volume":"18","year":"2013","journal-title":"Computational Linguistics and Chinese Language Processing"},{"year":"2014","key":"3"},{"volume":"9","year":"2017","key":"6"},{"key":"7","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2007.323266"},{"key":"10","doi-asserted-by":"publisher","DOI":"10.1121\/1.1458024"},{"volume":"39","journal-title":"Journal of Communications","year":"2018","key":"12"},{"year":"2016","key":"17"}],"container-title":["Advances in Multimedia"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/am\/2018\/5158164.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/am\/2018\/5158164.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/am\/2018\/5158164.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2018,7,18]],"date-time":"2018-07-18T11:44:00Z","timestamp":1531914240000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.hindawi.com\/journals\/am\/2018\/5158164\/"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,17]]},"references-count":8,"alternative-id":["5158164","5158164"],"URL":"https:\/\/doi.org\/10.1155\/2018\/5158164","relation":{},"ISSN":["1687-5680","1687-5699"],"issn-type":[{"type":"print","value":"1687-5680"},{"type":"electronic","value":"1687-5699"}],"subject":[],"published":{"date-parts":[[2018,7,17]]}}}