{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,3,29]],"date-time":"2022-03-29T12:45:42Z","timestamp":1648557942994},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2014,3,24]],"date-time":"2014-03-24T00:00:00Z","timestamp":1395619200000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"published-print":{"date-parts":[[2014,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Three-dimensional (3D) audio technologies are booming with the success of 3D video technology. The surge in audio channels makes its huge data unacceptable for transmitting bandwidth and storage media, and the signal compression algorithm for 3D audio systems becomes an important task. This paper investigates the conventional mid\/side (M\/S) coding method and discusses the signal correlation property of three-dimensional multichannel systems. Then based on the channel triple, a three-channel dependent M\/S coding (3D-M\/S) method is proposed to reduce interchannel redundancy and corresponding transform matrices are presented. Furthermore, a framework is proposed to enable 3D-M\/S compress any number of audio channels. Finally, the masking threshold of the perceptual audio core codec is modified, which guarantees the final coding noise to meet the perceptual threshold constraint of the original channel signals. Objective and subjective tests with panning signals indicate an increase in coding efficiency compared to Independent channel coding and a moderate complexity increase compared to a PCA method.<\/jats:p>","DOI":"10.1186\/1687-4722-2014-10","type":"journal-article","created":{"date-parts":[[2014,3,25]],"date-time":"2014-03-25T04:26:54Z","timestamp":1395721614000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Expanded three-channel mid\/side coding for three-dimensional multichannel audio systems"],"prefix":"10.1186","volume":"2014","author":[{"given":"Shi","family":"Dong","sequence":"first","affiliation":[]},{"given":"Ruimin","family":"Hu","sequence":"additional","affiliation":[]},{"given":"Xiaochen","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Yuhong","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Weiping","family":"Tu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2014,3,24]]},"reference":[{"issue":"5","key":"105_CR1","doi-asserted-by":"publisher","first-page":"2764","DOI":"10.1121\/1.405852","volume":"93","author":"AJ Berkhout","year":"1993","unstructured":"Berkhout AJ, de Vries D, Vogel P: Acoustic control by wave field synthesis. J. Acoust. Soc. Am 1993, 93(5):2764-2778. 10.1121\/1.405852","journal-title":"J. Acoust. Soc. Am"},{"issue":"11","key":"105_CR2","first-page":"859","volume":"33","author":"MA Gerzon","year":"1985","unstructured":"Gerzon MA: Ambisonics in multichannel broadcasting and video. J. Audio Eng. Soc 1985, 33(11):859-871.","journal-title":"J. Audio Eng. Soc"},{"key":"105_CR3","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1109\/MSP.2010.939040","volume":"28","author":"J Cooperstock","year":"2011","unstructured":"Cooperstock J: Multimodal telepresence systems. IEEE Signal Process. Mag 2011, 28: 77-86.","journal-title":"IEEE Signal Process. Mag"},{"issue":"4","key":"105_CR4","first-page":"329","volume":"53","author":"A Staff","year":"2005","unstructured":"Staff A: Multichannel audio systems and techniques. J. Audio Eng. Soc 2005, 53(4):329-335.","journal-title":"J. Audio Eng. Soc"},{"issue":"5","key":"105_CR5","first-page":"340","volume":"61","author":"F Rumsey","year":"2013","unstructured":"Rumsey F: Cinema sound for the 3-D era. J. Audio Eng. Soc 2013, 61(5):340-344.","journal-title":"J. Audio Eng. Soc"},{"key":"105_CR6","volume-title":"Birds on the wire - WFS live transmission project report","author":"J Nettingsmeier","year":"2008","unstructured":"Nettingsmeier J: Birds on the wire - WFS live transmission project report. Tech. rep., Fraunhofer 2008"},{"key":"105_CR7","first-page":"I","volume-title":"IEEE International Conference on Image Processing, 2007. ICIP 2007, Volume 1","author":"S Sakaida","year":"2007","unstructured":"Sakaida S, Iguchi K, Nakajima N, Nishida Y, Ichigaya A, Nakasu E, Kurozumi M, Gohshi S: The super hi-vision codec. IEEE International Conference on Image Processing, 2007. ICIP 2007, Volume 1 2007, I-21\u2013I-24."},{"issue":"6","key":"105_CR8","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1109\/TSA.2003.818109","volume":"11","author":"F Baumgarte","year":"2003","unstructured":"Baumgarte F, Faller C: Binaural cue coding-part I: psychoacoustic fundamentals and design principles. IEEE Trans. Speech Audio Process 2003, 11(6):509-519. 10.1109\/TSA.2003.818109","journal-title":"IEEE Trans. Speech Audio Process"},{"issue":"6","key":"105_CR9","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1109\/TSA.2003.818108","volume":"11","author":"C Faller","year":"2003","unstructured":"Faller C, Baumgarte F: Binaural cue coding-part II: schemes and applications. IEEE Trans. Speech Audio Process 2003, 11(6):520-531. 10.1109\/TSA.2003.818108","journal-title":"IEEE Trans. Speech Audio Process"},{"key":"105_CR10","volume-title":"Audio Engineering Society Convention 114","author":"W Oomen","year":"2003","unstructured":"Oomen W, Schuijers E, Brinker den B, Breebaart J: Advances in parametric coding for high-quality audio. Audio Engineering Society Convention 114 2003."},{"issue":"9","key":"105_CR11","doi-asserted-by":"crossref","first-page":"561917","DOI":"10.1155\/ASP.2005.1305","volume":"2005","author":"J Breebaart","year":"2005","unstructured":"Breebaart J, van de Par S, Kohlrausch A, Schuijers E: Parametric coding of stereo audio. EURASIP J. Adv. Sig. Pr 2005, 2005(9):561917.","journal-title":"EURASIP J. Adv. Sig. Pr"},{"key":"105_CR12","doi-asserted-by":"publisher","first-page":"1894","DOI":"10.1109\/ICME.2007.4285045","volume-title":"2007 IEEE International Conference on Multimedia and Expo","author":"J Herre","year":"2007","unstructured":"Herre J, Disch S: New concepts in parametric coding of spatial audio: from SAC to SAOC. 2007 IEEE International Conference on Multimedia and Expo 2007, 1894-1897."},{"key":"105_CR13","first-page":"I","volume-title":"IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007, Volume 1","author":"M Goodwin","year":"2007","unstructured":"Goodwin M, Jot J: Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007, Volume 1 2007, I-9\u2013I-12."},{"key":"105_CR14","doi-asserted-by":"publisher","first-page":"369","DOI":"10.1109\/ICASSP.2008.4517623","volume-title":"IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008","author":"B Cheng","year":"2008","unstructured":"Cheng B, Ritz C, Burnett I: A spatial squeezing approach to ambisonic audio compression. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008 2008, 369-372."},{"key":"105_CR15","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1109\/ICASSP.2009.4959572","volume-title":"IEEE International Conference on Acoustics, Speech and Signal Processing, 2009. ICASSP 2009","author":"E Hellerud","year":"2009","unstructured":"Hellerud E, Solvang A, Svensson U: Spatial redundancy in Higher Order Ambisonics and its use for lowdelay lossless compression. IEEE International Conference on Acoustics, Speech and Signal Processing, 2009. ICASSP 2009 2009, 269-272."},{"issue":"8","key":"105_CR16","doi-asserted-by":"publisher","first-page":"1483","DOI":"10.1109\/TASL.2009.2021716","volume":"17","author":"C Tzagkarakis","year":"2009","unstructured":"Tzagkarakis C, Mouchtaris A, Tsakalides P: A multichannel sinusoidal model applied to spot microphone signals for immersive audio. IEEE Trans. Audio Speech Lang. Process 2009, 17(8):1483-1497.","journal-title":"IEEE Trans. Audio Speech Lang. Process"},{"key":"105_CR17","doi-asserted-by":"publisher","first-page":"365","DOI":"10.1109\/ICASSP.2008.4517622","volume-title":"IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008","author":"F Pinto","year":"2008","unstructured":"Pinto F, Vetterli M: Wave field coding in the spacetime frequency domain. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008 2008, 365-368."},{"issue":"9","key":"105_CR18","doi-asserted-by":"publisher","first-page":"4608","DOI":"10.1109\/TSP.2010.2052045","volume":"58","author":"F Pinto","year":"2010","unstructured":"Pinto F, Vetterli M: space-time-frequency processing of acoustic wave fields: theory, algorithms, and applications. IEEE Trans. Signal Process 2010, 58(9):4608-4620.","journal-title":"IEEE Trans. Signal Process"},{"key":"105_CR19","volume-title":"Spatial squeezing techniques for low bit-rate multichannel audio coding","author":"B Cheng","year":"2011","unstructured":"Cheng B: Spatial squeezing techniques for low bit-rate multichannel audio coding. PhD thesis. University of Wollongong 2011"},{"issue":"8","key":"105_CR20","doi-asserted-by":"publisher","first-page":"1676","DOI":"10.1109\/TASL.2013.2260156","volume":"21","author":"B Cheng","year":"2013","unstructured":"Cheng B, Ritz C, Burnett I, Zheng X: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process 2013, 21(8):1676-1688.","journal-title":"IEEE Trans. Audio Speech Lang. Process"},{"issue":"4","key":"105_CR21","doi-asserted-by":"publisher","first-page":"365","DOI":"10.1109\/TSA.2003.814375","volume":"11","author":"D Yang","year":"2003","unstructured":"Yang D, Ai H, Kyriakakis C, Kuo CC: High-fidelity multichannel audio coding with Karhunen-Loeve transform. IEEE Trans. Speech Audio Process 2003, 11(4):365-380. 10.1109\/TSA.2003.814375","journal-title":"IEEE Trans. Speech Audio Process"},{"key":"105_CR22","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1109\/ICASSP.1992.225993","volume-title":"1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1992. ICASSP-92, Volume 2","author":"J Johnston","year":"1992","unstructured":"Johnston J, Ferreira A: Sum-difference stereo transform coding. 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1992. ICASSP-92, Volume 2 1992, 569-572."},{"key":"105_CR23","volume-title":"Proceedings of the 6th International Conference on Digital Audio Effects (DAFx-03)","author":"CM Liu","year":"2003","unstructured":"Liu CM, Lee WC, Hsiao YH: M\/S coding based on allocation entropy. Proceedings of the 6th International Conference on Digital Audio Effects (DAFx-03) 2003."},{"issue":"8","key":"105_CR24","doi-asserted-by":"publisher","first-page":"1373","DOI":"10.1109\/TASL.2008.2002068","volume":"16","author":"O Derrien","year":"2008","unstructured":"Derrien O, Richard G: A new model-based algorithm for optimizing the MPEG-AAC in MS-Stereo. IEEE Trans. Audio Speech Lang. Process 2008, 16(8):1373-1382.","journal-title":"IEEE Trans. Audio Speech Lang. Process"},{"key":"105_CR25","first-page":"1","volume-title":"2008 ITG Conference on Voice Communication (SprachKommunikation)","author":"H Krueger","year":"2008","unstructured":"Krueger H, Vary P: A new approach for low-delay joint-stereo coding. 2008 ITG Conference on Voice Communication (SprachKommunikation) 2008, 1-4."},{"key":"105_CR26","first-page":"2148","volume-title":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","author":"M Schafer","year":"2012","unstructured":"Schafer M, Vary P: Hierarchical multi-channel audio coding based on time-domain linear prediction. 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO) 2012, 2148-2152."},{"key":"105_CR27","volume-title":"In Audio Engineering Society Convention 132","author":"M Neuendorf","year":"2012","unstructured":"Neuendorf M, Multrus M, Rettelbach N, Fuchs G, Robilliard J, Lecomte J, Wilde S, Bayer S, Disch S, Helmrich C, Lefebvre R, Gournay P, Bessette B, Lapierre J, Kjorling K, Purnhagen H, Villemoes L, Oomen W, Schuijers E, Kikuiri K, Chinen T, Norimatsu T, Seng CK, Oh E, Kim M, Quackenbush S, Grill B: MPEG unified speech and audio coding-the ISO\/MPEG standard for high-efficiency audio coding of all content types. In Audio Engineering Society Convention 132. Audio Engineering Society, 2012);"},{"key":"105_CR28","doi-asserted-by":"publisher","first-page":"351","DOI":"10.1007\/978-3-642-23071-4_33","volume-title":"Microelectronic Systems","author":"M Multrus","year":"2011","unstructured":"Multrus M, Neuendorf M, Lecomte J, Fuchs G, Bayer S, Robilliard J, Nagel F, Wilde S, Fischer D, Hilpert J, Rettelbach N, Helmrich C, Disch S, Geiger R, Grill B: MPEG unified speech and audio coding - bridging the gap. In Microelectronic Systems. Edited by: Heuberger A, Elst G, Hanke R. Berlin, Heidelberg: (Springer Berlin Heidelberg; 2011:351-362."},{"key":"105_CR29","doi-asserted-by":"publisher","first-page":"497","DOI":"10.1109\/ICASSP.2011.5946449","volume-title":"2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"C Helmrich","year":"2011","unstructured":"Helmrich C, Carlsson P, Disch S, Edler B, Hilpert J, Neusinger M, Purnhagen H, Robilliard J, Villemoes L, RettelbachN: Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction. 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011, 497-500."},{"issue":"4","key":"105_CR30","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1109\/TASL.2008.918979","volume":"16","author":"CM Liu","year":"2008","unstructured":"Liu CM, Hsu HW, Lee WC: Compression artifacts in perceptual audio coding. IEEE Trans. Audio Speech Lang. Process 2008, 16(4):681-695.","journal-title":"IEEE Trans. Audio Speech Lang. Process"},{"issue":"10","key":"105_CR31","first-page":"807","volume":"60","author":"F Zotter","year":"2012","unstructured":"Zotter F, Frank M: All-round ambisonic panning and decoding. J. Audio Eng. Soc 2012, 60(10):807-820.","journal-title":"J. Audio Eng. Soc"},{"key":"105_CR32","first-page":"75","volume-title":"IEICE Tech. Rep., Volume 113 of EA2013-46","author":"A Ando","year":"2013","unstructured":"Ando A, Sugimoto T, Irie K: Coding of 22.2 multichannel audio signal by MPEG-AAC. IEICE Tech. Rep., Volume 113 of EA2013-46 2013, 75-80."},{"issue":"6","key":"105_CR33","doi-asserted-by":"publisher","first-page":"1467","DOI":"10.1109\/TASL.2010.2092429","volume":"19","author":"A Ando","year":"2011","unstructured":"Ando A: Conversion of multichannel sound signal maintaining physical properties of sound in reproduced sound field. IEEE Trans. Audio Speech Lang. Process 2011, 19(6):1467-1475.","journal-title":"IEEE Trans. Audio Speech Lang. Process"},{"key":"105_CR34","volume-title":"Method for the subjective assessment of intermediate sound quality (MUSHRA)","author":"ITU-T","year":"2001","unstructured":"ITU-T: Method for the subjective assessment of intermediate sound quality (MUSHRA). 2001."}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-4722-2014-10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1687-4722-2014-10\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-4722-2014-10.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T03:24:06Z","timestamp":1630553046000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/1687-4722-2014-10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,3,24]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,12]]}},"alternative-id":["105"],"URL":"https:\/\/doi.org\/10.1186\/1687-4722-2014-10","relation":{},"ISSN":["1687-4722"],"issn-type":[{"value":"1687-4722","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,3,24]]},"assertion":[{"value":"1 November 2013","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 March 2014","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 March 2014","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"10"}}