{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,2]],"date-time":"2022-04-02T08:44:53Z","timestamp":1648889093704},"reference-count":121,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2013,8,6]],"date-time":"2013-08-06T00:00:00Z","timestamp":1375747200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/2.0"},{"start":{"date-parts":[[2013,8,6]],"date-time":"2013-08-06T00:00:00Z","timestamp":1375747200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Braz Comput Soc"],"published-print":{"date-parts":[[2013,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>An automatic music transcriber is a device that detects, without human interference, the musical gestures required to play a particular piece. Many techniques have been proposed to solve the problem of automatic music transcription. This paper presents an overview on the theme, discussing digital signal processing techniques, pattern classification techniques and heuristic assumptions derived from music knowledge that were used to build some of the main systems found in the literature. The paper is focused on the motivations behind each technique, aiming to serve both as an introduction to the theme and as resource for the development of new solutions for automatic transcription.<\/jats:p>","DOI":"10.1007\/s13173-013-0118-6","type":"journal-article","created":{"date-parts":[[2013,8,5]],"date-time":"2013-08-05T08:54:29Z","timestamp":1375692869000},"page":"589-604","update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Survey on automatic transcription of music"],"prefix":"10.1007","volume":"19","author":[{"given":"Tiago","family":"Fernandes Tavares","sequence":"first","affiliation":[]},{"given":"Jayme","family":"Garcia Arnal Barbedo","sequence":"additional","affiliation":[]},{"given":"Romis","family":"Attux","sequence":"additional","affiliation":[]},{"given":"Amauri","family":"Lopes","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2013,8,6]]},"reference":[{"key":"118_CR1","unstructured":"Abdallah SA, Plumbley MD (2003) An independent component analysis approach to automatic music transcription. In: Proceedings of 114th AES convention 2003, Amsterdam, The Netherlands"},{"key":"118_CR2","unstructured":"Abdallah SA, Plumbley MD (2004) Polyphonic music transcription by non-negative sparse coding of power spectra. In: Proceedings 5th international conference music information retrieval (ISMIR\u201904), Barcelona, Spain"},{"key":"118_CR3","doi-asserted-by":"crossref","unstructured":"Al-Ghawanmeh F, Jafar IF, A.Al-Taee M, Al-Ghawanmeh MT, Muhsin ZJ (2011) Development of improved automatic music transcription system for the arabian flute (nay). In: Proceedings fo the 8th international multi-conference on systems, signals and devices (SSD), 22\u201325 Mar 2011","DOI":"10.1109\/SSD.2011.5993561"},{"issue":"6","key":"118_CR4","doi-asserted-by":"publisher","first-page":"1610","DOI":"10.1109\/TASL.2010.2093894","volume":"19","author":"F Argenti","year":"2011","unstructured":"Argenti F, Nesi P, Pantaleo G (2011) Automatic transcription of polyphonic music based on the constant-q bispectral analysis. IEEE Trans Audio Speech Lang Process 19(6):1610\u20131630","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR5","volume-title":"Modern information retrieval","author":"R Baeza-Yates","year":"1999","unstructured":"Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM Press, Addison-Wesley, New York"},{"issue":"12","key":"118_CR6","doi-asserted-by":"publisher","first-page":"1261","DOI":"10.1016\/j.apacoust.2004.05.007","volume":"65","author":"I Barbancho","year":"2004","unstructured":"Barbancho I, Barbancho A, Jurado A, Tardon L (2004) Transcription of piano recordings. Appl Acoust 65(12):1261\u20131287. doi:10.1016\/j.apacoust.2004.05.007. http:\/\/www.sciencedirect.com\/science\/article\/B6V1S-4D7CDP7-2\/","journal-title":"Appl Acoust"},{"issue":"6","key":"118_CR7","doi-asserted-by":"publisher","first-page":"2242","DOI":"10.1109\/TASL.2006.872609","volume":"14","author":"J Bello","year":"2006","unstructured":"Bello J, Daudet L, Sandler M (2006) Automatic piano transcription using frequency and time-domain information. IEEE Trans Audio Speech Lang Process 14(6):2242\u20132251. doi:10.1109\/TASL.2006.872609","journal-title":"IEEE Trans Audio Speech Lang Process"},{"issue":"5","key":"118_CR8","doi-asserted-by":"publisher","first-page":"1035","DOI":"10.1109\/TASL.2006.872609","volume":"14","author":"JP Bello","year":"2005","unstructured":"Bello JP, Daudet L, Abdallah S, Duxbury C, Davies M, Sandler MB (2005) A tutorial on onset detection in music signals. IEEE Trans Audio Speech Lang Process 14(5):1035\u20131047. doi:10.1109\/TASL.2006.872609","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR9","unstructured":"Bello JP, Monti G, Sandler M, S, M.: Techniques for automatic music transcription. In: Proceedings of the international symposium on music, information retrieval (ISMIR-00), Plymouth, MA, USA, Oct 2000, pp 23\u201325"},{"issue":"6","key":"118_CR10","doi-asserted-by":"publisher","first-page":"1111","DOI":"10.1109\/JSTSP.2011.2162394","volume":"5","author":"E Benetos","year":"2011","unstructured":"Benetos E, Dixon S (2011) Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription. IEEE J Sel Topics Signal Process 5(6):1111\u20131123","journal-title":"IEEE J Sel Topics Signal Process"},{"key":"118_CR11","doi-asserted-by":"crossref","unstructured":"Benetos E, Dixon S (2011) Multiple-instrument polyphonic music transcription using a convolutive probabilistic model. In: Sound and music computing (SMC 2011)","DOI":"10.1109\/ICASSP.2011.5946322"},{"key":"118_CR12","unstructured":"Benetos E, Klapuri A, Dixon S (2012) Score-informed transcription for automatic piano tutoring. In: Proceedings of the 20th European signal processing conference (EUSIPCO 2012)"},{"key":"118_CR13","doi-asserted-by":"publisher","unstructured":"Bertin N, Badeau R, Richard G (2007) Blind signal decompositions for automatic transcription of polyphonic music: Nmf and k-svd on the benchmark. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, ICASSP 2007, vol 1, pp I65\u2013I68. doi:10.1109\/ICASSP.2007.366617","DOI":"10.1109\/ICASSP.2007.366617"},{"key":"118_CR14","doi-asserted-by":"publisher","unstructured":"Bertin N, Badeau R, Vincent E (2009) Fast Bayesian nmf algorithms enforcing harmonicity and temporal continuity in polyphonic music transcription. In: IEEE workshop on applications of signal processing to audio and acoustics, WASPAA \u201909, pp 29\u201332. doi:10.1109\/ASPAA.2009.5346531","DOI":"10.1109\/ASPAA.2009.5346531"},{"issue":"3","key":"118_CR15","doi-asserted-by":"publisher","first-page":"538","DOI":"10.1109\/TASL.2010.2041381","volume":"18","author":"N Bertin","year":"2010","unstructured":"Bertin N, Badeau R, Vincent E (2010) Enforcing harmonicity and smoothness in bayesian non-negative matrix factorization applied to polyphonic music transcription. IEEE Trans Audio Speech Lang Process 18(3):538\u2013549. doi:10.1109\/TASL.2010.2041381","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR16","doi-asserted-by":"publisher","unstructured":"Bertin N, Fevotte C, Badeau R (2009) A tempering approach for itakura-saito non-negative matrix factorization with application to music transcription. In: IEEE international conference on acoustics, speech and signal processing, ICASSP 2009, pp 1545\u20131548 (2009). doi:10.1109\/ICASSP.2009.4959891","DOI":"10.1109\/ICASSP.2009.4959891"},{"key":"118_CR17","doi-asserted-by":"publisher","unstructured":"Boo W, Wang Y, Loscos A (2006) A violin music transcriber for personalized learning. In: Proceedings of the IEEE international conference on multimedia and expo 2006, pp 2081\u20132084. doi:10.1109\/ICME.2006.262644","DOI":"10.1109\/ICME.2006.262644"},{"key":"118_CR18","doi-asserted-by":"publisher","unstructured":"Boogaart C, Lienhart R (2009) Note onset detection for the transcription of polyphonic piano music. In: Proceedings of the IEEE international conference on multimedia and expo, ICME 2009, pp 446\u2013449. doi:10.1109\/ICME.2009.5202530","DOI":"10.1109\/ICME.2009.5202530"},{"key":"118_CR19","unstructured":"Boulanger-Lewandowski N, Bengio Y, Vincent P (2012) Discriminative non-negative matrix factorization for multiple pitch estimation. Proceedings of the ISMIR 2012, Porto, Portugal"},{"issue":"1","key":"118_CR20","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1121\/1.400476","volume":"89","author":"JC Brown","year":"1991","unstructured":"Brown JC (1991) Calculation of a constant q spectral transform. J Acoust Soc Am 89(1):425\u2013434","journal-title":"J Acoust Soc Am"},{"key":"118_CR21","doi-asserted-by":"publisher","unstructured":"Bruno I, Monni S, Nesi P (2003) Automatic music transcription supporting different instruments. In: Proceedings of the 3rd international conference on web delivering of music, 2003 WEDELMUSIC, pp 37\u201344. doi:10.1109\/WDM.2003.1233871","DOI":"10.1109\/WDM.2003.1233871"},{"key":"118_CR22","doi-asserted-by":"publisher","unstructured":"Cemgil A, Kappen B, Barber D (2003) Generative model based polyphonic music transcription. In: Proceedings of the 2003 IEEE workshop on applications of signal processing to audio and acoustics, pp 181\u2013184. doi:10.1109\/ASPAA.2003.1285861","DOI":"10.1109\/ASPAA.2003.1285861"},{"key":"118_CR23","doi-asserted-by":"crossref","unstructured":"Chien YR, Jeng SK (2002) An automatic transcription system with octave detection. In: Proceedings of the 2002 IEEE international conference on acoustics, speech, and signal (ICASSP), vol 2, pp II\u20131865.","DOI":"10.1109\/ICASSP.2002.5744990"},{"issue":"9","key":"118_CR24","doi-asserted-by":"publisher","first-page":"1798","DOI":"10.1016\/j.sigpro.2009.03.024","volume":"89","author":"G Costantini","year":"2009","unstructured":"Costantini G, Perfetti R, Todisco M (2009) Event based\u00a0transcription system for polyphonic piano music. Signal Process 89(9): 1798\u20131811 (2009). doi:10.1016\/j.sigpro.2009.03.024. http:\/\/www.sciencedirect.com\/science\/article\/B6V18-4W0R0H7-2\/","journal-title":"Signal Process"},{"key":"118_CR25","doi-asserted-by":"publisher","unstructured":"Costantini G, Todisco M, Perfetti R (2009) On the use of memory for detecting musical notes in polyphonic piano music. In: Proceedings of the European conference on circuit theory and design, ECCTD 2009, pp 806\u2013809. doi:10.1109\/ECCTD.2009.5275106","DOI":"10.1109\/ECCTD.2009.5275106"},{"key":"118_CR26","doi-asserted-by":"publisher","unstructured":"Costantini G, Todisco M, Perfetti R, Basili R, Casali D (2010) Svm based transcription system with short-term memory oriented to polyphonic piano music. In: Proceedings of the 15th IEEE Mediterranean electrotechnical conference, MELECON 2010\u20132010, pp 196\u2013201. doi:10.1109\/MELCON.2010.5476305","DOI":"10.1109\/MELCON.2010.5476305"},{"key":"118_CR27","unstructured":"Daniel A, Emiya V (2008) Perceptually-based evaluation of the errors usually made when automatically transcribing music. In: Proceedings of the ISMIR 2008, Philadelphia, PA"},{"key":"118_CR28","doi-asserted-by":"publisher","unstructured":"Derrien, O.: Multi-scale frame-based analysis of audio signals for musical transcription using a dictionary of chromatic waveforms. In: Proceedings 2006 IEEE international conference on acoustics, speech and signal processing, ICASSP 2006, vol 5, p V. doi:10.1109\/ICASSP.2006.1661211","DOI":"10.1109\/ICASSP.2006.1661211"},{"key":"118_CR29","unstructured":"Dessein A, Cont A, Lemaitre G (2010) Real-time polyphonic music transcription with non-negative matrix factorization and beta-divergence. In: Proceedings of the 11th international Society for Music Information Retrieval conference (ISMIR 2010), Utrecht, Netherlands"},{"key":"118_CR30","doi-asserted-by":"crossref","unstructured":"Downie JS (2006) The music information retrieval evaluation exchange (mirex). D-Lib Magaz 12(12)","DOI":"10.1045\/december2006-downie"},{"key":"118_CR31","unstructured":"Dressler K (2011) Pitch estimation by the pair-wise evaluation of spectral peaks. In: Proceedings of the AES 42nd international conference, Ilmenau, Germany"},{"issue":"8","key":"118_CR32","doi-asserted-by":"publisher","first-page":"2121","DOI":"10.1109\/TASL.2010.2042119","volume":"18","author":"Z Duan","year":"2010","unstructured":"Duan Z, Pardo B, Zhang C (2010) Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions. IEEE Trans Audio Speech Lang Process 18(8):2121\u20132133","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR33","unstructured":"Duda RO, Hart PE, Stork DG (2000) Pattern classification, 2nd edn. Wiley-Interscience, New York"},{"issue":"6","key":"118_CR34","doi-asserted-by":"crossref","first-page":"1643","DOI":"10.1109\/TASL.2009.2038819","volume":"18","author":"V Emiya","year":"2010","unstructured":"Emiya V, Badeau R, David B (2010) Multipitch estimation of piano sounds using a new probabilistic spectral smoothness principle. IEEE Trans Audio Speech Lang Process 18(6):1643\u20131654","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR35","unstructured":"Fonseca N, Ferreira A (2009) Measuring music transcription results based on a hybrid decay\/sustain evaluation. In: Proceedings of the 7th Triennial conference of European Society for the cognitive sciences of music (ESCOM 2009), 12\u201316 Aug, Jyv\u00e4skyl\u00e4, Finland"},{"key":"118_CR36","doi-asserted-by":"publisher","unstructured":"Foo SW, Lee EWT (2002) Transcription of polyphonic signals using fast filter bank. In: Proceedings of the IEEE International Symposium on circuits and systems, ISCAS 2002, vol 3, pp III-241\u2013III-244. dooi:10.1109\/ISCAS.2002.1010205","DOI":"10.1109\/ISCAS.2002.1010205"},{"key":"118_CR37","doi-asserted-by":"publisher","unstructured":"Gillet O, Richard G (2004) Automatic transcription of drum loops. In: Proceedings IEEE international conference on acoustics, speech, and signal processing (ICASSP \u201904), vol 4, pp iv-269\u2013iv-272. doi:10.1109\/ICASSP.2004.1326815","DOI":"10.1109\/ICASSP.2004.1326815"},{"key":"118_CR38","doi-asserted-by":"publisher","unstructured":"Gillet O, Richard G (2005) Automatic transcription of drum sequences using audiovisual features. In: Proceedings IEEE international conference on acoustics, speech, and signal processing (ICASSP \u201905), vol 3, pp iii-205\u2013iii-208. doi:10.1109\/ICASSP.2005.1415682","DOI":"10.1109\/ICASSP.2005.1415682"},{"issue":"3","key":"118_CR39","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1109\/TASL.2007.914120","volume":"16","author":"O Gillet","year":"2008","unstructured":"Gillet O, Richard G (2008) Transcription and separation of drum signals from polyphonic music. IEEE Trans Audio Speech Lang Process 16(3):529\u2013540. doi:10.1109\/TASL.2007.914120","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR40","unstructured":"Gomez E, Canadas F, Salamon J, Bonada J, Vera P, Cabanas P (2012) Predominant fundamental frequency estimation vs singing voice separation for the automatic transcription of accompanied flamenco singing. In: Proceedings of the 13th international society for music information retrieval conference (ISMIR), 8\u201312 Oct, Porto, Portugal"},{"key":"118_CR41","doi-asserted-by":"publisher","unstructured":"Goto M (2004) A real-time music-scene-description system: predominant-f0 estimation for detecting melody and bass lines in real-world audio signals. Speech Commun 43(4):311\u2013329. doi:10.1016\/j.specom.2004.07.001. http:\/\/www.sciencedirect.com\/science\/article\/B6V1C-4D07TBJ-6\/","DOI":"10.1016\/j.specom.2004.07.001"},{"key":"118_CR42","unstructured":"Goto M, Hashiguchi H, Nishimura T, Oka R (2002) Rwc music database: Popular, classical, and jazz music databases. In: Proceedings of the 3rd international conference on music information retrieval (ISMIR 2002), Oct 2002, pp 287\u2013288"},{"key":"118_CR43","doi-asserted-by":"publisher","unstructured":"Grindlay G, Ellis D (2009) Multi-voice polyphonic music transcription using eigeninstruments. In: Proceedinngs of the IEEE workshop on applications of signal processing to audio and acoustics, WASPAA \u201909, pp 53\u201356. doi:10.1109\/ASPAA.2009.5346514","DOI":"10.1109\/ASPAA.2009.5346514"},{"key":"118_CR44","doi-asserted-by":"publisher","unstructured":"Guibin Z, Sheng L (2007) Automatic transcription method for polyphonic music based on adaptive comb filter and neural network. In: Proceedings of the international conference on mechatronics and automation, ICMA 2007, pp 2592\u20132597. doi:10.1109\/ICMA.2007.4303965","DOI":"10.1109\/ICMA.2007.4303965"},{"key":"118_CR45","unstructured":"Hainsworth S, Macleod MD (2001) Automatic bass line transcription from polyphonic music. In: Proceedings of the international computer music conference, Havana"},{"key":"118_CR46","unstructured":"Hainsworth S, Macleod MD, Wolfe PJ (2001) Analysis of reassigned spectrograms for musical transcription. In: Proceedings of the IEEE workshop on applications of signal processing to audio and acoustics, Mohonk Mountain Resort, NY"},{"key":"118_CR47","unstructured":"Hainsworth SW, Macleod MD (2007) The automated music transcription problem. Cambridge University Engineering Department, Cambridge"},{"key":"118_CR48","doi-asserted-by":"crossref","unstructured":"Han J, Chen CW (2011) Improving melody extraction using probabilistic latent component analysis. In: Proceedings of the ICASSP 2011, pp 33\u201336","DOI":"10.1109\/ICASSP.2011.5946321"},{"key":"118_CR49","unstructured":"Hanson RJ (1995) Lawson. Solving least squares problems. Philadelphia, CL"},{"key":"118_CR50","unstructured":"Haykin S (2000) Neural networks: a comprehensive foundation, 2nd edn. Pearson Education, Prentice Hall"},{"key":"118_CR51","unstructured":"Helmholtz H (1885) On the sensation of tone, 4th edn. Dover Publications Inc., New York"},{"key":"118_CR52","unstructured":"Hsu CL, Jang JSR (2010) Singing pitch extraction by voice vibrato\/tremolo estimation and instrument partial deletion. In: Proceedings of the 11th international society for music information retrieval conference (ISMIR 2010), Utrecht, Netherlands"},{"key":"118_CR53","doi-asserted-by":"publisher","unstructured":"Ning Jiang D, Picheny M, Qin Y (2007) Voice-melody transcription under a speech recognition framework. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, ICASSP 2007, vol 4, pp IV-617\u2013IV-620. doi:10.1109\/ICASSP.2007.366988","DOI":"10.1109\/ICASSP.2007.366988"},{"key":"118_CR54","unstructured":"Karkoschka E (1972) Notation in new music: a critical guide to interpretation and realisation. Praeger. http:\/\/books.google.ca\/books?id=O4MYAQAAIAAJ"},{"key":"118_CR55","unstructured":"Keren R, Zeevi YY, Chazan D (1998) Multiresolution time-frequency analysis of polyphonic music. In: Proceedings of the IEEE-SP international symposium on time-frequency and time-scale analysis, pp 565\u2013568, Pittsburgh, PA, USA"},{"key":"118_CR56","doi-asserted-by":"crossref","unstructured":"Kirchhoff H, Dixon S, Klapuri A (2012) Multi-template shift-variant non-negative matrix deconvolution for semi-automatic music transcription. In: Proceedings of the 13th international conference on music information retrieval (ISMIR), Porto, Portugal","DOI":"10.1109\/ICASSP.2012.6287833"},{"key":"118_CR57","doi-asserted-by":"crossref","unstructured":"Klapuri A, Davy M (2006) Signal processing methods for music transcription. Springer, Berlin","DOI":"10.1007\/0-387-32845-9"},{"key":"118_CR58","doi-asserted-by":"publisher","unstructured":"Kobzantsev A, Chazan D, Zeevi Y (2005) Automatic transcription of piano polyphonic music. In: Proceedings of the 4th international symposium on image and signal processing and analysis, ISPA 2005, pp 414\u2013418. doi:10.1109\/ISPA.2005.195447","DOI":"10.1109\/ISPA.2005.195447"},{"issue":"1","key":"118_CR59","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1109\/TASSP.1978.1163047","volume":"26","author":"K Kodera","year":"1978","unstructured":"Kodera K, Gendrin R, Villedary C (1978) Analysis of time-varying signals with small bt values. IEEE Trans Acoust Speech Signal Process 26(1):64\u201376. doi:10.1109\/TASSP.1978.1163047","journal-title":"IEEE Trans Acoust Speech Signal Process"},{"key":"118_CR60","doi-asserted-by":"publisher","unstructured":"Lao W, Tan ET, Kam A (2004) Computationally inexpensive and effective scheme for automatic transcription of polyphonic music. In: Proceedings of the 2004 IEEE international conference on multimedia and expo, ICME \u201904, vol 3, pp 1775\u20131778. doi:10.1109\/ICME.2004.1394599","DOI":"10.1109\/ICME.2004.1394599"},{"key":"118_CR61","doi-asserted-by":"publisher","unstructured":"Lee CT, Yang YH, Chen H (2011) Automatic transcription of piano music by sparse representation of magnitude spectra. In: Proceedings of the 2011 IEEE international conference on multimedia and expo (ICME), pp 1\u20136. doi:10.1109\/ICME.2011.6012000","DOI":"10.1109\/ICME.2011.6012000"},{"key":"118_CR62","doi-asserted-by":"publisher","unstructured":"Li J, Han J, Shi Z, Li J (2010) An efficient approach to humming transcription for query-by-humming system. In: Proceedings of the 3rd international congress on image and signal processing (CISP 2010), vol 8, pp 3746\u20133749. doi:10.1109\/CISP.2010.5646801","DOI":"10.1109\/CISP.2010.5646801"},{"issue":"4","key":"118_CR63","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1109\/TCS.1986.1085930","volume":"33","author":"Y Lim","year":"1986","unstructured":"Lim Y (1986) Frequency-response masking approach for the synthesis of sharp linear phase digital filters. IEEE Trans Circ Syst 33(4):357\u2013364. doi:10.1109\/TCS.1986.1085930","journal-title":"IEEE Trans Circ Syst"},{"issue":"1","key":"118_CR64","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1109\/78.651200","volume":"46","author":"M Macleod","year":"1998","unstructured":"Macleod M (1998) Fast nearly ml estimation of the parameters of real or complex single tones or resolved multiple tones. IEEE Trans Signal Process 46(1):141\u2013148. doi:10.1109\/78.651200","journal-title":"IEEE Trans Signal Process"},{"issue":"12","key":"118_CR65","doi-asserted-by":"publisher","first-page":"3397","DOI":"10.1109\/78.258082","volume":"41","author":"S Mallat","year":"1993","unstructured":"Mallat S, Zhang Z (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397\u20133415. doi:10.1109\/78.258082","journal-title":"IEEE Trans Signal Process"},{"key":"118_CR66","doi-asserted-by":"crossref","unstructured":"Marolt M (2000) Transcription of polyphonic piano music with neural networks. In: Proceedings of the 10th Mediterranean electrotechnical conference, MEleCon 2000, vol 11, pp 512\u2013515","DOI":"10.1109\/MELCON.2000.879982"},{"issue":"3","key":"118_CR67","doi-asserted-by":"publisher","first-page":"439","DOI":"10.1109\/TMM.2004.827507","volume":"6","author":"M Marolt","year":"2004","unstructured":"Marolt M (2004) A connectionist approach to automatic transcription of polyphonic piano music. IEEE Trans Multimed 6(3): 439\u2013449. doi:10.1109\/TMM.2004.827507","journal-title":"IEEE Trans Multimed"},{"key":"118_CR68","doi-asserted-by":"crossref","unstructured":"Martin KD (1996) A blackboard system for automatic transcription of simple polyphonic music. Technical report","DOI":"10.1121\/1.416589"},{"key":"118_CR69","unstructured":"Mauch M, Dixon S (2010) Approximate note transcription for the improved identification of difficult chords. In: Proceedings of the 11th international society for music information retrieval conference (ISMIR 2010), Utrecht, Netherlands"},{"key":"118_CR70","unstructured":"Miwa T, Tadokoro Y, Saito T (1999) Musical pitch estimation and discrimination of musical instruments using comb filters for transcription. In: Proceedings of the 42nd Midwest symposium on circuits and systems, 1999, vol 1, pp 105\u2013108"},{"issue":"4","key":"118_CR71","first-page":"32","volume":"1","author":"JA Moorer","year":"1977","unstructured":"Moorer JA (1977) On the transcription of musical sound by computer. Comput Music J 1(4):32\u201338","journal-title":"Comput Music J"},{"key":"118_CR72","doi-asserted-by":"publisher","unstructured":"Muto Y, Tanaka T (2002) Transcription system for music by two instruments. In: Proceedings of the 6th international conference on signal processing, vol 2, pp 1676\u20131679. doi:10.1109\/ICOSP.2002.1180123","DOI":"10.1109\/ICOSP.2002.1180123"},{"key":"118_CR73","unstructured":"Nam J, Ngiam J, Lee H, Slaney M (2011) A classification-based polyphonic piano transcription approach using learned feature representations. In: Proceedings of the 12th international society for music information retrieval conference (ISMIR 2011), 24\u201328 Oct 2011, Miami, FL, USA"},{"key":"118_CR74","unstructured":"Niedermayer B (2008) Non-negative matrix division for the automatic transcription of polyphonic music. In: Proceedings of the ISMIR, pp 544\u2013549"},{"key":"118_CR75","doi-asserted-by":"publisher","unstructured":"O\u2019Grady PD, Rickard ST (2009) Automatic hexaphonic guitar transcription using non-negative constraints. In: Proceedings of signals and systems conference (ISSC 2009), IET Irish, pp 1\u20136. doi:10.1049\/cp.2009.1699","DOI":"10.1049\/cp.2009.1699"},{"key":"118_CR76","doi-asserted-by":"publisher","unstructured":"O\u2019Hanlon K, Nagano H, Plumbley M (2012) Structured sparsity for automatic music transcription. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 441\u2013444. doi:10.1109\/ICASSP.2012.6287911","DOI":"10.1109\/ICASSP.2012.6287911"},{"key":"118_CR77","unstructured":"Olson HF (1967) Music, physics and engineering, 2nd edn. Dover Publications Inc., New York"},{"key":"118_CR78","unstructured":"Oppenheim AV, Schafer R (1975) Digital signal processing. Prentice-Hall international editions. Prentice-Hall. http:\/\/books.google.ca\/books?id=vfdSAAAAMAAJ"},{"key":"118_CR79","doi-asserted-by":"publisher","unstructured":"Oudre L, Grenier Y, Fevotte C (2009) Chord recognition using measures of fit, chord templates and filtering methods. In: Proceedings of the IEEE workshop on applications of signal processing to audio and acoustics, WASPAA \u201909, pp 9\u201312. doi:10.1109\/ASPAA.2009.5346546","DOI":"10.1109\/ASPAA.2009.5346546"},{"key":"118_CR80","doi-asserted-by":"publisher","unstructured":"Oudre L, Grenier Y, Fevotte C (2011) Chord recognition by fitting rescaled chroma vectors to chord templates. In: Processings of the IEEE transactions on audio, speech, and language, vol 17(7):2222\u20132233. doi:10.1109\/TASL.2011.2139205","DOI":"10.1109\/TASL.2011.2139205"},{"key":"118_CR81","doi-asserted-by":"crossref","unstructured":"Patterson R, Robinson K, Holdsworth J, McKeown D, Allerhand C (1992) Auditory Physiiikigy und perception, chap. complex sounds and auditory images, Exford","DOI":"10.1016\/B978-0-08-041847-6.50054-X"},{"key":"118_CR82","doi-asserted-by":"publisher","unstructured":"Peeling P, Cemgil A, Godsill S (2008) Bayesian hierarchical models and inference for musical audio processing. In: Proceedings of the 3rd international symposium on wireless pervasive computing, ISWPC 2008, pp 278\u2013282. doi:10.1109\/ISWPC.2008.4556214","DOI":"10.1109\/ISWPC.2008.4556214"},{"issue":"3","key":"118_CR83","doi-asserted-by":"publisher","first-page":"519","DOI":"10.1109\/TASL.2009.2029769","volume":"18","author":"P Peeling","year":"2010","unstructured":"Peeling P, Cemgil A, Godsill S (2010) Generative spectrogram factorization models for polyphonic piano transcription. IEEE Trans Audio Speech Lang Process 18(3):519\u2013527. doi:10.1109\/TASL.2009.2029769","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR84","doi-asserted-by":"publisher","unstructured":"Pertusa A, I\u00c3\u015besta JM (2005) Polyphonic monotimbral music transcription using dynamic networks. Pattern Recogn Lett 26(12):1809\u20131818. doi:10.1016\/j.patrec.2005.03.001. http:\/\/www.sciencedirect.com\/science\/article\/B6V15-4FY3NWX-C\/","DOI":"10.1016\/j.patrec.2005.03.001"},{"key":"118_CR85","doi-asserted-by":"publisher","unstructured":"Phon-Amnuaisuk S (2010) Transcribing bach chorales using non-negative matrix factorisation. In: Proceedings of the 2010 international conference on audio language and image processing (ICALIP), pp 688\u2013693. doi:10.1109\/ICALIP.2010.5685059","DOI":"10.1109\/ICALIP.2010.5685059"},{"issue":"4","key":"118_CR86","doi-asserted-by":"publisher","first-page":"2382","DOI":"10.1121\/1.415426","volume":"99","author":"WJ Pielemeier","year":"1996","unstructured":"Pielemeier WJ, Wakefield GH (1996) A high-resolution time-frequency representation for musical instrument signals. J Acoust Soc Am 99(4):2382\u20132396","journal-title":"J Acoust Soc Am"},{"issue":"1","key":"118_CR87","first-page":"24","volume":"4","author":"M Piszczalski","year":"1977","unstructured":"Piszczalski M, Galler BA (1977) Automatic music transcription. Comput Music J 4(1):24\u201331","journal-title":"Comput Music J"},{"key":"118_CR88","doi-asserted-by":"publisher","unstructured":"Poliner GE, Ellis DP (2007) Improving generalization for classification-based polyphonic piano transcription. In: Proceedings of the 2007 IEEE workshop on applications of signal processing to audio and acoustics, pp 86\u201389. doi:10.1109\/ASPAA.2007.4393050","DOI":"10.1109\/ASPAA.2007.4393050"},{"key":"118_CR89","unstructured":"Privosnik M, Marolt M (1998) A system for automatic transcription of music based on multiple agents architecture. In: Proceedings of MELECON\u201998, pp 169\u2013172 (Tel Aviv 1998)"},{"issue":"2","key":"118_CR90","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1109\/5.18626","volume":"77","author":"LR Rabiner","year":"1989","unstructured":"Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2):257\u2013286","journal-title":"Proc. IEEE"},{"key":"118_CR91","unstructured":"Raczynski SA, Vincent E, Bimbot F, Sagayama S (2010) Multiple pitch transcription using dbn-based musicological models. In: Proceedings of the 11th international society for music information retrieval conference (ISMIR 2010), Utrecht, Netherlands"},{"issue":"8","key":"118_CR92","doi-asserted-by":"publisher","first-page":"2145","DOI":"10.1109\/TASL.2010.2042124","volume":"18","author":"V Rao","year":"2010","unstructured":"Rao V, Rao P (2010) Vocal melody extraction in the presence of pitched accompaniment in polyphonic music. IEEE Trans Audio Speech Lang Process 18(8):2145\u20132154. doi:10.1109\/TASL.2010.2042124","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR93","unstructured":"Raphael C (2002) Automatic transcription of piano music. In: Proceedings of the 3rd international conference on music information retrieval: ISMIR 2002, pp 15\u201319, Paris, France"},{"key":"118_CR94","doi-asserted-by":"publisher","unstructured":"Reis G, Fonseca N, Ferndandez F (2007) Genetic algorithm approach to polyphonic music transcription. In: Proceedings of the IEEE international symposium on intelligent signal processing, WISP 2007, pp 1\u20136. doi:10.1109\/WISP.2007.4447608","DOI":"10.1109\/WISP.2007.4447608"},{"issue":"8","key":"118_CR95","doi-asserted-by":"publisher","first-page":"2313","DOI":"10.1109\/TASL.2012.2201475","volume":"20","author":"G Reis","year":"2012","unstructured":"Reis G, Fernandez de Vega F, Ferreira A (2012) Automatic transcription of polyphonic piano music using genetic algorithms, adaptive spectral envelope modeling, and dynamic noise level estimation. IEEE Trans Audio Speech Lang Process 20(8):2313\u20132328. doi:10.1109\/TASL.2012.2201475","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR96","doi-asserted-by":"publisher","unstructured":"Ryynanen M, Klapuri A (2005) Polyphonic music transcription using note event modeling. In: Proceedings of the IEEE workshop on applications of signal processing to audio and acoustics, pp 319\u2013322. doi:10.1109\/ASPAA.2005.1540233","DOI":"10.1109\/ASPAA.2005.1540233"},{"key":"118_CR97","unstructured":"Ryynanen M, Klapuri A (2006) Transcription of the singing melody in polyphonic music. In: Proceedings of the 7th international conference on music information retrieval, Victoria, BC, Canada, pp 222\u2013227"},{"key":"118_CR98","doi-asserted-by":"publisher","unstructured":"Ryynanen M, Klapuri A (2007) Automatic bass line transcription from streaming polyphonic audio. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, ICASSP 2007, vol 4, pp. IV-1437\u2013IV-1440. doi:10.1109\/ICASSP.2007.367350","DOI":"10.1109\/ICASSP.2007.367350"},{"issue":"3","key":"118_CR99","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1162\/comj.2008.32.3.72","volume":"32","author":"M Ryynanen","year":"2008","unstructured":"Ryynanen M, Klapuri A (2008) Automatic transcription of melody, bass line, and chords in polyphonic music. Comput Music J 32(3):72\u201386","journal-title":"Comput Music J"},{"key":"118_CR100","unstructured":"Salamon J, Gulati S, Serra X (2012) A multipitch approach to tonic identification in indian classical music. In: Proceedings of the 13th international society for music information retrieval conference of the (ISMIR), Porto, Portugal"},{"key":"118_CR101","doi-asserted-by":"publisher","unstructured":"Shih HH, Narayanan S, Kuo CC (2002) An hmm-based approach to humming transcription. In: Proceedings of the 2002 IEEE international conference on multimedia and expo, ICME \u201902, vol 1, pp 337\u2013340. doi:10.1109\/ICME.2002.1035787","DOI":"10.1109\/ICME.2002.1035787"},{"key":"118_CR102","unstructured":"Simsekli U, Cemgil AT (2010) A comparison of probabilistic models for online pitch tracking. In: Proceedings of the 7th conference on sound and music computing (SMC), Barcelona, Spain"},{"key":"118_CR103","doi-asserted-by":"publisher","unstructured":"Smaragdis P, Brown J (2003) Non-negative matrix factorization for polyphonic music transcription. In: Proceedings of the 2003 IEEE workshop on applications of signal processing to audio and acoustics, pp 177\u2013180. doi:10.1109\/ASPAA.2003.1285860","DOI":"10.1109\/ASPAA.2003.1285860"},{"key":"118_CR104","doi-asserted-by":"publisher","unstructured":"Sophea S., Phon-Amnuaisuk S (2007) Determining a suitable desired factors for nonnegative matrix factorization in polyphonic music transcription. In: Proceedings of the international symposium on information technology convergence, ISITC 2007, pp 166\u2013170. doi:10.1109\/ISITC.2007.50","DOI":"10.1109\/ISITC.2007.50"},{"key":"118_CR105","unstructured":"Sterian A, Simoni MH, Wakefield GH (1999) Model-based musical transcription. In: Proceedings of the international computer music conference, Beijing, China"},{"key":"118_CR106","unstructured":"Sterian A, Wakefield GH (1996) Robust automated music transcription systems. In: Proceedings of the international computer music conference, Hong Kong"},{"key":"118_CR107","unstructured":"Sterian A, Wakefield GH (1997) A frequency-dependent bilinear time-frequency distribution for improved event detection. In: Proceedings of the international computer music conference, Thessaloniki, Greece"},{"key":"118_CR108","doi-asserted-by":"publisher","unstructured":"Tanaka T, Tagami Y (2002) Automatic midi data making from music wave data performed by 2 instruments using blind signal separation. In: Proceedings of the 41st SICE annual conference SICE 2002, vol 1, pp 451\u2013456. doi:10.1109\/SICE.2002.1195442","DOI":"10.1109\/SICE.2002.1195442"},{"key":"118_CR109","doi-asserted-by":"publisher","unstructured":"Tavares T, Odowichuck G, Zehtabi S, Tzanetakis G (2012) Audio-visual vibraphone transcription in real time. In: Proceedings of the IEEE 14th international workshop on multimedia signal processing (MMSP), pp 215\u2013220. doi:10.1109\/MMSP.2012.6343443","DOI":"10.1109\/MMSP.2012.6343443"},{"key":"118_CR110","unstructured":"Tavares TF, Barbedo JGA, Lopes A (2008) Towards the evaluation of automatic transcription of music. In: Proceedings of the VI Brazilian congress of audio, engineering (AES2008), Sao Paulo, Brazil"},{"issue":"4","key":"118_CR111","doi-asserted-by":"publisher","first-page":"1257","DOI":"10.1109\/TASL.2006.889801","volume":"15","author":"H Thornburg","year":"2007","unstructured":"Thornburg H, Leistikow R, Berger J (2007) Melody extraction and musical onset detection via probabilistic models of framewise stft peak data. IEEE Trans Audio Speech Lang Process 15(4):1257\u20131272. doi:10.1109\/TASL.2006.889801","journal-title":"IEEE Trans Audio Speech Lang Process"},{"key":"118_CR112","doi-asserted-by":"publisher","unstructured":"Tjahyanto A, Suprapto Y, Purnomo M, Wulandari D (2012) Fft-based features selection for javanese music note and instrument identification using support vector machines. In: Proceedings of the 2012 IEEE international conference on computer science and automation engineering (CSAE), vol 1, pp 439\u2013443. doi:10.1109\/CSAE.2012.6272633","DOI":"10.1109\/CSAE.2012.6272633"},{"key":"118_CR113","doi-asserted-by":"publisher","unstructured":"Triki M, Slock D (2009) Perceptually motivated quasi-periodic signal selection for polyphonic music transcription. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, ICASSP 2009, pp 305\u2013308. doi:10.1109\/ICASSP.2009.4959581","DOI":"10.1109\/ICASSP.2009.4959581"},{"key":"118_CR114","doi-asserted-by":"publisher","unstructured":"Uchida Y, Wada S (2011) Melody and bass line estimation method using audio feature database. In: Proceedins of the 2011 IEEE international conference on signal processing, communications and computing (ICSPCC), pp 1\u20136. doi:10.1109\/ICSPCC.2011.6061662","DOI":"10.1109\/ICSPCC.2011.6061662"},{"key":"118_CR115","doi-asserted-by":"publisher","unstructured":"Vincent E, Berlin N, Badeau R (2008) Harmonic and inharmonic nonnegative matrix factorization for polyphonic pitch transcription. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, ICASSP 2008, pp 109\u2013112. doi:10.1109\/ICASSP.2008.4517558","DOI":"10.1109\/ICASSP.2008.4517558"},{"key":"118_CR116","doi-asserted-by":"crossref","unstructured":"Vincent E, Rodet X (2004) Music transcription with ISA and HMM. In: Proceedings of the 5th international conference on independent component analysis and blind signal separation (ICA), Granada, Espagne, pp 1197\u20131204. http:\/\/hal.inria.fr\/inria-00544697","DOI":"10.1007\/978-3-540-30110-3_151"},{"key":"118_CR117","doi-asserted-by":"publisher","unstructured":"Wang Y, Zhang B, Schleusing O (2007) Educational violin transcription by fusing multimedia streams. In: Proceedings of the international workshop on Educational multimedia and multimedia education, Emme \u201907, ACM, New York, NY, USA, pp 57\u201366. doi:10.1145\/1290144.1290154. http:\/\/doi.acm.org\/10.1145\/1290144.1290154","DOI":"10.1145\/1290144.1290154"},{"key":"118_CR118","doi-asserted-by":"publisher","unstructured":"Wang YS, Hu TY, Jeng SK (2010) Automatic transcription for music with two timbres from monaural sound source. In: Proceedings of the 2010 IEEE international symposium on multimedia (ISM), pp 314\u2013317. doi:10.1109\/ISM.2010.54","DOI":"10.1109\/ISM.2010.54"},{"key":"118_CR119","doi-asserted-by":"publisher","unstructured":"Weller A, Ellis D, Jebara T (2009) Structured prediction models for chord transcription of music audio. In: Proceedings of the 2009 international conference on machine learning and applications, ICMLA \u201909, pp 590\u2013595. doi:10.1109\/ICMLA.2009.132","DOI":"10.1109\/ICMLA.2009.132"},{"issue":"6","key":"118_CR120","doi-asserted-by":"publisher","first-page":"787","DOI":"10.1109\/TPAMI.1987.4767985","volume":"9","author":"R Wilson","year":"1987","unstructured":"Wilson R (1987) Finite prolate spheroidal sequences and their applications i: generation and properties. IEEE Trans Pattern Anal Mach Intell 9(6):787","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"118_CR121","doi-asserted-by":"publisher","unstructured":"Yin J, Wang Y, Hsu D (2005) Digital violin tutor: an\u00a0integrated system for beginning violin learners. In: Proceedings of the 13th annual ACM international conference on Multimedia, MULTIMEDIA \u201905, pp 976\u2013985. ACM, New York. doi:10.1145\/1101149.1101353. http:\/\/doi.acm.org\/10.1145\/1101149.1101353","DOI":"10.1145\/1101149.1101353"}],"container-title":["Journal of the Brazilian Computer Society"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13173-013-0118-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13173-013-0118-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s13173-013-0118-6","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13173-013-0118-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T00:53:01Z","timestamp":1630543981000},"score":1,"resource":{"primary":{"URL":"https:\/\/journal-bcs.springeropen.com\/articles\/10.1007\/s13173-013-0118-6"}},"subtitle":["Historical overview of techniques"],"short-title":[],"issued":{"date-parts":[[2013,8,6]]},"references-count":121,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2013,11]]}},"alternative-id":["118"],"URL":"https:\/\/doi.org\/10.1007\/s13173-013-0118-6","relation":{},"ISSN":["0104-6500","1678-4804"],"issn-type":[{"value":"0104-6500","type":"print"},{"value":"1678-4804","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,8,6]]},"assertion":[{"value":"11 October 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 July 2013","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 August 2013","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}