{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T05:40:00Z","timestamp":1775281200044,"version":"3.50.1"},"publisher-location":"Berlin, Heidelberg","reference-count":30,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"value":"9783540653110","type":"print"},{"value":"9783540494300","type":"electronic"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[1998]]},"DOI":"10.1007\/3-540-49430-8_2","type":"book-chapter","created":{"date-parts":[[2007,8,11]],"date-time":"2007-08-11T14:57:33Z","timestamp":1186844253000},"page":"9-50","source":"Crossref","is-referenced-by-count":813,"title":["Efficient BackProp"],"prefix":"10.1007","author":[{"given":"Yann","family":"LeCun","sequence":"first","affiliation":[]},{"given":"Leon","family":"Bottou","sequence":"additional","affiliation":[]},{"given":"Genevieve B.","family":"Orr","sequence":"additional","affiliation":[]},{"given":"Klaus -Robert","family":"M\u00fcller","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2002,3,28]]},"reference":[{"key":"2_CR1","unstructured":"S. Amari. Neural learning in structuredparameter spaces \u2014 natural riemannian gradient. In Michael C. Mozer, Michael I. Jordan, and Thomas Petsche, editors, Advances in Neural Information Processing Systems, volume 9, page 127. The MIT Press, 1997."},{"issue":"2","key":"2_CR2","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1162\/089976698300017746","volume":"10","author":"S. Amari","year":"1998","unstructured":"S. Amari. Natural gradient works e.ciently in learning. Neural Computation, 10(2):251\u2013276, 1998.","journal-title":"Neural Computation"},{"key":"2_CR3","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1162\/neco.1992.4.2.141","volume":"4","author":"R. Battiti","year":"1992","unstructured":"R. Battiti. First-and second-order methods for learning: Between steepest descent andnewton\u2019s method. Neural Computation, 4:141\u2013166, 1992.","journal-title":"Neural Computation"},{"key":"2_CR4","unstructured":"S. Becker and Y. LeCun. Improving the convergence of backbropagation learning with secondo der metho ds. In David Touretzky, Geofrey Hinton, and T errence Sejnowski, editors, Proceedings of the 1988 Connectionist Models Summer School, pages 29\u201337. Lawrence Erlbaum Associates, 1989."},{"key":"2_CR5","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198538493.001.0001","volume-title":"Neural Networks for Pattern Recognition","author":"C. M. Bishop","year":"1995","unstructured":"C. M. Bishop. Neural Networks for Pattern Recognition. Clarendon Press, Oxford, 1995."},{"key":"2_CR6","unstructured":"L. Bottou. Online algorithms andsto chastic approximations. In David Saad, editor, Online Learning in Neural Networks (1997 Workshop at the Newton Institute), Cambridge, 1998. The Newton Institute Series, Cambridge University Press."},{"key":"2_CR7","first-page":"321","volume":"2","author":"D. S. Broomheadand","year":"1988","unstructured":"D. S. Broomheadand D. Lowe. Multivariable function interpolation andad aptive networks. Complex Systems, 2:321\u2013355, 1988.","journal-title":"Complex Systems"},{"key":"2_CR8","doi-asserted-by":"crossref","unstructured":"W. L. Buntine and A. S. Weigend. Computing second order derivatives in Feed-Forwardnet works: A review. IEEE Transactions on Neural Networks, 1993. To appear.","DOI":"10.1109\/72.286919"},{"key":"2_CR9","first-page":"832","volume-title":"Advances in Neural Information Processing Systems","author":"C. Darken","year":"1991","unstructured":"C. Darken and J. E. Moody. Note on learning rate schedules for stochastic optimization. In R. P. Lippmann, J. E. Moody, and D. S. Touretzky, editors, Advances in Neural Information Processing Systems, volume 3, pages 832\u2013838. Morgan Kaufmann, San Mateo,CA, 1991."},{"key":"2_CR10","volume-title":"Principal Component Neural Networks","author":"K. I. Diamantaras","year":"1996","unstructured":"K. I. Diamantaras and S. Y. Kung. Principal Component Neural Networks. Wiley, New York, 1996."},{"key":"2_CR11","series-title":"chapter 8.7: Polynomial time algorithms","first-page":"183","volume-title":"Practical Methods of Optimization","author":"R. Fletcher","year":"1987","unstructured":"R. Fletcher. Practical Methods of Optimization, chapter 8.7: Polynomial time algorithms, pages 183\u2013188. John Wiley & Sons, New York, second edition, 1987.","edition":"second edition"},{"issue":"1","key":"2_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1162\/neco.1992.4.1.1","volume":"4","author":"S. Geman","year":"1992","unstructured":"S. Geman, E. Bienenstock, and R. Doursat. Neural networks andthe bias\/variance dilemma. Neural Computation, 4(1):1\u201358, 1992.","journal-title":"Neural Computation"},{"key":"2_CR13","volume-title":"Technical Report DRB-306","author":"L. Goldstein","year":"1987","unstructured":"L. Goldstein. Mean square optimality in the continuous time Robbins Monro procedure. Technical Report DRB-306, Dept. of Mathematics, University of Southern California, LA, 1987."},{"key":"2_CR14","volume-title":"Matrix Computations","author":"G. H. Golub","year":"1989","unstructured":"G. H. Golub and C. F. Van Loan. Matrix Computations, 2nd ed. Johns Hopkins University Press, Baltimore, 1989.","edition":"2nd ed."},{"key":"2_CR15","first-page":"199","volume-title":"Mathematical Approaches to Neural Networks","author":"T.M. Heskes","year":"1993","unstructured":"T.M. Heskes and B. Kappen. On-line learning processes in arti.cial neural networks. In J. G. Tayler, editor, Mathematical Approaches to Neural Networks, volume 51, pages 199\u2013233. Elsevier, Amsterdam, 1993."},{"key":"2_CR16","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1016\/0893-6080(88)90003-2","volume":"1","author":"R. A. Jacobs","year":"1988","unstructured":"Robert A. Jacobs. Increasedrates of convergence through learning rate adaptation. Neural Networks, 1:295\u2013307, 1988.","journal-title":"Neural Networks"},{"key":"2_CR17","unstructured":"A. H. Kramer and A. Sangiovanni-Vincentelli. Efficient parallel learning algorithms for neural networks. In D. S. Touretzky, editor, Advances in Neural Information Processing Systems. Proceedings of the 1988 Conference, pages 40\u201348, San Mateo, CA, 1989. Morgan Kaufmann."},{"key":"2_CR18","series-title":"PhD thesis","volume-title":"Modeles connexionnistes de l\u2019apprentissage (connectionist learning models)","author":"Y. LeCun","year":"1987","unstructured":"Y. LeCun. Modeles connexionnistes de l\u2019apprentissage (connectionist learning models). PhD thesis, Universit\u00e9 P. et M. Curie (Paris VI), 1987."},{"key":"2_CR19","unstructured":"Y. LeCun. Generalization andnet work design strategies. In R. Pfeifer, Z. Schreter, F. Fogelman, and L. Steels, editors, Connectionism in Perspective, Amsterdam, 1989. Elsevier. Proceedings of the International Conference Connectionism in Perspective, University of Z\u00fcrich, 10.\u201313. October 1988."},{"key":"2_CR20","unstructured":"Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Handwritten digit recognition with a backpropagation network. In D. S. Touretsky, editor, Advances in Neural Information Processing Systems, vol. 2, San Mateo, CA, 1990. Morgan Kaufman."},{"key":"2_CR21","unstructured":"Y. LeCun, J.S. Denker, and S.A. Solla. Optimal brain damage. In D. S. Touretsky, editor, Advances in Neural Information Processing Systems, vol. 2, pages 598\u2013605, 1990."},{"key":"2_CR22","unstructured":"Y. LeCun, I. Kanter, and S. A. Solla. Secondord er properties of error surfaces. In Advances in Neural Information Processing Systems, vol. 3, San Mateo, CA, 1991. Morgan Kaufmann."},{"key":"2_CR23","unstructured":"Y. LeCun, P. Y. Simard, and B. Pearlmutter. Automatic learning rate maximization by on-line estimation of the hessian\u2019s eigenvectors. In Giles, Hanson, and Cowan, editors, Advances in Neural Information Processing Systems, vol. 5, San Mateo, CA, 1993. Morgan Kaufmann."},{"key":"2_CR24","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1016\/S0893-6080(05)80056-5","volume":"6","author":"M. M\u00d8ller","year":"1993","unstructured":"M. M\u00d8ller. A scaledconjugate gradient algorithm for fast supervisedlearning. Neural Networks, 6:525\u2013533, 1993.","journal-title":"Neural Networks"},{"issue":"1","key":"2_CR25","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1142\/S0129065793000031","volume":"4","author":"M. M\u00d8ller","year":"1993","unstructured":"M. M\u00d8ller. Supervised learning on large redundant training sets. International Journal of Neural Systems, 4(1):15\u201325, 1993.","journal-title":"International Journal of Neural Systems"},{"key":"2_CR26","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1162\/neco.1989.1.2.281","volume":"1","author":"J. E. Moody","year":"1989","unstructured":"J. E. Moody and C. J. Darken. Fast learning in networks of locally-tunedpro cessing units. Neural Computation, 1:281\u2013294, 1989.","journal-title":"Neural Computation"},{"key":"2_CR27","unstructured":"N. Murata. (in Japanese). PhD thesis, University of Tokyo, 1992."},{"key":"2_CR28","unstructured":"N. Murata, K.-R. M\u00fcller, A. Ziehe, and S. Amari. Adaptive on-line learning in changing environments. In Michael C. Mozer, Michael I. Jordan, and Thomas Petsche, editors, Advances in Neural Information Processing Systems, volume 9, page 599. The MIT Press, 1997."},{"key":"2_CR29","volume-title":"Digital Signal Processing","author":"A.V. Oppenheim","year":"1975","unstructured":"A.V. Oppenheim and R.W. Schafer. Digital Signal Processing. Prentice Hall, Englewood Cliffs, 1975."},{"key":"2_CR30","unstructured":"G. B. Orr. Dynamics and Algorithms for Stochastic learning. PhD thesis, Oregon Graduate Institute, 1995."}],"container-title":["Lecture Notes in Computer Science","Neural Networks: Tricks of the Trade"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/3-540-49430-8_2","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,17]],"date-time":"2024-02-17T10:24:15Z","timestamp":1708165455000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/3-540-49430-8_2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1998]]},"ISBN":["9783540653110","9783540494300"],"references-count":30,"URL":"https:\/\/doi.org\/10.1007\/3-540-49430-8_2","relation":{},"ISSN":["0302-9743"],"issn-type":[{"value":"0302-9743","type":"print"}],"subject":[],"published":{"date-parts":[[1998]]}}}