{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T20:13:40Z","timestamp":1777407220856,"version":"3.51.4"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"3-4","license":[{"start":{"date-parts":[[1992,5,1]],"date-time":"1992-05-01T00:00:00Z","timestamp":704678400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[1992,5]]},"DOI":"10.1007\/bf00992697","type":"journal-article","created":{"date-parts":[[2005,1,9]],"date-time":"2005-01-09T16:35:16Z","timestamp":1105288516000},"page":"257-277","source":"Crossref","is-referenced-by-count":265,"title":["Practical issues in temporal difference learning"],"prefix":"10.1007","volume":"8","author":[{"given":"Gerald","family":"Tesauro","sequence":"first","affiliation":[]}],"member":"297","reference":[{"key":"CR1","doi-asserted-by":"crossref","unstructured":"Anderson, C.W. (1987). Strategy learning with multilayer connectionist representations.Proceedings of the Fourth International Workshop on Machine Learning (pp. 103?114).","DOI":"10.1016\/B978-0-934613-41-5.50014-3"},{"key":"CR2","first-page":"835","volume":"13","author":"A.G. Barto","year":"1983","unstructured":"Barto, A.G., Sutton, R.S., & Anderson, C.W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems.IEEE Transactions on Systems, Man and Cybernetics, 13 835?846.","journal-title":"IEEE Transactions on Systems, Man and Cybernetics"},{"key":"CR3","unstructured":"Berliner, H. (1977). Experiences in evaluation with BKG?a program that plays backgammon.Proceedings of IJCAI (pp. 428?433)."},{"key":"CR4","unstructured":"Berliner, H. (1979). On the construction of evaluation functions for large domains.Proceedings of IJCAI (pp. 53?55)."},{"key":"CR5","doi-asserted-by":"crossref","first-page":"929","DOI":"10.1145\/76359.76371","volume":"36","author":"A. Blumer","year":"1989","unstructured":"Blumer, A., Ehrenfeucht, A., Haussler, D., & Warmuth, M. (1989). Learnability and the Vapnik-Chervonenkis dimension.JACM, 36 929?965.","journal-title":"JACM"},{"key":"CR6","unstructured":"Christensen, J. & Korf, R. (1986). A unified theory of heuristic evaluation functions and its application to learning.Proceeding of AAAI-86 (pp. 148?152)."},{"key":"CR7","first-page":"341","volume":"8","author":"P. Dayan","year":"1992","unstructured":"Dayan, P. (1992). The convergence of TD(?).Machine Learning, 8 341?362.","journal-title":"Machine Learning"},{"key":"CR8","volume-title":"Evolution, games and learning","author":"P.W. Frey","year":"1986","unstructured":"Frey, P.W. (1986). Algorithmic strategies for improving the performance of game playing programs. In: D. Farmer, et al. (Eds.),Evolution, games and learning. Amsterdam: North Holland."},{"key":"CR9","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/0004-3702(74)90027-7","volume":"5","author":"A.K. Griffith","year":"1974","unstructured":"Griffith, A.K. (1974). A comparison and evaluation of three machine learning procedures as applied to the game of checkers.Artificial Intelligence, 5 137?148.","journal-title":"Artificial Intelligence"},{"key":"CR10","volume-title":"Machine learning: An artificial intelligence approach (Vol. 2)","author":"J.H. Holland","year":"1986","unstructured":"Holland, J.H. (1986). Escaping brittleness: The possibilities of general-purpose learning algorithms applied to parallel rule-based systems. In: R.S. Michalski, J.G. Carbonell & T.M. Mitchell, (Eds.),Machine learning: An artificial intelligence approach (Vol. 2). Los Altos, CA: Morgan Kaufmann."},{"key":"CR11","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1016\/0893-6080(89)90020-8","volume":"2","author":"K. Hornik","year":"1989","unstructured":"Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators.Neural Networks, 2 359?366.","journal-title":"Neural Networks"},{"key":"CR12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/0004-3702(88)90076-8","volume":"36","author":"K.-F. Lee","year":"1988","unstructured":"Lee, K.-F. & Majahan, S. (1988). A pattern classification approach to evaluation function learning.Artificial Intelligence, 36 1?25.","journal-title":"Artificial Intelligence"},{"key":"CR13","volume-title":"Backgammon","author":"P. Magriel","year":"1976","unstructured":"Magriel, P. (1976).Backgammon. New York: Times Books."},{"key":"CR14","volume-title":"Perceptrons","author":"M.L. Minsky","year":"1969","unstructured":"Minsky, M.L. & Papert, S.A. (1969).Perceptrons. Cambridge, MA: MIT Press. (Republished as an expanded edition in 1988)."},{"key":"CR15","volume-title":"Using features to evaluate positions in experts' and novices' Othello games","author":"D.H. Mitchell","year":"1984","unstructured":"Mitchell, D.H. (1984). Using features to evaluate positions in experts' and novices' Othello games. Master's Thesis, Northwestern Univ., Evanston, IL."},{"key":"CR16","volume-title":"Machine learning","author":"J.R. Quinlan","year":"1983","unstructured":"Quinlan, J.R. (1983). Learning efficient classification procedures and their application to chess end games. In: R.S. Michalski, J.G. Carbonell & T.M. Mitchell (Eds.),Machine learning. Palo Alto, CA: Tioga."},{"key":"CR17","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1214\/aoms\/1177729586","volume":"22","author":"H. Robbins","year":"1951","unstructured":"Robbins, H. & Monro, S. (1951). A stochastic approximation method.Annals of Mathematical Statistics, 22 400?407.","journal-title":"Annals of Mathematical Statistics"},{"key":"CR18","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/5236.001.0001","volume-title":"Parallel distributed processing. Vol. 1","author":"D.E. Rumelhart","year":"1986","unstructured":"Rumelhart, D.E., Hinton, G.E., & Williams, R.J. (1986). Learning internal representations by error propagation. In: D. Rumelhart & J. McClelland, (Eds.),Parallel distributed processing. Vol. 1. Cambridge, MA: MIT Press."},{"key":"CR19","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1147\/rd.33.0210","volume":"3","author":"A. Samuel","year":"1959","unstructured":"Samuel, A. (1959). Some studies in machine learning using the game of checkers.IBM J. of Research and Development, 3 210?229.","journal-title":"IBM J. of Research and Development"},{"key":"CR20","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1147\/rd.116.0601","volume":"11","author":"A. Samuel","year":"1967","unstructured":"Samuel, A. (1967). Some studies in machine learning using the game of checkers, II?recent progress.IBM J. of Research and Development, 11 601?617.","journal-title":"IBM J. of Research and Development"},{"key":"CR21","volume-title":"Temporal credit assignment in reinforcement learning","author":"R.S. Sutton","year":"1984","unstructured":"Sutton, R.S. (1984). Temporal credit assignment in reinforcement learning. Doctoral Dissertation, Dept. of Computer and Information Science, Univ. of Massachusetts, Amherst."},{"key":"CR22","first-page":"9","volume":"3","author":"R.S. Sutton","year":"1988","unstructured":"Sutton, R.S. (1988). Learning to predict by the methods of temporal differences.Machine Learning, 3 9?44.","journal-title":"Machine Learning"},{"key":"CR23","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1016\/0004-3702(89)90017-9","volume":"39","author":"G. Tesauro","year":"1989","unstructured":"Tesauro, G. & Sejnowski, T.J. (1989). A parallel network that learns to play backgammon.Artificial Intelligence, 39 357?390.","journal-title":"Artificial Intelligence"},{"key":"CR24","first-page":"99","volume":"1","author":"G. Tesauro","year":"1989","unstructured":"Tesauro, G. (1989). Connectionist learning of expert preferences by comparison training. In D. Touretzky (Ed.),Advances in neural information processing, 1 99?106.","journal-title":"Advances in neural information processing"},{"key":"CR25","first-page":"33","volume":"III","author":"G. Tesauro","year":"1990","unstructured":"Tesauro, G. (1990). Neurogammon: a neural network backgammon program.IJCNN Proceedings III, 33?39.","journal-title":"IJCNN Proceedings"},{"key":"CR26","unstructured":"Utgoff, P.E. & Clouse, J.A. (1991). Two kinds of training information for evaluation function training. To appear in:Proceedings of AAAI-91."},{"key":"CR27","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1137\/1116025","volume":"16","author":"V.N. Vapnik","year":"1971","unstructured":"Vapnik, V.N. & Chervonenkis (1971). On the uniform convergence of relative frequencies of events to their probabilities.Theory Prob. Appl., 16 264?280.","journal-title":"Theory Prob. Appl."},{"key":"CR28","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1109\/PROC.1976.10286","volume":"64","author":"B. Widrow","year":"1976","unstructured":"Widrow, B., et al. (1976). Stationary and nonstationary learning characteristics of the LMS adaptive filter.Proceedings of the IEEE, 64 1151?1162.","journal-title":"Proceedings of the IEEE"},{"key":"CR29","doi-asserted-by":"crossref","first-page":"853","DOI":"10.1287\/mnsc.23.8.853","volume":"23","author":"N. Zadeh","year":"1977","unstructured":"Zadeh, N. & Kobliska, G. (1977). On optimal doubling in backgammon.Management Science, 23 853?858.","journal-title":"Management Science"}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/BF00992697.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/BF00992697\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/BF00992697","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,4,5]],"date-time":"2020-04-05T08:10:57Z","timestamp":1586074257000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/BF00992697"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1992,5]]},"references-count":29,"journal-issue":{"issue":"3-4","published-print":{"date-parts":[[1992,5]]}},"alternative-id":["BF00992697"],"URL":"https:\/\/doi.org\/10.1007\/bf00992697","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"value":"0885-6125","type":"print"},{"value":"1573-0565","type":"electronic"}],"subject":[],"published":{"date-parts":[[1992,5]]}}}