{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T12:43:17Z","timestamp":1778071397517,"version":"3.51.4"},"reference-count":50,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2020,6,27]],"date-time":"2020-06-27T00:00:00Z","timestamp":1593216000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2020,6,27]],"date-time":"2020-06-27T00:00:00Z","timestamp":1593216000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS-1564080"],"award-info":[{"award-number":["IIS-1564080"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS-1724157"],"award-info":[{"award-number":["IIS-1724157"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000006","name":"Office of Naval Research","doi-asserted-by":"publisher","award":["N000141612785"],"award-info":[{"award-number":["N000141612785"]}],"id":[{"id":"10.13039\/100000006","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton Robot"],"published-print":{"date-parts":[[2020,9]]},"DOI":"10.1007\/s10514-020-09925-w","type":"journal-article","created":{"date-parts":[[2020,6,27]],"date-time":"2020-06-27T20:04:48Z","timestamp":1593288288000},"page":"1167-1182","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":19,"title":["Iterative residual tuning for system identification and sim-to-real robot learning"],"prefix":"10.1007","volume":"44","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8795-1498","authenticated-orcid":false,"given":"Adam David","family":"Allevato","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Elaine","family":"Schaertl\u00a0Short","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mitch","family":"Pryor","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrea L.","family":"Thomaz","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,6,27]]},"reference":[{"key":"9925_CR1","doi-asserted-by":"crossref","unstructured":"Ajay, A., Wu, J., Fazeli, N., Bauza, M., Kaelbling, L. P., Tenenbaum, J. B., et al. (2018). Augmenting physical simulators with stochastic neural networks: Case study of planar pushing and bouncing. In International conference on intelligent robots and systems (IROS).","DOI":"10.1109\/IROS.2018.8593995"},{"key":"9925_CR2","unstructured":"Allevato, A., Schaertl\u00a0Short, E., Pryor, M., & Thomaz, A. L. (2019). TuneNet: One-shot residual tuning for system identification and sim-to-real robot task transfer. CoRR arXiv:1907.11200v3."},{"key":"9925_CR3","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1177\/0278364913495932","volume":"33","author":"K Ayusawa","year":"2013","unstructured":"Ayusawa, K., Venture, G., & Nakamura, Y. (2013). Identifiability and identification of inertial parameters using the underactuated base-link dynamics for legged multibody systems. The International Journal of Robotics Research, 33, 446\u2013468. https:\/\/doi.org\/10.1177\/0278364913495932.","journal-title":"The International Journal of Robotics Research"},{"key":"9925_CR4","doi-asserted-by":"crossref","unstructured":"Bottou, L. (2010). Large-scale machine learning with stochastic gradient descent. In Proceedings of COMPSTAT (pp. 177\u2013186). Berlin: Springer.","DOI":"10.1007\/978-3-7908-2604-3_16"},{"key":"9925_CR5","unstructured":"Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., & Zaremba, W. (2016). OpenAI Gym. CoRR arXiv:1606.01540v1."},{"key":"9925_CR6","unstructured":"Chang, M. B., Ullman, T., Torralba, A., & Tenenbaum, J. B. (2016). A compositional object-based approach to learning physical dynamics. Preprint arXiv:1612.00341."},{"key":"9925_CR7","doi-asserted-by":"crossref","unstructured":"Chebotar, Y., Handa, A., Makoviychuk, V., Macklin, M., Issac, J., Ratliff, N., et al. (2018). Closing the sim-to-real loop: Adapting simulation randomization with real world experience. CoRR.","DOI":"10.1109\/ICRA.2019.8793789"},{"key":"9925_CR8","unstructured":"de\u00a0Avila Belbute-Peres, F., Smith, K., Allen, K., Tenenbaum, J., & Kolter, J. Z. (2018). End-to-end differentiable physics for learning and control. In Advances in neural information processing systems (Vol. 31, pp. 7178\u20137189). Montreal. https:\/\/papers.nips.cc\/paper\/7948-end-to-end-differentiable-physics-for-learning-and-control."},{"key":"9925_CR9","unstructured":"Dietrich, O. E. (2015). USA racquetball official rules of racquetball. USA Racquetball."},{"key":"9925_CR10","unstructured":"Fazeli, N., Tedrake, R., & Rodriguez, A. (2015). Identifiability analysis of planar rigid-body frictional contact. In International symposium on robotics research (ISRR). Berlin: Springer."},{"key":"9925_CR11","doi-asserted-by":"publisher","unstructured":"Gautier, M., & Khalil, W. (1988). On the identification of the inertial parameters of robots. In Proceedings of the 27th IEEE conference on decision and control (pp. 2264\u20132269). New York: IEEE. https:\/\/doi.org\/10.1109\/CDC.1988.194738.","DOI":"10.1109\/CDC.1988.194738"},{"key":"9925_CR12","unstructured":"Golemo, F., Ta\u00efga, A. A., Oudeyer, P. Y., & Courville, A. (2018). Sim-to-real transfer with neural-augmented robot simulation. In 2nd conference on robot learning (CoRL18) (Vol. 87, pp. 817\u2013828)."},{"key":"9925_CR13","doi-asserted-by":"crossref","unstructured":"Hanna, J. P., & Stone, P. (2017). Grounded action transformation for robot learning in simulation. In Proceedings of the 31st AAAI conference on artificial intelligence (AAAI-17) (pp. 3834\u20133840).","DOI":"10.1609\/aaai.v31i1.11044"},{"key":"9925_CR14","unstructured":"Hanna, J. P., Thomas, P. S., Stone, P., & Niekum, S. (2017). Data-efficient policy evaluation through behavior policy search. In D. Precup, & Y. W. Teh (Eds.), Proceedings of the 34th international conference on machine learning, PMLR, Sydney, Australia, proceedings of machine learning research (Vol. 70, pp. 1394\u20131403)."},{"key":"9925_CR15","unstructured":"Hansen, N. (2016). The CMA evolution strategy: A tutorial. CoRR arXiv:1604.00772."},{"key":"9925_CR16","unstructured":"Heiden, E., Millard, D., Zhang, H., & Sukhatme, G. S. (2019). Interactive differentiable simulation. CoRR arXiv:1905.10706."},{"issue":"June","key":"9925_CR17","first-page":"1809","volume":"13","author":"P Hennig","year":"2012","unstructured":"Hennig, P., & Schuler, C. J. (2012). Entropy search for information-efficient global optimization. Journal of Machine Learning Research, 13(June), 1809\u20131837.","journal-title":"Journal of Machine Learning Research"},{"key":"9925_CR18","unstructured":"Hoffman, J., Tzeng, E., Park, T., Zhu, J. Y., Isola, P., Saenko, K., et al. (2017). CyCADA: Cycle-consistent adversarial domain adaptation. CoRR arXiv:1711.03213."},{"key":"9925_CR19","doi-asserted-by":"publisher","unstructured":"Hollerbach, J., Khalil, W., & Gautier, M. (2008). Model identification. In B. Siciliano & O. Khatib (Eds.), Springer handbook of robotics (pp. 321\u2013344). Berlin, Heidelberg: Springer. https:\/\/doi.org\/10.1007\/978-3-540-30301-5_15.","DOI":"10.1007\/978-3-540-30301-5_15"},{"key":"9925_CR20","doi-asserted-by":"crossref","unstructured":"James, S., Wohlhart, P., Kalakrishnan, M., Kalashnikov, D., Irpan, A., Ibarz, J., et al. (2019). Sim-to-real via sim-to-sim: Data-efficient robotic grasping via randomized-to-canonical adaptation networks. In Computer vision and pattern recognition (CVPR).","DOI":"10.1109\/CVPR.2019.01291"},{"key":"9925_CR21","doi-asserted-by":"crossref","unstructured":"Johannink, T., Bahl, S., Nair, A., Luo, J., Kumar, A., Loskyll, M., et al. (2018). Residual reinforcement learning for robot control. CoRR arXiv:1812.03201.","DOI":"10.1109\/ICRA.2019.8794127"},{"key":"9925_CR22","doi-asserted-by":"publisher","unstructured":"Khosla, P., & Kanade, T. (1985). Parameter identification of robot dynamics. In 1985 24th IEEE conference on decision and control (pp. 1754\u20131760). New York: IEEE. https:\/\/doi.org\/10.1109\/CDC.1985.268838.","DOI":"10.1109\/CDC.1985.268838"},{"key":"9925_CR23","unstructured":"Kingma, D. P, & Ba, J. (2014). Adam: A method for stochastic optimization. Preprint arXiv:1412.6980."},{"key":"9925_CR24","unstructured":"Kloss, A., Schaal, S., & Bohg, J. (2017). Combining learned and analytical models for predicting action effects. CoRR."},{"key":"9925_CR25","doi-asserted-by":"publisher","unstructured":"Kolev, S., & Todorov, E. (2015 December). Physically consistent state estimation and system identification for contacts. In IEEE-RAS international conference on humanoid robots (Vol. 2015, pp. 1036\u20131043). New York: IEEE. https:\/\/doi.org\/10.1109\/HUMANOIDS.2015.7363481.","DOI":"10.1109\/HUMANOIDS.2015.7363481"},{"issue":"1","key":"9925_CR26","doi-asserted-by":"publisher","first-page":"503","DOI":"10.1007\/BF01589116","volume":"45","author":"DC Liu","year":"1989","unstructured":"Liu, D. C., & Nocedal, J. (1989). On the limited memory BFGS method for large scale optimization. Mathematical Programming, 45(1), 503\u2013528. https:\/\/doi.org\/10.1007\/BF01589116.","journal-title":"Mathematical Programming"},{"key":"9925_CR27","unstructured":"Lutter, M., Ritter, C., & Peters, J. (2019). Deep Lagrangian networks: Using physics as model prior for deep learning. In 7th international conference on learning representations (ICLR)."},{"issue":"4","key":"9925_CR28","doi-asserted-by":"publisher","first-page":"3232","DOI":"10.1109\/LRA.2018.2851026","volume":"3","author":"D Ma","year":"2018","unstructured":"Ma, D., & Rodriguez, A. (2018). Friction variability in auto-collected dataset of planar pushing experiments and anisotropic friction. IEEE Robotics and Automation Letters, 3(4), 3232\u20133239. https:\/\/doi.org\/10.1109\/LRA.2018.2851026.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"9925_CR29","unstructured":"Mehta, B., Diaz, M., Golemo, F., Pal, C. J., & Paull, L. (2019). Active domain randomization. CoRR arXiv:1904.04762."},{"key":"9925_CR30","doi-asserted-by":"publisher","unstructured":"Paszke, A., Chanan, G., Lin, Z., Gross, S., Yang, E., Antiga, L., & Devito, Z. (2017). Automatic differentiation in PyTorch. In 31st conference on neural information processing systems (NIPS), Long Beach, CA (pp. 1\u20134). https:\/\/doi.org\/10.1017\/CBO9781107707221.009.","DOI":"10.1017\/CBO9781107707221.009"},{"key":"9925_CR31","doi-asserted-by":"crossref","unstructured":"Peng, X. B., Andrychowicz, M., Zaremba, W., & Abbeel, P. (2018). Sim-to-real transfer of robotic control with dynamics randomization. In IEEE international conference on robotics and Automation (ICRA) (pp. 1\u20138).","DOI":"10.1109\/ICRA.2018.8460528"},{"key":"9925_CR32","doi-asserted-by":"publisher","unstructured":"Pinto, L., Andrychowicz, M., Welinder, P., Zaremba, W., & Abbeel, P. (2017). Asymmetric actor critic for image-based robot learning. In Robotics: Science and systems XIV, robotics: Science and systems foundation. https:\/\/doi.org\/10.1007\/s10869-008-9083-z.","DOI":"10.1007\/s10869-008-9083-z"},{"issue":"1","key":"9925_CR33","doi-asserted-by":"publisher","first-page":"145","DOI":"10.1016\/S0893-6080(98)00116-6","volume":"12","author":"N Qian","year":"1999","unstructured":"Qian, N. (1999). On the momentum term in gradient descent learning algorithms. Neural Networks, 12(1), 145\u2013151.","journal-title":"Neural Networks"},{"key":"9925_CR34","doi-asserted-by":"publisher","unstructured":"Rajeswaran, A., Ghotra, S., Ravindran, B., & Levine, S. (2016). EPOpt: Learning robust neural network policies using model ensembles. CoRR. https:\/\/doi.org\/10.1073\/pnas.211563298.","DOI":"10.1073\/pnas.211563298"},{"key":"9925_CR35","doi-asserted-by":"publisher","unstructured":"Ramos, F., Possas, R. C., & Fox, D. (2019). BayesSim: Adaptive domain randomization via probabilistic inference for robotics simulators. In Proceedings of robotics: Science and systems, Freiburg im Breisgau, Germany. https:\/\/doi.org\/10.15607\/RSS.2019.XV.029.","DOI":"10.15607\/RSS.2019.XV.029"},{"key":"9925_CR36","unstructured":"Ruder, S. (2016). An overview of gradient descent optimization algorithms. CoRR arXiv:1609.04747."},{"key":"9925_CR37","unstructured":"Rusu, A. A., Ve\u010der\u00edk, M., Roth\u00f6rl, T., Heess, N., Pascanu, R., & Hadsell, R. (2017). Sim-to-real robot learning from pixels with progressive nets. In Conference on robot learning (pp. 262\u2013270)."},{"key":"9925_CR38","unstructured":"Silver, T., Allen, K., Tenenbaum, J., & Kaelbling, L. P. (2018). Residual policy learning. CoRR arXiv:1907.11200."},{"key":"9925_CR39","doi-asserted-by":"publisher","unstructured":"Tan, J., Xie, Z., Boots, B., & Liu, C. K. (2016). Simulation-based design of dynamic controllers for humanoid balancing. In IEEE international conference on intelligent robots and systems (pp. 2729\u20132736). New York: IEEE. https:\/\/doi.org\/10.1109\/IROS.2016.7759424.","DOI":"10.1109\/IROS.2016.7759424"},{"key":"9925_CR40","doi-asserted-by":"crossref","unstructured":"Tan, J., Zhang, T., Coumans, E., Iscen, A., Bai, Y., Hafner, D., Bohez, S., Vanhoucke, V. (2018). Sim-to-real: Learning agile locomotion for quadruped robots. In Robotics: Science and systems XIV, robotics: Science and systems foundation. arXiv:1804.10332v2.","DOI":"10.15607\/RSS.2018.XIV.010"},{"issue":"2","key":"9925_CR41","first-page":"26","volume":"4","author":"T Tieleman","year":"2012","unstructured":"Tieleman, T., & Hinton, G. (2012). Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4(2), 26\u201331.","journal-title":"COURSERA: Neural Networks for Machine Learning"},{"key":"9925_CR42","doi-asserted-by":"publisher","unstructured":"Tobin, J., Fong, R., Ray, A., Schneider, J., Zaremba, W., & Abbeel, P. (2017). Domain randomization for transferring deep neural networks from simulation to the real world. In IEEE international conference on intelligent robots and systems (pp. 23\u201330). New York: IEEE https:\/\/doi.org\/10.1109\/IROS.2017.8202133.","DOI":"10.1109\/IROS.2017.8202133"},{"key":"9925_CR43","doi-asserted-by":"publisher","unstructured":"Todorov, E. (2011). A convex, smooth and invertible contact model for trajectory optimization. In Proceedings\u2014IEEE international conference on robotics and automation (pp. 1071\u20131076). New York: IEEE. https:\/\/doi.org\/10.1109\/ICRA.2011.5979814.","DOI":"10.1109\/ICRA.2011.5979814"},{"key":"9925_CR44","doi-asserted-by":"publisher","unstructured":"Toussaint, M., Allen, K., Smith, K., & Tenenbaum, J. (2018). Differentiable physics and stable modes for tool-use and manipulation planning. In Robotics: Science and systems XIV. https:\/\/doi.org\/10.15607\/RSS.2018.XIV.044.","DOI":"10.15607\/RSS.2018.XIV.044"},{"key":"9925_CR45","doi-asserted-by":"publisher","unstructured":"Xu, Z., Wu, J., Zeng, A., Tenenbaum, J., Song, S. (2019). DensePhysNet: Learning dense physical object representations via multi-step dynamic interactions. In Proceedings of robotics: Science and systems, Freiburg im Breisgau, Germany. https:\/\/doi.org\/10.15607\/RSS.2019.XV.046.","DOI":"10.15607\/RSS.2019.XV.046"},{"key":"9925_CR46","doi-asserted-by":"publisher","unstructured":"Yu, W., Tan, J., Liu, C. K., & Turk, G. (2017). Preparing for the unknown: Learning a universal policy with online system identification. In Proceedings of robotics: Science and systems, Cambridge, MA. https:\/\/doi.org\/10.15607\/RSS.2017.XIII.048.","DOI":"10.15607\/RSS.2017.XIII.048"},{"key":"9925_CR47","doi-asserted-by":"publisher","unstructured":"Zeng, A., Song, S., Lee, J., Rodriguez, A., & Funkhouser, T. A. (2019). TossingBot: Learning to throw arbitrary objects with residual physics. In Proceedings of robotics: Science and systems, Freiburg im Breisgau, Germany. https:\/\/doi.org\/10.15607\/RSS.2019.XV.004.","DOI":"10.15607\/RSS.2019.XV.004"},{"issue":"2","key":"9925_CR48","doi-asserted-by":"publisher","first-page":"1148","DOI":"10.1109\/LRA.2019.2894216","volume":"4","author":"J Zhang","year":"2019","unstructured":"Zhang, J., Tai, L., Yun, P., Xiong, Y., Liu, M., Boedecker, J., et al. (2019). Vr-goggles for robots: Real-to-sim domain adaptation for visual control. IEEE Robotics and Automation Letters, 4(2), 1148\u20131155.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"9925_CR49","doi-asserted-by":"crossref","unstructured":"Zhu, S., Kimmel, A., Bekris, K. E., Boularias, A. (2018). Fast model identification via physics engines for data-efficient policy search. In Proceedings of the 27th international joint conference on artificial intelligence (IJCAI) (pp. 3249\u20133256).","DOI":"10.24963\/ijcai.2018\/451"},{"key":"9925_CR50","first-page":"1","volume":"20","author":"S Zhu","year":"2019","unstructured":"Zhu, S., Surovik, D., Bekris, K. E., & Boularias, A. (2019). Closing the reality gap of robotic simulators through task-oriented Bayesian optimization. Journal of Machine Learning Research, 20, 1\u201324.","journal-title":"Journal of Machine Learning Research"}],"container-title":["Autonomous Robots"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10514-020-09925-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10514-020-09925-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10514-020-09925-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,30]],"date-time":"2022-10-30T17:09:02Z","timestamp":1667149742000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10514-020-09925-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,27]]},"references-count":50,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2020,9]]}},"alternative-id":["9925"],"URL":"https:\/\/doi.org\/10.1007\/s10514-020-09925-w","relation":{},"ISSN":["0929-5593","1573-7527"],"issn-type":[{"value":"0929-5593","type":"print"},{"value":"1573-7527","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,27]]},"assertion":[{"value":"20 December 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 June 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Compliance with ethical standards"}},{"value":"In addition to the funding listed above, the authors state the following relationships and interests. Adam Allevato holds stock options in Diligent Robotics, Inc. Elaine Schaertl Short is supported by a Clare Boothe Luce Professorship from the Henry Luce Foundation and was funded by Microsoft for travel to the AI Breakthoughs workshop. Mitch Pryor is a consultant for Finnegan, LLP, received travel support from the British Consulate, and is funded under the research Grants DOE-LANL Grant 407626, DOE-IRP Grant DE-EM0004384, Army Research Office Grant W911NF-17-2-0180, Army Research Office (CMU Subcontract) W911NF-18-2-0218, Phillips 66 Project #UTA19-000187, Woodside Project #UTA17-001210, Wilder Systems Project #UTA18-000696, as well as the RAPID UT Industry Affiliate Program. Andrea L. Thomaz is the CEO of, and holds stock in, Diligent Robotics, Inc.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}