{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,5]],"date-time":"2025-11-05T10:28:57Z","timestamp":1762338537045,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":65,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,5,9]],"date-time":"2023-05-09T00:00:00Z","timestamp":1683590400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSF","award":["CNS-1837499"],"award-info":[{"award-number":["CNS-1837499"]}]},{"name":"National AI Institute for Edge Computing Leveraging Next Generation Wireless Networks","award":["CNS-2112562"],"award-info":[{"award-number":["CNS-2112562"]}]},{"name":"NIH","award":["UH3 NS103468"],"award-info":[{"award-number":["UH3 NS103468"]}]},{"name":"Medtronic PLC"},{"name":"Rune Labs"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,5,9]]},"DOI":"10.1145\/3576841.3585925","type":"proceedings-article","created":{"date-parts":[[2023,5,4]],"date-time":"2023-05-04T16:18:19Z","timestamp":1683217099000},"page":"44-55","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for Parkinson Disease Treatment"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5542-3631","authenticated-orcid":false,"given":"Qitong","family":"Gao","sequence":"first","affiliation":[{"name":"Duke University, Durham, NC, United States of America"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1547-3305","authenticated-orcid":false,"given":"Stephen L.","family":"Schmidt","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4154-0225","authenticated-orcid":false,"given":"Afsana","family":"Chowdhury","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-8213-0704","authenticated-orcid":false,"given":"Guangyu","family":"Feng","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2428-8833","authenticated-orcid":false,"given":"Jennifer J.","family":"Peters","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-6424-3333","authenticated-orcid":false,"given":"Katherine","family":"Genty","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5240-6588","authenticated-orcid":false,"given":"Warren M.","family":"Grill","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7118-0764","authenticated-orcid":false,"given":"Dennis A.","family":"Turner","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5357-0117","authenticated-orcid":false,"given":"Miroslav","family":"Pajic","sequence":"additional","affiliation":[{"name":"Duke University, Durham, NC, United States of America"}]}],"member":"320","published-online":{"date-parts":[[2023,5,9]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Mattia Arlotti Manuela Rosa etal 2016. The adaptive deep brain stimulation challenge. Parkinsonism & related disorders 28 (2016) 12--17.  Mattia Arlotti Manuela Rosa et al. 2016. The adaptive deep brain stimulation challenge. Parkinsonism & related disorders 28 (2016) 12--17.","DOI":"10.1016\/j.parkreldis.2016.03.020"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Mattia Arlotti Lorenzo Rossi etal 2016. An external portable device for adaptive deep brain stimulation (aDBS) clinical research in advanced Parkinson's Disease. Medical engineering & physics 38 5 (2016) 498--505.  Mattia Arlotti Lorenzo Rossi et al. 2016. An external portable device for adaptive deep brain stimulation (aDBS) clinical research in advanced Parkinson's Disease. Medical engineering & physics 38 5 (2016) 498--505.","DOI":"10.1016\/j.medengphy.2016.02.007"},{"key":"e_1_3_2_1_3_1","volume-title":"Deep brain stimulation for Parkinson's disease. Current opinion in neurobiology 13, 6","author":"Benabid Alim Louis","year":"2003","unstructured":"Alim Louis Benabid . 2003. Deep brain stimulation for Parkinson's disease. Current opinion in neurobiology 13, 6 ( 2003 ), 696--706. Alim Louis Benabid. 2003. Deep brain stimulation for Parkinson's disease. Current opinion in neurobiology 13, 6 (2003), 696--706."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Aleksandar Beric Patrick J Kelly etal 2001. Complications of deep brain stimulation surgery. Stereotactic and functional neurosurgery 77 1--4 (2001) 73--78.  Aleksandar Beric Patrick J Kelly et al. 2001. Complications of deep brain stimulation surgery. Stereotactic and functional neurosurgery 77 1--4 (2001) 73--78.","DOI":"10.1159\/000064600"},{"key":"e_1_3_2_1_5_1","volume-title":"Adaptive deep brain stimulation in Parkinson's disease. Parkinsonism & related disorders 22","author":"Beudel M","year":"2016","unstructured":"M Beudel and P Brown . 2016. Adaptive deep brain stimulation in Parkinson's disease. Parkinsonism & related disorders 22 ( 2016 ), S123--S126. M Beudel and P Brown. 2016. Adaptive deep brain stimulation in Parkinson's disease. Parkinsonism & related disorders 22 (2016), S123--S126."},{"volume-title":"Pattern recognition and machine learning","author":"Bishop Christopher","key":"e_1_3_2_1_6_1","unstructured":"Christopher Bishop . 2006. Pattern recognition and machine learning . Springer . Christopher Bishop. 2006. Pattern recognition and machine learning. Springer."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1523\/JNEUROSCI.21-03-01033.2001"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"A H Butt E Rovini etal 2018. Objective and automatic classification of Parkinson disease with Leap Motion controller. Biomedical engineering 17 1 (2018) 1--21.  A H Butt E Rovini et al. 2018. Objective and automatic classification of Parkinson disease with Leap Motion controller. Biomedical engineering 17 1 (2018) 1--21.","DOI":"10.1186\/s12938-018-0600-7"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.3389\/fnhum.2021.717401"},{"key":"e_1_3_2_1_10_1","volume-title":"Coindice: Off-policy confidence interval estimation. arXiv preprint arXiv:2010.11652","author":"Dai Bo","year":"2020","unstructured":"Bo Dai , Ofir Nachum , 2020 . Coindice: Off-policy confidence interval estimation. arXiv preprint arXiv:2010.11652 (2020). Bo Dai, Ofir Nachum, et al. 2020. Coindice: Off-policy confidence interval estimation. arXiv preprint arXiv:2010.11652 (2020)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/S1474-4422(06)70471-9"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMoa060281"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMoa0907083"},{"key":"e_1_3_2_1_14_1","unstructured":"Justin Fu Mohammad Norouzi etal 2020. Benchmarks for Deep Off-Policy Evaluation. In ICLR.  Justin Fu Mohammad Norouzi et al. 2020. Benchmarks for Deep Off-Policy Evaluation. In ICLR."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Ge Gao Qitong Gao etal 2022. A Reinforcement Learning-Informed Pattern Mining Framework for Multivariate Time Series Classification. In IJCAI.  Ge Gao Qitong Gao et al. 2022. A Reinforcement Learning-Informed Pattern Mining Framework for Multivariate Time Series Classification. In IJCAI.","DOI":"10.24963\/ijcai.2022\/415"},{"key":"e_1_3_2_1_16_1","volume-title":"Markel Sanz Ausin, and Min Chi","author":"Gao Ge","year":"2023","unstructured":"Ge Gao , Song Ju , Markel Sanz Ausin, and Min Chi . 2023 . Hope : Human-centric off-policy evaluation for e-learning and healthcare. In AAMAS. Ge Gao, Song Ju, Markel Sanz Ausin, and Min Chi. 2023. Hope: Human-centric off-policy evaluation for e-learning and healthcare. In AAMAS."},{"key":"e_1_3_2_1_17_1","unstructured":"Qitong Gao Ge Gao Min Chi and Miroslav Pajic. 2023. Variational Latent Branching Model for Off-Policy Evaluation. In ICLR.  Qitong Gao Ge Gao Min Chi and Miroslav Pajic. 2023. Variational Latent Branching Model for Off-Policy Evaluation. In ICLR."},{"key":"e_1_3_2_1_18_1","unstructured":"Qitong Gao Davood Hajinezhad etal 2019. Reduced Variance Deep Reinforcement Learning with Temporal Logic Specifications. In ICCPS. ACM.  Qitong Gao Davood Hajinezhad et al. 2019. Reduced Variance Deep Reinforcement Learning with Temporal Logic Specifications. In ICCPS. ACM."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCPS48487.2020.00018"},{"key":"e_1_3_2_1_20_1","volume-title":"Offline Policy Evaluation for Learning-based Deep Brain Stimulation Controllers. In 2022 ACM\/IEEE 13th International Conference on Cyber-Physical Systems (ICCPS). IEEE, 80--91","author":"Gao Qitong","year":"2022","unstructured":"Qitong Gao , Stephen L Schmidt , 2022 . Offline Policy Evaluation for Learning-based Deep Brain Stimulation Controllers. In 2022 ACM\/IEEE 13th International Conference on Cyber-Physical Systems (ICCPS). IEEE, 80--91 . Qitong Gao, Stephen L Schmidt, et al. 2022. Offline Policy Evaluation for Learning-based Deep Brain Stimulation Controllers. In 2022 ACM\/IEEE 13th International Conference on Cyber-Physical Systems (ICCPS). IEEE, 80--91."},{"key":"e_1_3_2_1_21_1","volume-title":"Gradient Importance Learning for Incomplete Observations. In International Conference on Learning Representations.","author":"Gao Qitong","year":"2022","unstructured":"Qitong Gao , Dong Wang , 2022 . Gradient Importance Learning for Incomplete Observations. In International Conference on Learning Representations. Qitong Gao, Dong Wang, et al. 2022. Gradient Importance Learning for Incomplete Observations. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989385"},{"key":"e_1_3_2_1_23_1","unstructured":"A. Guez R. D. Vincent M. Avoli and J. Pineau. 2008. Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning. In AAAI. 1671--1678.  A. Guez R. D. Vincent M. Avoli and J. Pineau. 2008. Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning. In AAAI. 1671--1678."},{"key":"e_1_3_2_1_24_1","volume-title":"ICML. PMLR","author":"Haarnoja Tuomas","year":"2018","unstructured":"Tuomas Haarnoja , Aurick Zhou , Pieter Abbeel , and Sergey Levine . 2018 . Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor . In ICML. PMLR , 1861--1870. Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. 2018. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In ICML. PMLR, 1861--1870."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1002\/mds.115"},{"key":"e_1_3_2_1_26_1","unstructured":"Geoffrey Hinton Oriol Vinyals Jeff Dean and others. [n.d.]. Distilling the knowledge in a neural network. ([n. d.]).  Geoffrey Hinton Oriol Vinyals Jeff Dean and others. [n.d.]. Distilling the knowledge in a neural network. ([n. d.])."},{"key":"e_1_3_2_1_27_1","volume-title":"Long short-term memory. Neural computation 9, 8","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long short-term memory. Neural computation 9, 8 ( 1997 ), 1735--1780. Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"crossref","unstructured":"S. Ishii W. Yoshida and J. Yoshimoto. 2002. Control of exploitation-exploration meta-parameter in reinforcement learning. Neural networks 15 (2002) 665--687.  S. Ishii W. Yoshida and J. Yoshimoto. 2002. Control of exploitation-exploration meta-parameter in reinforcement learning. Neural networks 15 (2002) 665--687.","DOI":"10.1016\/S0893-6080(02)00056-4"},{"key":"e_1_3_2_1_29_1","unstructured":"Nan Jiang and Lihong Li. 2016. Doubly robust off-policy value evaluation for reinforcement learning. In ICML. PMLR 652--661.  Nan Jiang and Lihong Li. 2016. Doubly robust off-policy value evaluation for reinforcement learning. In ICML. PMLR 652--661."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Ilija Jovanov Michael Naumann etal 2018. Platform for model-based design and testing for deep brain stimulation. In ICCPS.  Ilija Jovanov Michael Naumann et al. 2018. Platform for model-based design and testing for deep brain stimulation. In ICCPS.","DOI":"10.1109\/ICCPS.2018.00033"},{"key":"e_1_3_2_1_31_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_32_1","volume-title":"Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114","author":"Kingma Diederik P","year":"2013","unstructured":"Diederik P Kingma and Max Welling . 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 ( 2013 ). Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)."},{"key":"e_1_3_2_1_33_1","unstructured":"Ilya Kostrikov Ashvin Nair and Sergey Levine. 2022. Offline Reinforcement Learning with Implicit Q-Learning. In ICLR.  Ilya Kostrikov Ashvin Nair and Sergey Levine. 2022. Offline Reinforcement Learning with Implicit Q-Learning. In ICLR."},{"key":"e_1_3_2_1_34_1","first-page":"1956","article-title":"Reduction in subthalamic 8--35 Hz oscillatory activity correlates with clinical improvement in Parkinson's disease. Euro","volume":"23","author":"K\u00fchn A.A.","year":"2006","unstructured":"A.A. K\u00fchn , A. Kupsch , GH. Schneider , and P Brown . 2006 . Reduction in subthalamic 8--35 Hz oscillatory activity correlates with clinical improvement in Parkinson's disease. Euro . J. of Neuroscience 23 , 7 (2006), 1956 -- 1960 . A.A. K\u00fchn, A. Kupsch, GH. Schneider, and P Brown. 2006. Reduction in subthalamic 8--35 Hz oscillatory activity correlates with clinical improvement in Parkinson's disease. Euro. J. of Neuroscience 23, 7 (2006), 1956--1960.","journal-title":"J. of Neuroscience"},{"key":"e_1_3_2_1_35_1","volume-title":"On information and sufficiency. The annals of mathematical statistics 22, 1","author":"Kullback Solomon","year":"1951","unstructured":"Solomon Kullback and Richard A Leibler . 1951. On information and sufficiency. The annals of mathematical statistics 22, 1 ( 1951 ), 79--86. Solomon Kullback and Richard A Leibler. 1951. On information and sufficiency. The annals of mathematical statistics 22, 1 (1951), 79--86."},{"key":"e_1_3_2_1_36_1","unstructured":"Aviral Kumar Aurick Zhou George Tucker and Sergey Levine. 2020. Conservative q-learning for offline reinforcement learning. In NeurIPS.  Aviral Kumar Aurick Zhou George Tucker and Sergey Levine. 2020. Conservative q-learning for offline reinforcement learning. In NeurIPS."},{"key":"e_1_3_2_1_37_1","volume-title":"Selection of stimulus parameters for deep brain stimulation. Clinical neurophysiology 115, 11","author":"Kuncel Alexis M","year":"2004","unstructured":"Alexis M Kuncel and Warren M Grill . 2004. Selection of stimulus parameters for deep brain stimulation. Clinical neurophysiology 115, 11 ( 2004 ), 2431--2441. Alexis M Kuncel and Warren M Grill. 2004. Selection of stimulus parameters for deep brain stimulation. Clinical neurophysiology 115, 11 (2004), 2431--2441."},{"key":"e_1_3_2_1_38_1","first-page":"741","article-title":"Stochastic latent actor-critic: Deep reinforcement learning with a latent variable model","volume":"33","author":"Lee Alex X","year":"2020","unstructured":"Alex X Lee , Anusha Nagabandi , Pieter Abbeel , and Sergey Levine . 2020 . Stochastic latent actor-critic: Deep reinforcement learning with a latent variable model . Advances in Neural Information Processing Systems 33 (2020), 741 -- 752 . Alex X Lee, Anusha Nagabandi, Pieter Abbeel, and Sergey Levine. 2020. Stochastic latent actor-critic: Deep reinforcement learning with a latent variable model. Advances in Neural Information Processing Systems 33 (2020), 741--752.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_39_1","unstructured":"Timothy P Lillicrap Jonathan J Hunt etal 2016. Continuous control with deep reinforcement learning. ICLR (2016).  Timothy P Lillicrap Jonathan J Hunt et al. 2016. Continuous control with deep reinforcement learning. ICLR (2016)."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1002\/ana.23951"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1136\/jnnp-2016-313518"},{"key":"e_1_3_2_1_42_1","unstructured":"Qiang Liu Lihong Li Ziyang Tang and Dengyong Zhou. 2018. Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation. In NeurIPS.  Qiang Liu Lihong Li Ziyang Tang and Dengyong Zhou. 2018. Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation. In NeurIPS."},{"key":"e_1_3_2_1_43_1","volume-title":"On a test of whether one of two random variables is stochastically larger than the other. The annals of mathematical statistics","author":"Mann Henry B","year":"1947","unstructured":"Henry B Mann and Donald R Whitney . 1947. On a test of whether one of two random variables is stochastically larger than the other. The annals of mathematical statistics ( 1947 ), 50--60. Henry B Mann and Donald R Whitney. 1947. On a test of whether one of two random variables is stochastically larger than the other. The annals of mathematical statistics (1947), 50--60."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"crossref","unstructured":"C Marras JC Beck etal 2018. Prevalence of Parkinson's disease across North America. NPJ Parkinson's disease 4 1 (2018) 21.  C Marras JC Beck et al. 2018. Prevalence of Parkinson's disease across North America. NPJ Parkinson's disease 4 1 (2018) 21.","DOI":"10.1038\/s41531-018-0058-0"},{"key":"e_1_3_2_1_45_1","volume-title":"Adria Puigdomenech Badia, et al","author":"Mnih Volodymyr","year":"2016","unstructured":"Volodymyr Mnih , Adria Puigdomenech Badia, et al . 2016 . Asynchronous methods for deep reinforcement learning. In ICML. 1928--1937. Volodymyr Mnih, Adria Puigdomenech Badia, et al. 2016. Asynchronous methods for deep reinforcement learning. In ICML. 1928--1937."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"crossref","unstructured":"Volodymyr Mnih Koray Kavukcuoglu etal 2015. Human-level control through deep reinforcement learning. Nature 518 7540 (2015) 529.  Volodymyr Mnih Koray Kavukcuoglu et al. 2015. Human-level control through deep reinforcement learning. Nature 518 7540 (2015) 529.","DOI":"10.1038\/nature14236"},{"key":"e_1_3_2_1_47_1","volume-title":"Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections. NeurIPS 32","author":"Nachum Ofir","year":"2019","unstructured":"Ofir Nachum , Yinlam Chow , Bo Dai , and Lihong Li . 2019 . Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections. NeurIPS 32 (2019). Ofir Nachum, Yinlam Chow, Bo Dai, and Lihong Li. 2019. Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections. NeurIPS 32 (2019)."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065717500125"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMct1208070"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Enrico Opri Stephanie Cernera etal 2020. Chronic embedded cortico-thalamic closed-loop deep brain stimulation for the treatment of essential tremor. Science translational medicine 12 572 (2020) eaay7680.  Enrico Opri Stephanie Cernera et al. 2020. Chronic embedded cortico-thalamic closed-loop deep brain stimulation for the treatment of essential tremor. Science translational medicine 12 572 (2020) eaay7680.","DOI":"10.1126\/scitranslmed.aay7680"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"crossref","unstructured":"Bahram Parvinian Christopher Scully etal 2018. Regulatory considerations for physiological closed-loop controlled medical devices used for automated critical care: food and drug administration workshop discussion topics. Anesthesia and analgesia 126 6 (2018) 1916.  Bahram Parvinian Christopher Scully et al. 2018. Regulatory considerations for physiological closed-loop controlled medical devices used for automated critical care: food and drug administration workshop discussion topics. Anesthesia and analgesia 126 6 (2018) 1916.","DOI":"10.1213\/ANE.0000000000002329"},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0129065709001987"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"crossref","unstructured":"Rob Powers Maryam Etezadi-Amoli etal 2021. Smartwatch inertial sensors continuously monitor real-world motor fluctuations in Parkinson's disease. Science translational medicine 13 579 (2021) eabd7865.  Rob Powers Maryam Etezadi-Amoli et al. 2021. Smartwatch inertial sensors continuously monitor real-world motor fluctuations in Parkinson's disease. Science translational medicine 13 579 (2021) eabd7865.","DOI":"10.1126\/scitranslmed.abd7865"},{"key":"e_1_3_2_1_54_1","volume-title":"Eligibility traces for off-policy policy evaluation","author":"Precup Doina","year":"2000","unstructured":"Doina Precup . 2000. Eligibility traces for off-policy policy evaluation . Computer Science Department Faculty Publication Series ( 2000 ), 80. Doina Precup. 2000. Eligibility traces for off-policy policy evaluation. Computer Science Department Faculty Publication Series (2000), 80."},{"key":"e_1_3_2_1_55_1","volume-title":"Anne Margarethe Stiggelbout, and Bob Johannes Van Hilten","author":"Ramaker Claudia","year":"2002","unstructured":"Claudia Ramaker , Johan Marinus , Anne Margarethe Stiggelbout, and Bob Johannes Van Hilten . 2002 . Systematic evaluation of rating scales for impairment and disability in Parkinson's disease. Movement disorders 17, 5 (2002), 867--876. Claudia Ramaker, Johan Marinus, Anne Margarethe Stiggelbout, and Bob Johannes Van Hilten. 2002. Systematic evaluation of rating scales for impairment and disability in Parkinson's disease. Movement disorders 17, 5 (2002), 867--876."},{"key":"e_1_3_2_1_56_1","unstructured":"Andrei A Rusu Sergio G Colmenarejo etal 2016. Policy Distillation. In ICLR.  Andrei A Rusu Sergio G Colmenarejo et al. 2016. Policy Distillation. In ICLR."},{"key":"e_1_3_2_1_57_1","unstructured":"David Silver Guy Lever etal 2014. Deterministic policy gradient algorithms.  David Silver Guy Lever et al. 2014. Deterministic policy gradient algorithms."},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10827-011-0366-4"},{"key":"e_1_3_2_1_59_1","doi-asserted-by":"crossref","unstructured":"Scott Stanslaski Jeffrey Herron etal 2018. A chronically implantable neural coprocessor for investigating the treatment of neurological disorders. IEEE transactions on biomedical circuits and systems 12 6 (2018) 1230--1245.  Scott Stanslaski Jeffrey Herron et al. 2018. A chronically implantable neural coprocessor for investigating the treatment of neurological disorders. IEEE transactions on biomedical circuits and systems 12 6 (2018) 1230--1245.","DOI":"10.1109\/TBCAS.2018.2880148"},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1523\/JNEUROSCI.1128-16.2016"},{"key":"e_1_3_2_1_61_1","unstructured":"Ziyang Tang Yihao Feng etal 2019. Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation. In ICLR.  Ziyang Tang Yihao Feng et al. 2019. Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation. In ICLR."},{"key":"e_1_3_2_1_62_1","unstructured":"Philip Thomas and Emma Brunskill. 2016. Data-efficient off-policy policy evaluation for reinforcement learning. In ICML. PMLR 2139--2148.  Philip Thomas and Emma Brunskill. 2016. Data-efficient off-policy policy evaluation for reinforcement learning. In ICML. PMLR 2139--2148."},{"key":"e_1_3_2_1_63_1","unstructured":"Joshua K Wong G\u00fcnther Deuschl etal 2022. Proc. the 9th Annual Deep Brain Stimulation Think Tank: Advances in Cutting Edge Technologies Artificial Intelligence Neuromodulation Neuroethics Pain Interventional Psychiatry Epilepsy and Traumatic Brain Injury. Frontiers in Human Neuroscience (2022) 25.  Joshua K Wong G\u00fcnther Deuschl et al. 2022. Proc. the 9th Annual Deep Brain Stimulation Think Tank: Advances in Cutting Edge Technologies Artificial Intelligence Neuromodulation Neuroethics Pain Interventional Psychiatry Epilepsy and Traumatic Brain Injury. Frontiers in Human Neuroscience (2022) 25."},{"key":"e_1_3_2_1_64_1","unstructured":"Yuhuai Wu Elman Mansimov etal 2017. Scalable trust-region method for deep reinforcement learning using kronecker-factored approximation. In NeurIPS.  Yuhuai Wu Elman Mansimov et al. 2017. Scalable trust-region method for deep reinforcement learning using kronecker-factored approximation. In NeurIPS."},{"key":"e_1_3_2_1_65_1","volume-title":"NeurIPS","volume":"33","author":"Yang Mengjiao","year":"2020","unstructured":"Mengjiao Yang , Ofir Nachum , 2020 . Off-Policy Evaluation via the Regularized Lagrangian . In NeurIPS , Vol. 33 . Mengjiao Yang, Ofir Nachum, et al. 2020. Off-Policy Evaluation via the Regularized Lagrangian. In NeurIPS, Vol. 33."}],"event":{"name":"ICCPS '23: ACM\/IEEE 14th International Conference on Cyber-Physical Systems (with CPS-IoT Week 2023)","sponsor":["SIGBED ACM Special Interest Group on Embedded Systems","IEEE TCRTS"],"location":"San Antonio TX USA","acronym":"ICCPS '23"},"container-title":["Proceedings of the ACM\/IEEE 14th International Conference on Cyber-Physical Systems (with CPS-IoT Week 2023)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3576841.3585925","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:47:27Z","timestamp":1750178847000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3576841.3585925"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,9]]},"references-count":65,"alternative-id":["10.1145\/3576841.3585925","10.1145\/3576841"],"URL":"https:\/\/doi.org\/10.1145\/3576841.3585925","relation":{},"subject":[],"published":{"date-parts":[[2023,5,9]]},"assertion":[{"value":"2023-05-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}