{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,28]],"date-time":"2025-05-28T21:34:18Z","timestamp":1748468058269},"reference-count":47,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01n02","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2005,2]]},"abstract":"<jats:p>The Intensive Care Unit (ICU) is a challenging environment to both patient and caregiver. Continued shortages in staffing increase risk to patients. To evaluate the use of intelligent systems in the improvement of patient care, an intelligent agent was developed to regulate ICU patient sedation. A temporal differencing form of reinforcement learning was used to train the agent in the administration of intravenous propofol in simulated ICU patients. The agent utilized a well-studied pharmacokinetic model to calculate the distribution of drug within the patient. Pharmacodynamics were then estimated for the drug effect. A derivative of the electroencephalograms, the bispectral index, served as the system control variable. The agent demonstrated satisfactory control of the simulated patient's consciousness level in static and dynamic setpoint conditions. The agent demonstrated superior stability and responsiveness when compared to a well-tuned PID controller, the control method of choice in closed-loop sedation control literature.<\/jats:p>","DOI":"10.1142\/s021821300500203x","type":"journal-article","created":{"date-parts":[[2005,5,12]],"date-time":"2005-05-12T11:59:43Z","timestamp":1115899183000},"page":"137-156","source":"Crossref","is-referenced-by-count":8,"title":["SEDATION OF SIMULATED ICU PATIENTS USING REINFORCEMENT LEARNING BASED CONTROL"],"prefix":"10.1142","volume":"14","author":[{"given":"ERIC D.","family":"SINZINGER","sequence":"first","affiliation":[{"name":"Department of Computer Science, Texas Tech University, Lubbock TX 79409-3104, USA"}]},{"given":"BRETT","family":"MOORE","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Texas Tech University, Lubbock TX 79409-3104, USA"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1097\/00000542-200201000-00017"},{"key":"rf2","doi-asserted-by":"publisher","DOI":"10.1001\/jama.288.16.1987"},{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.1046\/j.1365-2346.1999.00557.x"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1109\/10.81576"},{"key":"rf6","first-page":"248","volume":"68","author":"Ball","journal-title":"Minerva Anestesiol."},{"key":"rf7","doi-asserted-by":"crossref","first-page":"877","DOI":"10.1097\/00000539-200104000-00015","volume":"92","author":"Bannister","journal-title":"Anesth. Analg."},{"key":"rf8","doi-asserted-by":"publisher","DOI":"10.1097\/00000542-200108000-00011"},{"key":"rf9","first-page":"155","volume":"17","author":"Bellman","journal-title":"J. Math. Comput."},{"key":"rf10","doi-asserted-by":"publisher","DOI":"10.1097\/00000539-200203000-00006"},{"key":"rf11","unstructured":"\u00a0Boyan and \u00a0Moore, Advances in Neural Information Processing Systems 7 (The MIT Press, 1995)\u00a0pp. 369\u2013376."},{"key":"rf12","first-page":"304","volume":"2","author":"Cybenko G.","journal-title":"Mathematics of Control Signals and Systems"},{"key":"rf13","unstructured":"\u00a0Davies, Advances in Neural Information Processing Systems\u00a09 (The MIT Press, 1997)\u00a0pp. 1005\u20131011."},{"key":"rf14","first-page":"341","volume":"8","author":"Dayan","journal-title":"Machine Learning"},{"key":"rf15","doi-asserted-by":"publisher","DOI":"10.1007\/s001340050765"},{"key":"rf16","doi-asserted-by":"publisher","DOI":"10.1093\/bja\/78.2.180"},{"key":"rf17","first-page":"128","volume":"98","author":"Gurses E.","journal-title":"Anesthesia and Analgesia"},{"key":"rf18","doi-asserted-by":"publisher","DOI":"10.2165\/00003495-199550040-00006"},{"key":"rf20","unstructured":"\u00a0Gullapalli, Advances in Neural Information Processing Systems\u00a05 (Morgan Kaufmann, San Mateo, CA, 1993)\u00a0pp. 327\u2013334."},{"key":"rf21","doi-asserted-by":"publisher","DOI":"10.1007\/BF02353793"},{"key":"rf22","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1177\/0310057X9902700306","volume":"27","author":"Hunt-Smith","journal-title":"Anaesth. Intensive Care"},{"key":"rf23","doi-asserted-by":"publisher","DOI":"10.1093\/bja\/83.2.223"},{"key":"rf24","doi-asserted-by":"crossref","first-page":"1210","DOI":"10.1097\/00000539-200105000-00024","volume":"92","author":"Kerssens","journal-title":"Anesth. Analg."},{"key":"rf25","first-page":"953","volume":"114","author":"Kolmogorov A. N.","journal-title":"Doklady Akademiia Nauk SSSR"},{"key":"rf26","doi-asserted-by":"crossref","first-page":"507","DOI":"10.4037\/ajcc1999.8.1.507","volume":"8","author":"Kowalski","journal-title":"Am. J. Crit. Care"},{"key":"rf27","doi-asserted-by":"publisher","DOI":"10.1056\/NEJM200005183422002"},{"key":"rf28","first-page":"690","volume":"57","author":"Leslie","journal-title":"Anaesthesia"},{"key":"rf30","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1213\/00000539-199701000-00033","volume":"84","author":"Liu","journal-title":"Anesth. Analg."},{"key":"rf31","doi-asserted-by":"publisher","DOI":"10.1093\/bja\/67.1.41"},{"key":"rf32","doi-asserted-by":"publisher","DOI":"10.1023\/A:1021250320103"},{"key":"rf33","doi-asserted-by":"publisher","DOI":"10.1046\/j.1365-2044.1998.00467.x"},{"key":"rf34","first-page":"1348","author":"Munos","journal-title":"IJCAI"},{"key":"rf35","first-page":"121","volume":"2","author":"Norrie","journal-title":"Nurs. Crit. Care"},{"key":"rf36","first-page":"748","author":"Papavassiliou","journal-title":"IJCAI"},{"key":"rf37","doi-asserted-by":"publisher","DOI":"10.1097\/00003246-199502000-00014"},{"key":"rf38","volume-title":"Artificial intelligence","author":"Russell","year":"1995"},{"key":"rf39","doi-asserted-by":"publisher","DOI":"10.1034\/j.1399-6576.2000.440819.x"},{"key":"rf40","doi-asserted-by":"publisher","DOI":"10.1053\/joms.2001.23366"},{"key":"rf41","doi-asserted-by":"publisher","DOI":"10.1016\/S0278-2391(00)90911-X"},{"key":"rf42","doi-asserted-by":"publisher","DOI":"10.1097\/00000542-199906000-00003"},{"key":"rf43","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2044.1998.53s107.x"},{"key":"rf45","doi-asserted-by":"publisher","DOI":"10.1007\/BF01618421"},{"key":"rf46","doi-asserted-by":"publisher","DOI":"10.1097\/00000542-200002000-00021"},{"key":"rf47","doi-asserted-by":"publisher","DOI":"10.1097\/00000542-200107000-00007"},{"key":"rf48","volume-title":"Reinforcement learning: An introduction","author":"Sutton","year":"1998"},{"key":"rf49","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2044.1998.53s104.x"},{"key":"rf50","doi-asserted-by":"publisher","DOI":"10.1016\/1053-0770(93)90009-A"},{"key":"rf53","first-page":"1275","volume":"81","author":"Vuyk","journal-title":"Anesth. Analg."}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S021821300500203X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,3]],"date-time":"2023-05-03T06:21:28Z","timestamp":1683094888000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S021821300500203X"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,2]]},"references-count":47,"journal-issue":{"issue":"01n02","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2005,2]]}},"alternative-id":["10.1142\/S021821300500203X"],"URL":"https:\/\/doi.org\/10.1142\/s021821300500203x","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,2]]}}}