{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T09:54:11Z","timestamp":1777715651166,"version":"3.51.4"},"reference-count":39,"publisher":"SAGE Publications","issue":"12","license":[{"start":{"date-parts":[[1999,12,1]],"date-time":"1999-12-01T00:00:00Z","timestamp":944006400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["The International Journal of Robotics Research"],"published-print":{"date-parts":[[1999,12]]},"abstract":"<jats:p>This paper describes what I have done and learned in robot-vision research, and speculates what is necessary and promising in the future. In early artificial intelligence research, it was proved in the context of recognition of polyhedra that a hierarchical method in which a higher process assumes an ideal output of lower processes has limitations. A similar approach has been effective for vision processes that use the result of color segmentation, stereo vision, optical flow, or correspondence between multiple images. It is often more effective for recognition of difficult scenes to increase available input data than to try to make a clever procedure. There was an attempt to approach more flexible vision systems like human vision, which include reliable stereo vision and integration of multiple visual cues. Now a tightly coupled perception-action paradigm is an important issue, where significant research themes are person tracking and recognition, flexible real-time vision processors, and planning of perception-action considering the planning cost.<\/jats:p>","DOI":"10.1177\/02783649922067799","type":"journal-article","created":{"date-parts":[[2003,7,19]],"date-time":"2003-07-19T02:59:46Z","timestamp":1058583586000},"page":"1185-1200","source":"Crossref","is-referenced-by-count":0,"title":["Robot Visor Research: Past and Future Roles"],"prefix":"10.1177","volume":"18","author":[{"given":"Yoshiaki","family":"Shirai","sequence":"first","affiliation":[{"name":"Department of Computer-Controlled Mechanical Systems, Osaka University,                         Suita, Osaka 565-0871, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[1999,12,1]]},"reference":[{"key":"atypb1","doi-asserted-by":"crossref","unstructured":"Barnard, S. T., and Thompson, W. B. 1980. Disparity analysis of images . IEEE Trans. Pattern Analysis Machine Intell. 2(4): 333\u2013340 .","DOI":"10.1109\/TPAMI.1980.4767032"},{"key":"atypb2","doi-asserted-by":"publisher","DOI":"10.1109\/5.381842"},{"key":"atypb3","unstructured":"Dean, T., and Boddy, M. 1988. An analysis of time-dependent planning. Proc. of AAAI-88. Cambridge, MA: AAAI Press , pp. 49\u201354."},{"key":"atypb4","doi-asserted-by":"publisher","DOI":"10.1016\/0146-664X(73)90011-7"},{"key":"atypb5","unstructured":"Feldman, J. A., Feldman, G. M., Falk, G., Grape, G., Pearlman, J., Sobel, I., and Tenebaur, J. M. 1969. The Stanford Hand-Eye project. Proc. of the 1st IJCAI, Washington, DC , pp. 521\u2013526."},{"key":"atypb6","unstructured":"Guzman, A. 1968. Decomposition of a visual scene into three-dimensional bodies . Proc. of the AFIPS Fall Joint Comp. Conf., vol. 33, pp. 291\u2013304 ."},{"key":"atypb7","doi-asserted-by":"crossref","unstructured":"Hild, M., and Shirai, Y. 1993 (Berlin). Interpretation of natural scenes using multi-parameter default models and qualitative constraints . Proc. ICCV\u201993, pp. 497\u2013501 .","DOI":"10.1109\/ICCV.1993.378173"},{"key":"atypb8","doi-asserted-by":"crossref","unstructured":"Hirata, S., Shirai, Y., and Asada, M. 1992 (Raleigh, NC). Scene interpretation using 3-D information extracted from monocular color images . Proc. IROS, pp. 1603\u20131610 .","DOI":"10.1109\/IROS.1992.594232"},{"key":"atypb9","doi-asserted-by":"crossref","unstructured":"Iketani, A., Nagai, A., Kuno, Y., and Shirai, Y. 1998 (Brisbane). Detecting persons on changing background . Proc. of the 14th ICPR, pp. 74\u201376 .","DOI":"10.1109\/ICPR.1998.711083"},{"key":"atypb10","doi-asserted-by":"crossref","unstructured":"Jo, K., Kuno, Y., and Shirai, Y. 1998 (Hong Kong). Context-based recognition of manipulative hand gestures for human computer interaction . Proc. ACCV, pp. 368\u2013375 .","DOI":"10.1007\/3-540-63931-4_238"},{"key":"atypb11","doi-asserted-by":"crossref","unstructured":"Konolige, K. 1998. Small vision systems: Hardware and implementation. In Shirai, Y., and Hirose, S. (eds.) Robotics Research. New York: Springer-Verlag , pp. 203\u2013212.","DOI":"10.1007\/978-1-4471-1580-9_19"},{"key":"atypb12","unstructured":"Marr, D. 1982. Vision. San Francisco, CA: Freeman ."},{"key":"atypb13","doi-asserted-by":"publisher","DOI":"10.1177\/027836499701600606"},{"key":"atypb14","unstructured":"Miura J., and Shirai, Y. 1997b (Nagoya, Japan). Vision-motion planning for a mobile robot considering vision uncertainty and planning cost . Proc. IJCAI-97, pp. 1194\u20131200 ."},{"key":"atypb15","doi-asserted-by":"crossref","unstructured":"Nakayama, O., Shirai, Y., and Asada, M. 1992. Multistage stereo method giving priority to reliable matching. Proc. of the IEEE Intl. Conf. on Robot. and Automat. Los Alamitos, CA: IEEE , pp. 1753\u20131758.","DOI":"10.1109\/ROBOT.1992.220126"},{"key":"atypb16","unstructured":"Nishimoto, Y. and Shirai, Y. 1985 (Los Angeles, CA). A parallel matching algorithm for stereo vision . Proc. 9th IJCAI, pp. 977\u2013980 ."},{"key":"atypb17","doi-asserted-by":"crossref","unstructured":"Okada, R., Shirai, Y., and Miura, J. 1996. Object tracking based on optical flow and disparity. Proc. of the IEEE\/SICE\/RSJ Intl. Conf. on Multisensor Fusion and Integration for Intell. Sys. Washington, DC: IEEE , pp. 565\u2013571.","DOI":"10.1109\/MFI.1996.572231"},{"key":"atypb18","unstructured":"Okamoto, A., Shirai, Y., and Asada, M. 1993. Integration of color and range data for three-dimensional scene description . Trans. IEICE Japan E76-D(4): 501\u2013506 ."},{"key":"atypb19","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(79)90024-4"},{"key":"atypb20","unstructured":"Oshima, M., and Shirai, Y. 1981 (Vancouver, Canada). Object recognition using three-dimensional information . Proc. of the 7th IJCAI, pp. 601\u2013606 ."},{"key":"atypb21","doi-asserted-by":"publisher","DOI":"10.1016\/S0262-8856(97)00070-X"},{"key":"atypb22","unstructured":"Roberts, L.G. 1963. Machine perception of three-dimensional solids. In Tit, J. T. et al. (eds.) Optical and Electro-Optical Information Processing. Cambridge, MA: MIT Press , pp. 159\u2013197."},{"key":"atypb23","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(73)90002-7"},{"key":"atypb24","unstructured":"Shirai, Y. 1975 (Tubilisi, USSR). Edge finding, segmentation of edges and recognition of complex objects . Proc. of the 4th IJCAI, pp. 674\u2013681 ."},{"key":"atypb25","unstructured":"Shirai, Y. 1984. An approach to object recognition using 3-D solid models . Intl. J. Robot. Res. 465\u2013474 ."},{"key":"atypb26","unstructured":"Shirai, Y. 1989 (Oulu, Finland). Robot vision: Range data acquisition and utilization . Proc. of the 6th Scandinavian Conf. on Image Analysis, pp. 15\u201323 ."},{"key":"atypb27","doi-asserted-by":"crossref","unstructured":"Shirai, Y. 1992 (The Hague, The Netherlands). 3-D computer vision and applications . Proc. of the ICPR, pp. 236\u2013245 .","DOI":"10.1109\/ICPR.1992.201549"},{"key":"atypb28","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(73)90015-0"},{"key":"atypb29","unstructured":"Shirai, Y., Inoue, H., Inaba, M., Terada, M., and Tateyama, Y. 1999 (Tokyo). Robot as friendly artifact\u2014human recognition and interaction . Proc. of the Intl. Conf. on Adv. Robot."},{"key":"atypb30","unstructured":"Shirai, Y., and Suwa, M. 1971 (London). Recognition of polyhedrons with a range finder . Proc. of the 2nd IJCAI, pp. 80\u201387 ."},{"key":"atypb31","unstructured":"Shirai, Y., and Tsuji, S. 1971. Extraction of the line drawing of 3-dimensional objects by sequential illumination from several directions . Proc. of the 2nd IJCAI, pp. 71\u201389 ."},{"key":"atypb32","doi-asserted-by":"crossref","unstructured":"Takizawa, H., Shirai, Y., Kuno, Y., and Miura, J. 1996 (Munich, Germany). Recognition of intersection scene by attentive observation for a mobile robot . Proc. of the IEEE\/RSJ IROS. Washington, DC: IEEE, pp. 1648\u20131654 .","DOI":"10.1109\/IROS.1996.569033"},{"key":"atypb33","doi-asserted-by":"crossref","unstructured":"Takizawa, H., Shirai, Y., Miura, J., and Kuno, Y. 1998 (Victoria, Canada). Planning of observation and motion for interpretation of road intersection scenes considering uncertainty . Proc. of IROS, pp. 520\u2013525 .","DOI":"10.1109\/IROS.1998.724671"},{"key":"atypb34","doi-asserted-by":"crossref","unstructured":"Taniguchi, Y., and Shirai, Y. 1998 (Hong Kong). Evidencebased scene interpretation considering subjective certainty of recognition . Proc. of ACCV, pp. 432\u2013439 .","DOI":"10.1007\/3-540-63931-4_246"},{"key":"atypb35","doi-asserted-by":"crossref","unstructured":"Taniguchi, Y.Shirai, Y., and Asada, M. 1994. Scene interpretation by fusing intermediate results of multiple visual sensory information processing. Proc. of the IEEE Intl. Conf. on Multisensor Fusion and Integration for Intell. Sys. Washington, DC: IEEE , pp. 699\u2013706.","DOI":"10.1109\/MFI.1994.398386"},{"key":"atypb36","unstructured":"Uemachi, S., Ayaki, Y., Ishibashi, T., and Shirai, Y. 1998 (Takamatsu, Japan). Trinocular vision system using local disparity histgram for detection of heavy machinery approaching to power transmission lines . Proc. of the Intl. Conf. on Quality Control by Art. Vision, pp. 397\u2013402 ."},{"key":"atypb37","doi-asserted-by":"crossref","unstructured":"Winston, P. H. 1972. The MIT robot . Machine Intell. 7: 431\u2013463 .","DOI":"10.1016\/0094-114X(72)90065-1"},{"key":"atypb38","doi-asserted-by":"crossref","unstructured":"Yamamoto, S., Mae, Y., Shirai Y., and Miura, J. 1995 (Nagoya). Real-time multiple object tracking based on optical flows . Proc. of the IEEE Intl. Conf. on Robot. and Automat. Washington, DC: IEEE, pp. 2328\u20132333 .","DOI":"10.1109\/ROBOT.1995.525608"},{"key":"atypb39","doi-asserted-by":"crossref","unstructured":"Yamane, T., Shirai, Y., and Miura, J. 1998 (Loeven, Belgium). Person tracking by integrating optical flow and uniform brightness regions . Proc. of the IEEE Intl. Conf. on Robot. and Automat. Washington, DC: IEEE, pp. 3267\u20133272 .","DOI":"10.1109\/ROBOT.1998.680942"}],"container-title":["The International Journal of Robotics Research"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/02783649922067799","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/02783649922067799","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T10:15:47Z","timestamp":1777457747000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/02783649922067799"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1999,12]]},"references-count":39,"journal-issue":{"issue":"12","published-print":{"date-parts":[[1999,12]]}},"alternative-id":["10.1177\/02783649922067799"],"URL":"https:\/\/doi.org\/10.1177\/02783649922067799","relation":{},"ISSN":["0278-3649","1741-3176"],"issn-type":[{"value":"0278-3649","type":"print"},{"value":"1741-3176","type":"electronic"}],"subject":[],"published":{"date-parts":[[1999,12]]}}}