{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T03:47:16Z","timestamp":1760240836392,"version":"build-2065373602"},"reference-count":45,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2019,10,11]],"date-time":"2019-10-11T00:00:00Z","timestamp":1570752000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>In this paper, we propose an open-set object detection framework based on a dynamic hierarchical structure with incremental learning capabilities for unseen object classes. We were motivated by the observation that deep features extracted from visual objects show a strong hierarchical clustering property. The hierarchical feature model (HFM) was used to learn a new object class by using collaborative sampling (CS), and open-set-aware active semi-supervised learning (ASSL) algorithms. We divided object proposals into superclasses by using the agglomerative clustering algorithm. Data samples in each superclass node were classified into multiple augmented class nodes instead of directly associating with regular object classes. One or more augmented class nodes are related to a regular object class, and each augmented class has only one superclass. Object proposals from inexperienced data distribution are assigned to an augmented class node. Dynamic HFM nodes in the decision path are assembled to constitute an ensemble prediction, and the new augmented object is associated with a new regular object class. Our experimental results showed that the proposed method uses standard benchmark datasets such as PASCAL VOC, MS COCO, ILSVRC DET, and local datasets to perform better than state-of-the-art techniques.<\/jats:p>","DOI":"10.3390\/sym11101271","type":"journal-article","created":{"date-parts":[[2019,10,11]],"date-time":"2019-10-11T10:53:03Z","timestamp":1570791183000},"page":"1271","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Hierarchical Open-Set Object Detection in Unseen Data"],"prefix":"10.3390","volume":"11","author":[{"given":"Yeong Hyeon","family":"Kim","sequence":"first","affiliation":[{"name":"Intelligence Technology Lab, Inha University, 100 Inha-Ro, Nam Gu, Incheon 22212, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dong Kyun","family":"Shin","sequence":"additional","affiliation":[{"name":"KT R&amp;D Center, 151 Taebong-ro, Seocho-gu, Seoul 06763, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Minhaz Uddin","family":"Ahmed","sequence":"additional","affiliation":[{"name":"Intelligence Technology Lab, Inha University, 100 Inha-Ro, Nam Gu, Incheon 22212, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Phill Kyu","family":"Rhee","sequence":"additional","affiliation":[{"name":"Intelligence Technology Lab, Inha University, 100 Inha-Ro, Nam Gu, Incheon 22212, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2019,10,11]]},"reference":[{"key":"ref_1","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Bendale, A., and Boult, T. (2015, January 7\u201312). Towards Open Set Deep Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2016.173"},{"key":"ref_3","unstructured":"Geng, C., and Chen, S. (2018). Hierarchical Dirichlet Process-based Open Set Recognition. arXiv."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1757","DOI":"10.1109\/TPAMI.2012.256","article-title":"Toward Open Set Recognition","volume":"35","author":"Scheirer","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","unstructured":"Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., and LeCun, Y. (2013). OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks. arXiv."},{"key":"ref_6","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Advances in Neural Information Processing Systems, Nips."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_8","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_10","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Dollar, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1923","DOI":"10.1109\/TIP.2017.2667405","article-title":"HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition","volume":"26","author":"Fan","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2665","DOI":"10.1109\/TKDE.2016.2581161","article-title":"ML-FOREST: A multi-label tree ensemble method for multi-label classification","volume":"28","author":"Wu","year":"2016","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1690","DOI":"10.1109\/TPAMI.2016.2613924","article-title":"Sparse representation-based open set recognition","volume":"39","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1109\/TPAMI.2017.2707495","article-title":"The Extreme Value Machine","volume":"40","author":"Rudd","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/j.cogsys.2017.05.006","article-title":"Active and semi-supervised learning for object detection with imperfect data","volume":"45","author":"Rhee","year":"2017","journal-title":"Cognit. Syst. Res."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (VOC) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., and Zitnick, C.L. (2014). Microsoft COCO: Common objects in context. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8693 LNCS(PART 5). European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"ImageNet Large Scale Visual Recognition Challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1177\/0037549717709932","article-title":"Object recognition and detection with deep learning for autonomous driving applications","volume":"93","author":"Demir","year":"2017","journal-title":"Simulation"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1109\/TPAMI.2013.50","article-title":"Representation Learning: A Review and New Perspectives. Pattern Analysis and Machine Intelligence","volume":"35","author":"Bengio","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_22","first-page":"1","article-title":"Object Detection with Discriminatively Trained Part Based Models","volume":"32","author":"Felzenszwalb","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Makantasis, K., Doulamis, A., Doulamis, N., and Psychas, K. (2016, January 25\u201328). Deep learning based human behavior recognition in industrial workflows. Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA.","DOI":"10.1109\/ICIP.2016.7532630"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Cao, Z., Simon, T., Wei, S.E., and Sheikh, Y. (2017, January 21\u201326). Realtime multi-person 2D pose estimation using part affinity fields. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.143"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Han, J., Zhang, D., Cheng, G., Liu, N., and Xu, D. (2018). Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection: A Survey. IEEE Signal Process. Mag.","DOI":"10.1109\/MSP.2017.2749125"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1639","DOI":"10.1109\/TIP.2017.2781424","article-title":"Robust Object Co-Segmentation Using Background Prior","volume":"27","author":"Han","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1909","DOI":"10.1109\/TMM.2015.2477242","article-title":"A Continuous Learning Framework for Activity Recognition Using Deep Hybrid Feature Models","volume":"17","author":"Hasan","year":"2015","journal-title":"IEEE Trans. Multimed."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Tang, Y., Wang, J., Gao, B., Dellandrea, E., Gaizauskas, R., and Chen, L. (2016, January 27\u201330). Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.233"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Nguyen, A., Yosinski, J., and Clune, J. (2015, January 7\u201312). Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298640"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Moosavi-Dezfooli, S.M., Fawzi, A., Fawzi, O., and Frossard, P. (, January 21\u201326January). Universal adversarial perturbations. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.17"},{"key":"ref_31","unstructured":"Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., and Fergus, R. (2013). Intriguing properties of neural networks. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Yang, J., Guan, Y., and Dong, X. (2014, January 28\u201330). Lymphocyte-style word representations. Proceedings of the 2014 IEEE International Conference on Information and Automation (ICIA 2014), Hailar, Chian.","DOI":"10.1109\/ICInfA.2014.6932783"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1109\/TPAMI.2012.59","article-title":"3D convolutional neural networks for human action recognition","volume":"35","author":"Ji","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"41","DOI":"10.5121\/ijdkp.2014.4104","article-title":"ENHANCING THE LABELLING TECHNIQUE OF SUFFIX TREE CLUSTERING ALGORITHM","volume":"4","author":"Mahalakshmi","year":"2014","journal-title":"Int. J. Data Min. Knowl. Manag. Process"},{"key":"ref_35","unstructured":"Muslea, I., Minton, S.N., and Knoblock, C.A. (2003, January 9\u201315). Active learning with strong and weak views: A case study on wrapper induction. Proceedings of the IJCAI International Joint Conference on Artificial Intelligence, Acapulco, Mexico."},{"key":"ref_36","first-page":"201","article-title":"Active Learning Literature Survey","volume":"15","author":"Settles","year":"2010","journal-title":"Mach. Learn."},{"key":"ref_37","unstructured":"Gordon, J., and Hern\u00e1ndez-Lobato, J.M. (2017). Bayesian Semisupervised Learning with Deep Generative Models. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1499","DOI":"10.1109\/LSP.2016.2603342","article-title":"Joint Face Detection and Alignment using Multitask Cascaded Convolutional Networks","volume":"23","author":"Zhang","year":"2016","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_39","first-page":"10","article-title":"Semi-Supervised Learning Literature Survey Contents","volume":"10","author":"Zhu","year":"2008","journal-title":"Sciences"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"2384","DOI":"10.1109\/TGRS.2014.2359933","article-title":"Collaborative Active and Semisupervised Learning for Hyperspectral Remote Sensing Image Classification. Geoscience and Remote Sensing","volume":"53","author":"Wan","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Sorokin, A., and Forsyth, D. (2008, January 23\u201328). Utility data annotation with Amazon Mechanical Turk. Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), Anchorage, AK, USA.","DOI":"10.1109\/CVPRW.2008.4562953"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"4163","DOI":"10.1109\/TSP.2012.2196696","article-title":"Robust clustering using outlier-sparsity regularization","volume":"60","author":"Forero","year":"2012","journal-title":"IEEE Trans. Signal Process."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Settles, B., and Craven, M. (2008, January 25\u201327). An analysis of active learning strategies for sequence labeling tasks. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP \u201908), Honolulu, Hawaii.","DOI":"10.3115\/1613715.1613855"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"2169","DOI":"10.1109\/ACCESS.2018.2875720","article-title":"Incremental Deep Learning for Robust Object Detection in Unknown Cluttered Environments","volume":"6","author":"Shin","year":"2018","journal-title":"IEEE Access"},{"key":"ref_45","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., and Zheng, X. (2016). TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/10\/1271\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:25:06Z","timestamp":1760189106000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/11\/10\/1271"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,11]]},"references-count":45,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2019,10]]}},"alternative-id":["sym11101271"],"URL":"https:\/\/doi.org\/10.3390\/sym11101271","relation":{},"ISSN":["2073-8994"],"issn-type":[{"type":"electronic","value":"2073-8994"}],"subject":[],"published":{"date-parts":[[2019,10,11]]}}}