{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T00:52:57Z","timestamp":1775263977492,"version":"3.50.1"},"reference-count":34,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2018,11,5]],"date-time":"2018-11-05T00:00:00Z","timestamp":1541376000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Natural Science Foundation of Jiangsu Province, China","award":["BK20141004"],"award-info":[{"award-number":["BK20141004"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61601230"],"award-info":[{"award-number":["61601230"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Existing object trackers are mostly based on correlation filtering and neural network frameworks. Correlation filtering is fast but has poor accuracy. Although a neural network can achieve high precision, a large amount of computation increases the tracking time. To address this problem, we utilize a convolutional neural network (CNN) to learn object direction. We propose a target direction classification network based on CNNs that has a directional shortcut to the tracking target, unlike the particle filter that randomly finds the target. Our network uses an end-to-end approach to determine scale variation that has good robustness to scale variation sequences. In the pretraining stage, the Visual Object Tracking Challenges (VOT) dataset is used to train the network for learning positive and negative sample classification and direction classification. In the online tracking stage, the sliding window operation is performed by using the obtained directional information to determine the exact position of the object. The network only calculates a single sample, which guarantees a low computational burden. The positive and negative sample redetection strategies can successfully ensure that the samples are not lost. The one-pass evaluation (OPE) evaluation results of the object tracking benchmark (OTB) demonstrate that the algorithm is very robust and is also faster than several deep trackers.<\/jats:p>","DOI":"10.3390\/a11110178","type":"journal-article","created":{"date-parts":[[2018,11,5]],"date-time":"2018-11-05T10:43:45Z","timestamp":1541414625000},"page":"178","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Deep Directional Network for Object Tracking"],"prefix":"10.3390","volume":"11","author":[{"given":"Zhaohua","family":"Hu","sequence":"first","affiliation":[{"name":"School of Electronic &amp; Information Engineering, Nanjing University of Information Science &amp; Technology, Nanjing 210044, China"},{"name":"Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology, Nanjing University of Information Science &amp; Technology, Nanjing 210044, China"}]},{"given":"Xiaoyi","family":"Shi","sequence":"additional","affiliation":[{"name":"School of Electronic &amp; Information Engineering, Nanjing University of Information Science &amp; Technology, Nanjing 210044, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,11,5]]},"reference":[{"key":"ref_1","unstructured":"Wang, N., and Yeung, D.Y. (2013). Learning a deep compact image representation for visual tracking. International Conference on Neural Information Processing Systems, Curran Associates Inc."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1958","DOI":"10.1109\/TPAMI.2008.128","article-title":"80 million tiny images: A large data set for nonparametric object and scene recognition","volume":"30","author":"Torralba","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Kristan, M., Pflugfelder, R., Leonardis, A., Matas, J., Porikli, F., Cehovin, L., Nebehay, G., Fernandez, G., Vojir, T., and Gatt, A. (2013, January 2\u20138). The visual object tracking vot2013 challenge results. Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops (ICCVW), Sydney, Australia.","DOI":"10.1109\/ICCVW.2013.20"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Wu, Y., Lim, J., and Yang, M.H. (2013, January 23\u201328). Online object tracking: A. benchmark. Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.312"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Nam, H., and Han, B. (July, January 26). Learning multi-domain convolutional neural networks for visual tracking. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.465"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Shahbaz Khan, F., Felsberg, M., and Van de Weijer, J. (2014, January 24\u201327). Adaptive color attributes for real-time visual tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.143"},{"key":"ref_7","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of oriented gradients for human detection. Proceedings of the CVPR 2005 IEEE Computer Society Conference on Vision and Pattern Recognition, San Diego, CA, USA."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Robinson, A., Khan, F.S., and Felsberg, M. (2016). Beyond correlation filters: Learning continuous convolution operators for visual tracking. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-46454-1_29"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Valmadre, J., Bertinetto, L., Henriques, J., Vedaldi, A., and Torr, P.H. (2017, January 21\u201326). End-to-end representation learning for correlation filter based tracking. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.531"},{"key":"ref_10","unstructured":"Wang, Q., Gao, J., Xing, J., Zhang, M., and Hu, W. (arXiv, 2017). DCFNet: Discriminant Correlation Filters Network for Visual Tracking, arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2015, January 7\u201312). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_12","unstructured":"David, H., Thrun, S., and Savarese, S. (2016). Learning to track at 100 fps with deep regression networks. European Conference on Computer Vision, Springer."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Danelljan, M., H\u00e4ger, G., Khan, F., and Felsberg, M. (2014, January 1\u20135). Accurate scale estimation for robust visual tracking. Proceedings of the British Machine Vision Conference, Nottingham, UK.","DOI":"10.5244\/C.28.65"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Babenko, B., Yang, M., and Belongie, S. (2009, January 20\u201325). Visual tracking with online multiple instance learning. Proceedings of the CVPR 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPRW.2009.5206737"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"2002","DOI":"10.1109\/TPAMI.2014.2315808","article-title":"Fast compressive tracking","volume":"36","author":"Zhang","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1007\/s11263-007-0075-7","article-title":"Incremental learning for robust visual tracking","volume":"77","author":"Ross","year":"2008","journal-title":"Int. J. Comput. Vis."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhang, T., Bibi, A., and Ghanem, B. (2016, January 27\u201330). In defense of sparse tracking: Circulant sparse tracker. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.421"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2259","DOI":"10.1109\/TPAMI.2011.66","article-title":"Robust visual tracking and vehicle classification via sparse representation","volume":"33","author":"Mei","year":"2011","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1409","DOI":"10.1109\/TPAMI.2011.239","article-title":"Tracking-learning-detection","volume":"34","author":"Kalal","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1109\/TPAMI.2014.2345390","article-title":"High-speed tracking with kernelized correlation filters","volume":"37","author":"Henriques","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, N., Shi, J., Yeung, D.Y., and Jia, J. (2015, January 7\u201313). Understanding and diagnosing visual tracking systems. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.355"},{"key":"ref_22","unstructured":"Wang, N., Li, S., Gupta, A., and Yeung, D.Y. (arXiv, 2015). Transferring rich feature hierarchies for robust visual tracking, arXiv."},{"key":"ref_23","unstructured":"Nam, H., Baek, M., and Han, B. (arXiv, 2016). Modeling and propagating cnns in a tree structure for visual tracking, arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Bhat, G., Khan, F.S., and Felsberg, M. (2017, January 21\u201326). ECO: Efficient convolution operators for tracking. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.733"},{"key":"ref_25","unstructured":"Kristan, M., and Matas, J. (2015, January 7\u201313). The visual object tracking vot2015 challenge results. Proceedings of the Visual Object Tracking Workshop 2015 at ICCV2015, Santiago, Chile."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Chatfield, K., Simonyan, K., Vedaldi, A., and Zisserman, A. (arXiv, 2014). Return of the devil in the details: Delving deep into convolutional nets, arXiv.","DOI":"10.5244\/C.28.6"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Vedaldi, A., and Lenc, K. (2015, January 26\u201330). MatConvnet: Convolutional Neural Networks for MATLAB. Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia.","DOI":"10.1145\/2733373.2807412"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1779","DOI":"10.1109\/TED.2016.2529656","article-title":"Robust visual tracking via convolutional networks without training","volume":"25","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Image Process."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Ma, C., Huang, J.B., Yang, X., and Yang, M.H. (2015, January 7\u201313). Hierarchical convolutional features for visual tracking. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.352"},{"key":"ref_31","unstructured":"Li, Y., and Zhu, J. (2014). A scale adaptive kernel correlation filter tracker with feature integration. European Conference on Computer Vision, Springer."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Danelljan, M., Hager, G., Shahbaz Khan, F., and Felsberg, M. (2015, January 7\u201313). Convolutional features for correlation filter based visual tracking. Proceedings of the IEEE International Conference on Computer Vision Workshops, Washington, DC, USA.","DOI":"10.1109\/ICCVW.2015.84"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Song, Y., Ma, C., Gong, L., Zhang, J., Lau, R.W., and Yang, M.H. (2017, January 22\u201329). Crest: Convolutional residual learning for visual tracking. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.279"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Bertinetto, L., Valmadre, J., Golodetz, S., Miksik, O., and Torr, P.H. (2016, January 27\u201330). Staple: Complementary learners for real-time tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.156"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/11\/11\/178\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T23:43:46Z","timestamp":1775259826000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/11\/11\/178"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,5]]},"references-count":34,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2018,11]]}},"alternative-id":["a11110178"],"URL":"https:\/\/doi.org\/10.3390\/a11110178","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,5]]}}}