{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:24:30Z","timestamp":1777656270334,"version":"3.51.4"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,1,3]],"date-time":"2022-01-03T00:00:00Z","timestamp":1641168000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,1,3]],"date-time":"2022-01-03T00:00:00Z","timestamp":1641168000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41971424"],"award-info":[{"award-number":["41971424"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003392","name":"Natural Science Foundation of Fujian Province","doi-asserted-by":"publisher","award":["2020J01701"],"award-info":[{"award-number":["2020J01701"]}],"id":[{"id":"10.13039\/501100003392","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005270","name":"Fujian Provincial Department of Science and Technology","doi-asserted-by":"publisher","award":["JAT190318"],"award-info":[{"award-number":["JAT190318"]}],"id":[{"id":"10.13039\/501100005270","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Scientific Research Foundation of Jimei University","award":["ZP2022008"],"award-info":[{"award-number":["ZP2022008"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Intell Syst"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Recent advances in camera-equipped drone applications increased the demand for visual object detection algorithms with deep learning for aerial images. There are several limitations in accuracy for a single deep learning model. Inspired by ensemble learning can significantly improve the generalization ability of the model in the machine learning field, we introduce a novel integration strategy to combine the inference results of two different methods without non-maximum suppression. In this paper, a global and local ensemble network (GLE-Net) was proposed to increase the quality of predictions by considering the global weights for different models and adjusting the local weights for bounding boxes. Specifically, the global module assigns different weights to models. In the local module, we group the bounding boxes that corresponding to the same object as a cluster. Each cluster generates a final predict box and assigns the highest score in the cluster as the score of the final predict box. Experiments on benchmarks VisDrone2019 show promising performance of GLE-Net compared with the baseline network.<\/jats:p>","DOI":"10.1007\/s44196-021-00056-3","type":"journal-article","created":{"date-parts":[[2022,1,3]],"date-time":"2022-01-03T13:03:22Z","timestamp":1641215002000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["GLE-Net: A Global and Local Ensemble Network for Aerial Object Detection"],"prefix":"10.1007","volume":"15","author":[{"given":"Jiajia","family":"Liao","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yujun","family":"Liu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yingchao","family":"Piao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1707-5685","authenticated-orcid":false,"given":"Jinhe","family":"Su","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guorong","family":"Cai","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yundong","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,1,3]]},"reference":[{"key":"56_CR1","first-page":"1","volume":"99","author":"G Zhang","year":"2019","unstructured":"Zhang, G., Lu, S., Zhang, W.: CAD-Net: A context-aware detection network for objects in remote sensing imagery. Remote Sensing 99, 1\u201310 (2019)","journal-title":"Remote Sensing"},{"key":"56_CR2","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., Luo, Z. R2CNN: Rotational region CNN for orientation robust scene text detection. arXiv preprint arXiv:1706.09579 (2017)","DOI":"10.1109\/ICPR.2018.8545598"},{"key":"56_CR3","first-page":"2688","volume-title":"Ensemble Methods for Object Detection","author":"A Casado-Garc\u0131a","year":"2020","unstructured":"Casado-Garc\u0131a, A., Heras, J.: Ensemble Methods for Object Detection, pp. 2688\u20132695. IOS Press, Amsterdam (2020)"},{"key":"56_CR4","unstructured":"Jocher, G., Nishimura, K., Mineeva, T., Vilari\u00f1o, R.: YOLOv5 (2020)"},{"key":"56_CR5","unstructured":"Zhou, X., Wang, D., Krhenb\u00fchl, P.: Objects as points. arXiv preprint arXiv:1904.07850 (2019)"},{"key":"56_CR6","doi-asserted-by":"crossref","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., Tian, Q.: Centernet: Keypoint triplets for object detection. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp. 6569\u20136578 (2019)","DOI":"10.1109\/ICCV.2019.00667"},{"key":"56_CR7","unstructured":"Du, D., Zhu, P., Wen, L., Bian, X., Liu, Z.M.: VisDrone-DET2019: the vision meets drone object detection in image challenge results. In: ICCV Visdrone Workshop (2019)"},{"key":"56_CR8","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779\u2013788 (2016)","DOI":"10.1109\/CVPR.2016.91"},{"key":"56_CR9","doi-asserted-by":"crossref","unstructured":"Redmon, J., Farhadi, A.: YOLO9000: Better, Faster, Stronger, pp. 6517\u20136525 (2017).","DOI":"10.1109\/CVPR.2017.690"},{"key":"56_CR10","unstructured":"Redmon, J., Farhadi, A.: YOLOv3: An Incremental Improvement. arXiv preprint arXiv:1804.02767 (2018)"},{"key":"56_CR11","unstructured":"Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)"},{"key":"56_CR12","first-page":"21","volume-title":"European Conference on Computer Vision","author":"W Liu","year":"2016","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: single shot MultiBox detector. In: European Conference on Computer Vision, pp. 21\u201337. Springer, Cham (2016)"},{"key":"56_CR13","first-page":"2999","volume":"99","author":"TY Lin","year":"2017","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., Doll\u00e1r, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 99, 2999\u20133007 (2017)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"56_CR14","doi-asserted-by":"crossref","unstructured":"Zhang, S., Wen, L., Bian, X., Lei, Z., Li, S.Z.: Single-shot refinement neural network for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4203\u20134212 (2018)","DOI":"10.1109\/CVPR.2018.00442"},{"key":"56_CR15","doi-asserted-by":"crossref","unstructured":"Law, H., Deng, J.: CornerNet: Detecting objects as paired keypoints. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 734\u2013750 (2018)","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"56_CR16","unstructured":"Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91\u201399 (2015)"},{"key":"56_CR17","doi-asserted-by":"crossref","unstructured":"Girshick, R. J. C. S. Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440\u20131448 (2015)","DOI":"10.1109\/ICCV.2015.169"},{"key":"56_CR18","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580\u2013587 (2014)","DOI":"10.1109\/CVPR.2014.81"},{"key":"56_CR19","unstructured":"Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379\u2013387 (2016)"},{"key":"56_CR20","doi-asserted-by":"crossref","unstructured":"Cai, Z., Vasconcelos, N.: Cascade R-CNN: delving into high quality object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)","DOI":"10.1109\/CVPR.2018.00644"},{"key":"56_CR21","doi-asserted-by":"crossref","unstructured":"Chen, Q., Wang, Y., Yang, T., Zhang, X., Cheng, J., Sun, J. You only look one-level feature. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 13039\u201313048 (2021)","DOI":"10.1109\/CVPR46437.2021.01284"},{"key":"56_CR22","doi-asserted-by":"crossref","unstructured":"Luo, Y., Cao, X., Zhang, J., Guo, J., Shen, H., Wang, T., Feng, Q.: CE-FPN: enhancing channel information for object detection. arXiv preprint arXiv:2103.10643 (2021)","DOI":"10.1007\/s11042-022-11940-1"},{"key":"56_CR23","first-page":"549","volume-title":"European Conference on Computer Vision","author":"H Qiu","year":"2020","unstructured":"Qiu, H., Ma, Y., Li, Z., Liu, S., Sun, J.: Borderdet: border feature for dense object detection. In: European Conference on Computer Vision, pp. 549\u2013564. Springer (2020)"},{"key":"56_CR24","doi-asserted-by":"crossref","unstructured":"Jin, W., Yu, H.J.: CvT-ASSD: convolutional vision-transformer based attentive single shot MultiBox detector. arXiv preprint arXiv:2110.12364 (2021)","DOI":"10.1109\/ICTAI52525.2021.00117"},{"key":"56_CR25","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., Zhang, L.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3974\u20133983 (2018)","DOI":"10.1109\/CVPR.2018.00418"},{"key":"56_CR26","first-page":"296","volume":"159","author":"K Li","year":"2020","unstructured":"Li, K., Wan, G., Cheng, G., Meng, L., Han, J.: Object detection in optical remote sensing images: a survey and a new benchmark. Remote Sensing 159, 296\u2013307 (2020)","journal-title":"Remote Sensing"},{"issue":"12","key":"56_CR27","first-page":"7405","volume":"54","author":"C Gong","year":"2016","unstructured":"Gong, C., Zhou, P., Han, J.: Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images. Remote Sensing 54(12), 7405\u20137415 (2016)","journal-title":"Remote Sensing"},{"key":"56_CR28","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., Lu, Q.: Learning RoI transformer for oriented object detection in aerial images. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2849\u20132858 (2020)","DOI":"10.1109\/CVPR.2019.00296"},{"issue":"3","key":"56_CR29","doi-asserted-by":"publisher","first-page":"1100","DOI":"10.1109\/TIP.2017.2773199","volume":"27","author":"Z Zou","year":"2018","unstructured":"Zou, Z., Shi, Z.: random access memories: a new paradigm for target detection in high resolution aerial remote sensing images. IEEE Trans. Image Process. 27(3), 1100\u20131111 (2018)","journal-title":"IEEE Trans. Image Process."},{"issue":"4","key":"56_CR30","first-page":"297","volume":"85","author":"MY Yang","year":"2019","unstructured":"Yang, M.Y., Liao, W., Li, X., Cao, Y., Rosenhahn, B.J.P.E.: Vehicle detection in aerial images. Remote Sensing 85(4), 297\u2013304 (2019)","journal-title":"Remote Sensing"},{"key":"56_CR31","doi-asserted-by":"crossref","unstructured":"Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., Xian, S., Fu, K.: SCRDet: Towards more robust detection for small, cluttered and rotated objects. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp. 8232\u20138241 (2019)","DOI":"10.1109\/ICCV.2019.00832"},{"key":"56_CR32","unstructured":"Yang, X., Yan, J., Yang, X., Tang, J., Liao, W., He, T.: SCRDet++: detecting small, cluttered and rotated objects via instance-level feature denoising and rotation loss smoothing. arXiv preprint arXiv:2004.13316 (2020)"},{"issue":"5","key":"56_CR33","first-page":"3377","volume":"58","author":"P Wang","year":"2019","unstructured":"Wang, P., Sun, X., Diao, W., Fu, K.: FMSSD: feature-merged single-shot detection for multiscale objects in large-scale remote sensing imagery. Remote Sensing 58(5), 3377\u20133390 (2019)","journal-title":"Remote Sensing"},{"key":"56_CR34","doi-asserted-by":"crossref","unstructured":"Albaba, B.M., Ozer, S. SyNet: an ensemble network for object detection in UAV images. In: 2020 25th International Conference on Pattern Recognition (ICPR). IEEE, pp. 10227\u201310234 (2021)","DOI":"10.1109\/ICPR48806.2021.9412847"},{"key":"56_CR35","doi-asserted-by":"crossref","unstructured":"Qin, R., Liu, Q., Gao, G., Huang, D., Wang, Y. MRDet: a multi-head network for accurate oriented object detection in aerial images. arXiv preprint arXiv:2012.13135 (2020)","DOI":"10.1109\/TGRS.2021.3113473"},{"key":"56_CR36","unstructured":"Yang, X., Liu, Q., Yan, J., Li, A., Zhang, Z., Yu, G.: R3det: refined single-stage detector with feature refinement for rotating object. arXiv preprint arXiv:1908.05612 (2019)"},{"key":"56_CR37","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., Girshick, R.B.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961\u20132969 (2017)","DOI":"10.1109\/ICCV.2017.322"},{"key":"56_CR38","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770\u2013778 (2016)","DOI":"10.1109\/CVPR.2016.90"},{"key":"56_CR39","doi-asserted-by":"crossref","unstructured":"Yu, F., Wang, D., Shelhamer, E., Darrell, T.: Deep layer aggregation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2403\u20132412 (2018)","DOI":"10.1109\/CVPR.2018.00255"},{"key":"56_CR40","first-page":"483","volume-title":"European Conference on Computer Vision","author":"A Newell","year":"2016","unstructured":"Newell, A., Yang, K., Jia, D.: Stacked hourglass networks for human pose estimation. In: European Conference on Computer Vision, pp. 483\u2013499. Springer, Cham (2016)"},{"key":"56_CR41","first-page":"740","volume-title":"European Conference on Computer Vision","author":"S Belongie","year":"2014","unstructured":"Belongie, S.: Microsoft COCO: common objects in context. In: European Conference on Computer Vision, pp. 740\u2013755. Springer, Cham (2014)"},{"issue":"3","key":"56_CR42","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","volume":"115","author":"O Russakovsky","year":"2015","unstructured":"Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211\u2013252 (2015)","journal-title":"Int. J. Comput. Vis."},{"key":"56_CR43","doi-asserted-by":"crossref","unstructured":"Tang, Z., Liu, X., Shen, G., Yang, B.: PENet: object detection using points estimation in aerial images. arXiv preprint arXiv:2001.08247 (2020)","DOI":"10.1109\/ICMLA51294.2020.00069"},{"key":"56_CR44","doi-asserted-by":"crossref","unstructured":"Jadhav, A., Mukherjee, P., Kaushik, V., Lall, B.: Aerial multi-object tracking by detection using deep association networks. In: 2020 National Conference on Communications (NCC). IEEE, pp. 1\u20136 (2020)","DOI":"10.1109\/NCC48643.2020.9056035"},{"key":"56_CR45","unstructured":"Li, Z., Peng, C., Yu, G., Zhang, X., Deng, Y., Sun, J.: Light-head r-cnn: In defense of two-stage object detector. arXiv preprint arXiv:1711.07264 (2017)"},{"key":"56_CR46","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117\u20132125 (2017)","DOI":"10.1109\/CVPR.2017.106"}],"container-title":["International Journal of Computational Intelligence Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44196-021-00056-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44196-021-00056-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44196-021-00056-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,21]],"date-time":"2023-01-21T15:13:22Z","timestamp":1674314002000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44196-021-00056-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1,3]]},"references-count":46,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["56"],"URL":"https:\/\/doi.org\/10.1007\/s44196-021-00056-3","relation":{},"ISSN":["1875-6883"],"issn-type":[{"value":"1875-6883","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,3]]},"assertion":[{"value":"29 April 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 December 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 January 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that there is no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"2"}}