{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T10:07:51Z","timestamp":1775210871373,"version":"3.50.1"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,8,2]],"date-time":"2016-08-02T00:00:00Z","timestamp":1470096000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,8,2]],"date-time":"2016-08-02T00:00:00Z","timestamp":1470096000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Ministry of the Environment, Japan"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["IPSJ T Comput Vis Appl"],"published-print":{"date-parts":[[2016,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Demand for automatic bird ecology investigation rises rapidly along with the widespread installation of wind energy plants to estimate their adverse environmental effect. While significant advance in general image recognition has been made by deep convolutional neural networks (CNNs), automatically recognizing birds at small scale together with large background regions is still an open problem in computer vision. To tackle object detection at various scales, we combine a deep detector with semantic segmentation methods; namely, we train a deep CNN detector, fully convolutional networks (FCNs), and the variant of FCNs, and integrate their results by the support vector machines to achieve high detection performance. Through experimental results on a bird image dataset, we show the effectiveness of the method for scale-aware object detection.<\/jats:p>","DOI":"10.1186\/s41074-016-0006-z","type":"journal-article","created":{"date-parts":[[2016,8,3]],"date-time":"2016-08-03T16:54:13Z","timestamp":1470243253000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":33,"title":["Combining deep features for object detection at various scales: finding small birds in landscape images"],"prefix":"10.1186","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7465-1752","authenticated-orcid":false,"given":"Akito","family":"Takeki","sequence":"first","affiliation":[]},{"given":"Tu Tuan","family":"Trinh","sequence":"additional","affiliation":[]},{"given":"Ryota","family":"Yoshihashi","sequence":"additional","affiliation":[]},{"given":"Rei","family":"Kawakami","sequence":"additional","affiliation":[]},{"given":"Makoto","family":"Iida","sequence":"additional","affiliation":[]},{"given":"Takeshi","family":"Naemura","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2016,8,2]]},"reference":[{"issue":"7","key":"6_CR1","doi-asserted-by":"publisher","first-page":"1082","DOI":"10.2193\/2008-555","volume":"73","author":"KS Smallwood","year":"2009","unstructured":"Smallwood KS, Rugge L, Morrison ML (2009) Influence of behavior on bird mortality in wind energy developments. J Wildl Manage 73(7): 1082\u20131098.","journal-title":"J Wildl Manage"},{"key":"6_CR2","unstructured":"Bassi S, Bowen A, Fankhauser S (2012) The case for and against onshore wind energy in the UK. Grantham Res. Inst. on Climate Change and Env. Policy Brief."},{"key":"6_CR3","doi-asserted-by":"crossref","unstructured":"Tighe J, Lazebnik S (2013) Finding things: image parsing with regions and per-exemplar detectors In: CVPR, 3001\u20133008.. IEEE.","DOI":"10.1109\/CVPR.2013.386"},{"key":"6_CR4","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition In: Proc. of Computer Vision and Pattern Recognition, 770\u2013778.. IEEE.","DOI":"10.1109\/CVPR.2016.90"},{"key":"6_CR5","unstructured":"Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks In: NIPS, 1097\u20131105."},{"key":"6_CR6","doi-asserted-by":"crossref","unstructured":"Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation In: CVPR.. IEEE.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"6_CR7","unstructured":"Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2015) Semantic image segmentation with deep convolutional nets and fully connected CRFs In: ICLR. http:\/\/arxiv.org\/abs\/1412.7062."},{"issue":"3","key":"6_CR8","first-page":"421","volume":"20","author":"C Cortes","year":"1995","unstructured":"Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3): 421\u2013436.","journal-title":"Mach Learn"},{"key":"6_CR9","doi-asserted-by":"crossref","unstructured":"Takeki A, Tuan Trinh T, Yoshihashi R, Kawakami R, Iida M, Naemura T (2016) Detection of small birds in large images by combining a deep detector with semantic segmentation In: ICIP.. IEEE.","DOI":"10.1109\/ICIP.2016.7533106"},{"key":"6_CR10","unstructured":"Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition In: ICLR.. IEEE."},{"key":"6_CR11","doi-asserted-by":"crossref","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions In: CVPR.. IEEE.","DOI":"10.1109\/CVPR.2015.7298594"},{"issue":"1","key":"6_CR12","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. JMLR 15(1): 1929\u20131958.","journal-title":"JMLR"},{"key":"6_CR13","unstructured":"Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift In: ICML."},{"key":"6_CR14","doi-asserted-by":"crossref","unstructured":"Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation In: CVPR.","DOI":"10.1109\/CVPR.2014.81"},{"issue":"2","key":"6_CR15","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","volume":"104","author":"JR Uijlings","year":"2013","unstructured":"Uijlings JR, van de Sande KE, Gevers T, Smeulders AW (2013) Select search object recognition. IJCV 104(2): 154\u2013171.","journal-title":"IJCV"},{"key":"6_CR16","doi-asserted-by":"crossref","unstructured":"Dong J, Chen Q, Yan S, Yuille A (2014) Towards unified object detection and semantic segmentation In: ECCV, 299\u2013314.. Springer.","DOI":"10.1007\/978-3-319-10602-1_20"},{"key":"6_CR17","doi-asserted-by":"crossref","unstructured":"Hariharan B, Arbel\u00e1ez P, Girshick R, Malik J (2015) Hypercolumns for object segmentation and fine-grained localization In: CVPR.","DOI":"10.1109\/CVPR.2015.7298642"},{"key":"6_CR18","doi-asserted-by":"crossref","unstructured":"Mallat S (1999) A wavelet tour of signal processing. Academic press.","DOI":"10.1016\/B978-012466606-1\/50008-8"},{"key":"6_CR19","doi-asserted-by":"crossref","unstructured":"Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features In: CVPR, 511.. IEEE.","DOI":"10.1109\/CVPR.2001.990517"},{"issue":"1","key":"6_CR20","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1006\/jcss.1997.1504","volume":"55","author":"Y Freund","year":"1997","unstructured":"Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1): 119\u2013139.","journal-title":"J Comput Syst Sci"},{"issue":"2","key":"6_CR21","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1007\/s11263-012-0574-z","volume":"101","author":"J Tighe","year":"2013","unstructured":"Tighe J, Lazebnik S (2013) Superparsing. IJCV 101(2): 329\u2013349.","journal-title":"IJCV"},{"key":"6_CR22","doi-asserted-by":"crossref","unstructured":"Yoshihashi R, Kawakami R, Iida M, Naemura T (2015) Construction of a bird image dataset for ecological investigations In: ICIP, 4248\u20134252.. IEEE.","DOI":"10.1109\/ICIP.2015.7351607"},{"key":"6_CR23","doi-asserted-by":"crossref","unstructured":"Mottaghi R, Chen X, Liu X, Cho NG, Lee SW, Fidler S, Urtasun R, Yuille A (2014) The role of context for object detection and semantic segmentation in the wild In: CVPR, 891\u2013898.. IEEE.","DOI":"10.1109\/CVPR.2014.119"},{"issue":"2","key":"6_CR24","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","volume":"88","author":"M Everingham","year":"2010","unstructured":"Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) Pascal vis object class (VOC) challenge. IJCV 88(2): 303\u2013338.","journal-title":"IJCV"},{"key":"6_CR25","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: surpassing human-level performance on imagenet classification In: ICCV.. IEEE.","DOI":"10.1109\/ICCV.2015.123"},{"key":"6_CR26","doi-asserted-by":"crossref","unstructured":"Pepik B, Benenson R, Ritschel T, Schiele B (2015) What is holding back convnets for detection? In: Patt. Recog, 517\u2013528.. Springer.","DOI":"10.1007\/978-3-319-24947-6_43"}],"container-title":["IPSJ Transactions on Computer Vision and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s41074-016-0006-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s41074-016-0006-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s41074-016-0006-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s41074-016-0006-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,4]],"date-time":"2025-06-04T12:13:18Z","timestamp":1749039198000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1186\/s41074-016-0006-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,8,2]]},"references-count":26,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,12]]}},"alternative-id":["6"],"URL":"https:\/\/doi.org\/10.1186\/s41074-016-0006-z","relation":{},"ISSN":["1882-6695"],"issn-type":[{"value":"1882-6695","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,8,2]]},"assertion":[{"value":"21 April 2016","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 June 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 August 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"5"}}