{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T12:55:11Z","timestamp":1773924911010,"version":"3.50.1"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2017,12,1]],"date-time":"2017-12-01T00:00:00Z","timestamp":1512086400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2017,12,11]],"date-time":"2017-12-11T00:00:00Z","timestamp":1512950400000},"content-version":"vor","delay-in-days":10,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["IPSJ T Comput Vis Appl"],"published-print":{"date-parts":[[2017,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Fine-grained visual categorization has recently received great attention as the volumes of labeled datasets for classification of specific objects, such as cars, bird species, and air-crafts, have been increasing. The availability of large datasets led to significant performance improvements in several vision-based classification tasks. Visual classification of maritime vessels is another important task, assisting naval security and surveillance applications. We introduced, MARVEL, a large-scale image dataset for maritime vessels, consisting of 2 million user-uploaded images and their various attributes, including vessel identity, type, category, year built, length, and tonnage, collected from a community website. The images were categorized into vessel type classes and also into superclasses defined by combining semantically similar classes, following a semi-automatic clustering scheme. For the analysis of the presented dataset, extensive experiments have been performed, involving several potentially useful applications: vessel type classification, identity verification, retrieval, and identity recognition with and without prior vessel type knowledge. Furthermore, we attempted interesting problems of visual marine surveillance such as predicting and classifying maritime vessel attributes such as length, summer deadweight, draught, and gross tonnage by solely interpreting the visual content in the wild, where no additional cues such as scale, orientation, or location are provided. By utilizing generic and attribute-specific deep representations for maritime vessels, we obtained promising results for the aforementioned applications.<\/jats:p>","DOI":"10.1186\/s41074-017-0033-4","type":"journal-article","created":{"date-parts":[[2017,12,11]],"date-time":"2017-12-11T12:42:06Z","timestamp":1512996126000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Generic and attribute-specific deep representations for maritime vessels"],"prefix":"10.1186","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2252-1478","authenticated-orcid":false,"given":"Berkan","family":"Solmaz","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Erhan","family":"Gundogdu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Veysel","family":"Yucesoy","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aykut","family":"Koc","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2017,12,11]]},"reference":[{"issue":"3","key":"33_CR1","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","volume":"115","author":"O Russakovsky","year":"2015","unstructured":"Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis (IJCV) 115(3):211\u2013252. doi:10.1007\/s11263-015-0816-y.","journal-title":"Int J Comput Vis (IJCV)"},{"key":"33_CR2","first-page":"1097","volume-title":"Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1","author":"A Krizhevsky","year":"2012","unstructured":"Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks In: Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1, 1097\u20131105.. Curran Associates Inc., Lake Tahoe, Nevada. http:\/\/dl.acm.org\/citation.cfm?id=2999134.2999257."},{"key":"33_CR3","doi-asserted-by":"crossref","unstructured":"Lin D, Shen X, Lu C, Jia J (2015) Deep lac: Deep localization, alignment and classification for fine-grained recognition In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1666\u20131674. doi:10.1109\/CVPR.2015.7298775.","DOI":"10.1109\/CVPR.2015.7298775"},{"key":"33_CR4","doi-asserted-by":"crossref","unstructured":"Xie S, Yang T, Wang X, Lin Y (2015) Hyper-class augmented and regularized deep learning for fine-grained image classification In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2645\u20132654. doi:10.1109\/CVPR.2015.7298880.","DOI":"10.1109\/CVPR.2015.7298880"},{"key":"33_CR5","doi-asserted-by":"crossref","unstructured":"Liu L, Shen C, van den Hengel A (2015) The treasure beneath convolutional layers: Cross-convolutional-layer pooling for image classification In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4749\u20134757. doi:10.1109\/CVPR.2015.7299107.","DOI":"10.1109\/CVPR.2015.7299107"},{"key":"33_CR6","unstructured":"Maji S, Rahtu E, Kannala J, Blaschko M, Vedaldi A (2013) Fine-grained visual classification of aircraft. arXiv preprint arXiv:1306.5151."},{"key":"33_CR7","first-page":"3622","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"A Vedaldi","year":"2014","unstructured":"Vedaldi A, Mahendran S, Tsogkas S, Maji S, Girshick R, Kannala J, Rahtu E, Kokkinos I, Blaschko MB, Weiss D, Taskar B, Simonyan K, Saphra N, Mohamed S (2014) Understanding objects in detail with fine-grained attributes In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3622\u20133629.. Institute of Electrical and Electronics Engineers, USA."},{"key":"33_CR8","unstructured":"Wah C, Branson S, Welinder P, Perona P, Belongie S (2011) The Caltech-UCSD Birds-200-2011 Dataset. Technical Report CNS-TR-2011-001. California Institute of Technology."},{"key":"33_CR9","doi-asserted-by":"crossref","unstructured":"Krause J, Stark M, Deng J, Fei-Fei L (2013) 3d object representations for fine-grained categorization In: Computer Vision Workshops (ICCVW), 2013 IEEE International Conference On, 554\u2013561. doi:10.1109\/ICCVW.2013.77.","DOI":"10.1109\/ICCVW.2013.77"},{"key":"33_CR10","doi-asserted-by":"crossref","unstructured":"Yang L, Luo P, Loy CC, Tang X (2015) A large-scale car dataset for fine-grained categorization and verification In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3973\u20133981. doi:10.1109\/CVPR.2015.7299023.","DOI":"10.1109\/CVPR.2015.7299023"},{"key":"33_CR11","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1145\/2833258.2833266","volume-title":"Proceedings of the Sixth International Symposium on Information and Communication Technology. SoICT 2015","author":"C Dao-Duc","year":"2015","unstructured":"Dao-Duc C, Xiaohui H, Mor\u00e8re O (2015) Maritime vessel images classification using deep convolutional neural networks In: Proceedings of the Sixth International Symposium on Information and Communication Technology. SoICT 2015, 276\u2013281.. ACM, New York. doi:10.1145\/2833258.2833266. http:\/\/doi.acm.org\/10.1145\/2833258.2833266."},{"key":"33_CR12","unstructured":"Ship Photos and Ship Tracker. www.shipspotting.com. Accessed 1 May 2017."},{"key":"33_CR13","first-page":"165","volume-title":"Asian Conference on Computer Vision","author":"E Gundogdu","year":"2016","unstructured":"Gundogdu E, Solmaz B, Y\u00fccesoy V, Ko\u00e7 A (2016) MARVEL: a large-scale image dataset for maritime vessels In: Asian Conference on Computer Vision, 165\u2013180.. Springer International Publishing, Cham."},{"key":"33_CR14","first-page":"104340A","volume-title":"Electro-Optical Remote Sensing XI. vol. 10434","author":"B Solmaz","year":"2017","unstructured":"Solmaz B, Gundogdu E, Karaman K, Ko\u00e7 A, et al (2017) Fine-grained visual marine vessel classification for coastal surveillance and defense applications In: Electro-Optical Remote Sensing XI. vol. 10434, 104340A.. International Society for Optics and Photonics, USA."},{"key":"33_CR15","first-page":"1114","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"X Zhang","year":"2016","unstructured":"Zhang X, Zhou F, Lin Y, Zhang S (2016) Embedding label structures for fine-grained feature representation In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1114\u20131123.. Institute of Electrical and Electronics Engineers, USA."},{"key":"33_CR16","doi-asserted-by":"publisher","first-page":"1778","DOI":"10.1109\/CVPR.2009.5206772","volume-title":"2009 IEEE Conference on Computer Vision and Pattern Recognition","author":"A Farhadi","year":"2009","unstructured":"Farhadi A, Endres I, Hoiem D, Forsyth D (2009) Describing objects by their attributes In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, 1778\u20131785.. Institute of Electrical and Electronics Engineers, USA. doi:10.1109\/CVPR.2009.5206772."},{"key":"33_CR17","doi-asserted-by":"crossref","unstructured":"Lampert CH, Nickisch H, Harmeling S (2009) Learning to detect unseen object classes by between-class attribute transfer In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference On, 951\u2013958. doi:10.1109\/CVPR.2009.5206594.","DOI":"10.1109\/CVPRW.2009.5206594"},{"key":"33_CR18","doi-asserted-by":"crossref","unstructured":"Sun Y, Bo L, Fox D (2013) Attribute based object identification In: 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, May 6-10, 2013, 2096\u20132103. doi:10.1109\/ICRA.2013.6630858.","DOI":"10.1109\/ICRA.2013.6630858"},{"key":"33_CR19","doi-asserted-by":"crossref","unstructured":"Vedaldi A, Lenc K (2015) In: Proceedings of the 23rd ACM international conference on Multimedia, 689\u2013692.. ACM.","DOI":"10.1145\/2733373.2807412"},{"key":"33_CR20","doi-asserted-by":"crossref","unstructured":"Chatfield K, Simonyan K, Vedaldi A, Zisserman A (2014) Return of the devil in the details: delving deep into convolutional nets. arXiv preprint arXiv:1405.3531.","DOI":"10.5244\/C.28.6"},{"issue":"2","key":"33_CR21","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1023\/A:1013637720281","volume":"47","author":"K Crammer","year":"2002","unstructured":"Crammer K, Singer Y (2002) On the learnability and design of output codes for multiclass problems. Mach Learn 47(2):201\u2013233. doi:10.1023\/A:1013637720281.","journal-title":"Mach Learn"},{"key":"33_CR22","doi-asserted-by":"publisher","first-page":"408","DOI":"10.1145\/1401890.1401942","volume-title":"Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD \u201908","author":"SS Keerthi","year":"2008","unstructured":"Keerthi SS, Sundararajan S, Chang KW, Hsieh CJ, Lin CJ (2008) A sequential dual method for large scale multi-class linear svms In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD \u201908, 408\u2013416.. ACM, New York. doi:10.1145\/1401890.1401942. http:\/\/doi.acm.org\/10.1145\/1401890.1401942."},{"key":"33_CR23","first-page":"1871","volume":"9","author":"RE Fan","year":"2008","unstructured":"Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871\u20131874.","journal-title":"J Mach Learn Res"},{"key":"33_CR24","doi-asserted-by":"publisher","first-page":"1891","DOI":"10.1109\/CVPR.2014.244","volume-title":"2014 IEEE Conference on Computer Vision and Pattern Recognition","author":"Y Sun","year":"2014","unstructured":"Sun Y, Wang X, Tang X (2014) Deep learning face representation from predicting 10,000 classes In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, 1891\u20131898.. Institute of Electrical and Electronics Engineers, USA. doi:10.1109\/CVPR.2014.244."},{"key":"33_CR25","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1145\/1961189.1961199","volume":"2","author":"CC Chang","year":"2011","unstructured":"Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:27\u201312727. Software available at http:\/\/www.csie.ntu.edu.tw\/~cjlin\/libsvm.","journal-title":"ACM Trans Intell Syst Technol"},{"key":"33_CR26","first-page":"1735","volume-title":"Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference On. vol. 2","author":"R Hadsell","year":"2006","unstructured":"Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping In: Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference On. vol. 2, 1735\u20131742.. IEEE, USA."},{"issue":"3","key":"33_CR27","doi-asserted-by":"publisher","first-page":"1010","DOI":"10.1109\/TIP.2014.2372619","volume":"24","author":"JM Guo","year":"2015","unstructured":"Guo JM, Prasetyo H (2015) Content-based image retrieval using features extracted from halftoning-based block truncation coding. IEEE Trans Image Process 24(3):1010\u20131024. doi:10.1109\/TIP.2014.2372619.","journal-title":"IEEE Trans Image Process"},{"issue":"1","key":"33_CR28","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1109\/TIP.2002.807356","volume":"12","author":"G Qiu","year":"2003","unstructured":"Qiu G (2003) Color image indexing using btc. IEEE Trans Image Process 12(1):93\u2013101.","journal-title":"IEEE Trans Image Process"},{"issue":"10","key":"33_CR29","doi-asserted-by":"publisher","first-page":"3318","DOI":"10.1109\/TIM.2011.2135010","volume":"60","author":"CC Lai","year":"2011","unstructured":"Lai CC, Chen YC (2011) A user-oriented image retrieval system based on interactive genetic algorithm. IEEE Trans Instrum Meas 60(10):3318\u20133325. doi:10.1109\/TIM.2011.2135010.","journal-title":"IEEE Trans Instrum Meas"},{"issue":"2","key":"33_CR30","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1007\/s11263-017-1016-8","volume":"124","author":"A Gordo","year":"2017","unstructured":"Gordo A, Almazan J, Revaud J, Larlus D (2017) End-to-end learning of deep visual representations for image retrieval. Int J Comput Vis 124(2):237\u2013254.","journal-title":"Int J Comput Vis"},{"issue":"7","key":"33_CR31","doi-asserted-by":"publisher","first-page":"3261","DOI":"10.1109\/TIP.2016.2545249","volume":"25","author":"J Lai","year":"2016","unstructured":"Lai J, Jiang X (2016) Classwise sparse and collaborative patch representation for face recognition. IEEE Trans Image Process 25(7):3261\u20133272. doi:10.1109\/TIP.2016.2545249.","journal-title":"IEEE Trans Image Process"},{"key":"33_CR32","doi-asserted-by":"publisher","first-page":"5289","DOI":"10.1109\/CVPR.2015.7299166","volume-title":"2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"D Gong","year":"2015","unstructured":"Gong D, Li Z, Tao D, Liu J, Li X (2015) A maximum entropy feature descriptor for age invariant face recognition In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5289\u20135297.. Institute of Electrical and Electronics Engineers, USA. doi:10.1109\/CVPR.2015.7299166."},{"issue":"5","key":"33_CR33","doi-asserted-by":"publisher","first-page":"684","DOI":"10.1109\/TPAMI.2005.92","volume":"27","author":"KC Lee","year":"2005","unstructured":"Lee KC, Ho J, Kriegman DJ (2005) Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans Pattern Anal Mach Intell 27(5):684\u2013698. doi:10.1109\/TPAMI.2005.92.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"12","key":"33_CR34","doi-asserted-by":"publisher","first-page":"1615","DOI":"10.1109\/TPAMI.2003.1251154","volume":"25","author":"T Sim","year":"2003","unstructured":"Sim T, Baker S, Bsat M (2003) The cmu pose, illumination, and expression database. IEEE Trans Pattern Anal Mach Intell 25(12):1615\u20131618. doi:10.1109\/TPAMI.2003.1251154.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"33_CR35","doi-asserted-by":"crossref","unstructured":"Ricanek K, Tesafaye T (2006) Morph: a longitudinal image database of normal adult age-progression In: 7th International Conference on Automatic Face and Gesture Recognition (FGR06), 341\u2013345. doi:10.1109\/FGR.2006.78.","DOI":"10.1109\/FGR.2006.78"},{"key":"33_CR36","volume-title":"Merchant Marine Officers\u2019 Handbook","author":"EA Turpin","year":"1980","unstructured":"Turpin EA, McEwen WA (1980) Merchant Marine Officers\u2019 Handbook. 4th edn.. Cornell Maritime Press, Centreville, Maryland."},{"issue":"5","key":"33_CR37","doi-asserted-by":"publisher","first-page":"1207","DOI":"10.1162\/089976600300015565","volume":"12","author":"B Sch\u00f6lkopf","year":"2000","unstructured":"Sch\u00f6lkopf B, Smola AJ, Williamson RC, Bartlett PL (2000) New support vector algorithms. Neural computation 12(5):1207\u20131245.","journal-title":"Neural computation"}],"container-title":["IPSJ Transactions on Computer Vision and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s41074-017-0033-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s41074-017-0033-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s41074-017-0033-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,28]],"date-time":"2025-06-28T13:30:10Z","timestamp":1751117410000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1186\/s41074-017-0033-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,12]]},"references-count":37,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2017,12]]}},"alternative-id":["33"],"URL":"https:\/\/doi.org\/10.1186\/s41074-017-0033-4","relation":{},"ISSN":["1882-6695"],"issn-type":[{"value":"1882-6695","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,12]]},"assertion":[{"value":"21 June 2017","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 November 2017","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 December 2017","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}},{"value":"Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Publisher\u2019s Note"}}],"article-number":"22"}}