{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T05:45:42Z","timestamp":1772516742075,"version":"3.50.1"},"reference-count":28,"publisher":"Association for Computing Machinery (ACM)","issue":"3s","license":[{"start":{"date-parts":[[2017,8,10]],"date-time":"2017-08-10T00:00:00Z","timestamp":1502323200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2017,8,31]]},"abstract":"<jats:p>In this article, we propose a mobile food recognition system that uses the picture of the food, taken by the user\u2019s mobile device, to recognize multiple food items in the same meal, such as steak and potatoes on the same plate, to estimate the calorie and nutrition of the meal. To speed up and make the process more accurate, the user is asked to quickly identify the general area of the food by drawing a bounding circle on the food picture by touching the screen. The system then uses image processing and computational intelligence for food item recognition. The advantage of recognizing items, instead of the whole meal, is that the system can be trained with only single item food images. At the training stage, we first use region proposal algorithms to generate candidate regions and extract the convolutional neural network (CNN) features of all regions. Second, we perform region mining to select positive regions for each food category using maximum cover by our proposed submodular optimization method. At the testing stage, we first generate a set of candidate regions. For each region, a classification score is computed based on its extracted CNN features and predicted food names of the selected regions. Since fast response is one of the important parameters for the user who wants to eat the meal, certain heavy computational parts of the application are offloaded to the cloud. Hence, the processes of food recognition and calorie estimation are performed in cloud server. Our experiments, conducted with the FooDD dataset, show an average recall rate of 90.98%, precision rate of 93.05%, and accuracy of 94.11% compared to 50.8% to 88% accuracy of other existing food recognition systems.<\/jats:p>","DOI":"10.1145\/3063592","type":"journal-article","created":{"date-parts":[[2017,8,11]],"date-time":"2017-08-11T12:17:17Z","timestamp":1502453837000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":64,"title":["Mobile Multi-Food Recognition Using Deep Learning"],"prefix":"10.1145","volume":"13","author":[{"given":"Parisa","family":"Pouladzadeh","sequence":"first","affiliation":[{"name":"University of Ottawa, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shervin","family":"Shirmohammadi","sequence":"additional","affiliation":[{"name":"University of Ottawa, Canada, and Istanbul Sehir Univesrity, Trukey"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,8,10]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"www.obesitynetwork.ca.  www.obesitynetwork.ca."},{"key":"e_1_2_1_2_1","unstructured":"http:\/\/www.who.int.  http:\/\/www.who.int."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIM.2016.7384954"},{"key":"e_1_2_1_4_1","volume-title":"Abdulsalam Yassine, and Shervin Shirmohammadi.","author":"Pouladzadeh Parisa","year":"2016","unstructured":"Parisa Pouladzadeh , Pallavi Kuhad , Sri Vijay Bharat Peddi , Abdulsalam Yassine, and Shervin Shirmohammadi. 2016 . Calorie measurement and food classification using deep learning neural network In Proceedings of the IEEE International Conference on Instrumentation and Measurement Technology (I 2MTC\u201916). Parisa Pouladzadeh, Pallavi Kuhad, Sri Vijay Bharat Peddi, Abdulsalam Yassine, and Shervin Shirmohammadi. 2016. Calorie measurement and food classification using deep learning neural network In Proceedings of the IEEE International Conference on Instrumentation and Measurement Technology (I2MTC\u201916)."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-23222-5_54"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMEW.2015.7169853"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10599-4_29"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2012.157"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2013.5"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1177\/1932296815582222"},{"key":"e_1_2_1_11_1","volume-title":"Submodular Functions and Optimization","author":"Fujishige Satoru","unstructured":"Satoru Fujishige . 2005. Submodular Functions and Optimization , Vol. 58 . Elsevier . Satoru Fujishige. 2005. Submodular Functions and Optimization, Vol. 58. Elsevier."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654889"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02579435"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/JBHI.2014.2304925"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654970"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/MMUL.2015.39"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM.2015.54"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCG.2008.39"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-016-1642-6"},{"key":"e_1_2_1_20_1","doi-asserted-by":"crossref","unstructured":"Morteza Akbari Fard Hamed Haddadi and Alireza Tavakoli Targhi. 2016. Fruits and vegetables calorie counter using convolutional neural networks. In ACM Dig. Health 121--122.  Morteza Akbari Fard Hamed Haddadi and Alireza Tavakoli Targhi. 2016. Fruits and vegetables calorie counter using convolutional neural networks. In ACM Dig. Health 121--122.","DOI":"10.1145\/2896338.2896355"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2986035.2986039"},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 13--21","author":"Keiji Yanai Wataru Shimoda","year":"2016","unstructured":"Wataru Shimoda Keiji Yanai . 2016 . Foodness proposal for multiple food detection by training of single food images . In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 13--21 . Wataru Shimoda Keiji Yanai. 2016. Foodness proposal for multiple food detection by training of single food images. In Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management. 13--21."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2986035.2986047"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of Neural Information Processing Systems (NIPS\u201912)","author":"Krizhevsky A.","unstructured":"A. Krizhevsky , I. Sutskever , and G. Hinton , 2012. Imagenet classification with deep convolutional neural networks . In Proceedings of Neural Information Processing Systems (NIPS\u201912) . A. Krizhevsky, I. Sutskever, and G. Hinton, 2012. Imagenet classification with deep convolutional neural networks. In Proceedings of Neural Information Processing Systems (NIPS\u201912)."},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201913)","author":"Zeiler M. D.","unstructured":"M. D. Zeiler , M. Ranzato , R. Monga , M. Mao , K. Yang , Q. V. Le , P. Nguyen , A. Senior , V. Vanhoucke , J. Dean and G. E. Hinton . 2013. Onrectied linear units for speech processing . In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201913) . M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean and G. E. Hinton. 2013. Onrectied linear units for speech processing. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP\u201913)."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.726791"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIM.2014.2303533"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2566584"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3063592","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3063592","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:36:34Z","timestamp":1750217794000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3063592"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,10]]},"references-count":28,"journal-issue":{"issue":"3s","published-print":{"date-parts":[[2017,8,31]]}},"alternative-id":["10.1145\/3063592"],"URL":"https:\/\/doi.org\/10.1145\/3063592","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,8,10]]},"assertion":[{"value":"2016-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-08-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}