Human Visual System vs Convolution Neural Networks in food recognition task: An empirical comparison

作者:

Highlights:

摘要

Automated food recognition from food plate is useful for smartphone-based applications promoting healthy lifestyles and for automated carbohydrate counting, e.g. targeted at type I diabetic patients, but the variation of appearance of food items makes it a difficult task. Convolution Neural Networks (CNNs) raised to prominence in recent years, and they will enable those applications if they are able to match HVS accuracy at least in meal classification. In this work we run an experimental comparison of accuracy between CNNs and HVS based on a simple meal recognition task. We set up a survey for humans with two phases, training and testing, and also give the food dataset to state-of-the-art CNNs. The results, considering some relevant variations in the setup, allow us to reach conclusions regarding the comparison, characteristics and limitations of CNNs, which are relevant for future improvements.

论文关键词:

论文评审过程:Received 25 October 2018, Revised 4 November 2019, Accepted 19 November 2019, Available online 30 November 2019, Version of Record 31 January 2020.

论文官网地址:https://doi.org/10.1016/j.cviu.2019.102878