Invited Speakers

Kiyoharu Aizawa

Multimedia Processing Lab, The University of Tokyo, Japan

Kiyoharu Aizawa received the B.E., the M.E. and the Dr.Eng. degrees in Electrical Engineering all from the University of Tokyo, in 1983, 1985, 1988, respectively. He is currently a Professor at Department of Information and Communication Engineering of the University of Tokyo. He was a Visiting Assistant Professor at University of Illinois from 1990 to 1992. His research interest is in multimedia applications, image processing and computer vision. He has pioneered FoodLog which assists users to record their food intake by image recognition and retrieval techniques.
He received the 1987 Young Engineer Award and the 1990, 1998 Best Paper Awards, the 1991 Achievement Award, 1999 Electronics Society Award from IEICE Japan and the 1998 Fujio Frontier Award, the 2002 and 2009 Best Paper Award and 2013 Achievement award from ITE Japan. He received the IBM Japan Science Prize in 2002.
He is on Editorial Boards of IEEE MultiMedia, ACM TOMM, APSIPA Transactions on Signal and Information Processing and International Journal of Multimedia Information Retrieval. He served as the Editor in Chief of Journal of ITE Japan, an Associate Editor of IEEE Trans. Image Processing, IEEE Trans. CSVT and IEEE Trans. Multimedia. He is/was a president of ITE and ISS society of IEICE, 2019 and 2018, respectively. He has served a number of international and domestic conferences; he was a General co-Chair of ACM Multimedia 2012 and ACM ICMR2018. He is a Fellow of IEEE, IEICE, ITE and a council member of Science Council of Japan.

FoodLog: Multimedia Food Recording Platform and its Application for Athletes’ Nutrition Management


Ramesh Jain

Institute for Future Health, Department of Computer Science, University of California, USA and Peng Cheng Lab, Shen Zhen, China

Ramesh Jain is an entrepreneur, researcher, and educator.
He is a Donald Bren Professor in Information & Computer Sciences at University of California, Irvine. His research interests covered Control Systems, Computer Vision, Artificial Intelligence, and Multimedia Computing. His current research passion is in addressing health issues using cybernetic principles building on the progress in sensors, mobile, processing, artificial intelligence, computer vision, and storage technologies. He is founding director of the Institute for Future Health at UCI. He is a Fellow of AAAS, ACM, IEEE, AAAI, IAPR, and SPIE.
Ramesh co-founded several companies, managed them in initial stages, and then turned them over to professional management. He enjoys new challenges and likes to use technology to solve them. He is participating in addressing the biggest challenge for us all: how to live long in good health.

Eat, Drink and be Happy

Food is essential for human life and it is fundamental to the human experience. Selecting right food at a right place in right situation is very important, but a challenging problem. Food recommendation must consider the personal food model, analyzing unique food characteristics, incorporating various social as well as personal contexts, and food related health and domain knowledge. Each of these is a challenge for multimedia computing. In this presentation, we will discuss technical challenges and opportunities for addressing food recommendation to make people happy and healthy. Recent progress in multimodal computing makes it possible to start addressing this important problem for all of us at personal level. We will discuss our ideas considering concrete emerging approaches for building a multimodal personal food recommendation system using FoodLogs and health context.


Frederic Ronga, PhD

Nestlé Research Center, Lausanne, Switzerland

Frederic is leading the Digital Nutrition & Health group in the department of Nutrition & Dietary Recommendations at the Nestlé Research Center in Lausanne, Switzerland. The mission of his group is to develop impactful digital solutions for personalized nutrition. Frederic’s research interest spans innovative technologies for data capture and data processing in the field of nutrition and health. At Nestlé Research, Frederic has worked on developing new algorithms for dietary assessment and recommendations, meal planning and dietary constraint management, feeding into holistic digital platforms that contribute to Nestlé’s purpose of enhancing quality of life and contributing to a healthier future. Frederic holds a PhD in Physics. Before joining Nestlé, he worked on extracting new scientific insights from large datasets, at CERN (Switzerland) and KEK (Japan).

Personalized Nutrition in the connected era: challenges and opportunities

Today’s consumer lives in a connected world, with trends in self-quantification and personalization driving preferences and habits. This talk will discuss the mission of Nestlé Research’s Digital Nutrition & Health team, as well as its challenges and approaches on trying to feed into holistic digital platforms that contribute to Nestlé Purpose of enhancing quality of life and contributing to a healthier future, with a special focus on dietary intake capture, assessment, and recommendations.


Karan Sikka, PhD

Center for Vision Technologies, SRI International, Princeton, USA

Karan Sikka is an Advanced Computer Scientist at Center for Vision Technologies, SRI International in Princeton, USA. He graduated with a PhD degree in 2016 from Machine Perception Lab at UCSD and was advised by Dr. Marian Bartlett. Before joining UCSD, he completed his bachelor’s in ECE at Indian Institute of Technology Guwahati in 2010. At SRI he is a co-PI for several Govt. funded programs (ONR CEROSS, DARPA M3I, and AFRL Mesa) related to understanding and analysing social media structures with multimodal content. His research is focused on solving several fundamental problems in Computer Vision such as multimodal learning, weakly supervised learning, few/zero-shot learning, action recognition etc. He has won a best paper honorable mention award at IEEE Face and Gesture 2013, and a best paper award at the Emotion Recognition in the Wild Workshop at ICMI 2013. He serves as a reviewer/program-committee for venues such as CVPR, ACCV, IJCV, IEEE TAC, IEEE TM, ICMI, IEEE AFGR etc. 

Learning User Preferences from Social Multimedia Analysis and Overview of the iFood2019 Challenge

There has been an explosion of multimodal content generated on social media networks in the last few years, which has necessitated a deeper understanding of social media content and user behavior. In the first part of the task, I will present a novel method that provides a unified framework for understanding content as well as modeling user preferences from noisy social media posts. I will discuss some applications in understanding food preferences and trends using this algorithm.
I will then give an overview of the second large-scale food classification challenge in images (iFood challenge) held as part of the sixth Fine Grained Visual Classification Workshop at CVPR19. We introduce a new dataset of 251 fine-grained (prepared) food categories with 118K training images collected from the web, and human verified labels for both validation set (11K images) and the test set (12K images). 40 teams from academia and industry competed in this challenge with the top team obtaining a 5.6% top-3 error percentage, which is almost 2-points better than previous year challenge.