Dr. Yuki M. Asano
Towards Pixel-level Self-supervised Learning
In this talk I will talk about self-supervised learning and its current development of moving from image-level to pixel-level representations. We will first dive into how clustering can be combined with representation learning using optimal transport, a paradigm still relevant in current SoTA models like SwAV/DINO/MSN/DINOv2. Next, I will show how self-supervised clustering can be used for unsupervised segmentation in images and how videos can be leveraged to obtain even better image encoders.
Dr. Aulo Gelli
An innovative passive dietary monitoring system
Objectives: To validate FRANI (Food Recognition Assistance and Nudging Insights), a mobile phone application with computer vision assisted dietary assessment, and multi-pass 24-hour recalls (24HR), against weighed records (WR) in female youth aged 18-24y in Ghana.
Methods: Dietary intake was assessed on two non-consecutive days using FRANI, WRs and 24HRs. Equivalence of nutrient intake was tested using mixed effect models adjusted for repeated measures, by comparing ratios (FRANI/WR and 24HR/WR) with equivalence margins at 10%, 15% and 20% error bounds. Agreement between methods was assessed using the concordance correlation coefficient (CCC).
Results: Equivalence for FRANI and WR was determined at the 10% bound for fibre and folate, at 15% bound for energy, protein, fat, iron, riboflavin, thiamine, and vitamin B6 and zinc, while intake of calcium was equivalent at the 20% bound. Comparisons between 24HR and WR found protein and riboflavin intake estimates falling within a 10% bound. Iron, niacin, thiamine, and zinc intakes were equivalent at the 15% bound; and folate and vitamin B6 were equivalent at the 20% bound. The CCCs between FRANI and WR ranged between 0.44 and 0.72 (mean=0.57), and between 0.45 and 0.76 (mean=0.62) for 24HR and WR.
Conclusions: FRANI-assisted dietary assessment and 24HRs were found to accurately estimate nutrient intake in female youth in urban Ghana.
Funding Sources: CGIAR.