Score: 0

Food Image Generation on Multi-Noun Categories

Published: December 9, 2025 | arXiv ID: 2512.09095v1

By: Xinyue Pan , Yuhao Chen , Jiangpeng He and more

Generating realistic food images for categories with multiple nouns is surprisingly challenging. For instance, the prompt "egg noodle" may result in images that incorrectly contain both eggs and noodles as separate entities. Multi-noun food categories are common in real-world datasets and account for a large portion of entries in benchmarks such as UEC-256. These compound names often cause generative models to misinterpret the semantics, producing unintended ingredients or objects. This is due to insufficient multi-noun category related knowledge in the text encoder and misinterpretation of multi-noun relationships, leading to incorrect spatial layouts. To overcome these challenges, we propose FoCULR (Food Category Understanding and Layout Refinement) which incorporates food domain knowledge and introduces core concepts early in the generation process. Experimental results demonstrate that the integration of these techniques improves image generation performance in the food domain.

LLMs-based Augmentation for Domain Adaptation in Long-tailed Food Datasets

CV and Pattern Recognition

Lets phones know what food you're eating.

20 Nov 2025 0

86%

Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval

CV and Pattern Recognition

Find recipes from food pictures better.

19 Nov 2025 1

86%

Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition

CV and Pattern Recognition

Lets phones guess what you ate from pictures.

9 Apr 2025 1

View PDF Login to Bookmark

Food Image Generation on Multi-Noun Categories

Technical Abstract

LLMs-based Augmentation for Domain Adaptation in Long-tailed Food Datasets

Towards Unbiased Cross-Modal Representation Learning for Food Image-to-Recipe Retrieval

Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition