Losses that Cook: Topological Optimal Transport for Structured Recipe Generation
By: Mattia Ottoborgo, Daniele Rege Cambrin, Paolo Garza
Potential Business Impact:
Makes recipes more accurate and easier to follow.
Cooking recipes are complex procedures that require not only a fluent and factual text, but also accurate timing, temperature, and procedural coherence, as well as the correct composition of ingredients. Standard training procedures are primarily based on cross-entropy and focus solely on fluency. Building on RECIPE-NLG, we investigate the use of several composite objectives and present a new topological loss that represents ingredient lists as point clouds in embedding space, minimizing the divergence between predicted and gold ingredients. Using both standard NLG metrics and recipe-specific metrics, we find that our loss significantly improves ingredient- and action-level metrics. Meanwhile, the Dice loss excels in time/temperature precision, and the mixed loss yields competitive trade-offs with synergistic gains in quantity and time. A human preference analysis supports our finding, showing our model is preferred in 62% of the cases.
Similar Papers
The Digital Sous Chef -- A Comparative Study on Fine-Tuning Language Models for Recipe Generation
Computation and Language
Makes computers write better recipes with exact amounts.
Real-Time Cooked Food Image Synthesis and Visual Cooking Progress Monitoring on Edge Devices
CV and Pattern Recognition
Makes cooking food look real on phones.
Towards an Action-Centric Ontology for Cooking Procedures Using Temporal Graphs
Artificial Intelligence
Helps robots cook any meal perfectly.