Score: 0

CulinaryCut-VLAP: A Vision-Language-Action-Physics Framework for Food Cutting via a Force-Aware Material Point Method

Published: January 10, 2026 | arXiv ID: 2601.06451v1

By: Hyunseo Koh , Chang-Yong Song , Youngjae Choi and more

Food cutting is a highly practical yet underexplored application at the intersection of vision and robotic manipulation. The task remains challenging because interactions between the knife and deformable materials are highly nonlinear and often entail large deformations, frequent contact, and topological change, which in turn hinder stable and safe large-scale data collection. To address these challenges, we propose a unified framework that couples a vision-language-action (VLA) dataset with a physically realistic cutting simulator built on the material point method (MPM). Our simulator adopts MLS-MPM as its computational core, reducing numerical dissipation and energy drift while preserving rotational and shear responses even under topology-changing cuts. During cutting, forces and stress distributions are estimated from impulse exchanges between particles and the grid, enabling stable tracking of transient contact forces and energy transfer. We also provide a benchmark dataset that integrates diverse cutting trajectories, multi-view visual observations, and fine-grained language instructions, together with force--torque and tool--pose labels to provide physically consistent training signals. These components realize a learning--evaluation loop that respects the core physics of cutting and establishes a safe, reproducible, and scalable foundation for advancing VLA models in deformable object manipulation.

ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

Robotics

Robots feel and move better with touch.

28 May 2025 0

86%

Value Vision-Language-Action Planning & Search

Robotics

Helps robots learn to do tasks better and faster.

2 Jan 2026 0

86%

Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing

Robotics

Robots learn to make cocktails by seeing, hearing, and pouring.

12 Jan 2025 0

View PDF Login to Bookmark

CulinaryCut-VLAP: A Vision-Language-Action-Physics Framework for Food Cutting via a Force-Aware Material Point Method

Technical Abstract

ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

Value Vision-Language-Action Planning & Search

Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing