A Safety Benchmark with Damage-Aware Simulation for Robot Manipulation
What if your home robot finishes the job but breaks your kitchen in the process?
Robots keep getting better at handling everyday objects, but finishing the task isn't the whole story. A robot that picks up an egg and cracks it, or pours a glass of water and spills half of it, still isn't ready for a real home. Safety is the part that's missing, and today's simulators barely measure it.
OopsieVerse is a unified, damage-aware simulation framework for household manipulation. At its core, DamageSim turns physical signals like contact forces, heat, and liquid into measurable mechanical, thermal, and fluid damage, so the benchmark can score not just whether a robot finished the task, but whether it did so safely. It runs in both BEHAVIOR-1K and RoboCasa, and supports safer data collection, damage-aware imitation and reinforcement learning, and Vision-Language-Action safety evaluation.
Explore the work
A simulator-agnostic plugin that turns contact forces, heat, and liquid exposure into measurable mechanical, thermal, and fluid damage.
Explore → 02 OopsieBenchA suite of 32 household tasks that contrast easy-but-risky strategies with safer, more careful ones.
Explore → 03 Use CasesSafer data collection, damage-aware imitation and reinforcement learning, VLA safety evals, and sim-to-real transfer.
Explore →@inproceedings{balaji2026oopsieverse,
title={OopsieVerse: A Safety Benchmark with Damage-Aware Simulation for Robot Manipulation},
author={Balaji, Arnav and Bahety, Arpit and Ambatipudi, Sriniket and Lam, Daniel and Xu, Junhong and Mart{\'\i}n-Mart{\'\i}n, Roberto},
booktitle={Robotics: Science and Systems (RSS), 2026},
year={2026}
}
DamageSim is our simulator-agnostic plugin that makes physical safety measurable by tracking object-centric “health.” It monitors simulator signals—such as contact forces, temperature, and liquid exposure—and converts them into mechanical (e.g., impact or compression), thermal, and fluid damage, which can be used as observations, rewards, or termination conditions. We instantiate it in RoboCasa (MuJoCo) and BEHAVIOR-1K (Omniverse) showcasing its consistency across simulators.
Mechanical Damage
Thermal Damage
Fluid Damage
DamageSim is simulator-agnostic. we instantiate it in BEHAVIOR-1k (Nvidia Omniverse) and RoboCasa (MuJoCo) to demonstrate consistent safety measurement across different physics backends.
MuJoCoRoboCasa
OmniverseBEHAVIOR-1k
OopsieBench is a suite of 32 household tasks in total (15 tasks shown in the grid below; hover a tile to see its name). The suite is designed to (i) expose policies to realistic, physically damaging failure modes in household manipulation, and (ii) make safety measurable by contrasting easy but risky strategies with safer ones that require more careful interaction (e.g., gentler contact, safer approaches, or avoiding hazards). The benchmark spans diverse scenes, objects, and damage modalities, is cross-platform (BEHAVIOR-1k and RoboCasa), and includes a dataset of safe and unsafe human teleop demonstrations for five tasks.
Click a tile to enlarge.
Pour Glass
Add Firewood
Lift Egg
Shelve Item
Wipe Countertop
Attach Camera
Open Microwave Door
Pick up Scrubber
Ignite Wood
Open Single Door
Turn on Microwave
Counter to Microwave
Turn on Stove
Pour Water (Safe + Unsafe Data)
Pour Water (Safety-filtered Data)
Shelve Cereal Box (Safe + Unsafe Data)
Shelve Cereal Box (Safety-filtered Data)