Dataset catalog

Find robotics datasets by task, embodiment, modality, license, and format.

The catalog starts with high-value open robotics datasets and treats license awareness, format metadata, enrichment readiness, and source review as first-class fields.

8 matching datasets
License-aware catalog
Enrichment-ready metadata

DROID

DROID research consortium · v1.0.0

Commercial use likely permitted

Large-scale in-the-wild robot manipulation dataset.

Type

In-the-wild robot manipulation

Size

76K+ trajectories, 350h

Format

TFDS / RLDS / LeRobot

License

CC-BY 4.0 or source-verified

RGB-D
proprioception
actions
language
manipulation
household
teleoperation
Attribution required
Source link placeholder until final verification
Enrichment recommended
2 files135.4 MB4.6/5 · 2 comments

BridgeData V2

UC Berkeley · v1.0.0

Commercial use likely permitted

Low-cost robot manipulation dataset.

Type

Low-cost robot manipulation

Size

60K trajectories

Format

TFDS / raw

License

CC-BY 4.0

RGB-D
actions
proprioception
language
manipulation
household
grasping
Attribution required
Source link placeholder
Enrichment available
2 files135.4 MB4.6/5 · 2 comments

Open X-Embodiment

Open X-Embodiment collaboration · v1.0.0

Check subset license

Cross-robot dataset across many robot embodiments.

Type

Cross-robot embodied dataset

Size

1M+ episodes, 22 robot types, 500+ skills

Format

RLDS

License

Mixed

RGB
actions
proprioception
language
manipulation
grasping
household
tool use
Check subset license
Link-only until subset licensing is verified
Enrichment custom review
2 files135.4 MB4.6/5 · 2 comments

ALOHA

ALOHA project community · v1.0.0

Commercial use likely permitted

Bimanual teleoperation and mobile manipulation datasets.

Type

Bimanual teleoperation

Size

Varies by subset

Format

HDF5 / LeRobot

License

Apache 2.0 for selected LeRobot-hosted subsets

video
actions
proprioception
bimanual
teleoperation
manipulation
tool use
Non-commercial subsets excluded
Source link placeholder
Enrichment available
2 files135.4 MB4.6/5 · 2 comments

LIBERO

LIBERO research project · v1.0.0

Link-only until verified

Lifelong robot learning benchmark.

Type

Lifelong robot learning benchmark

Size

130 tasks, 65K demos

Format

benchmark / simulation / HDF5

License

Open benchmark, verify before mirroring

simulation
images
actions
states
manipulation
benchmarking
household
Link-only until verified
Link-only until license review is complete
Enrichment limited
2 files135.4 MB4.6/5 · 2 comments

RoboNet

RoboNet project · v1.0.0

Link-only until verified

Multi-robot manipulation dataset.

Type

Multi-robot manipulation

Size

15M frames, 7 robot platforms

Format

custom

License

Verify before mirroring

video
actions
manipulation
grasping
Link-only until verified
Link-only until verified
Enrichment custom review
2 files135.4 MB4.6/5 · 2 comments

RoboMimic / MimicGen

RoboMimic and MimicGen communities · v1.0.0

Commercial use likely permitted

Imitation learning framework and generated demonstration datasets.

Type

Imitation learning and generated demos

Size

50K+ demos

Format

HDF5

License

MIT for framework / verify dataset subset

simulation
actions
images
states
manipulation
benchmarking
assembly
Check subset license
Source link placeholder
Enrichment available
2 files135.4 MB4.6/5 · 2 comments

Egocentric-100K

Egocentric data project · v1.0.0

Commercial use likely permitted

Large-scale egocentric manual labor video dataset.

Type

Egocentric manual labor video

Size

100K+ hours, 10.8B frames

Format

WebDataset / MP4

License

Apache 2.0

video
metadata
camera intrinsics
egocentric
tool use
warehouse
inspection
Attribution required
Source link placeholder
Enrichment recommended
2 files135.4 MB4.6/5 · 2 comments
License and ownership notice

HumanoidLayer does not claim ownership of third-party open datasets. We index, curate, normalize metadata, and provide access workflows according to each dataset's license. Some datasets may be link-only until licensing is verified. Commercial use depends on the original license and subset restrictions.