What's the meaning of modalities in MUJOCO PUSH dataset? #20

mrbeann · 2022-05-26T13:17:16Z

Hi, I recently tried the MUJOCO PUSH dataset, but I cannot figure out the concrete meaning of the modalities. The paper mentioned

The multimodal inputs are gray-scaled images (1 × 32 × 32) from an RGB camera, forces (and binary contact information) from a force/torque sensor, and the 3D position of the robot end-effector.

I found the modality in the dataset are "control", "image", "sensor", "pos". What are the correspondences between these modalities and the paper? (i.e. what's the meaning of these modalities?).

arav-agarwal2 · 2022-05-27T18:06:51Z

Someone else can confirm, but here's how I think of things:
-> The "image" modality refers to the gray-scale images.
-> The "pos" modality refers to the 3d position of the end-effector.
-> The "sensor" refers to the forces/binary contact information.
-> The "control" refers to what the controller is sending the arm itself. ( This one I'm the least sure about ).

mrbeann · 2022-05-28T02:30:40Z

I agree with your ideas, but this does not seem to correspond to the paper? For example, Figure 8.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the meaning of modalities in MUJOCO PUSH dataset? #20

What's the meaning of modalities in MUJOCO PUSH dataset? #20

mrbeann commented May 26, 2022

arav-agarwal2 commented May 27, 2022

mrbeann commented May 28, 2022

What's the meaning of modalities in MUJOCO PUSH dataset? #20

What's the meaning of modalities in MUJOCO PUSH dataset? #20

Comments

mrbeann commented May 26, 2022

arav-agarwal2 commented May 27, 2022

mrbeann commented May 28, 2022