RT 1/2: Translating Vision and Language into Robotic Actions