Robots Atlas>ROBOTS ATLAS

Prompt Engineering in Practice · Multimodality

Vision Prompting

Multimodality

Introduction

How to design prompts for images: screenshots, diagrams, charts, UI. Detail mode, ROI cropping, hallucinations on small text, visual CoT and grounding.