image_url content block and a response_format with a JSON schema; the model returns JSON that conforms to the schema.
For example, you could extract a project name and a column count from a screenshot of a Trello board:
JSON
Combine image input with a JSON schema to extract typed data from screenshots, documents, and photos.
image_url content block and a response_format with a JSON schema; the model returns JSON that conforms to the schema.
For example, you could extract a project name and a column count from a screenshot of a Trello board: