The Claude 3 and 4 families of models comes with new vision capabilities that allow Claude to understand and analyze images, opening up exciting possibilities for multimodal interaction.
Aspect ratio | Image size |
---|---|
1:1 | 1092x1092 px |
3:4 | 951x1268 px |
2:3 | 896x1344 px |
9:16 | 819x1456 px |
1:2 | 784x1568 px |
tokens = (width px * height px)/750
Here are examples of approximate tokenization and costs for different image sizes within our API’s size constraints based on Claude Sonnet 3.7 per-token price of $3 per million input tokens:
Image size | # of Tokens | Cost / image | Cost / 1K images |
---|---|---|---|
200x200 px(0.04 megapixels) | ~54 | ~$0.00016 | ~$0.16 |
1000x1000 px(1 megapixel) | ~1334 | ~$0.004 | ~$4.00 |
1092x1092 px(1.19 megapixels) | ~1590 | ~$0.0048 | ~$4.80 |
image
content blocksExample: One image
Role | Content |
---|---|
User | [Image] Describe this image. |
Example: Multiple images
Image 1:
and Image 2:
and so on. You don’t need newlines between images or between images and the prompt.Ask Claude to describe the differences between multiple images.Role | Content |
---|---|
User | Image 1: [Image 1] Image 2: [Image 2] How are these images different? |
Example: Multiple images with a system prompt
Content | |
---|---|
System | Respond only in Spanish. |
User | Image 1: [Image 1] Image 2: [Image 2] How are these images different? |
Example: Four images across two conversation turns
Role | Content |
---|---|
User | Image 1: [Image 1] Image 2: [Image 2] How are these images different? |
Assistant | [Claude’s response] |
User | Image 1: [Image 3] Image 2: [Image 4] Are these images similar to the first two? |
Assistant | [Claude’s response] |
user
role as part of any standard multiturn conversation structure.What image file types does Claude support?
image/jpeg
image/png
image/gif
image/webp
Can Claude read image URLs?
Is there a limit to the image file size I can upload?
How many images can I include in one request?
Does Claude read image metadata?
Can I delete images I've uploaded?
Where can I find details on data privacy for image uploads?
What if Claude's image interpretation seems wrong?
Can Claude generate or edit images?