Learn about configuring Claude Code through Google Vertex AI, including setup, IAM configuration, and troubleshooting.
gcloud
) installed and configuredus-east5
regions. Ensure you are using us-east5
and have quota allocated, or switch to supported models.cache_control
ephemeral flag. To disable it, set DISABLE_PROMPT_CACHING=1
. For heightened rate limits, contact Google Cloud support./login
and /logout
commands are disabled since authentication is handled through Google Cloud credentials.Model type | Default value |
---|---|
Primary model | claude-sonnet-4@20250514 |
Small/fast model | claude-3-5-haiku@20241022 |
roles/aiplatform.user
role includes the required permissions:
aiplatform.endpoints.predict
- Required for model invocationaiplatform.endpoints.computeTokens
- Required for token counting