Different tiers of service allow you to balance availability, performance, and predictable costs based on your application’s needs.
service_tier
parameter:
service_tier
parameter accepts the following values:
"auto"
(default) - Uses the Priority Tier capacity if available, falling back to your other capacity if not"standard_only"
- Only use standard tier capacity, useful if you don’t want to use your Priority Tier capacityusage
object also includes the service tier assigned to the request:
service_tier="auto"
with a model with a Priority Tier commitment, these response headers provide insights:
service_tier
parameter to auto