Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.
Modalities
Input Price
$2.50/M
Output Price
$12.50/M
Context
1M
Released
Oct 31, 2025
Weekly Tokens
40.9M
Create an API key from your OpenRouter dashboard and set it as an environment variable:
Use amazon/nova-premier-v1 with the OpenRouter API:
OpenRouter provides an OpenAI-compatible completion API to 400+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.
In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.
For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.
Add "stream": true to your request body to receive responses as server-sent events:
Sends a request for a model response for the given chat conversation. Supports both streaming and non-streaming modes.
https://openrouter.ai/api/v1/chat/completionsBearer $OPENROUTER_API_KEYapplication/jsonoptional — your site URL, for rankingsoptional — your site name, for rankingsamazon/nova-premier-v1Creates a streaming or non-streaming response using the OpenAI Responses API format.
Docshttps://openrouter.ai/api/v1/responsesBearer $OPENROUTER_API_KEYapplication/jsonoptional — your site URL, for rankingsoptional — your site name, for rankingsamazon/nova-premier-v1Creates a message using the Anthropic Messages API format. Supports text, images, PDFs, tools, and extended thinking.
Docshttps://openrouter.ai/api/v1/messagesBearer $OPENROUTER_API_KEYapplication/jsonoptional — your site URL, for rankingsoptional — your site name, for rankingsamazon/nova-premier-v1| Name | Type | Default | Description |
|---|---|---|---|
max_tokens | integer | — | This sets the upper limit for the number of tokens the model can generate in response. |
temperature | float | 1 | This setting influences the variety in the model's responses. |
top_p | float | 1 | This setting limits the model's choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P. |
top_k | integer | 0 | This limits the model's choice of tokens at each step, making it choose from a smaller set. |
stop | array | — | Stop generation immediately if the model encounter any token specified in the stop array. |
tools | array | — | Tool calling parameter, following OpenAI's tool calling request shape. |