Nova Lite 2 API: Pricing, Playground & Docs

About Nova Lite 2

Fast, cost-effective multimodal reasoning model for text, images, documents, and video on a 1M context (long docs and ~90 min clips).

Also known as Amazon Nova Lite 2, Nova-Lite-2

visionfunction callingreasoning

Nova Lite 2 specs

Model ID: nova-lite-2
Provider: Amazon
Category: Text Generation
Released: Dec 2, 2025
Context window: 1M tokens
Max output: 32,000 tokens
Input: TextImageVideoDocument
Output: Text
Structured output: JSON Mode
Batch API: Available, 35% off list price
Endpoints: POST/v1/chat/completionsPOST/v1/responsesPOST/v1/messagesPOST/v1beta/models/nova-lite-2:generateContent
Alternate model IDs: amazon-nova-lite-2amazon/nova-lite-2us.amazon.nova-2-lite-v1:0

Nova Lite 2 API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type

Spec

Rate

Input

per 1M prompt tokens

$0.38

Output

per 1M generated tokens

$3.16

Cached input

per 1M tokens

$0.2128

Web Search (Linkup)

per call when invoked

$0.013

Compare on the full pricing page

How to call the Nova Lite 2 API

Nova Lite 2 serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id nova-lite-2. Get an API key from the EmpirioLabs dashboard.

cURL

curl https://api.empiriolabs.ai/v1/chat/completions \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nova-lite-2",
    "messages": [
      {"role": "user", "content": "Write a haiku about the ocean."}
    ]
  }'

Python (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

response = client.chat.completions.create(
    model="nova-lite-2",
    messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)

Full Nova Lite 2 API reference

Nova Lite 2 API parameters

Request parameters supported by the Nova Lite 2 API on EmpirioLabs. Defaults apply when a field is omitted.

Parameter	Type	Default	Range / values	Description
temperature	number	0.7	0 to 2	Sampling temperature. 0 = deterministic, 2 = maximum randomness.
top_p	number	0.9	0 to 1	Nucleus sampling probability mass. Lower = more focused.
max_tokens	number	4096	1 to 65536	Maximum tokens in the response.
stop	string	-	-	Up to 4 strings where the model will stop generating further tokens.
enable_reasoning	boolean	true	-	Enable the model's reasoning mode. Slower but improves multi-step problems.
enable_thinking	boolean	true	-	Enable extended reasoning before the final answer. Alias of enable_reasoning.
reasoning_effort	enum	medium	low, medium, high	Reasoning effort level (low \| medium \| high). Higher = more thinking time.
reasoning	string	-	-	Responses API reasoning object: {"effort":"low\|medium\|high"}
response_format	enum	-	-	Return the output as a valid JSON object (JSON mode). Describe the fields you want in your prompt.
web_search_linkup	boolean	false	-	Optional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as additional context. Adds $0.013 per call when invoked on top of the model's normal token cost. Disabled by default.
disable_formatting	boolean	false	-	When enabled, the gateway will not append the "Sources" footer to assistant responses that used Linkup web search. Useful when the model output is piped to another...

Good to know

Reasoning traces are NOT exposed from AWS. Video uploads up to ~1 GB.

Nova Lite 2 API: common questions

How much does the Nova Lite 2 API cost?

On EmpirioLabs, Nova Lite 2 is billed pay as you go: Input $0.38 per 1M prompt tokens; Output $3.16 per 1M generated tokens; Cached input $0.2128 per 1M tokens. The live rate card on this page always matches what the API charges.

What is the context window of Nova Lite 2?

Nova Lite 2 supports a 1M-token context window with up to 32,000 output tokens per response.

Is the Nova Lite 2 API OpenAI-compatible?

Yes. Nova Lite 2 serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to nova-lite-2.

Can I try Nova Lite 2 in the browser before integrating?

Yes. The EmpirioLabs playground runs Nova Lite 2 in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Nova Lite 2 API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Nova Lite 2 API

About Nova Lite 2

Nova Lite 2 specs

Nova Lite 2 API pricing

How to call the Nova Lite 2 API

Nova Lite 2 API parameters

Good to know

Nova Lite 2 API: common questions

How much does the Nova Lite 2 API cost?

What is the context window of Nova Lite 2?

Is the Nova Lite 2 API OpenAI-compatible?

Can I try Nova Lite 2 in the browser before integrating?

How do I get a Nova Lite 2 API key?

More Text Generation model APIs

GLM 5.2

Kimi K3

Kimi K2.7 Code

Muse Spark 1.1

Fugu Ultra v1.1

Qwen3.7 Plus

Ready to use better endpoints?