Gemma 4 E4B API: Pricing, Playground & Docs

About Gemma 4 E4B

Gemma 4 E4B is a Google open multimodal chat model with image input, function calling, structured output, and efficient instruction following.

Supports text and image input, streaming, function tools, structured JSON output, seed control, and thinking mode on by default. Billing is per successful message.

Also known as Google Gemma 4 E4B, Gemma-4-E4B

visionfunction callingstructured outputreasoning

Gemma 4 E4B specs

Model ID: gemma-4-e4b
Provider: Google
Category: Text Generation
Released: Mar 31, 2026
Context window: 8K tokens
Max output: 4,096 tokens
Input: TextImage
Output: Text
Endpoints: POST /v1/chat/completions
POST /v1/responses
POST /v1/messages

Gemma 4 E4B API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type

Spec

Rate

Per Message

fixed

$0.03

Web Search (Linkup)

per call when invoked

$0.013

Compare on the full pricing page

How to call the Gemma 4 E4B API

Gemma 4 E4B serves the OpenAI-compatible Chat Completions API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 with your EmpirioLabs API key and use the model id gemma-4-e4b. Get an API key from the EmpirioLabs dashboard.

cURL

curl https://api.empiriolabs.ai/v1/chat/completions \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemma-4-e4b",
    "messages": [
      {"role": "user", "content": "Write a haiku about the ocean."}
    ]
  }'

Python (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

response = client.chat.completions.create(
    model="gemma-4-e4b",
    messages=[{"role": "user", "content": "Write a haiku about the ocean."}],
)
print(response.choices[0].message.content)

Full Gemma 4 E4B API reference

Gemma 4 E4B API parameters

Request parameters supported by the Gemma 4 E4B API on EmpirioLabs. Defaults apply when a field is omitted.

Parameter	Type	Default	Range / values	Description
temperature	number	0.7	0 to 2	Sampling temperature. Lower values are more deterministic.
top_p	number	0.95	0 to 1	Nucleus sampling probability mass.
min_p	number	0	0 to 1	Minimum token probability filter.
max_tokens	integer	4096	1 to 4096	Maximum output tokens.
stop	string	-	-	One or more stop strings.
seed	integer	-	0 to 2147483647	Optional deterministic sampling seed.
enable_thinking	boolean	true	-	Enable the model reasoning channel before final output.
reasoning_effort	enum	-	low, medium, high	OpenAI-compatible hint. Medium and high enable thinking mode.
presence_penalty	number	0	-2 to 2	Penalize tokens that already appeared.
frequency_penalty	number	0	-2 to 2	Penalize repeated tokens by frequency.
repetition_penalty	number	1	0.1 to 2	Discourage exact repeated text.
logprobs	boolean	false	-	Return token log probabilities when supported.
top_logprobs	integer	-	0 to 20	Number of alternate token log probabilities to return.
tools	array	-	-	OpenAI-compatible function tool definitions.

5 more parameters in the docs

Good to know

Supports text and image input, streaming, function tools, structured JSON output, seed control, and thinking mode on by default. Billing is per successful message.

Gemma 4 E4B API: common questions

How much does the Gemma 4 E4B API cost?

On EmpirioLabs, Gemma 4 E4B is billed pay as you go: Per Message $0.03 fixed; Web Search (Linkup) $0.013 per call when invoked. The live rate card on this page always matches what the API charges.

What is the context window of Gemma 4 E4B?

Gemma 4 E4B supports a 8K-token context window with up to 4,096 output tokens per response.

Is the Gemma 4 E4B API OpenAI-compatible?

Yes. Gemma 4 E4B serves the OpenAI-compatible Chat Completions API, so existing OpenAI SDKs work by pointing base_url at https://api.empiriolabs.ai/v1 and setting the model id to gemma-4-e4b.

Can I try Gemma 4 E4B in the browser before integrating?

Yes. The EmpirioLabs playground runs Gemma 4 E4B in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Gemma 4 E4B API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Gemma 4 E4B API

About Gemma 4 E4B

Gemma 4 E4B specs

Gemma 4 E4B API pricing

How to call the Gemma 4 E4B API

Gemma 4 E4B API parameters

Good to know

Gemma 4 E4B API: common questions

How much does the Gemma 4 E4B API cost?

What is the context window of Gemma 4 E4B?

Is the Gemma 4 E4B API OpenAI-compatible?

Can I try Gemma 4 E4B in the browser before integrating?

How do I get a Gemma 4 E4B API key?

More Text Generation model APIs

GLM 5.2

Kimi K2.7 Code

Fugu Ultra

Qwen3.7 Plus

Kimi K2.7 Code Highspeed

MiniMax M3

Ready to use better endpoints?