Tongyi Embedding Vision Flash API: Pricing, Playground & Docs

Q: Which endpoint does Tongyi Embedding Vision Flash use?

Tongyi Embedding Vision Flash is served through POST /v1/embeddings on api.empiriolabs.ai with standard bearer-token authentication.

About Tongyi Embedding Vision Flash

Speed-optimised multimodal embedding — same shape as Vision-Plus, 3× cheaper image/video tokens.

Also known as Alibaba Cloud Tongyi Embedding Vision Flash, Tongyi-Embedding-Vision-Flash

multimodalindependent vectorslow cost

Tongyi Embedding Vision Flash specs

Model ID: tongyi-embedding-vision-flash
Provider: Alibaba Cloud
Category: Embeddings
Released: Sep 23, 2025
Context window: 1024 tokens
Input: TextImageVideo
Output: Embedding
Region: Singapore
Endpoints: POST/v1/embeddings

Tongyi Embedding Vision Flash API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type

Spec

Rate

Text input

per 1M tokens

$0.09

Image / video input

per 1M tokens

$0.03

Compare on the full pricing page

How to call the Tongyi Embedding Vision Flash API

Tongyi Embedding Vision Flash serves the OpenAI-compatible Embeddings API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 and use the model id tongyi-embedding-vision-flash. Get an API key from the EmpirioLabs dashboard.

cURL

curl https://api.empiriolabs.ai/v1/embeddings \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tongyi-embedding-vision-flash",
    "input": "The quick brown fox jumps over the lazy dog."
  }'

Python (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

embedding = client.embeddings.create(
    model="tongyi-embedding-vision-flash",
    input="The quick brown fox jumps over the lazy dog.",
)
print(embedding.data[0].embedding[:8])

Full Tongyi Embedding Vision Flash API reference

Tongyi Embedding Vision Flash API parameters

Request parameters supported by the Tongyi Embedding Vision Flash API on EmpirioLabs. Defaults apply when a field is omitted.

Parameter	Type	Default	Range / values	Description
input	string	-	-	Either an OpenAI-style part array [{type:'image',url:...},{type:'text',text:...}] or a native part list {contents:[{image:'...'},{text:'...'}]}. Up to 8 images at 3...
user	string	-	-	Optional caller identifier.

Good to know

Output

Fixed 768-dim vector per input

Per-input limits

Text: up to 1,024 tokens
Image: up to 8 per request, 3 MB each (JPG, PNG, BMP)
Video: up to 10 MB per file (MP4, MPEG, MOV, MPG, WEBM, AVI, FLV, MKV)

Languages

Chinese, English

Tongyi Embedding Vision Flash API: common questions

How much does the Tongyi Embedding Vision Flash API cost?

On EmpirioLabs, Tongyi Embedding Vision Flash is billed pay as you go: Text input $0.09 per 1M tokens; Image / video input $0.03 per 1M tokens. The live rate card on this page always matches what the API charges.

What is the context window of Tongyi Embedding Vision Flash?

Tongyi Embedding Vision Flash supports a 1024-token context window.

Which endpoint does Tongyi Embedding Vision Flash use?

Tongyi Embedding Vision Flash is served through POST /v1/embeddings on api.empiriolabs.ai with standard bearer-token authentication.

Can I try Tongyi Embedding Vision Flash in the browser before integrating?

Yes. The EmpirioLabs playground runs Tongyi Embedding Vision Flash in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Tongyi Embedding Vision Flash API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Tongyi Embedding Vision Flash API

About Tongyi Embedding Vision Flash

Tongyi Embedding Vision Flash specs

Tongyi Embedding Vision Flash API pricing

How to call the Tongyi Embedding Vision Flash API

Tongyi Embedding Vision Flash API parameters

Good to know

Output

Per-input limits

Languages

Tongyi Embedding Vision Flash API: common questions

How much does the Tongyi Embedding Vision Flash API cost?

What is the context window of Tongyi Embedding Vision Flash?

Which endpoint does Tongyi Embedding Vision Flash use?

Can I try Tongyi Embedding Vision Flash in the browser before integrating?

How do I get a Tongyi Embedding Vision Flash API key?

More Embeddings model APIs

Text Embedding v4

Tongyi Embedding Vision Plus

Qwen3.7 Plus

Qwen3.7 Flash

Qwen3.7 Max

Qwen3.5 122B-A10B

Ready to use better endpoints?