Tongyi Embedding Vision Flash API

Speed-optimised multimodal embedding — same shape as Vision-Plus, 3× cheaper image/video tokens.

Alibaba CloudEmbeddings1024 contextSingaporeProprietary EndpointNew

About Tongyi Embedding Vision Flash

Speed-optimised multimodal embedding — same shape as Vision-Plus, 3× cheaper image/video tokens.

multimodalindependent vectorslow cost

Tongyi Embedding Vision Flash specs

Model ID
tongyi-embedding-vision-flash
Provider
Alibaba Cloud
Category
Embeddings
Context window
1024 tokens
Input
text, image, video
Output
embedding
Region
Singapore
Endpoints
POST /v1/embeddings

Tongyi Embedding Vision Flash API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type
Spec
Rate
Text input
per 1M tokens
$0.09
Image / video input
per 1M tokens
$0.03
Compare on the full pricing page

How to call the Tongyi Embedding Vision Flash API

Tongyi Embedding Vision Flash serves the OpenAI-compatible Embeddings API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 and use the model id tongyi-embedding-vision-flash. Get an API key from the EmpirioLabs dashboard.

cURL
curl https://api.empiriolabs.ai/v1/embeddings \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tongyi-embedding-vision-flash",
    "input": "The quick brown fox jumps over the lazy dog."
  }'
Python (OpenAI SDK)
from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

embedding = client.embeddings.create(
    model="tongyi-embedding-vision-flash",
    input="The quick brown fox jumps over the lazy dog.",
)
print(embedding.data[0].embedding[:8])
Full Tongyi Embedding Vision Flash API reference

Tongyi Embedding Vision Flash API parameters

Request parameters supported by the Tongyi Embedding Vision Flash API on EmpirioLabs. Defaults apply when a field is omitted.

ParameterTypeDefaultRange / valuesDescription
inputstring--Either an OpenAI-style part array [{type:'image',url:...},{type:'text',text:...}] or a native part list {contents:[{image:'...'},{text:'...'}]}. Up to 8 images at 3...
userstring--Optional caller identifier.

Good to know

Output

  • Fixed 768-dim vector per input

Per-input limits

  • Text: up to 1,024 tokens
  • Image: up to 8 per request, 3 MB each (JPG, PNG, BMP)
  • Video: up to 10 MB per file (MP4, MPEG, MOV, MPG, WEBM, AVI, FLV, MKV)

Languages

  • Chinese, English

Tongyi Embedding Vision Flash API: common questions

How much does the Tongyi Embedding Vision Flash API cost?

On EmpirioLabs, Tongyi Embedding Vision Flash is billed pay as you go: Text input $0.09 per 1M tokens; Image / video input $0.03 per 1M tokens. The live rate card on this page always matches what the API charges.

What is the context window of Tongyi Embedding Vision Flash?

Tongyi Embedding Vision Flash supports a 1024-token context window.

Which endpoint does Tongyi Embedding Vision Flash use?

Tongyi Embedding Vision Flash is served through POST /v1/embeddings on api.empiriolabs.ai with standard bearer-token authentication.

Can I try Tongyi Embedding Vision Flash in the browser before integrating?

Yes. The EmpirioLabs playground runs Tongyi Embedding Vision Flash in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Tongyi Embedding Vision Flash API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.