Tongyi Embedding Vision Plus API

Multimodal embedding producing independent vectors for text, image, and video inputs.

Alibaba CloudEmbeddings1024 contextSingaporeProprietary EndpointNew

About Tongyi Embedding Vision Plus

Multimodal embedding producing independent vectors for text, image, and video inputs.

Also known as Tongyi Embedding Vision, Tongyi-Embedding-Vision-Plus

multimodalindependent vectors

Tongyi Embedding Vision Plus specs

Model ID
tongyi-embedding-vision-plus
Provider
Alibaba Cloud
Category
Embeddings
Context window
1024 tokens
Input
text, image, video
Output
embedding
Region
Singapore
Endpoints
POST /v1/embeddings

Tongyi Embedding Vision Plus API pricing

Live pay-as-you-go rates from the EmpirioLabs catalog. You are billed only for what you use, with no monthly minimum.

Type
Spec
Rate
Text input
per 1M tokens
$0.09
Image / video input
per 1M tokens
$0.09
Compare on the full pricing page

How to call the Tongyi Embedding Vision Plus API

Tongyi Embedding Vision Plus serves the OpenAI-compatible Embeddings API. Point any OpenAI SDK at https://api.empiriolabs.ai/v1 and use the model id tongyi-embedding-vision-plus. Get an API key from the EmpirioLabs dashboard.

cURL
curl https://api.empiriolabs.ai/v1/embeddings \
  -H "Authorization: Bearer $EMPIRIOLABS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tongyi-embedding-vision-plus",
    "input": "The quick brown fox jumps over the lazy dog."
  }'
Python (OpenAI SDK)
from openai import OpenAI

client = OpenAI(
    base_url="https://api.empiriolabs.ai/v1",
    api_key="YOUR_EMPIRIOLABS_API_KEY",
)

embedding = client.embeddings.create(
    model="tongyi-embedding-vision-plus",
    input="The quick brown fox jumps over the lazy dog.",
)
print(embedding.data[0].embedding[:8])
Full Tongyi Embedding Vision Plus API reference

Tongyi Embedding Vision Plus API parameters

Request parameters supported by the Tongyi Embedding Vision Plus API on EmpirioLabs. Defaults apply when a field is omitted.

ParameterTypeDefaultRange / valuesDescription
inputstring--Either an OpenAI-style part array [{type:'image',url:...},{type:'text',text:...}] or a native part list {contents:[{image:'...'},{text:'...'}]}. Up to 8 images at 3...
userstring--Optional caller identifier.

Good to know

Output

  • Fixed 1152-dim vector per input (no fusion across modalities)

Per-input limits

  • Text: up to 1,024 tokens
  • Image: up to 8 per request, 3 MB each (JPG, PNG, BMP)
  • Video: up to 10 MB per file (MP4, MPEG, MOV, MPG, WEBM, AVI, FLV, MKV)

Languages

  • Chinese, English

Tongyi Embedding Vision Plus API: common questions

How much does the Tongyi Embedding Vision Plus API cost?

On EmpirioLabs, Tongyi Embedding Vision Plus is billed pay as you go: Text input $0.09 per 1M tokens; Image / video input $0.09 per 1M tokens. The live rate card on this page always matches what the API charges.

What is the context window of Tongyi Embedding Vision Plus?

Tongyi Embedding Vision Plus supports a 1024-token context window.

Which endpoint does Tongyi Embedding Vision Plus use?

Tongyi Embedding Vision Plus is served through POST /v1/embeddings on api.empiriolabs.ai with standard bearer-token authentication.

Can I try Tongyi Embedding Vision Plus in the browser before integrating?

Yes. The EmpirioLabs playground runs Tongyi Embedding Vision Plus in the browser with the same parameters the API exposes, so you can test prompts before writing code.

How do I get a Tongyi Embedding Vision Plus API key?

Create an EmpirioLabs account, then generate a key under API Keys in the dashboard. Billing is pay-as-you-go credits, so you only pay for the requests you make.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.