Explore our production ready AI Models

Browse the full catalog of models across text, image, audio, video, 3D, and more.

Model catalog

AI models on one OpenAI-compatible API.

Browse text, image, video, audio, 3D, search, and agent endpoints with pay-as-you-go pricing. The interactive catalog loads current availability from EmpirioLabs, and these model docs are crawlable without client JavaScript.

Open model docs

New & Featured

Proprietary EndpointNew

Text-to-video and image-to-video with synchronized native audio, at 720p or 1080p for 3 to 15 seconds, with aspect ratio and prompt control.

Released Jun 17, 2026
Save up to 21%
Proprietary EndpointNew

Reasoning and coding model with a 1M token context, 128K output, adjustable reasoning effort, native web search, and tool calling.

SingaporeReleased Jun 16, 20261M context
GermanyReleased Jun 16, 20261M context
Save up to 7%

Kimi K2.7 Code

Moonshot AI
Proprietary EndpointNew

Kimi K2.7 Code is Moonshot's trillion-parameter agentic coding model with 256K context, always-on reasoning, and text, image, and video inputs.

Released Jun 16, 2026256K context
GermanyReleased Jun 16, 2026256K context
Save up to 31%

Qwen3.7 Plus

Alibaba Cloud
Proprietary EndpointNew

Cost-effective Qwen3.7 vision-language model for text, image, video, coding, tool use, GUI understanding, and 1M-context workflows.

SingaporeReleased Jun 1, 20261M context
ChinaReleased Jun 1, 20261M context
Proprietary EndpointNew

Kimi K2.7 Code Highspeed is the faster-serving tier of Moonshot's agentic coding model, with 256K context, always-on reasoning, and image and video input.

Released Jun 16, 2026256K context
Save up to 25%

MiniMax M3

MiniMax
Proprietary EndpointNew

MiniMax M3 is a multimodal reasoning model for coding, agents, and long-context analysis with text, image, and video input.

SingaporeReleased Jun 1, 2026524K context

Text Generation55

Save up to 21%
Proprietary EndpointNew

Reasoning and coding model with a 1M token context, 128K output, adjustable reasoning effort, native web search, and tool calling.

SingaporeReleased Jun 16, 20261M context
GermanyReleased Jun 16, 20261M context
Save up to 7%

Kimi K2.7 Code

Moonshot AI
Proprietary EndpointNew

Kimi K2.7 Code is Moonshot's trillion-parameter agentic coding model with 256K context, always-on reasoning, and text, image, and video inputs.

Released Jun 16, 2026256K context
GermanyReleased Jun 16, 2026256K context
Save up to 31%

Qwen3.7 Plus

Alibaba Cloud
Proprietary EndpointNew

Cost-effective Qwen3.7 vision-language model for text, image, video, coding, tool use, GUI understanding, and 1M-context workflows.

SingaporeReleased Jun 1, 20261M context
ChinaReleased Jun 1, 20261M context
Proprietary EndpointNew

Kimi K2.7 Code Highspeed is the faster-serving tier of Moonshot's agentic coding model, with 256K context, always-on reasoning, and image and video input.

Released Jun 16, 2026256K context
Save up to 25%

MiniMax M3

MiniMax
Proprietary EndpointNew

MiniMax M3 is a multimodal reasoning model for coding, agents, and long-context analysis with text, image, and video input.

SingaporeReleased Jun 1, 2026524K context
Save up to 34%

Qwen3.7 Max

Alibaba Cloud
Proprietary EndpointNew

Qwen3.7 Max is a flagship text model for coding, productivity, long-running agents, deep thinking, tools, and 1M-token context.

SingaporeReleased May 21, 20261M context
ChinaReleased May 21, 20261M context

Image Generation7

Save up to 39%

FLUX.2 Klein 4B

Black Forest Labs
Native InferenceNew

Apache-licensed 4B FLUX.2 Klein image generation and editing model with text-to-image, reference-image editing, and creative workflow support.

Released Jan 15, 2026
Proprietary Endpoint

Image generation and editing model creating and modifying images from text or image inputs, with inpainting, virtual try-on, and style controls.

Released Dec 3, 2024
Proprietary Endpoint

Open-source text-to-image model on a multimodal Mixture-of-Experts architecture with photorealistic detail and strong multilingual text rendering.

Released Sep 28, 2025
Proprietary Endpoint

Autoregressive framework on the Janus Pro 7B model that unifies multimodal understanding and image generation in one architecture.

Released Jan 27, 2025
Save up to 8%

Qwen Image 2.0

Alibaba Cloud
Proprietary Endpoint

Unified image generation and editing model with class-leading complex Chinese/English text rendering, realistic textures, and multi-image fusion.

SingaporeReleased Mar 3, 2026
Proprietary EndpointNew

Unified multimodal image model that reasons through prompts before rendering, producing high-resolution and consistent edits and brand visuals.

MalaysiaReleased Feb 13, 2026

Video Generation15

Proprietary EndpointNew

Text-to-video and image-to-video with synchronized native audio, at 720p or 1080p for 3 to 15 seconds, with aspect ratio and prompt control.

Released Jun 17, 2026
Proprietary Endpoint

Video generation model producing up to 2-minute multi-shot videos from text and optional image prompts with improved quality and consistency.

Released Apr 7, 2025

HappyHorse 1.0

Alibaba Cloud
Proprietary EndpointNew

Video model offering Text-to-Video, Image-to-Video, Reference-to-Video, and Video Edit modes with high-fidelity, motion-smooth output.

SingaporeReleased May 6, 2026
Save up to 19%
Native Inference

8.3B-parameter video model with native 720p output (upscalable to 1080p), strong motion coherence, and bilingual prompt understanding up to 10s.

Released Nov 20, 2025

Kling O3

Kling AI
Proprietary Endpoint

Video model in Standard or Pro modes with Text-to-Video, Image-to-Video, Reference-to-Video, editing, native sound, and multi-scene transitions.

Released Feb 5, 2026
Proprietary Endpoint

Kling 3.0 model that transfers motion from a reference video onto a character from a reference image, with Standard 720p and Pro 1080p tiers.

Audio Generation10

Save up to 17%
Native InferenceNew

Open-source music generation model for text-to-song and lyric-guided audio, with fast 8-step XL Turbo inference for controllable song iteration.

Released Apr 2, 2026
Save up to 30%
Proprietary EndpointNew

Sub-130ms TTFB voice synthesis with 271+ voices across 15 languages, expressive prosody, and real-time SSE streaming for low-latency voice agents.

Released May 5, 2026
Save up to 15%
Proprietary EndpointNew

Broadcast-quality voice synthesis with rich expressive prosody, 271+ voices across 15 languages, and real-time SSE streaming with per-word timestamps.

Released May 5, 2026
Proprietary Endpoint

Low-latency text-to-speech with single- and multi-speaker voices and controllable style, accent, and expressive tone for production apps.

Released May 20, 2025
Proprietary Endpoint

High-quality TTS preview for podcasts, audiobooks, and customer support, with expressive multi-speaker voices across 23+ languages.

Released May 20, 2025
Proprietary EndpointNew

Highly controllable TTS with new Audio Tags for precise style, tone, pace, and delivery across narration, assistants, and voice apps.

Released Apr 13, 2026

Transcription3

Proprietary Endpoint

Speech-to-text transcription using the Nova-3 model with multi-language support and advanced customizable settings for production workloads.

Released Feb 12, 2025
Proprietary Endpoint

Whisper-1 speech-to-text transcription trained on multilingual supervised audio, with a 25 MB upload limit per file.

Released Sep 21, 2022
Save up to 17%
Native InferenceNew

Controlled Whisper Large v3 Turbo transcription with multilingual ASR, translation, VAD, timestamps, subtitles, hotwords, and decoder controls.

Released Oct 1, 2024

Research & Search14

Proprietary Endpoint

Quick LLM-style answer to a natural-language question, grounded in fresh Exa web search results with inline citations and source links.

Proprietary Endpoint

Asynchronous research task that explores the web, gathers sources, synthesizes findings, and returns cited answers for in-depth queries.

Proprietary Endpoint

AI-powered web search with detailed overviews and answers, faster than Deep Search. Ranks #1 on OpenAI SimpleQA benchmark.

100K context
Proprietary Endpoint

Institutional-grade research powered by Claude Opus 4.6 reasoning, with maximum depth, enhanced tool access, and extensive source coverage.

3D Generation1

Save up to 90%

TRELLIS.2 4B

Microsoft
Native InferenceNew

TRELLIS.2 image-to-3D model that turns a reference image into a textured GLB asset with resolution, seed, mesh, texture, and export controls.

Embeddings3

Text Embedding v4

Alibaba Cloud
Proprietary EndpointNew

Multilingual text embedding with selectable output dimensions (64–2048). Up to 8,192 tokens per input.

SingaporeReleased Jun 4, 20258192 context
Proprietary EndpointNew

Speed-optimised multimodal embedding — same shape as Vision-Plus, 3× cheaper image/video tokens.

SingaporeReleased Sep 23, 20251024 context
Proprietary EndpointNew

Multimodal embedding producing independent vectors for text, image, and video inputs.

SingaporeReleased Sep 23, 20251024 context

Rerankers1

Qwen3 Rerank

Alibaba Cloud
Proprietary EndpointNew

Semantic document reranker. Sorts up to 500 candidates per query by relevance, supports 100+ languages, and accepts a custom sorting instruction.

SingaporeReleased Jun 5, 20254000 context

Tools & Agents2

GPTZero

GPTZero
Proprietary Endpoint

Deep-learning detector that flags portions of text likely generated by AI versus human, classifying content as entirely human, AI, or mixed.

Manus

Manus
Proprietary Endpoint

Autonomous AI agent that turns a high-level prompt into subtasks, calls tools and APIs, and delivers end-to-end results without manual orchestration.

No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.
No items found.

Ready to use better endpoints?

Explore our models, or contact us about business inquiries, custom deployments, or anything else.