21 Models Available

Model Catalog

All open-source. All running on our GPU clusters. One API key for everything.

21 models

Image

(4)

ImageBlack Forest Labs

FLUX.1 Schnell

Ultra-fast image generation optimized for speed. High-quality images in under a second.

Pricing: $0.004 per image

Best for

Marketing visualsProduct mockupsCreative pipelines

ImageBlack Forest Labs

FLUX.1 Dev

Development-grade image model with higher fidelity and more creative control.

Pricing: $0.025 per image

Best for

High-fidelity artDesign iterationCreative direction

ImageBlack Forest Labs

FLUX.1 Pro

Professional image generation with maximum quality and prompt adherence.

Pricing: $0.05 per image

Best for

Commercial assetsProfessional designBrand content

ImageStability AI

Stable Diffusion XL

Industry-standard text-to-image model with extensive ecosystem and LoRA support.

Pricing: $0.004 per image

Best for

Customizable generationLoRA workflowsBatch production

Vision

(3)

VisionAlibaba

Qwen2.5-VL-7B Instruct

Multimodal vision-language model that accepts text and images for understanding visual content.

Pricing: $0.10 / $0.20 per 1M tokens

Best for

Document understandingImage Q&AVisual extraction

VisionMeta

Llama 3.2 11B Vision Instruct

Multimodal model supporting interleaved text and image inputs for visual reasoning.

Pricing: $0.18 / $0.18 per 1M tokens

Best for

Visual QAChart readingImage captioning

VisionAlibaba

Qwen2.5-VL-72B Instruct

Large-scale vision-language model with state-of-the-art visual understanding.

Pricing: $0.54 / $0.80 per 1M tokens

Best for

Complex visual reasoningOCRMulti-image analysis

STT

(2)

STTOpenAI

Whisper Large v3

Industry-standard speech-to-text supporting 90+ languages via multipart form upload.

Pricing: $0.006 per minute

Best for

Meeting transcriptionPodcast indexingVoice commands

STTOpenAI

Whisper Large v3 Turbo

Faster variant of Whisper with near-identical accuracy at lower latency.

Pricing: $0.004 per minute

Best for

Real-time transcriptionLive captionsStreaming ASR

TTS

(1)

TTSHexgrad

Kokoro-82M

Lightweight text-to-speech with multiple voices. Outputs mp3, wav, or pcm audio.

Pricing: $0.015 per 1K characters

Best for

Voice assistantsAudiobook generationAccessibility tools

Video

(2)

VideoTHUDM

CogVideoX-5B

Text-to-video generation. Submit a prompt and poll for the completed MP4 video.

Pricing: $0.05 per video

Best for

Social media contentExplainer animationsRapid prototyping

VideoAlibaba

Wan 2.1 T2V 14B

High-quality text-to-video model with consistent motion and scene understanding.

Pricing: $0.08 per video

Best for

Product demosMarketing videosStorytelling

Chat

(9)

Text LLMMeta

Llama 3.1 8B Instruct

High-quality open-source LLM for chat, instruction following, and general-purpose text generation.

Pricing: $0.10 / $0.20 per 1M tokens

Context: 131K

Best for

ChatbotsCustomer supportRAG pipelines

Text LLMMeta

Llama 3.1 70B Instruct

Large-scale LLM with superior reasoning, coding, and multilingual capabilities.

Pricing: $0.54 / $0.80 per 1M tokens

Context: 131K

Best for

Complex reasoningCode generationEnterprise apps

Text LLMMeta

Llama 3.3 70B Instruct

Latest Llama generation with improved instruction following and safety.

Pricing: $0.54 / $0.80 per 1M tokens

Context: 131K

Best for

Agent workflowsMulti-turn dialogueTool use

Text LLMMistral AI

Mistral 7B Instruct v0.3

Efficient instruction-tuned model excelling at text generation and summarization.

Pricing: $0.10 / $0.20 per 1M tokens

Context: 32K

Best for

SummarizationContent generationClassification

Text LLMMistral AI

Mixtral 8x7B Instruct

Sparse mixture-of-experts model delivering excellent quality with efficient inference.

Pricing: $0.24 / $0.50 per 1M tokens

Context: 32K

Best for

Multi-taskLong-form writingAnalysis

Text LLMAlibaba

Qwen2.5 7B Instruct

Multilingual model with strong performance in Chinese and English tasks.

Pricing: $0.10 / $0.20 per 1M tokens

Context: 128K

Best for

Multilingual appsCode assistMath reasoning

Text LLMAlibaba

Qwen2.5 72B Instruct

Large-scale multilingual model rivaling frontier closed-source models.

Pricing: $0.54 / $0.80 per 1M tokens

Context: 128K

Best for

Enterprise AIComplex analysisMultilingual

Text LLMDeepSeek

DeepSeek V3

Efficient MoE model with excellent coding and math capabilities at lower cost.

Pricing: $0.30 / $0.60 per 1M tokens

Context: 128K

Best for

Code generationMathematical reasoningData analysis

Text LLMGoogle

Gemma 2 9B Instruct

Compact yet powerful model from Google's Gemma family, optimized for helpful responses.

Pricing: $0.10 / $0.20 per 1M tokens

Context: 8K

Best for

On-device AIQuick responsesEducation

Start building with any model

Free $1 credit on signup. One API key for every model.