What AI models are available on ModelsLab?

ModelsLab offers 100,000+ AI models including text-to-image (Stable Diffusion, FLUX, SDXL), text-to-video, text-to-audio, 3D generation, voice cloning, and LLM APIs from providers like Alibaba, Google, Meta, and more.

How do I get started with ModelsLab APIs?

Sign up for a free account, get your API key from the dashboard, and start making API calls immediately. We provide SDKs for Python, JavaScript, and cURL examples for every model.

What are the pricing options for ModelsLab?

ModelsLab offers flexible pay-per-use pricing starting from $0.001 per API call, with free tier access available. Enterprise plans with custom pricing and dedicated support are also available.

What programming languages are supported?

ModelsLab APIs work with any language that can make HTTP requests — Python, JavaScript, PHP, Ruby, Go, Rust, Java, and more. We provide official SDKs, CLI tools, and MCP server integrations.

Do you offer API documentation?

Yes, every model on ModelsLab has comprehensive API documentation with code examples, parameter descriptions, and integration guides available at docs.modelslab.com and on each model's API Documentation tab.

Happy Horse 1.0 is now on ModelsLab

Try Now

ModelsLab

AI Models Pricing Enterprise

AI Workflows Library

Explore our collection of pre-built AI workflows for various use cases

Create Workflow

Create custom AI workflows tailored to your specific needs

API Docs

llms.txt

API details in LLM-friendly format

CLI

Command-line interface for API

Skills

Skills for coding agents to use APIs

Agents Control Plane

Agents Control Plane for API

SDKs

SDKs for ModelsLab API

MCP Server

MCP Server for API

Affiliate

Book a Call

AI APIs for Developers

Type /to search

All Image APIs Audio APIs Video APIs 3D APIs LLM APIs Train Models CivitAI Models

Popular Models

View all

happyhorse-1.0-v2v

HappyHorse Text2Video

HappyHorse-I2V

GPT Image 2.0 Text To Image

GPT Image 2.0 - Image Edit

Seedance 2.0 Multi Reference To Video

Filters

Sort by

LatestFree firstPrice: Low to HighPrice: High to Low

Provider

Alibaba Cloud

Bfl

Byteplus

Elevenlabs

Google

Inworld

Klingai

Ltx

Minimax

Modelslab

Openai

Runway Ml

Sonauto

Sync

Xai

Use Case

Closed Source Models

775

Open Source Models

Text to Image

Image to Video

AI Image Editing tools

Text to Video

Image to Image

Music Generation

AI Interior

Song Edit

Trainer

Text to Speech

AI Avatar/Headshot

Virtual try-on

Sound Effect

Image to Text

Video to Video

Voice Changer

3D Generations

Speech to Text

Dubbing

Lips Sync

AI Model APIs

166 models

Alibaba

happyhorse-1.0-v2v

HappyHorse-Video-Edit supports advanced video editing through natural language instructions. It allows for local or global editing of video elements using up to 5 reference images, precisely preserving original motion dynamics to achieve.

Closed SourceNew AddedBest SellingTrending On Reels

Alibaba

HappyHorse Text2Video

The HappyHorse Text-to-Video model features highly realistic dynamic generation capabilities. It accurately comprehends text semantics to output high-quality videos that are fluid, natural, and rich in detail.

Closed SourceNew AddedBest SellingTrending On Reels

Alibaba

HappyHorse-I2V

The HappyHorse Image-to-Video model features highly realistic dynamic generation capabilities. It maintains strict consistency with the source image, producing high-quality videos that are fluid, natural, and rich in detail.

Closed SourceNew AddedBest SellingTrending On Reels

Open Ai

GPT Image 2.0 Text To Image

GPT Image 2.0 is OpenAI's most advanced text-to-image model, launched April 2026. Describe any scene, concept, poster, product visual, or creative idea in plain language — and the model thinks through your brief, plans the layout.

Closed SourceNew AddedHigh Quality Output

Open Ai

GPT Image 2.0 - Image Edit

GPT Image 2.0 transforms how images get edited. Upload any photo — a product shot, portrait, marketing asset, or personal image — and simply describe what you want changed. Swap the background, adjust lighting, remove objects, add readable text.

Closed SourceNew AddedHigh Quality OutputBest Image Editing

Bytedance

Seedance 2.0 Multi Reference To Video

Seedance 2.0 Multi Reference to Video blends up to 9 images, 3 videos, and 3 audio tracks into a single cinematic generation. Inherit character likeness, camera movement, voice timbre, and music in one prompt — ideal for branded ads, character-consistent

Closed SourceNew AddedBest for CreatorsTop Selling Best for Filmmakers

Bytedance

Seedance 2.0 (Start/ End Frame) Image To Video

Seedance 2.0 Start/End Frame to Video lets creators lock both the opening and closing frame of a generated clip, producing fluid, physically accurate motion in between. Supporting 480p/720p output, 4–15 second durations, and all major aspect ratios

Closed SourceNew AddedBest for CreatorsTop Selling Best for Filmmakers

Alibaba

Wan 2.7 Image To Video

Discover WAN 2.7 Image-to-Video AI model with multi-modal support. Generate videos from images, text, audio, or frames with advanced video continuation and frame-based generation.

Closed SourceNew AddedBest for Creators

Google

Lyria 3 Text To Music

Lyria 3 is a cutting-edge AI music generator that creates original 30-second tracks from text prompts and images. Instantly transform your ideas into unique soundscapes.

Closed SourceNew AddedMusic ProductionBest song generationBest SFX30 second output

Alibaba

Wan 2.7 Text To Image

Wan 2.7 text to image converts natural language prompts into high-quality AI-generated images with superior text rendering, subject consistency, and complex instruction following.

Closed SourceNew AddedFastest Image Gen

Alibaba

Wan 2.7 - Image Edit

Wan 2.7 image edit delivers state-of-the-art AI image editing, text-to-image generation, interactive editing, and multi-image reference support — with superior text rendering and instruction following.

Closed SourceNew AddedFastest Image EditBest Image Editing

Google

Veo 3.1 Lite Text To Video

Create cinematic AI videos instantly with Veo 3.1 Lite Text-to-Video API. Convert text or images into high-quality videos with fast rendering, motion control, and scalable API.

Closed SourceNew AddedBest for CreatorsBest for Filmmakers

ModelsLab

ModelsLab
ML

Imagen

Text to Image API
Image Editing API
Inpainting API
Outpainting API
Image Upscaler API
Image Enhancer API
Image Extender API
Headshot Generator API
Avatar Generator API
Sketch to Image API
Flux Image Generator API
Image to Image API
Character Generator API
Image to Text API
Floor Planning API
Interior API
Fashion API

Audio Gen

Voices
Text to Speech API
AI Voice Generator API
AI Dubbing API
Voice Cloning API
Speech to Text API
Voice Changer API
Celebrity Voice Generator API
Text to Music API
Text to SFX API

Video Fusion

Text to Video API
AI Video Generator API
Image to Video API
AI Storyboard Generator
AI Film Studio

3D verse

Text to 3D API
Image to 3D API

Others

Train Master API
Showcase

For Agents

llms.txt
CLI
Skills
Documentation
MCP Server
Agents Control Plane
SDKs

Guides

Image Generation API
Stable Diffusion API
Grok Imagine
Flux LoRA Training
ComfyUI API
Midjourney Alternative
CivitAI Alternative
fal.ai Alternative
Replicate Alternative
ElevenLabs Alternative
Kling AI
Real-ESRGAN Upscaler

Resources

Plugins
M Studio — AI Filmmaking
Showcase
API Models Directory
Unlimited
Enterprise API Solutions
API Documentation
Developer Support

About

Support
Pricing
Blog
Changelog
LLM.txt
Comparison
Affiliate Program

Join Us

YouTube
Discord
Reddit User Profile
Twitter
Reddit Community

Support Terms of use API Status Refund Policy Privacy Policy