AI APIs for Developers
Filters
Sort by
Provider
Use Case
AI Model APIs
166 modelsAlibaba
happyhorse-1.0-v2v
HappyHorse-Video-Edit supports advanced video editing through natural language instructions. It allows for local or global editing of video elements using up to 5 reference images, precisely preserving original motion dynamics to achieve.
Alibaba
HappyHorse Text2Video
The HappyHorse Text-to-Video model features highly realistic dynamic generation capabilities. It accurately comprehends text semantics to output high-quality videos that are fluid, natural, and rich in detail.
Alibaba
HappyHorse-I2V
The HappyHorse Image-to-Video model features highly realistic dynamic generation capabilities. It maintains strict consistency with the source image, producing high-quality videos that are fluid, natural, and rich in detail.
Open Ai
GPT Image 2.0 Text To Image
GPT Image 2.0 is OpenAI's most advanced text-to-image model, launched April 2026. Describe any scene, concept, poster, product visual, or creative idea in plain language — and the model thinks through your brief, plans the layout.
Open Ai
GPT Image 2.0 - Image Edit
GPT Image 2.0 transforms how images get edited. Upload any photo — a product shot, portrait, marketing asset, or personal image — and simply describe what you want changed. Swap the background, adjust lighting, remove objects, add readable text.
Bytedance
Seedance 2.0 Multi Reference To Video
Seedance 2.0 Multi Reference to Video blends up to 9 images, 3 videos, and 3 audio tracks into a single cinematic generation. Inherit character likeness, camera movement, voice timbre, and music in one prompt — ideal for branded ads, character-consistent
Bytedance
Seedance 2.0 (Start/ End Frame) Image To Video
Seedance 2.0 Start/End Frame to Video lets creators lock both the opening and closing frame of a generated clip, producing fluid, physically accurate motion in between. Supporting 480p/720p output, 4–15 second durations, and all major aspect ratios
Alibaba
Wan 2.7 Image To Video
Discover WAN 2.7 Image-to-Video AI model with multi-modal support. Generate videos from images, text, audio, or frames with advanced video continuation and frame-based generation.
Lyria 3 Text To Music
Lyria 3 is a cutting-edge AI music generator that creates original 30-second tracks from text prompts and images. Instantly transform your ideas into unique soundscapes.
Alibaba
Wan 2.7 Text To Image
Wan 2.7 text to image converts natural language prompts into high-quality AI-generated images with superior text rendering, subject consistency, and complex instruction following.
Alibaba
Wan 2.7 - Image Edit
Wan 2.7 image edit delivers state-of-the-art AI image editing, text-to-image generation, interactive editing, and multi-image reference support — with superior text rendering and instruction following.
Veo 3.1 Lite Text To Video
Create cinematic AI videos instantly with Veo 3.1 Lite Text-to-Video API. Convert text or images into high-quality videos with fast rendering, motion control, and scalable API.











