Search Page 6

recraft/vectorize

Converts a given raster image to SVG format using Recraft model.

transform

flux-2-max

FLUX.2 [max] delivers state-of-the-art image generation and advanced image editing with exceptional realism, precision, and consistency.

Endpoint for Qwen's Image Editing 2511 model.

transform

wan/v2.2-a14b/image-to-video

fal-ai/wan/v2.2-A14B/image-to-video

elevenlabs/speech-to-text/scribe-v2

minimax-music/v2

Generate music from text prompts using the MiniMax Music 2.0 model, which leverages advanced AI techniques to create high-quality, diverse musical compositions.

music

audio

text-to-audio

Use Scribe-V2 from ElevenLabs to do blazingly fast speech to text inferences!

Use Scribe-V2 from ElevenLabs to do blazingly fast speech to text inferences!

speech-to-text

minimax-music/v2.6

MiniMax Music 2.6 creates complete tracks with singing, backing music, and detailed arrangements from lyrics and a style description.

kling-video/ai-avatar/v2/pro

Kling AI Avatar v2 Pro: The premium endpoint for creating avatar videos with realistic humans, animals, cartoons, or stylized characters

kling-video/v3/standard/motion-control

fashn/tryon/v1.6

FASHN v1.6 delivers precise virtual try-on capabilities, accurately rendering garment details like text and patterns at 864x1296 resolution from both on-model and flat-lay photo references.

Generate high-fidelity images from text in seconds with Krea 2 Turbo, the speed-optimized open-source version of Krea 2, preserving its aesthetic range for rapid ideation.

Transfer movements from a reference video to any character image. Cost-effective mode for motion transfer, perfect for portraits and simple animations.

veo3.1/first-last-frame-to-video

Generate videos from a first and last framed using Google's Veo 3.1

ffmpeg-api/extract-frame

ffmpeg endpoint for first, middle and last frame extraction from videos

utility

editing

kling-video/v2.6/pro/text-to-video

Kling 2.6 Pro: Top-tier text-to-video with cinematic visuals, fluid motion, and native audio generation.

Kling 2.6 Pro: Top-tier text-to-video with cinematic visuals, fluid motion, and native audio generation.

text-to-video

Image-to-image editing with FLUX.2 [dev] from Black Forest Labs. Precise modifications using natural language descriptions and hex color control—in a flash.

flux-2/flash/edit

Image-to-image editing with FLUX.2 [dev] from Black Forest Labs. Precise modifications using natural language descriptions and hex color control—in a flash.

veo3.1/fast/first-last-frame-to-video

Generate videos from a first/last frame using Google's Veo 3.1 Fast

Generate videos from a first/last frame using Google's Veo 3.1 Fast

flux-pro/kontext/text-to-image

The FLUX.1 Kontext [pro] text-to-image delivers state-of-the-art image generation results with unprecedented prompt following, photorealistic rendering, and flawless typography.

The FLUX.1 Kontext [pro] text-to-image delivers state-of-the-art image generation results with unprecedented prompt following, photorealistic rendering, and flawless typography.

text-to-image

bria/eraser

Bria Eraser enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use. Access the model's source code and weights: https://bria.ai/contact-us

image editing

object removal

recraft/v4/text-to-image

Recraft V4 was developed with designers to bring true visual taste to AI image generation. Built for brand systems and production-ready workflows, it goes beyond prompt accuracy delivering stronger composition, refined lighting, realistic materials, and a cohesive aesthetic. The result is imagery shaped by professional design judgment, ready for immediate real-world use without additional post-processing.

text-to-image

trellis

Generate 3D models from your images using Trellis. A native 3D generative model enabling versatile and high-quality 3D asset creation.

image-to-3d

trellis-2

Generate 3D models from your images using Trellis 2. A native 3D generative model enabling versatile and high-quality 3D asset creation.

image-to-3d

lyria2

Lyria 2 is Google's latest music generation model, you can generate any type of music with this model.

music

kling-video/v1.6/standard/text-to-video

text-to-audio

Generate video clips from your prompts using Kling 1.6 (std)

Generate video clips from your prompts using Kling 1.6 (std)

text-to-video

Generate video clips from your images using Kling 1.6 (pro)

kling-video/v1.6/pro/image-to-video

Generate video clips from your images using Kling 1.6 (pro)

bytedance/seedance/v1.5/pro/text-to-video

Generate videos with audio with Seedance 1.5

Generate videos with audio with Seedance 1.5

Qwen-Image is an image generation foundation model in the Qwen series that achieves significant advances in complex text rendering and precise image editing.

text-to-image

Qwen-Image-2.0 is a next-generation foundational unified generation-and-editing model

qwen-image-2/edit

Qwen-Image-2.0 is a next-generation foundational unified generation-and-editing model

transform