Search Page 21

Showing 28 of 1396 results

elevenlabs/dubbing

Generate dubbed videos or audios using ElevenLabs Dubbing feature!

dubbing

audio-to-audio

audio-to-video

Kling's Native 4K is a video generation model that directly outputs professional-grade 4K video in one step, eliminating the need for post-production upscaling

kling-video/o3/4k/text-to-video

Kling's Native 4K is a video generation model that directly outputs professional-grade 4K video in one step, eliminating the need for post-production upscaling

Get EBU R128 loudness normalization from audio files using FFmpeg API.

ffmpeg

json

Text-to-image generation with LoRA support for FLUX.2 [klein] 9B Base from Black Forest Labs. Custom style adaptation and fine-tuned model variations.

flux-2/klein/9b/base/lora

Text-to-image generation with LoRA support for FLUX.2 [klein] 9B Base from Black Forest Labs. Custom style adaptation and fine-tuned model variations.

text-to-image

Luma Ray 3.2 generates cinematic video from a text prompt, with control over resolution, duration, and seamless looping, plus reference images to lock in subject and style.

luma/agent/ray/v3.2/text-to-video

Luma Ray 3.2 generates cinematic video from a text prompt, with control over resolution, duration, and seamless looping, plus reference images to lock in subject and style.

luma-dream-machine/ray-2-flash/image-to-video

Ray2 Flash is a fast video generative model capable of creating realistic visuals with natural, coherent motion.

Generate images from your prompts using Luma Photon. Photon is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

text-to-image

moondream3-preview/caption

Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.

vision

flux-krea-lora

Super fast endpoint for the FLUX.1 [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora

personalization

text-to-image

z-image-turbo-trainer-v2

Fast LoRA trainer for Z-Image-Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

minimax/video-01/image-to-video

Generate video clips from your images using MiniMax Video model

motion

transformation

image-to-video

Luma Uni-1 Max generates a single image at the model's highest fidelity, delivering richer detail and stronger prompt adherence than the base tier for hero-quality stills.

luma/agent/uni-1/v1/max

Luma Uni-1 Max generates a single image at the model's highest fidelity, delivering richer detail and stronger prompt adherence than the base tier for hero-quality stills.

luma-dream-machine/ray-2/modify

Ray2 Modify is a video generative model capable of restyling or retexturing the entire shot, from turning live-action into CG or stylized animation, to changing wardrobe, props, or the overall aesthetic and swap environments or time periods, giving you control over background, location, or even weather.

ltx-2.3-quality/render-to-real

Transform your 3D video render into realistic using first frame with Ltx 2.3

video

video-to-video

imagineart/imagineart-1.5-pro-preview/text-to-image

ImagineArt 1.5 Pro is an advanced text-to-image model that creates ultra-high-fidelity 4K visuals with lifelike realism, refined aesthetics, and powerful creative output suited for professional use.

State of the art Image to 3D Object generation

image-to-3d

Generate seamlessly tiling photorealistic images from text using Z-Image Turbo

z-image/turbo/tiling

Generate seamlessly tiling photorealistic images from text using Z-Image Turbo

florence-2-large/detailed-caption

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

captioning

multimodal

vision

hyper3d/rodin/v2.5/text-to-3d

Rodin V2.5 by Hyper3D generates realistic and production ready 3D models from text or images.

text-to-3d

deepfilternet3

Enhance speech audio by removing background noise and upsampling to 48KHz

speech-enhancement

audio-to-audio

dwpose

Predict poses from images.

pose

utility

image-to-image

hunyuan-image/v3/instruct/text-to-image

Instruct version of Hunyuan-Image 3.0, with internal reasoning capabilities.

hunyuan-image

instruct

text-to-image

image-apps-v2/hair-change

Change hairstyles and hair colors in photos realistically.

hair-edit

style-change

image-to-image

heygen/avatar5/digital-twin

Create natural HeyGen Avatar V digital twin videos from text or audio, with lip-sync, optional backgrounds, captions, and MP4/WebM output.

minimax/video-01-live/image-to-video

Generate video clips from your images using MiniMax Video model

Wan Effects generates high-quality videos with popular effects from images

motion

effects

image-to-video

stable-audio-3/small/music/text-to-audio

Stable Audio 3 Small Music is a 459 million parameter latent diffusion model that generates full stereo music compositions up to 2 minutes from text prompts, lightweight enough for on-device deployment.

Hunyuan World 1.0 turns a single image into a panorama or a 3D world. It creates realistic scenes from the image, allowing you to explore and view it from different angles.

image-to-image

Showing 561 to 588 of 1396 results