Search Page 27

Showing 28 of 1396 results

ovi/image-to-video

Ovi can generate videos with audio from image and text inputs.

image-to-audio-video

image-to-video

vidu/q3/reference-to-video/mix

Vidu's latest Q3 Reference to Video Mix model

image-to-video

vibevoice

Generate long, expressive multi-voice speech using Microsoft's powerful TTS

Text To Image Model using Boogu-Image

text-to-image

rundiffusion-fal/juggernaut-flux-lora

Juggernaut Base Flux LoRA by RunDiffusion is a drop-in replacement for Flux [Dev] that delivers sharper details, richer colors, and enhanced realism to all your LoRAs and LyCORIS with full compatibility.

image generation

text-to-image

hunyuan3d-v3/text-to-3d

Turn simple sketches into detailed, fully-textured 3D models. Instantly convert your concept designs into formats ready for Unity, Unreal, and Blender.

text-to-3d

ghiblify

Reimagine and transform your ordinary photos into enchanting Studio Ghibli style artwork

stylized

transform

image-to-image

flux/krea/image-to-image

FLUX.1 Krea [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.

image-to-image

aura-flow

AuraFlow v0.3 is an open-source flow-based text-to-image generation model that achieves state-of-the-art results on GenEval. The model is currently in beta.

typography

style

text-to-image

sdxl-controlnet-union

An efficent SDXL multi-controlnet text-to-image model.

image-editing/background-change

Replace your photo's background with any scene you desire, from beach sunsets to urban landscapes, with perfect lighting and shadows

stylized

transform

image-to-image

Generate video with audio from images using LTX-2.3 Distilled

ltx-2.3-22b/distilled/image-to-video

Generate video with audio from images using LTX-2.3 Distilled

image-to-video

mirelo-ai/sfx1.6/text-to-audio

Generate ambient sounds for any text prompt. Now you can turn any SFX into a natural loop for ambient soundscapes.

sfx

text-to-audio

glm-image

Create high-quality images with accurate text rendering and rich knowledge details—supports editing, style transfer, and maintaining consistent characters across multiple images.

text-to-image

omnigen-v2

OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!

wan/v2.2-a14b/text-to-video/lora

Wan-2.2 text-to-video is a video model that generates high-quality videos with high visual quality and motion diversity from text prompts. This endpoint supports LoRAs made for Wan 2.2.

text-to-video

Utilize Flux.1 [dev] Controlnet to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

flux-lora-canny

Utilize Flux.1 [dev] Controlnet to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

Foley Control is a video-to-audio model that automatically generates synchronized sound effects for videos, using text prompts to shape the type of sound while matching the timing and action on screen.

SAM 3.1 builds comes with Object Multiplex, a shared-memory approach for joint multi-object tracking that delivers faster speeds with larger number of objects tracked.

Post Processing is an endpoint that can enhance images using a variety of techniques including grain, blur, sharpen, and more.

stylized

utility

image-to-image

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

flux-1/dev/image-to-image

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

image-to-image

hidream-o1-image/dev

Unified image generation with HiDream-O1-Image. Create, edit, and personalize high-resolution images up to 2K—single native model handles text-to-image, editing, and custom subjects without external components.

text-to-image

sana/sprint

Sana Sprint is a text-to-image model capable of generating 4K images with exceptional speed.

text to image

high-speed

text-to-image

Kling O1 Omni generates new shots guided by an input reference video, preserving cinematic language such as motion, and camera style to produce seamless scene continuity.

kling-video/o1/standard/video-to-video/reference

Kling O1 Omni generates new shots guided by an input reference video, preserving cinematic language such as motion, and camera style to produce seamless scene continuity.

video-to-video

FLUX Control LoRA Canny is a high-performance endpoint that uses a control image using a Canny edge map to transfer structure to the generated image and another initial image to guide color.

flux-control-lora-canny/image-to-image

FLUX Control LoRA Canny is a high-performance endpoint that uses a control image using a Canny edge map to transfer structure to the generated image and another initial image to guide color.

Text-to-image generation with FLUX.2 [klein] 4B Base from Black Forest Labs. Enhanced realism, crisper text generation, and native editing capabilities.

text-to-image

sam-3-1/video

SAM 3.1 builds comes with Object Multiplex, a shared-memory approach for joint multi-object tracking that delivers faster speeds with larger number of objects tracked.

wan/v2.2-a14b/text-to-image

Wan 2.2's 14B model generates high-resolution, photorealistic images with powerful prompt understanding and fine-grained visual detail

text-to-image

Showing 729 to 756 of 1396 results