Extend and reframe images with Luma Photon Reframe. This advanced tool intelligently expands your visuals, seamlessly blending new content to enhance creativity and adaptability, offering unmatched personalization and quality for creators at a fraction of the cost.
luma-photon/reframe
image-to-image

Extend and reframe images with Luma Photon Reframe. This advanced tool intelligently expands your visuals, seamlessly blending new content to enhance creativity and adaptability, offering unmatched personalization and quality for creators at a fraction of the cost.

outpainting
reframe
Use vidu Text-to-Image to turn your prompts into reality.
vidu/q2/text-to-image
text-to-image

Use vidu Text-to-Image to turn your prompts into reality.

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
florence-2-large/ocr
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

ocr
multimodal
Sana v1.5 1.6B is a lightweight text-to-image model that delivers 4K image generation with impressive efficiency.
sana/v1.5/1.6b
text-to-image

Sana v1.5 1.6B is a lightweight text-to-image model that delivers 4K image generation with impressive efficiency.

text to image
4k
lightweight
Run SDXL at the speed of light
fast-lcm-diffusion
text-to-image

Run SDXL at the speed of light

lcm
diffusion
turbo
Generate videos from images and prompts using CogVideoX-5B
cogvideox-5b/image-to-video
image-to-video

Generate videos from images and prompts using CogVideoX-5B

Train LTX-2.3 22B for custom styles and effects.
ltx23-video-trainer
training

Train LTX-2.3 22B for custom styles and effects.

ltx2.3-video
fine-tuning
Interpolate images with FILM - Frame Interpolation for Large Motion
film
image-to-image

Interpolate images with FILM - Frame Interpolation for Large Motion

interpolation
Blend products into backgrounds with automatic perspective and lighting correction
qwen-image-edit-plus-lora-gallery/integrate-product
image-to-image

Blend products into backgrounds with automatic perspective and lighting correction

stylized
transform
Clone voice of any person and speak anything in their voice using zonos' voice cloning.
zonos
text-to-audio

Clone voice of any person and speak anything in their voice using zonos' voice cloning.

voice cloning
Generate video with audio from audio, text and images using LTX-2 Distilled
ltx-2-19b/distilled/audio-to-video
audio-to-video

Generate video with audio from audio, text and images using LTX-2 Distilled

Qwen Image LoRA training
qwen-image-trainer
training

Qwen Image LoRA training

lora
personalization
Audio reasoning variant of NVIDIA's Nemotron 3 Nano Omni. 30B A3B hybrid Transformer-Mamba MoE - accepts audio plus a prompt and returns text.
new
nvidia/nemotron-3-nano-omni/audio
audio-to-text

Audio reasoning variant of NVIDIA's Nemotron 3 Nano Omni. 30B A3B hybrid Transformer-Mamba MoE - accepts audio plus a prompt and returns text.

nemotron
nvidia
audio-understanding
Generate long videos from images using LongCat Video
longcat-video/image-to-video/480p
image-to-video

Generate long videos from images using LongCat Video

Generate professional product photography with realistic lighting and backgrounds.
image-apps-v2/product-photography
image-to-image

Generate professional product photography with realistic lighting and backgrounds.

product
marketing
See how you or others might look at different ages, from younger to older, while preserving core facial features.
image-editing/age-progression
image-to-image

See how you or others might look at different ages, from younger to older, while preserving core facial features.

stylized
transform
Transform your photos into cool plushies while keeping the original characters likeness
image-editing/plushie-style
image-to-image

Transform your photos into cool plushies while keeping the original characters likeness

stylized
transform
PersonaPlex is a real-time, full-duplex speech-to-speech conversational model that enables persona control through text-based role prompts and audio-based voice conditioning.
personaplex
audio-to-audio

PersonaPlex is a real-time, full-duplex speech-to-speech conversational model that enables persona control through text-based role prompts and audio-based voice conditioning.

audio
Add details to faces, enhance face features, remove blur.
image-editing/realism
image-to-image

Add details to faces, enhance face features, remove blur.

stylized
transform
realism
Choose the Nth image from an image URL list for workflows.
new
workflow-utilities/pick-image-by-index
workflow

Choose the Nth image from an image URL list for workflows.

Generate profiles using 30-50 images of a subject with Phota.
phota/create-profile
training

Generate profiles using 30-50 images of a subject with Phota.

stylized
transform
typography
Sana v1.5 4.8B is a powerful text-to-image model that generates ultra-high quality 4K images with remarkable detail.
sana/v1.5/4.8b
text-to-image

Sana v1.5 4.8B is a powerful text-to-image model that generates ultra-high quality 4K images with remarkable detail.

text to image
4k
high-quality
Generate video clips from your prompts using Kling 1.0
kling-video/v1/standard/effects
text-to-video

Generate video clips from your prompts using Kling 1.0

motion
Audio separation with SAM Audio. Isolate any sound using natural language—professional-grade audio editing made simple for creators, researchers, and accessibility applications.
sam-audio/visual-separate
video-to-audio

Audio separation with SAM Audio. Isolate any sound using natural language—professional-grade audio editing made simple for creators, researchers, and accessibility applications.

sam-audio
Removes objects and their visual effects using natural language, replacing them with contextually appropriate content
object-removal
image-to-image

Removes objects and their visual effects using natural language, replacing them with contextually appropriate content

utility
editing
Pika Scenes v2.2 creates videos from a images with high quality output.
pika/v2.2/pikascenes
image-to-video

Pika Scenes v2.2 creates videos from a images with high quality output.

editing
effects
animation
Framepack is an efficient Image-to-video model that autoregressively generates videos.
framepack/f1
image-to-video

Framepack is an efficient Image-to-video model that autoregressively generates videos.

image to video
motion
Super fast endpoint for the FLUX.1 [dev] inpainting model with LoRA support, enabling rapid and high-quality image inpaingting using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.
flux-krea-lora/inpainting
image-to-image

Super fast endpoint for the FLUX.1 [dev] inpainting model with LoRA support, enabling rapid and high-quality image inpaingting using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
Showing 841 to 868 of 1354 results