Search Page 32

Showing 28 of 1396 results

Juggernaut Pro Flux by RunDiffusion is the flagship Juggernaut model rivaling some of the most advanced image models available, often surpassing them in realism. It combines Juggernaut Base with RunDiffusion Photo and features enhancements like reduced background blurriness.

image generation

text-to-image

fast-lcm-diffusion/image-to-image

Run SDXL at the speed of light

heygen/v3/lipsync/speed

Replace or dub audio on an existing video with fast audio-only lip-sync.

Automatically generates text captions for your videos from the audio as per text colour/font specifications

OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!

Interpolate between video frames

interpolation

editing

video-to-video

Extends a face into a full body portrait

flux-2-lora-gallery/face-to-full-portrait

Extends a face into a full body portrait

stylized

transform

image-to-image

Generate video clips from your prompts using Kling 1.6 (pro)

kling-video/v1.6/pro/effects

Generate video clips from your prompts using Kling 1.6 (pro)

text-to-video

image-editing/retouch

Retouch photos of faces. Remove blemishes and improve the skin.

image-to-image

stable-audio-3/medium/audio-to-audio

Stable Audio 3 Medium audio-to-audio is a 1.4 billion parameter latent diffusion model that transforms an input audio clip into new stereo variations up to 6 minutes guided by a text prompt.

Get waveform data from audio files using FFmpeg API.

ffmpeg

json

wan/v2.6/reference-to-video

Wan 2.6 reference-to-video model.

reference-to-video

video-to-video

Create seamless cinematic transitions between two images with PixVerse C1, with native audio and up to 1080p.

pixverse/c1/transition

Create seamless cinematic transitions between two images with PixVerse C1, with native audio and up to 1080p.

wan/v2.6/reference-to-video/flash

Wan 2.6 reference-to-video flash model.

reference-to-video

video-to-video

Ray2 Flash Modify is a video generative model capable of restyling or retexturing the entire shot, from turning live-action into CG or stylized animation, to changing wardrobe, props, or the overall aesthetic and swap environments or time periods, giving you control over background, location, or even weather.

luma-dream-machine/ray-2-flash/modify

Ray2 Flash Modify is a video generative model capable of restyling or retexturing the entire shot, from turning live-action into CG or stylized animation, to changing wardrobe, props, or the overall aesthetic and swap environments or time periods, giving you control over background, location, or even weather.

DeepSeek Janus-Pro is a novel text-to-image model that unifies multimodal understanding and generation through an autoregressive framework

stylized

text-to-image

diffrhythm

DiffRhythm is a blazing fast model for transforming lyrics into full songs. It boasts the capability to generate full songs in less than 30 seconds.

music

text-to-audio

Edit any image with a natural-language instruction using Bernini-R, changing the weather, materials, objects, or style while preserving the original composition.

bernini-r/edit-image

Edit any image with a natural-language instruction using Bernini-R, changing the weather, materials, objects, or style while preserving the original composition.

vidu/q2/image-to-video/pro

Use the latest Vidu Q2 models which much more better quality and control on your videos.

image-to-video

nafnet/denoise

Use NAFNet to fix issues like blurriness and noise in your images. This model specializes in image restoration and can help enhance the overall quality of your photography.

PersonaPlex is a real-time, full-duplex speech-to-speech conversational model that enables persona control through text-based role prompts and audio-based voice conditioning.

audio

audio-to-audio

LoRA inference endpoint for the Qwen Image Editing model.

qwen-image-edit-lora

LoRA inference endpoint for the Qwen Image Editing model.

image-editing

lora

image-to-image

Bring speech to your texts using Qwen3-TTS Custom-Voice model with pre-trained voices or use your custom voice with Qwen3-TTS Clone Voice model

qwen-3-tts/text-to-speech/0.6b

Bring speech to your texts using Qwen3-TTS Custom-Voice model with pre-trained voices or use your custom voice with Qwen3-TTS Clone Voice model

text-to-speech

joyai-image-edit

All-in-one image AI with JoyAI-Image. Understand, create, and edit images through natural language—the model's deep visual understanding powers more accurate generation and precise editing in a unified system.

image-editing

image-to-image

kokoro/japanese

A fast and natural-sounding Japanese text-to-speech model optimized for smooth pronunciation.

speech

text-to-audio

dwpose/video

Predict poses from videos.

pose

utility

video-to-video

Image-to-image editing with FLUX.2 [klein] 4B from Black Forest Labs and custom LoRA. Precise modifications using natural language descriptions and hex color control.

flux-2/klein/4b/edit/lora

Image-to-image editing with FLUX.2 [klein] 4B from Black Forest Labs and custom LoRA. Precise modifications using natural language descriptions and hex color control.

image-to-image

post-processing/sharpen

Apply sharpening effects with three modes: basic unsharp mask, smart sharpening with edge preservation, and Contrast Adaptive Sharpening (CAS).

stylized

transform

image-to-image

Showing 869 to 896 of 1396 results