Search Page 29

Showing 28 of 1396 results

image-apps-v2/headshot-photo

Generate professional headshot photos with customizable backgrounds.

headshot

profile-photo

image-to-image

stable-audio-3/medium/audio-to-audio

Stable Audio 3 Medium audio-to-audio is a 1.4 billion parameter latent diffusion model that transforms an input audio clip into new stereo variations up to 6 minutes guided by a text prompt.

flux-kontext-trainer

LoRA trainer for FLUX.1 Kontext [dev]

training

mirelo-ai/sfx1.6/text-to-audio

Generate ambient sounds for any text prompt. Now you can turn any SFX into a natural loop for ambient soundscapes.

sfx

text-to-audio

meshy/v5/remesh

Meshy-5 remesh allows you to remesh and export existing 3D models into various formats

3d-to-3d

vidu/q2/reference-to-video/pro

Use the latest Vidu Q2 Pro models which much more better quality and control on your videos.

image-to-video

fast-svd-lcm

Generate short video clips from your images using SVD v1.1 at Lightning Speed

turbo

image-to-video

Generate high-quality video with audio from reference video, text and images using LTX-2.3

ltx-2.3-quality/reference-video-to-video

Generate high-quality video with audio from reference video, text and images using LTX-2.3

video-to-video

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

flux-1/dev/image-to-image

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

image-to-image

decart/lucy-edit/pro

Edit outfits, objects, faces, or restyle your video - all with maximum detail retention.

video-edit

video-to-video

nvidia/cosmos-3-super/image-to-video

Cosmos3 is a collection of Omnimodal world models capable of generating dynamic, high-quality video, image, audio, and action commands from combinations of text, image, video, and action trajectory inputs.

Discover ultimate control with Pikaframes key frame interpolation, a stunning image-to-video feature that allows you to upload up to 5 keyframes, customize their transition length and prompt, and see their images come to life as seamless videos.

image-to-video

speech-to-text

Leverage the rapid processing capabilities of AI models to enable accurate and efficient real-time speech-to-text transcription.

speech-to-text

Extend high-quality video with audio from input video using LTX-2.3

new

ltx-2.3-quality/extend-video

Extend high-quality video with audio from input video using LTX-2.3

extend

longer

video-to-video

workflow-utilities/reverse-video

FFMPEG Utility to Reverse Videos

video-to-video

film

Interpolate images with FILM - Frame Interpolation for Large Motion

interpolation

image-to-image

stable-cascade

Stable Cascade: Image generation on a smaller & cheaper latent space.

FireRed Image Edit is FireRed's state of the art open source editing model, re-trained from Qwen Image Edit 2509.

GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.

optical character recognition

high-res

utility

vision

Generate video with audio from audio, text and images using LTX-2

ltx-2-19b/audio-to-video

Generate video with audio from audio, text and images using LTX-2

audio-to-video

Bring speech to your texts using Qwen3-TTS Custom-Voice model with pre-trained voices or use your custom voice with Qwen3-TTS Clone Voice model

qwen-3-tts/text-to-speech/0.6b

Bring speech to your texts using Qwen3-TTS Custom-Voice model with pre-trained voices or use your custom voice with Qwen3-TTS Clone Voice model

text-to-speech

image2pixel

Turn images into pixel-perfect retro art

LongCat image Edit is a 6B parameter image editing model excelling at multilingual text rendering, photorealism and deployment efficiency.

image-to-image

qwen-image-2512-trainer

Qwen Image 2512 LoRA training

lora

personalization

training

Transform your consistent character into different art styles, settings, or scenarios while maintaining their distinctive appearance and identity

ideogram/character/remix

Transform your consistent character into different art styles, settings, or scenarios while maintaining their distinctive appearance and identity

character-consistency

image-to-image

Generate professional, eCommerce-ready product shots by replacing backgrounds with realistic lighting and accurate perspective from a simple text prompt. Trained exclusively on licensed data for safe commercial use.

bria/replace-background

Generate professional, eCommerce-ready product shots by replacing backgrounds with realistic lighting and accurate perspective from a simple text prompt. Trained exclusively on licensed data for safe commercial use.

bria

replace-background

image-to-image

imagineart/imagineart-2.0-edit-preview/image-to-image

ImagineArt 2.0 Edit delivers precise prompt-guided image editing at 2K resolution, preserving fine detail and realism while accurately applying targeted changes across one or more reference images.

Lumina-Image-2.0 is a 2 billion parameter flow-based diffusion transforer which features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

Showing 785 to 812 of 1396 results