Search Page 28

Showing 28 of 1396 results

Fine-tune FLUX.2 [klein] 9B from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific editing tasks.

training

nvidia/nemotron-3-nano-omni/video

Video reasoning variant of NVIDIA's Nemotron 3 Nano Omni. 30B A3B hybrid Transformer-Mamba MoE - accepts video plus a prompt and returns text.

FireRed Image Edit is FireRed's state of the art open source editing model, re-trained from Qwen Image Edit 2509.

Sana Sprint is a text-to-image model capable of generating 4K images with exceptional speed.

Generate short video clips from your images using SVD v1.1 at Lightning Speed

turbo

image-to-video

Kling O1 Omni generates new shots guided by an input reference video, preserving cinematic language such as motion, and camera style to produce seamless scene continuity.

kling-video/o1/standard/video-to-video/reference

Kling O1 Omni generates new shots guided by an input reference video, preserving cinematic language such as motion, and camera style to produce seamless scene continuity.

video-to-video

image-apps-v2/headshot-photo

Generate professional headshot photos with customizable backgrounds.

Leverage the rapid processing capabilities of AI models to enable accurate and efficient real-time speech-to-text transcription.

speech-to-text

Extend high-quality video with audio from input video using LTX-2.3

new

ltx-2.3-quality/extend-video

Extend high-quality video with audio from input video using LTX-2.3

Stable Diffusion v1.5

diffusion

text-to-image

flux-2/klein/4b/base

Text-to-image generation with FLUX.2 [klein] 4B Base from Black Forest Labs. Enhanced realism, crisper text generation, and native editing capabilities.

text-to-image

sam-3-1/video

SAM 3.1 builds comes with Object Multiplex, a shared-memory approach for joint multi-object tracking that delivers faster speeds with larger number of objects tracked.

wan/v2.2-a14b/video-to-video

Wan-2.2 video-to-video is a video model that generates high-quality videos with high visual quality and motion diversity from text prompts and source videos.

video-to-video

vecglypher/image-to-svg

Vector font generation with VecGlypher. Create custom glyphs from text descriptions or reference images—outputs clean SVG paths directly without raster-to-vector conversion.

image-to-image

hunyuan-video-v1.5/text-to-video

Hunyuan Video 1.5 is Tencent's latest and best video model

hunyuan-video

text-to-video

perceptron/isaac-01

Isaac-01 is a multimodal vision-language model from Perceptron for various vision language tasks.

multimodal

vision

Ray2 Flash is a fast video generative model capable of creating realistic visuals with natural, coherent motion.

luma-dream-machine/ray-2-flash

Ray2 Flash is a fast video generative model capable of creating realistic visuals with natural, coherent motion.

motion

transformation

text-to-video

recraft/v3/create-style

Recraft V3 Create Style is capable of creating unique styles for Recraft V3 based on your images.

Turn images into pixel-perfect retro art

Removes objects and their visual effects using natural language, replacing them with contextually appropriate content

Interpolate videos with FILM - Frame Interpolation for Large Motion

interpolation

video-to-video

Generate videos from prompts using LTX Video-0.9.5

ltx-video-v095

Generate videos from prompts using LTX Video-0.9.5

video

text-video

text-to-video

FLUX Control LoRA Canny is a high-performance endpoint that uses a control image using a Canny edge map to transfer structure to the generated image and another initial image to guide color.

flux-control-lora-canny/image-to-image

FLUX Control LoRA Canny is a high-performance endpoint that uses a control image using a Canny edge map to transfer structure to the generated image and another initial image to guide color.

lora

style transfer

image-to-image