Search Page 23

Fine-tune FLUX.2 [dev] from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific styles and domains.

flux-2-trainer-v2

Fine-tune FLUX.2 [dev] from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific styles and domains.

training

Create high-fidelity video with audio from text with LTX-2 Pro.

ltx-2/text-to-video

Create high-fidelity video with audio from text with LTX-2 Pro.

flux-2-lora-gallery/multiple-angles

ernie-image

High-quality text-to-image model by Baidu. Supports English, Chinese, and Japanese prompts with built-in prompt expansion.

Generates same object from different angles (azimuth/elevation)

qwen-image/image-to-image

SOTA open-source text-to-image model delivering high-fidelity outputs with accurate typography. JSON-structured prompts provide production-ready controllability for enterprise and agentic workflows. Trained exclusively on licensed data.

bria/fibo/generate

SOTA open-source text-to-image model delivering high-fidelity outputs with accurate typography. JSON-structured prompts provide production-ready controllability for enterprise and agentic workflows. Trained exclusively on licensed data.

Qwen-Image (Image-to-Image) transforms and edits input images with high fidelity, enabling precise style transfer, enhancement, and creative modification.

flux-2/klein/4b/base/edit/lora

zonos2

Zonos2 is a text-to-speech model that clones a voice from a short sample and speaks naturally across many languages.

tts

voice cloning

text-to-speech

Image-to-image editing with LoRA support for FLUX.2 [klein] 4B Base from Black Forest Labs. Specialized style transfer and domain-specific modifications.

Image-to-image editing with LoRA support for FLUX.2 [klein] 4B Base from Black Forest Labs. Specialized style transfer and domain-specific modifications.

hunyuan-3d/v3.1/rapid/text-to-3d

Create detailed, fully-textured 3D models with text

text-to-3d

moondream2

Moondream2 is a highly efficient open-source vision language model that combines powerful image understanding capabilities with a remarkably small footprint.

vision

Generate videos from prompts using LTX Video-0.9.7 13B Distilled and custom LoRA

ltx-video-13b-distilled

Generate videos from prompts using LTX Video-0.9.7 13B Distilled and custom LoRA

video

ltx-video

minimax/image-01/subject-reference

Generate images from text and a reference image using MiniMax Image-01 for consistent character appearance.

kling-video/o1/standard/reference-to-video

Transform images, elements, and text into consistent, high-quality video scenes, ensuring stable character identity, object details, and environments.

image-to-video

stable-audio-3/small/sfx/text-to-audio

Stable Audio 3 Small SFX is a 459 million parameter latent diffusion model that generates high-quality sound effects from text prompts, designed for on-device deployment on mobile phones and consumer laptops.

wan-25-preview/text-to-image

Wan 2.5 text-to-image model.

text-to-image

hunyuan-video-v1.5/image-to-video

Hunyuan Video 1.5 is Tencent's latest and best video model

image-to-video

Remove unwanted elements (objects, people, text) while maintaining image consistency

qwen-image-edit-plus-lora-gallery/remove-element

Remove unwanted elements (objects, people, text) while maintaining image consistency

pixverse/v4.5/text-to-video

Generate high quality video clips from text and image prompts using PixVerse v4.5

Generate high quality video clips from text and image prompts using PixVerse v4.5

glm-image/image-to-image

Create high-quality images with accurate text rendering and rich knowledge details—supports editing, style transfer, and maintaining consistent characters across multiple images.

pixverse/v5.5/text-to-video

Generate high quality video clips from text and image prompts using PixVerse v5.5

Generate high quality video clips from text and image prompts using PixVerse v5.5

clarityai/crystal-video-upscaler

Do high precision video upscaling that respects the original video perfectly using Crystal Upscaler's new video upscaling method!

upscale

video-to-video

Generate high-quality images from depth maps using Flux.1 [dev] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.

flux-lora-depth

Generate high-quality images from depth maps using Flux.1 [dev] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.

image-apps-v2/relighting

Adjust and enhance images with different lighting styles.

relighting

Image editing endpoint for Qwen-Image-Max. Qwen Image Max improves upon the Qwen Image Plus series by enhancing the realism and naturalness of images.

qwen-image-max/edit

Image editing endpoint for Qwen-Image-Max. Qwen Image Max improves upon the Qwen Image Plus series by enhancing the realism and naturalness of images.

qwen-image

max