Search Page 26

Showing 28 of 1396 results

Use the latest pixverse v5.6 model to turn your texts and images into amazing videos.

Generate video prompts using a variety of techniques including camera direction, style, pacing, special effects and more.

openrouter/router/audio

Run any audio capable LLM with fal. Process audio files — transcription, analysis, understanding, understand— using Gemini (Google) models. Supports wav, mp3, aiff, aac, ogg, flac, m4a. Powered by OpenRouter.

unknown

hunyuan_world/image-to-world

Hunyuan World 1.0 turns a single image into a panorama or a 3D world. It creates realistic scenes from the image, allowing you to explore and view it from different angles.

image-to-3d

kokoro/brazilian-portuguese

A natural and expressive Brazilian Portuguese text-to-speech model optimized for clarity and fluency.

speech

text-to-audio

FLUX.1 [schnell] Redux is a high-performance endpoint for the FLUX.1 [schnell] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux/schnell/redux

FLUX.1 [schnell] Redux is a high-performance endpoint for the FLUX.1 [schnell] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer

image-to-image

hunyuan3d/v2/turbo

Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.

stylized

image-to-3d

Generate high-quality video with audio from reference, character sheet, storyboard using LTX-2.3

new

ltx-2.3-quality/ingredient

Generate high-quality video with audio from reference, character sheet, storyboard using LTX-2.3

ideogram/upscale

Ideogram Upscale enhances the resolution of the reference image by up to 2X and might enhance the reference image too. Optionally refine outputs with a prompt for guided improvements.

upscaling

high-res

image-to-image

FLUX Control LoRA Depth is a high-performance endpoint that uses a control image using a depth map to transfer structure to the generated image and another initial image to guide color.

flux-control-lora-depth/image-to-image

FLUX Control LoRA Depth is a high-performance endpoint that uses a control image using a depth map to transfer structure to the generated image and another initial image to guide color.

Vision

Cohere Transcribe turns your business audio into accurate text, ready for search, analytics, and automation

Nucleus-Image is a text-to-image generation model built on a sparse mixture-of-experts (MoE) diffusion transformer architecture.

HiDream-I1 dev is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.

text-to-image

recraft/v4.1/utility/text-to-image

Recraft V4.1 Utility is a faster, lighter variant of V4.1 made for high-volume creative workflows. Ideal for ideation, A/B exploration, and content pipelines, it keeps Recraft's design sensibility while optimizing for throughput and cost.

void-video-inpainting

VOID removes objects from videos along with all interactions they induce on the scene

utility

editing

video-to-video

moondream3-preview/segment

Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.

mask

segmentation

image-to-image

Adjust and enhance videos with Ray-2 Reframe. This advanced tool seamlessly reframes videos to your desired aspect ratio, intelligently inpainting missing regions to ensure realistic visuals and coherent motion, delivering exceptional quality and creative flexibility.

luma-dream-machine/ray-2-flash/reframe

Adjust and enhance videos with Ray-2 Reframe. This advanced tool seamlessly reframes videos to your desired aspect ratio, intelligently inpainting missing regions to ensure realistic visuals and coherent motion, delivering exceptional quality and creative flexibility.

florence-2-large/caption

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

captioning

multimodal

vision

Professional-grade creative upscaler that doubles resolution up to 10MP, regenerating sharper textures, refined details, and cleaner faces. Trained exclusively on licensed data for risk-free commercial use.

bria/upscale/creative

Professional-grade creative upscaler that doubles resolution up to 10MP, regenerating sharper textures, refined details, and cleaner faces. Trained exclusively on licensed data for risk-free commercial use.

kandinsky5-pro/image-to-video

Kandinsky 5.0 Pro is a diffusion model for fast, high-quality image-to-video generation.

image-to-video

decart/lucy-restyle

Restyle videos up to 30 min long - maintaining maximum detail quality.

video-edit

video-to-video

kokoro/spanish

A natural-sounding Spanish text-to-speech model optimized for Latin American and European Spanish.

speech

text-to-audio

thinksound

Generate realistic audio for a video with an optional text prompt and combine

Generate videos from prompts using CogVideoX-5B

text-to-video

image-editing/realism

Add details to faces, enhance face features, remove blur.

wan-pro/image-to-video

Wan-2.1 Pro is a premium image-to-video model that generates high-quality 1080p videos at 30fps with up to 6 seconds duration, delivering exceptional visual quality and motion diversity from images

Use React-1 from SyncLabs to refine human emotions and do realistic lip-sync without losing details!

lipsync

video-to-video

Showing 701 to 728 of 1396 results