Skip to main content
This page lists all available models for Runpod Public Endpoints. Select a model below to view its parameters, pricing, and usage examples. You can also browse and test models in the Runpod Hub playground.

Image models

Generate and edit images with text prompts or reference images.
ModelDescriptionPrice
Flux DevHigh-quality image generation with exceptional prompt adherence.$0.02/megapixel
Flux SchnellFast, lightweight generation for prototyping.$0.0024/megapixel
Flux Kontext DevEdit images based on text instructions.$0.025/image
P-Image T2IUltra-fast text-to-image with automatic prompt enhancement.$0.005/image
P-Image EditPremium image editing with complex compositions.$0.01/image
Qwen ImageImage generation with advanced text rendering.$0.02/image
Qwen Image LoRAImage generation with LoRA customization.$0.025/image
Qwen Image EditImage editing with text rendering capabilities.$0.02/image
Qwen Image Edit 2511Enhanced image editing with improved consistency.$0.02/image
Qwen Image Edit 2511 LoRAAdvanced editing with LoRA support.$0.025/image
Seedream 4.0 T2INew-generation text-to-image creation.$0.027/image
Seedream 4.0 EditNew-generation image editing.$0.027/image
Seedream 3.0Bilingual image generation (Chinese-English).$0.03/image
WAN 2.6 T2IOpen-source text-to-image at 1024x1024.$0.03/image
Z-Image TurboFast 6B parameter image generation.$0.005/image
Nano Banana EditGoogle’s model for combining multiple images.$0.038/image
Nano Banana Pro EditAdvanced multi-image editing with resolution options.$0.14–$0.24/image

Video models

Create videos from images or text prompts. Pricing varies by resolution and duration.
ModelDescriptionPrice
InfiniteTalkAudio-driven talking/singing video generation.$0.25 (480p), $0.50 (720p)
Kling v2.1 I2V ProProfessional image-to-video with enhanced fidelity.$0.45/5s, $0.90/10s
Kling v2.6 Motion ControlMotion transfer from reference videos.$0.21/3s, $0.63/10s
Kling Video O1 R2VCreative video with multi-reference images.$0.112/second
Seedance 1.0 ProHigh-performance video with multi-shot storytelling.From $0.12/5s
Seedance 1.5 Pro I2VCinematic image-to-video with expressive motion.$0.024–$0.052/second
SORA 2 I2VOpenAI’s video and audio generation.$0.40 (4s), $0.80 (8s), $1.20 (12s)
SORA 2 Pro I2VProfessional-grade SORA video generation.From $1.20 (720p/4s)
WAN 2.6 T2VText-to-video with resolution options.$0.50/5s (480p), $2.25/10s (720p)
WAN 2.5Image-to-video with prompt expansion.From $0.25/5s
WAN 2.2 I2V LoRAImage-to-video with LoRA camera controls.$0.35/5s, $0.56/8s
WAN 2.2 I2VOpen-source image-to-video at 720p.$0.30/5s
WAN 2.2 T2VOpen-source text-to-video at 720p.$0.30/5s
WAN 2.1 I2VImage-to-video at 720p.$0.30/5s
WAN 2.1 T2VText-to-video at 720p.$0.30/5s

Text models

Generate text with large language models.
ModelDescriptionPrice
Cogito 671B v2.1671B MoE model with FP8 dynamic quantization.$0.50/1M tokens
GPT-OSS 120BOpenAI’s open-weight 120B parameter model.$10.00/1M tokens
IBM Granite 4.032B parameter long-context instruct model.$10.00/1M tokens
Qwen3 32B AWQAdvanced reasoning and multilingual support. OpenAI-compatible.$10.00/1M tokens

Audio models

Transcribe speech or generate audio from text.
ModelDescriptionPrice
Chatterbox TurboFast TTS with 20 preset voices and voice cloning.$0.001/second
Whisper V3 LargeState-of-the-art speech recognition.$0.05/1K chars
Minimax Speech 02 HDText-to-speech with emotional control.$0.05/1K chars

Next steps