Welcome to Runpod - Runpod Documentation

Runpod is a cloud computing platform built for AI, machine learning, and general compute needs. Whether you’re running deep learning models, training AI, or deploying cloud-based applications, Runpod provides scalable, high-performance GPU and CPU resources to power your workloads.

Get started

If you’re new to Runpod, start here to learn the essentials and deploy your first GPU.

Quickstart

Create an account, deploy your first GPU Pod, and use it to execute code.

Manage accounts

Learn how to manage your personal and team accounts and set up permissions.

Create an API key

Create API keys to manage your access to Runpod resources.

Connection options

Learn about different methods for connecting to Runpod and managing resources.

Serverless

Serverless offers pay-per-second serverless computing with built-in autoscaling for production workloads.

Introduction

Learn how Serverless works and how to deploy pre-configured endpoints.

Pricing

Learn how Serverless billing works and how to optimize your costs.

vLLM quickstart

Deploy a large language model for text or image generation in minutes using vLLM.

Build your first worker

Build a custom worker and deploy it as a Serverless endpoint.

Pods

Pods allow you to run containerized workloads on dedicated GPU or CPU instances.

Introduction

Understand the components of a Pod and options for configuration.

Choose a Pod

Learn how to choose the right Pod for your workload.

Support

Contact

Submit a support request using our contact page.

Status page

Check the status of Runpod services and infrastructure.

Discord

Join the Runpod community on Discord.

API Reference

​Get started

Quickstart

Manage accounts

Create an API key

Connection options

​Serverless

Introduction

Pricing

vLLM quickstart

Build your first worker

​Pods

Introduction

Choose a Pod

​Support

Contact

Status page

Discord

Get started

Serverless

Pods

Support