> ## Documentation Index
> Fetch the complete documentation index at: https://docs.runpod.io/llms.txt
> Use this file to discover all available pages before exploring further.

# Pricing

> Learn how Serverless billing works to optimize your costs.

<div className="overview-page-wrapper" />

<Tip>
  Runpod offers custom pricing plans for large scale and enterprise workloads. [Contact our sales team](https://ecykq.share.hsforms.com/2MZdZATC3Rb62Dgci7knjbA) to learn more.
</Tip>

Serverless offers pay-per-second pricing with no upfront costs. You're billed from when a worker starts until it fully stops, rounded up to the nearest second.

## Worker types

|              | Flex workers                          | Active workers                               |
| ------------ | ------------------------------------- | -------------------------------------------- |
| **Behavior** | Scale to zero when idle               | Always running (24/7)                        |
| **Pricing**  | Standard per-second rate              | Discounts available through sales inquiry    |
| **Best for** | Variable workloads, cost optimization | Consistent traffic, low-latency requirements |

## What you're billed for

Your total cost includes compute time and storage:

| Cost component     | Description                      | Rate                                                         |
| ------------------ | -------------------------------- | ------------------------------------------------------------ |
| **Compute**        | GPU time while workers run       | See the [Runpod pricing page](https://www.runpod.io/pricing) |
| **Container disk** | Worker storage (5-min intervals) | \~\$0.10/GB/month                                            |
| **Network volume** | Shared persistent storage        | \$0.07/GB/month (\< 1TB), \$0.05/GB/month (> 1TB)            |

### Compute cost breakdown

Workers incur charges during three phases:

1. **Start time**: Initializing the container and loading models into GPU memory. Minimize with [FlashBoot](/serverless/endpoints/endpoint-configurations#flashboot) or [model caching](/serverless/endpoints/model-caching).

2. **Execution time**: Processing requests. Set [execution timeouts](/serverless/endpoints/endpoint-configurations#execution-timeout) to prevent runaway jobs.

3. **Idle timeout duration**: The time a worker remains active (running) after completing a request, waiting for additional requests before scaling down (default: 5 seconds). Configure in [endpoint settings](/serverless/endpoints/endpoint-configurations#idle-timeout).

<Tip>
  For high-volume workloads with significant storage needs, use [network volumes](/storage/network-volumes) to share data across workers and reduce per-worker storage costs.
</Tip>

## Account limits

**Spend limit**: Default limit of \$80/hour across all resources. [Contact support](https://www.runpod.io/contact) to increase.

## Billing support

If you believe you've been billed incorrectly, [contact support](https://www.runpod.io/contact), including the following information in your ticket:

* Endpoint ID
* Request ID (if applicable)
* Approximate time of the issue