Manage Endpoints
Learn to manage Severless Endpoints.
Create an Endpoint
You can create an Endpoint in the Web interface.
- Navigate to Serverless Endpoints.
- Select + New Endpoint and enter the following:
- Endpoint Name.
- Select your GPUs.
- Configure your workers.
- Add a container image.
- Select Deploy.
Delete an Endpoint
You can delete an Endpoint in the Web interface. Before an Endpoint can be deleted, all workers must be removed.
- Navigate to Serverless Endpoints.
- Select the Endpoint you'd like to remove.
- Select Edit Endpoint and set Max Workers to
0
. - Choose Update and then Delete Endpoint.
Edit an Endpoint
You can edit a running Endpoint in the Web interface after you've deployed it.
- Navigate to Serverless Endpoints.
- Select the Endpoint you'd like to edit.
- Select Edit Endpoint and make your changes.
- Choose Update.
Set GPU prioritization an Endpoint
When creating or modifying a Worker Endpoint, specify your GPU preferences in descending order of priority. This allows you to configure the desired GPU models for your Worker Endpoints.
RunPod attempts to allocate your first choice if it's available. If your preferred GPU isn't available, the system automatically defaults to the next available GPU in your priority list.
- Navigate to Serverless Endpoints.
- Select the Endpoint you'd like to update.
- Select the priority of the GPUs you'd like to use.
- Choose Update.
You can force a configuration update by setting Max Workers to 0, selecting Update, then updating your max workers back to your needed value.
Add a Network Volume
Network volumes are a way to share data between Workers: they are mounted to the same path on each Worker. For example, if a Worker contains a large-language model, you can use a network volume to share the model across all Workers.
- Navigate to Serverless Endpoints.
- Select the Endpoint you'd like to edit.
- Select Edit Endpoint and make your changes.
- Under Advanced choose Select Network Volume.
- Select the storage device and then choose Update to continue.