Skip to main content

Job operations

This page provides instructions on job operations using the Runpod Endpoint. You can invoke a job to run Endpoints the way you would interact with an API, get a status of a job, purge your job queue, and more with operations.

The following guide demonstrates how to use cURL to interact with an Endpoint. You can also use the following SDK to interact with Endpoints programmatically:

For information on sending requests, see Send a request.

Asynchronous Endpoints

Asynchronous endpoints are designed for long-running tasks. When you submit a job through these endpoints, you receive a Job ID in response. You can use this Job ID to check the status of your job at a later time, allowing your application to continue processing without waiting for the job to complete immediately. This approach is particularly useful for tasks that require significant processing time or when you want to manage multiple jobs concurrently.

curl -X POST https://api.runpod.ai/v2/{endpoint_id}/run \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer ${API_KEY}' \
-d '{"input": {"prompt": "Your prompt"}}'

Synchronous Endpoints

Synchronous endpoints are ideal for short-lived tasks where immediate results are necessary. Unlike asynchronous calls, synchronous endpoints wait for the job to complete and return the result directly in the response. This method is suitable for operations that are expected to complete quickly and where the client can afford to wait for the result.

curl -X POST https://api.runpod.ai/v2/{endpoint_id}/runsync \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer ${API_KEY}' \
-d '{"input": {"prompt": "Your prompt"}}'

Health Endpoint

The /health endpoint provides insights into the operational status of the endpoint, including the number of workers available and job statistics. This information can be used to monitor the health and performance of the API, helping you manage workload and troubleshoot issues more effectively.

curl --request GET \
--url https://api.runpod.ai/v2/{endpoint_id}/health \
--header 'accept: application/json' \
--header 'Authorization: Bearer ${API_KEY}'

Cancel Job

To cancel a job in progress, specify the cancel parameter with the endpoint ID and the job ID.

curl -X POST https://api.runpod.ai/v2/{endpoint_id}/cancel/{job_id} \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer ${API_KEY}'

Purge Queue Endpoint

The /purge-queue endpoint allows you to clear all jobs that are currently in the queue. This operation does not affect jobs that are already in progress. It is a useful tool for managing your job queue, especially in situations where you need to reset or clear pending tasks due to operational changes or errors.

curl -X POST https://api.runpod.ai/v2/{endpoint_id}/purge-queue \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer ${API_KEY}'

Check Job Status

To track the progress or result of an asynchronous job, you can check its status using the Job ID. This endpoint provides detailed information about the job, including its current status, execution time, and the output if the job has completed.

curl https://api.runpod.ai/v2/{endpoint_id}/status/{job_id} \
-H 'Authorization: Bearer ${API_KEY}'

Stream results

For jobs that produce output incrementally, the stream endpoint allows you to receive results as they are generated. This is particularly useful for tasks that involve continuous data processing or where immediate partial results are beneficial.

curl https://api.runpod.ai/v2/{endpoint_id}/stream/{job_id} \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer ${API_KEY}'
note

The maximum size for a payload that can be sent using yield to stream results is 1 MB.

Rate Limits

RunPod's Endpoints facilitate submitting jobs and retrieving outputs. Access these endpoints at: https://api.runpod.ai/v2/{endpoint_id}/{operation}

  • /run

    • 1000 requests per 10 seconds, 200 concurrent
  • /runsync

    • 2000 requests per 10 seconds, 400 concurrent
  • /status, /status-sync, /stream

    • 2000 requests per 10 seconds, 400 concurrent
  • /cancel

    • 100 requests per 10 seconds, 20 concurrent
  • /purge-queue

    • 2 requests per 10 seconds
  • /openai/*

    • 2000 requests per 10 seconds, 400 concurrent
  • /requests

    • 10 requests per 10 seconds, 2 concurrent
note

Retrieve results from /status within 30 minutes for privacy protection.

For reference information on Endpoints, see Endpoint Operations.