> ## Documentation Index
> Fetch the complete documentation index at: https://docs.runpod.io/llms.txt
> Use this file to discover all available pages before exploring further.

# Whisper V3 Large

> State-of-the-art automatic speech recognition for transcribing audio to text.

Whisper V3 Large is OpenAI's state-of-the-art automatic speech recognition model that transcribes audio to text. It supports multiple languages and can handle various audio formats with high accuracy.

<Card title="Try in playground" icon="play" href="https://console.runpod.io/hub/playground/audio/whisper-v3-large" horizontal>
  Test Whisper V3 Large in the Runpod Hub playground.
</Card>

|              |                                                     |
| ------------ | --------------------------------------------------- |
| **Endpoint** | `https://api.runpod.ai/v2/whisper-v3-large/runsync` |
| **Pricing**  | \$0.05 per 1000 characters                          |
| **Type**     | Audio transcription                                 |

## Request

All parameters are passed within the `input` object in the request body.

<ParamField body="input.audio" type="string" required>
  URL of the audio file to transcribe.
</ParamField>

<ParamField body="input.prompt" type="string">
  Optional context or prompt to guide transcription. Useful for domain-specific terminology or formatting hints.
</ParamField>

<RequestExample>
  ```bash cURL theme={"theme":{"light":"github-light","dark":"github-dark"}}
  curl -X POST "https://api.runpod.ai/v2/whisper-v3-large/runsync" \
    -H "Authorization: Bearer $RUNPOD_API_KEY" \
    -H "Content-Type: application/json" \
    -d '{
      "input": {
        "prompt": "",
        "audio": "https://example.com/audio-file.mp3"
      }
    }'
  ```

  ```python Python theme={"theme":{"light":"github-light","dark":"github-dark"}}
  import requests

  response = requests.post(
      "https://api.runpod.ai/v2/whisper-v3-large/runsync",
      headers={
          "Authorization": f"Bearer {RUNPOD_API_KEY}",
          "Content-Type": "application/json",
      },
      json={
          "input": {
              "prompt": "",
              "audio": "https://example.com/audio-file.mp3",
          }
      },
  )

  result = response.json()
  print(result["output"]["transcription"])
  ```

  ```javascript JavaScript theme={"theme":{"light":"github-light","dark":"github-dark"}}
  const response = await fetch(
    "https://api.runpod.ai/v2/whisper-v3-large/runsync",
    {
      method: "POST",
      headers: {
        Authorization: `Bearer ${RUNPOD_API_KEY}`,
        "Content-Type": "application/json",
      },
      body: JSON.stringify({
        input: {
          prompt: "",
          audio: "https://example.com/audio-file.mp3",
        },
      }),
    }
  );

  const result = await response.json();
  console.log(result.output.transcription);
  ```
</RequestExample>

## Response

<ResponseField name="id" type="string">
  Unique identifier for the request.
</ResponseField>

<ResponseField name="status" type="string">
  Request status. Returns `COMPLETED` on success, `FAILED` on error.
</ResponseField>

<ResponseField name="delayTime" type="integer">
  Time in milliseconds the request spent in queue before processing began.
</ResponseField>

<ResponseField name="executionTime" type="integer">
  Time in milliseconds the model took to transcribe the audio.
</ResponseField>

<ResponseField name="workerId" type="string">
  Identifier of the worker that processed the request.
</ResponseField>

<ResponseField name="output" type="object">
  The transcription result.

  <ResponseField name="output.transcription" type="string">
    The transcribed text from the audio.
  </ResponseField>

  <ResponseField name="output.cost" type="float">
    Cost of the transcription in USD.
  </ResponseField>
</ResponseField>

<ResponseExample>
  ```json 200 theme={"theme":{"light":"github-light","dark":"github-dark"}}
  {
    "id": "sync-a1b2c3d4-e5f6-7890-abcd-ef1234567890-u1",
    "status": "COMPLETED",
    "delayTime": 18,
    "executionTime": 12345,
    "workerId": "oqk7ao1uomckye",
    "output": {
      "transcription": "Hello and welcome to this demonstration of Whisper V3. This is a sample transcription that shows the high accuracy of the model.",
      "cost": 0.0065
    }
  }
  ```

  ```json 400 theme={"theme":{"light":"github-light","dark":"github-dark"}}
  {
    "id": "sync-a1b2c3d4-e5f6-7890-abcd-ef1234567890-u1",
    "status": "FAILED",
    "error": "Invalid audio URL or unsupported format"
  }
  ```
</ResponseExample>

## Supported audio formats

Whisper V3 Large supports common audio formats including:

* MP3
* WAV
* FLAC
* M4A
* OGG

## Cost calculation

Whisper V3 Large charges \$0.05 per 1000 characters of transcribed audio. Example costs:

| Characters        | Cost    |
| ----------------- | ------- |
| 500 characters    | \$0.025 |
| 1,000 characters  | \$0.05  |
| 10,000 characters | \$0.50  |
