Skip to main content
SORA 2 I2V is OpenAI’s video and audio generation model that creates dynamic videos from static images. It excels at generating videos with complex actions, ambient sounds, and character dialogue based on detailed text prompts.

Try in playground

Test SORA 2 I2V in the Runpod Hub playground.
Endpointhttps://api.runpod.ai/v2/sora-2-i2v/runsync
Pricing$0.40–$1.20 per video
TypeVideo generation

Request

All parameters are passed within the input object in the request body.
input.prompt
string
required
Text description of the desired video, including action, ambient sound, and character dialogue.
input.image
string
required
URL of the source image to animate.
input.duration
integer
required
Video duration in seconds. Valid options: 4, 8, or 12.
curl -X POST "https://api.runpod.ai/v2/sora-2-i2v/runsync" \
  -H "Authorization: Bearer $RUNPOD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "input": {
      "prompt": "Action: The mech slowly pushes itself up with a damaged mechanical arm, sparks flying. Ambient Sound: Distant explosions, electrical sizzle. Character Dialogue: (Processed mechanical voice) No retreat.",
      "image": "https://example.com/mech.jpeg",
      "duration": 4
    }
  }'

Response

id
string
Unique identifier for the request.
status
string
Request status. Returns COMPLETED on success, FAILED on error.
delayTime
integer
Time in milliseconds the request spent in queue before processing began.
executionTime
integer
Time in milliseconds the model took to generate the video.
workerId
string
Identifier of the worker that processed the request.
output
object
The generation result containing the video URL and cost.
output.video_url
string
URL of the generated video. This URL expires after 7 days.
output.cost
float
Cost of the generation in USD.
{
  "id": "sync-a1b2c3d4-e5f6-7890-abcd-ef1234567890-u1",
  "status": "COMPLETED",
  "delayTime": 35,
  "executionTime": 120345,
  "workerId": "oqk7ao1uomckye",
  "output": {
    "video_url": "https://video.runpod.ai/abc123/output.mp4",
    "cost": 0.40
  }
}
Video URLs expire after 7 days. Download and store generated videos immediately if you need to keep them.

Prompt structure

For best results, structure your prompt with these components:
  • Action: Describe what happens in the scene
  • Ambient Sound: Describe background audio
  • Character Dialogue: Include spoken lines with voice descriptions

Cost calculation

SORA 2 I2V pricing varies by duration:
DurationCost
4 seconds$0.40
8 seconds$0.80
12 seconds$1.20