API Reference
API Credit Costs
Use this page as the source of truth for Subclip API credit usage.
| API | Cost |
|---|---|
| Viral Captions | 1 credit per rendered minute + 1 transcription credit + 1 caption analysis credit. Face tracking adds 1 credit minimum, or rendered minutes x 1 credit, whichever is higher. Minimum charge is 2 credits. |
| AI Dubbing | Duration preflight: transcription minutes when no transcript is supplied + dubbing minutes + 5 credits for voice clone/multi-speaker + 3 credits for video background preservation. |
| AI Clipping | Existing AI clipping pipeline credits plus 5 API orchestration credits for import/render orchestration. Viral Captions analysis cost is added when dynamicCaptions is enabled. YouTube transcript fetch adds 1 credit when fetched. |
| Text to Speech | 5 credits per generated audio minute, minimum 1 credit. Voice clone adds 5 credits. |
| Social Media Transcript | 1 credit per source media minute after import and duration probe, rounded up to a whole minute. |
| Video Studio API without AI analysis | 2 credits minimum, or rendered minutes x 1 credit, whichever is higher. Optional BGM/SFX adds 1 credit total. |
| Video Studio API with AI analysis | Base render cost + 2 AI planning credits. Vision analysis adds 0.25 credits per image and 1 credit per video when enabled. |
| Video Studio API voiceover | Adds 1 credit minimum, or rendered minutes x 5 credits, whichever is higher. |
| Enhance Audio API | 1 credit minimum, or duration seconds x 0.10816 credits, whichever is higher. Rounded up to a whole credit. |
Viral Captions API
Workflow: Viral Captions API docs.
Base render is 1 credit per rendered minute. Transcription adds 1 credit. Caption analysis adds 1 credit.
Face tracking adds 1 credit minimum, or rendered minutes x 1 credit, whichever is higher. Minimum charge is 2 credits.
The same formula applies whether captions are generated from ASR or supplied with SRT.
| Job | Credits |
|---|---|
| 1 minute, no face tracking | 3 |
| 1 minute, face tracking | 4 |
| 3 minutes, no face tracking | 5 |
| 3 minutes, face tracking | 8 |
AI Dubbing API
Workflow: AI Dubbing API docs.
Every AI Dubbing API request must include durationSeconds. Subclip checks the estimated credit requirement before creating the job.
The preflight formula is transcription minutes when no transcript is supplied, plus dubbing minutes, plus voice clone and background-preservation add-ons when requested.
| Job | Credits |
|---|---|
| 3 minutes, transcript supplied, catalog voice | 3 |
| 3 minutes, no transcript, catalog voice | 6 |
| 3 minutes, no transcript, clone voice, video background preserved | 14 |
| 3 minutes, multi-speaker | 11 |
AI Clipping API
Workflow: AI Clipping API docs.
AI Clipping API jobs add 5 API orchestration credits for source import, API packaging, and final render orchestration.
When you omit segments, the existing AI clipping pipeline costs still apply. When dynamicCaptions is true, Viral Captions analysis costs also apply.
When youtubeTranscript is true and the YouTube transcript is fetched, the completed job adds 1 credit. If transcript fetch fails and the job falls back to ASR, that extra credit is not counted.
| Job | Credits |
|---|---|
| Selected segments, dynamicCaptions false | +5 API credits |
| AI-selected clips | AI clipping pipeline credits + 5 API credits |
| AI-selected clips with dynamicCaptions true | AI clipping pipeline credits + Viral Captions analysis + 5 API credits |
| YouTube transcript fetched | Any AI Clipping job + 1 credit |
Text to Speech API
Workflow: Text to Speech API docs.
Credits are estimated before the job starts. Text to Speech costs 5 credits per generated audio minute with a 1 credit minimum. Voice clone adds 5 credits.
| Job | Credits |
|---|---|
| 30 seconds generated speech | 3 |
| 1 minute generated speech | 5 |
| 1 minute generated speech with voice clone | 10 |
| 3 minutes generated speech | 15 |
Video Studio API
Workflow: Video Studio API docs.
Without AI analysis or add-ons, cost is 2 credits minimum, or rendered minutes x 1 credit, whichever is higher.
BGM or SFX adds 1 credit total. bgmQuery has no separate search charge.
AI planning adds 2 credits when aiAnalysis is true. Vision analysis adds 0.25 credits per image and 1 credit per video. Audio analysis adds 1 credit per uploaded audio file.
Voiceover adds 1 credit minimum, or rendered minutes x 5 credits, whichever is higher. Stock B-roll search is not exposed in the Video Studio API right now.
| Job | Credits |
|---|---|
| 1 minute render, no AI or add-ons | 2 |
| 3 minute render, no AI or add-ons | 3 |
| 3 minute render with AI planning only | 5 |
| 3 minute render with AI planning, 4 images, 1 video | 7 |
| 3 minute render with BGM or SFX | 4 |
| 3 minute render with voiceover | 18 |
Enhance Audio API
Workflow: Enhance Audio API docs.
API background music separation is disabled. Provider choice does not change the credit formula.
Cost is 1 credit minimum, or duration seconds x 0.10816 credits, whichever is higher. The result is rounded up to the next whole credit.
That works out to about 6.49 credits per minute.
| Job | Credits |
|---|---|
| 1 minute | 7 |
| 3 minutes | 20 |
| 10 minutes | 65 |
Social Media Transcript API
Workflow: Social Media Transcript API docs.
Subclip imports the media first, probes duration, then checks credits before transcription starts.
Cost is 1 credit per source media minute, rounded up to a whole minute.