Introduction
Video Bridge is an AI co-pilot for Premiere Pro. Tell it what you want — it cuts silences, tags B-roll, transcribes interviews, and acts on your timeline.
What it helps you do
- Smart Cut — auto-detect silences and filler words
- B-roll classifier — AI tags every clip
- Captions — word-level, 10 animated styles
- Chat your edits — "cut everything before 0:30" → executes
- Bin organise — bulk rename and auto-bin by tag
Installation
Signed .zxp extension for Premiere Pro 2024+.
Quick install
- Download. Get
VideoBridge.zxpfrom your order receipt. - Close Premiere. Installer needs Premiere closed.
- Run installer. Double-click — ZXPInstaller signs and registers.
- Open Premiere.
Window → Extensions → Video Bridge.
System requirements
| Requirement | Minimum |
|---|---|
| Premiere Pro | 2024 (24.0+) |
| OS | macOS 12+, Windows 10+ |
| RAM | 16 GB |
| Disk | ~2 GB (incl. local Whisper) |
Setup & API keys
Video Bridge is BYOK — you pay providers directly.
Supported providers
Anthropic (Claude)
Best for transcript reasoning. Get key.
OpenAI (GPT & Whisper)
Required for Cloud Whisper. Get key.
Google (Gemini)
Strongest vision. Get key.
Adding a key
- Open Settings. Gear icon.
- Paste your key. Each provider has its own field.
- Pick defaults. Sonnet 4 for chat, Whisper-large-v3 for transcripts, Gemini Flash for B-roll.
Your first cut
Two minutes from panel-open to a cleaned-up timeline.
- Drop a clip on the timeline.
- Switch to Smart Cut tab. Default thresholds work for most footage.
- Click Preview. Cuts appear as markers — scrub to verify.
- Click Apply. Ripple deletes silences. Fully undoable.
- Iterate. Raise threshold for noisier audio.
Writing effective prompts
When chatting with the timeline, specificity wins.
Be specific about edit operations
tighten this up
No clear target.
remove silences over 0.5s between 00:00:30 and 00:02:00
Bounded, measurable.
Transcript-anchored prompts
# Find moments by what was said "Find moment where I say 'the secret is'" "Cut every section about pricing" "Add a marker on every laugh"
Bin queries
# Query via B-roll tags "Show all outdoor clips under 5s" "Find clips tagged interview + closeup"
Common workflows
Three recipes — copy them, adapt them.
Recipe 1 — Podcast in 10 minutes
- Smart Cut. Pull silences + fillers (saves 15–20%).
- Caption. Cloud Whisper → "Podcast-Bold".
- Chat. "Cut sections under 8 seconds".
- Organise. Auto-bin by interview / B-roll / titles.
Recipe 2 — Tag a 4-hour shoot
- Select all unclassified clips. 200+ at once is fine.
- B-roll → Classify. Gemini Flash, ~$0.20 for whole shoot.
- Search. "outdoor + person + warm" → 12 instant results.
Recipe 3 — Caption a 60-min interview
- Smart Cut first. Cuts Whisper bill by 10–20%.
- Cloud Whisper. Pick "Minimal-Mono".
- Adjust. Essential Graphics → bulk restyle.
Best practices
Pick the right model
| Task | Model | Why |
|---|---|---|
| Transcript reasoning | Sonnet 4 | Best at "find moment" |
| Quick chat ops | Haiku | 5× cheaper |
| B-roll classification | Gemini Flash | Cheapest vision |
| Captions | Whisper-large-v3 | Best on accents |
Smart Cut before transcribe
Removing silences first reduces Whisper costs by 10–20%. Always cut first.
Custom B-roll prompts
Default schema is generic. For wedding/vlog/doc — write a project-specific prompt in Settings → B-roll.
Troubleshooting
Smart Cut produces too many/few cuts
- Raise silence threshold for noisy recordings
- Increase padding (80ms → 150ms)
- Disable filler removal to keep pauses
"API key invalid"
- Re-paste — trailing space breaks it
- Confirm OpenAI key has Whisper access
- Switch to Local Whisper as fallback
B-roll returns weird tags
- Write a custom prompt for your genre
- Increase frames-sampled-per-clip
Panel crashes Premiere on open
CEP cache issue. Delete ~/Library/Caches/CSXS/ (macOS) or %APPDATA%\Adobe\CEP\extensions\cache\ (Win).
Smart Cut
Auto-detect and ripple-delete silences, fillers, bad takes.
Workflow
- Select clip(s). Multi-select supported.
- Configure thresholds. Silence (-40dB), padding (80ms), fillers.
- Preview cuts. Markers appear — scrub to verify.
- Apply. Ripple-deletes. Undoable.
Filler word list
Settings → Smart Cut. Add language-specific fillers (Hindi: matlab, haan) or your own tics.
B-roll classifier
AI scans your bin and tags every clip — searchable instantly.
How it works
- Select clips (or "all unclassified")
- Choose vision model (Gemini Flash recommended)
- Click Classify — samples frames, asks AI to tag
- Tags appear as metadata, searchable
Default schema
type: "interview" | "broll" | "screen" location: "indoor" | "outdoor" | "studio" subjects: ["person", "laptop"] mood: "warm" | "tense" | "energetic"
Custom prompts
Settings → B-roll → Custom prompt. Paste your own schema for project-specific tagging.
Caption panel
Word-level captions with animated styles, built on Whisper.
Workflow
- Select footage clip. Any audio-bearing video.
- Choose transcription. Local (offline, ~1.5GB model) or Cloud.
- Pick a style. 10 presets.
- Generate. Renders as Essential Graphics on V2.
Languages
Whisper supports 99 languages. Force language for code-mixed audio.
AI to Premiere
Chat with your timeline. Plain English → Premiere operations.
Example prompts
# Edit ops "Cut everything before 00:00:30" "Move clip 3 to V2" "Add a 0.5s dip-to-black between cuts 4 & 5" # Transcript-based "Find moment where I say 'the secret is'" "Cut every section about pricing" # Bin queries "Find outdoor B-roll under 5s"
Bin organise
Rename, recolor, re-bin clips in bulk.
Smart rename
AI suggests names from B-roll tags: MVI_4382.MOV → interview_sagar_closeup_01.mov.
Auto-bin
Create bins by tag: Interviews / B-roll / Screen / Titles. Clips moved, not duplicated.
Shortcuts
| Action | Shortcut |
|---|---|
| Send chat prompt | ⌘ Enter |
| Run Smart Cut | ⌘ Shift X |
| Generate captions | ⌘ Shift C |
| Classify clips | ⌘ Shift B |
| Switch tab | ⌘ ] / ⌘ [ |
Cost & usage
| Task | Model | Cost |
|---|---|---|
| Smart Cut on 30-min | Local | $0.00 |
| Chat edit op | Sonnet 4 | $0.01–0.05 |
| Caption 1-min | Whisper Cloud | $0.006 |
| Caption 60-min | Whisper Cloud | $0.36 |
| Classify per clip | Gemini Flash | $0.001 |
Privacy & data
What's sent to providers
- Audio file (Whisper transcription)
- Sampled 720p JPEG frames (B-roll)
- Transcript text (chat)
What stays local
- Video files (never uploaded)
- API keys (OS keychain)
- Chat history, classification cache
Local Whisper
Settings → Captions → Local. First run downloads ~1.5 GB. Then fully offline.
FAQ
Does it work on DaVinci Resolve?
Not yet. Resolve uses a different API. Roadmap for 2026.
Can I run Whisper fully offline?
Yes — Local Whisper. ~1.5 GB model.
vs Descript / AutoEdit?
Runs inside Premiere — no round-tripping. $49 one-time vs monthly subs.
Per machine or user?
Per user. Up to 2 machines.
Are video files uploaded?
No. Only audio or sampled JPEGs.
Refund policy?
14-day no-questions-asked refund.
Changelog
What's new, version by version.
Public launch
5 days ago- Smart Cut with custom thresholds and filler list
- B-roll classifier with custom prompts
- Whisper local + cloud captions, 10 styles
- AI to Premiere multi-turn chat
- Bin auto-organise
Roadmap
What's coming next.
In progress
Target: next month- Color-grade-aware classification (LUT detection)
- Smart Cut for music-bed sections
- Custom caption styles
Considering
Not committed- DaVinci Resolve port
- Team licenses with shared bin tags
- Multi-speaker diarization