Method 1: Use a web transcript tool (fastest)
Paste the YouTube video URL into a transcript extraction tool. The tool fetches captions from YouTube's data, formats them, and returns clean text.
Best for:
- One-off extractions
- Multiple videos
- SRT export with timestamps
- Translation
- Bulk research workflows
Steps:
- Copy the video URL from your browser
- Paste into SubExtract (or similar)
- Click Extract
- Copy the output, or download as SRT or TXT
Method 2: Use YouTube's native transcript view
YouTube has a built-in "Show transcript" feature on most videos.
Steps:
- Open the video on YouTube (web only — mobile app doesn't have this)
- Click the three-dot menu below the video → Show transcript
- The transcript appears in a sidebar with timestamps
- Manually select all text and copy
Best for: quick one-off reads when you're already on YouTube.
Limitations: copy-paste only, no SRT export, no batch processing, mobile app missing.
Method 3: Chrome extensions
Install a transcript extension and click its icon while watching a video.
Best for: users who watch a lot of YouTube and want one-click access.
Limitations: browser-bound (only works in Chrome), may break with YouTube interface changes, often comes with login walls or in-app upsells.
Comparison
| Method | Speed | SRT export | Bulk | Mobile | Cost | |---|---|---|---|---|---| | Web tool | Fast | Yes | Yes | Yes | Free tier | | Native transcript | Slow (manual copy) | No | No | No | Free | | Chrome extension | Fast (one-click) | Some | No | No | Often paid |
For most use cases, the web tool is fastest and most flexible.
Common issues and fixes
"This video has no transcript" — the video has no captions in any language. Auto-generation may have failed (very common with low audio quality). Nothing can extract what doesn't exist.
Transcript is in the wrong language — YouTube auto-generates in the detected speech language. Use the language picker on the extraction tool to switch, or translate after extraction.
Music or sound effects in transcript — auto-captions sometimes include [Music] or [Applause] markers. Most tools preserve these; you can find-and-replace if you want plain prose.
Frequently asked questions
Is the transcript accurate? For clear English speech, auto-generated transcripts are ~95% accurate. Heavy accents, technical jargon, and overlapping speakers reduce accuracy. Always proofread for important uses.
Can I get transcripts for live streams? Once a live stream ends and the video is archived, captions are usually available within a few hours. During a live stream, captions may be live-generated but extraction tools typically wait for the archive.
What format should I save in? For reading or research: plain text (.txt). For video editing or accessibility work: SRT (with timestamps). For LLM context: plain text without timestamps.