Convert SRT subtitle files to plain text by stripping sequence numbers and timestamps. Options to preserve or remove blank lines, strip HTML tags from subtitles, and keep or remove speaker labels. Live preview with line count.
Drop an .srt file or paste SRT content into the input. SRT files contain subtitle text with sequence numbers and timestamps in format "00:01:23,456 --> 00:01:26,789". These are stripped leaving only the dialogue text.
Remove blank lines: collapses multiple blank lines for clean paragraphs. Strip HTML tags: removes formatting tags like <i>, <b>, <font> common in subtitle files. Remove speaker labels: strips labels like [SPEAKER] or (music) from transcripts.
The clean text appears in the output panel. Copy to clipboard or download as a .txt file. Useful for creating transcripts, extracting quotes, searching subtitle content, or feeding dialogue to AI tools.
SRT (SubRip Text) is the most widely used subtitle format. Each subtitle block has three parts: a sequence number (1, 2, 3...), a timestamp line (00:01:23,456 --> 00:01:26,789), and the subtitle text (one or more lines). Blocks are separated by blank lines. SRT files are plain text with .srt extension, readable by VLC, YouTube, Netflix, and virtually all video players.
Transcript creation: remove timestamps to get readable dialogue for articles, blog posts, or documentation. Content search: plain text is easier to search than timestamped SRT. AI input: paste transcripts into ChatGPT, Claude, or other AI tools for summarisation, translation, or analysis. Quote extraction: find specific lines without timestamp noise. Subtitle editing: clean up the text in a text editor before re-importing as SRT.
SRT supports a subset of HTML for formatting: <i>italic text</i> for italics (common for foreign language dialogue), <b>bold text</b> for emphasis, <u>underlined text</u>, and <font color="#ff0000">coloured text</font>. Some subtitle tools also add <c> tags for WebVTT-style colour classes. Enable "Strip HTML tags" to remove all these and get plain dialogue text.
Speaker labels identify who is speaking: [JOHN]: Hello there. Or (narrator): Once upon a time... Or >> for new speaker in SDH (Subtitles for the Deaf and Hard-of-Hearing) subtitles. These are useful in subtitle files but may clutter a plain text transcript. The strip speaker labels option removes square-bracket [labels], parenthetical (labels), and >> markers.
WebVTT is the modern web subtitle format used by HTML5 video. Its timestamp format is slightly different (00:01:23.456 --> 00:01:26.789 with a period instead of comma). This tool primarily targets SRT format. For WebVTT, the same approach works — paste the VTT content and the tool strips the WEBVTT header and timestamps leaving the dialogue text.
Yes — plain text transcripts are ideal for AI analysis. Remove timestamps and blank lines for the most compact input. The resulting clean dialogue text can be pasted directly into ChatGPT, Claude, or other AI tools for: summarising video content, extracting key points, translating dialogue, identifying speakers, checking factual claims, or generating show notes.