Tero Subtitler (maintained by URUWorks): Offers a user-friendly multilingual interface with comprehensive subtitle editing capabilities.
It supports multiple subtitle formats and has powerful editing tools, including timeline waveform visualization, automatic backup, translation memory, audio and video preview, and other functions, allowing for easy subtitle creation, editing, and exporting.
It also integrates a variety of advanced technologies, such as automatic transcription, automatic translation, and video dubbing, which greatly improves the efficiency and quality of subtitle production.
1. Project introduction
As you can see from the README, TeroSubtitler is a “subtitle editing software” that supports multi-platform (Windows/macOS/Linux) use. It is positioned as a more comprehensive subtitle tool, not just simple subtitle format conversion or timeline editing.
It is an open-source project of URUWorks and follows the MPL-2.0 license.
2. Main functions
It supports the following functions/features as seen from the README and project documentation:
| Functional categories | Specifics |
|---|---|
| Interface & Usability | User-friendly interface, multilingual interface support (i.e. interface can be localized) |
| Editors | Support for subtitle editing in SMPTE and Media mode (i.e. different timeline/timecode handling) |
| Format support | Support import/export of multiple subtitle formats |
| Undo/Redo | Multi-level undo/redo operations |
| Search / Replace | Support searching and replacing subtitle text |
| Automatic backups | Automatic saving and backup mechanism during the editing process |
| Mode / Assist | There is Source mode, Transcription mode, etc. (may show original subtitles/transcription/originals, etc.) |
| Translation assistance | There is support for “Translation Memory” to facilitate the reuse of sentences or templates |
| Audiovisual preview | You can preview the video/audio and subtitle synchronization effect in the software (“Audiovisual preview”). |
| Timeline / Waveform | There is a timeline interface with waveform visualization, which is convenient for viewing audio levels and aligning subtitle time points |
| Tools / Actions | Includes frame/video rate conversion, formatting (fonts, alignment, etc.) |
| Quality Control & Analysis | Tools like spell checking, subtitle proofreading, comparing multiple subtitle versions, and more |
| Automation / Correction | Automatically detect errors, violations (e.g., timeouts, overlaps, etc.) and correct functions |
| Special export/generation | Export subtitles to MP3, generate hardcoded subtitles, support Blu-ray SUP format, generate blank videos, and even video dubbing/text-to-speech (TTS) capabilities |
| Automatic transcription / recognition | Integrate whisper.cpp or Faster-Whisper for automatic subtitle generation/transcription from audio |
| Online / URL support | With yt-dlp support, open video feeds from web URLs |
| OCR support | Integration of the Tesseract engine for text recognition (OCR) from images |
In addition, it uses mpv as the internal video playback engine and FFmpeg for audio and video processing (transcoding, frame capture, etc.).
3. Technical architecture/stack
From the project description it can be inferred (and partly indicated by the README) its technical architecture features:
- Language/Framework: The main code of the project is written in Pascal (Free Pascal / Lazarus). The README mentions “Lazarus IDE” as the compilation environment.
- External Dependencies / Integration Tools:
- MPV: Used as video/audio playback and synchronized preview.
- FFmpeg: Used for underlying audio and video processing, transcoding, and other operations.
- yt-dlp: Used to support opening videos via URL (download or streaming support)
- whisper.cpp or Faster-Whisper: for speech recognition / audio-to-text (automatic transcription)
- Tesseract: OCR engine for identifying text from video frames/images (possibly for hard subtitle image recognition or text extraction from frames)
- Several internal packages (UW Common units, UW Subtitle API, UW Tero Controls, etc.) are used as its core modules.
- UI components: Use the commonly used component sets of Lazarus (e.g., ATSynEdit, ATFlatControls, BGRABitmap, FPSpreadsheet, etc.)
Because with Pascal / Lazarus, this means that it is a native desktop app and not based on Electron, Qt, . NET. In this way, the architecture may have advantages/disadvantages in terms of UI responsiveness, resource usage, and startup speed.
4. Advantages and limitations
Pros:
- Comprehensive features – Compared to simple subtitle editors, TeroSubtitler is a “more “all-round” tool with functions ranging from editing, verification, transcription, translation, previewing, format conversion, and even generating hard subtitle videos and dubbing.
- Open source & free — Under the MPL 2.0 license, you can view, modify, extend the source code, and make it easy for the community to participate.
- Cross-platform support — While implemented with Pascal, the goal is to support mainstream desktop platforms, not limited to one platform.
- Integrate modern tools — Integrate modern AI/tools like whisper / TTS / OCR to enhance automation.
- Strong detail control — quality control, correction tools, multi-level undo, search replacement, timeline waveform display, etc., suitable for professional/semi-professional users.
Limitations / Risks / Challenges
- Learning curve – Feature-rich means the interface and operation can be complex, and new users need time to get started.
- Pascal / Lazarus ecosystem is smaller — there may be fewer community resources, third-party libraries, maintainers than more popular languages/frameworks.
- Relying on multiple external tools – mpv, FFmpeg, whisper, Tesseract, etc. can cause trouble with configuration and compatibility on different systems.
- Performance & Resource Consumption — If you’re dealing with long videos, complex subtitles, multitrack audio, OCR/transcription, etc., it can be CPU/memory demanding.
- Stability/bug risk – Open source projects, with many features, can easily introduce bugs, incompatibilities, or platform differences in interaction.
- Accuracy limitations for automation – Features like automatic transcription and OCR rely on external models/engines, which can lead to errors due to factors such as recognition errors, language/accent/sound quality, etc.
5. Usage scenarios
This project is suitable for the following types of people/scenarios:
- Subtitle production/translation workers want to complete the process of editing, translating, proofreading, and previewing in one tool.
- Users with fine requirements for subtitle timelines, formats, and quality are not satisfied with simple tools.
- Users who have certain technical skills (such as being able to debug dependencies, compile) or are willing to try open source tools.
- Education/Research/Open Source Community Users who want to customize, extend, and redevelop subtitle tools.
- Subtitle workflows that require cross-platform support (Windows/macOS/Linux).
Github:https://github.com/URUWorks/TeroSubtitler
Tubing: