Help us keep the list up to date and submit new video software here.
Tool
Complete Version history / Release notes / Changelog / What's New for VideOCR
VideOCR v1.5.1
Hotfix release:
Fixes an issue where the program did not correctly track the start time of subtitles
Removes stitching of images in the recognition step due to degraded accuracy in comparison to v1.4.2
Fixes a crash during ETA calculation using Google Lens as the OCR engine
Main Highlight of the v1.5.x release:
Adds support for Google Lens OCR as a second OCR engine for highly accurate text recognition
--> 200+ languages are now supported
VideOCR v1.5.0
Highlights:
Adds support for Google Lens OCR as a second OCR engine for highly accurate text recognition
--> 200+ languages are now supported
Rework of OCR pipeline step
--> Up to 2.5x speed increase over previous version
Add multi-select support to the batch queue
Notes:
Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
Utilizes standalone version of Chrome-Lens-OCR from https://github.com/timminator/Chrome-Lens-OCR
Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)
VideOCR v1.4.2
Highlights:
Switch to time based processing and complete rework of pipeline
--> 2x Speed increase for Step 1
--> Preprocessing step "Mapping" for VFR videos no longer necessary
--> Enables instantaneous seeking
Implements an interactive crop box with resizing and dragging
Lowered default ocr_image_max_width value to speed up high-res video processing
Add support for subtitle alignment tags (@quentin954)
Adds a GUI scaling feature to manually increase font and button size
Adds a preview of the specified brightness threshold to the Video Tab
Optimized seeking in GUI
Installed disk footprint reduced by 250MB
Fixes:
Some buttons were not correctly updated with the selected language
Fixes wrongly applying an already deleted crop box
Adds a check to not allow editing of already running jobs
Adds validation when adding jobs to queue
Main Highlights of the v1.4.x release:
Integrates the latest PaddleOCR 3.4 version which extends support to 110 languages and provides increased accurary for latin, cyrillic, arabic etc.
VideOCR v1.4.1
Highlights:
Lowered default ocr_image_max_width value to speed up high-res video processing
GUI gets properly resized and positioned if size extends Workarea
Improve progress update rate to inform the user about the current status
Drop AVX2/FMA requirement
Fixes:
Fixed output for right to left languages
Fixed post completion action commands
Fixed a bug where the GUI would not minimize when clicking its taskbar icon
VideOCR v1.4.0
Highlights:
Integrates the latest PaddleOCR 3.4 version which extends support to 110 languages and provides increased accurary for latin, cyrillic, arabic etc.
Adds support for batch processing
Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
Adds a progress bar and a status message for better progress indication
Adds ETA calculation for every step
Tasks can now be paused and resumed
Adds support for post-completion tasks, e.g. shutdown, sleep, lock...
The operating system is now prevented from going to sleep while the operation is ongoing
The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message
Adds an option for normalizing traditional chinese to simplified chinese for "Chinese & English" for better results
Fixes:
Fixed an issue where subtitles could merge incorrectly due to substring matches
Fixed an issue where subtitles were not correctly merged for vfr videos depending on the specified merge gap
Fixed usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions
Post Processing is now disabled by default due to increased accuracy of PaddleOCR
Notes:
Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)
Hotfix release:
Fixes an issue where the program did not correctly track the start time of subtitles
Removes stitching of images in the recognition step due to degraded accuracy in comparison to v1.4.2
Fixes a crash during ETA calculation using Google Lens as the OCR engine
Main Highlight of the v1.5.x release:
Adds support for Google Lens OCR as a second OCR engine for highly accurate text recognition
--> 200+ languages are now supported
VideOCR v1.5.0
Highlights:
Adds support for Google Lens OCR as a second OCR engine for highly accurate text recognition
--> 200+ languages are now supported
Rework of OCR pipeline step
--> Up to 2.5x speed increase over previous version
Add multi-select support to the batch queue
Notes:
Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
Utilizes standalone version of Chrome-Lens-OCR from https://github.com/timminator/Chrome-Lens-OCR
Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)
VideOCR v1.4.2
Highlights:
Switch to time based processing and complete rework of pipeline
--> 2x Speed increase for Step 1
--> Preprocessing step "Mapping" for VFR videos no longer necessary
--> Enables instantaneous seeking
Implements an interactive crop box with resizing and dragging
Lowered default ocr_image_max_width value to speed up high-res video processing
Add support for subtitle alignment tags (@quentin954)
Adds a GUI scaling feature to manually increase font and button size
Adds a preview of the specified brightness threshold to the Video Tab
Optimized seeking in GUI
Installed disk footprint reduced by 250MB
Fixes:
Some buttons were not correctly updated with the selected language
Fixes wrongly applying an already deleted crop box
Adds a check to not allow editing of already running jobs
Adds validation when adding jobs to queue
Main Highlights of the v1.4.x release:
Integrates the latest PaddleOCR 3.4 version which extends support to 110 languages and provides increased accurary for latin, cyrillic, arabic etc.
VideOCR v1.4.1
Highlights:
Lowered default ocr_image_max_width value to speed up high-res video processing
GUI gets properly resized and positioned if size extends Workarea
Improve progress update rate to inform the user about the current status
Drop AVX2/FMA requirement
Fixes:
Fixed output for right to left languages
Fixed post completion action commands
Fixed a bug where the GUI would not minimize when clicking its taskbar icon
VideOCR v1.4.0
Highlights:
Integrates the latest PaddleOCR 3.4 version which extends support to 110 languages and provides increased accurary for latin, cyrillic, arabic etc.
Adds support for batch processing
Preprocessing step for vfr videos (Mapping) and Step 1 are now multithreaded (e.g. up to 7x performance increase on a 6-Core 5600x).
Adds a progress bar and a status message for better progress indication
Adds ETA calculation for every step
Tasks can now be paused and resumed
Adds support for post-completion tasks, e.g. shutdown, sleep, lock...
The operating system is now prevented from going to sleep while the operation is ongoing
The CLI version can now be run in parallel to process multiple videos at the same time (thanks @Phr33d0m)
ISO 639 Language codes are now added to the subtitle file names. This allows video players to identify the correct subtitle language
Command Line Arguments for the CLI version are now properly validated to provide a user-friendly error message
Adds an option for normalizing traditional chinese to simplified chinese for "Chinese & English" for better results
Fixes:
Fixed an issue where subtitles could merge incorrectly due to substring matches
Fixed an issue where subtitles were not correctly merged for vfr videos depending on the specified merge gap
Fixed usage of wrongly sized image for subtitle position "any" in ssim calculation. This resulted in a performance decrease in previous versions
Post Processing is now disabled by default due to increased accuracy of PaddleOCR
Notes:
Utilizes standalone version of PaddleOCR from https://github.com/timminator/PaddleOCR-Standalone
Linux version requires glibc 2.35 or newer (can be checked by running ldd --version in a terminal)