Releases: carlosacchi/captiocr
CaptiOCR v0.17.5
v0.17.5: ROVER + TF-IDF post-processing pipeline (consolidated 0.17.x release)
The headline change of the 0.17 line is a new post-processing pipeline
that replaces the old binary consensus / hysteresis deduplicator with a
continuous ROVER + TF-IDF novelty-scoring algorithm. The pipeline
landed in 0.17.0 and was refined through 0.17.4. This release ships
the documentation and release-publishing fixes that finalize the cycle
so the new behaviour is correctly described and the supporting policy
files are actually shipped to the public repository.
Documentation
- Rewrote README around the new pipeline: an end-to-end walkthrough of
the capture loop (region screenshot -> local Tesseract -> raw frame
buffer -> post-processing -> save) and a dedicated "How CaptiOCR
Captures" section detailing ROVER frame-presence weighting,
sliding-window TF-IDF novelty, dual novel-word tracking (suffix
difflib alignment for output, set-difference scoring for mid-sentence
insertions), warmup scaling, end-of-stream flush using the most
recent frame, the v0.17.4 speaker-label cap and anchored
novel-word search, and gibberish-detection tuning. - Documented the new dialog knobs (Emit Score Threshold, Frequency
Window, Frame Voting Window) and the scripts/reprocess_capture.py
utility for re-running the pipeline against raw capture files. - Added a "Why CaptiOCR? (No paid AI add-on required)" positioning
section and a "Security Hardening" section summarising the 0.17.2
protections. - Project Structure section updated to reflect scripts/,
requirements.lock, SECURITY.md, PRIVACY.md, and the per-user data
folder layout.
Release publishing
- publish-release.yml now copies SECURITY.md, PRIVACY.md, and
requirements.lock to the public repository, so the README links and
the documented hashed-install flow no longer 404 on github.com. - version.txt bumped to 0.17.5 (25/04/2026).
CaptiOCR v0.17.4
v0.17.4: Post-processing fidelity fixes
- Cap speaker-label qualifier regex to <=30 chars and reject candidates
with sentence punctuation; prevents OCR-mangled frames (e.g. missing
closing paren) from being treated as a giant "full name" and replacing
every occurrence of the real speaker label with corrupted text. - Anchor _find_novel_words to a match block ending near the end of the
previous text instead of the last match anywhere; preserves middle
utterances bracketed by repeating speaker labels (e.g. "Pause", "OK",
"enabled Workspace") that were silently dropped before. - Add scripts/reprocess_capture.py utility to re-run the post-processing
pipeline against any raw capture file.
CaptiOCR v0.16.3
Bump version to 0.16.3 and update release date
CaptiOCR v0.16.1
v0.16.1: End-of-stream flush for trailing captured frames
Add end-of-stream flush logic to the fidelity pipeline so that
trailing frames still in the buffer at the end of capture are
emitted rather than silently dropped. The flush candidate goes
through the same no-downgrade, dedup, overlap, and sentence-split
steps (4-7) as regular consensus blocks, ensuring consistent
output quality.
CaptiOCR v0.16.0
Merge feature/v0.16-fidelity-fixes: Fidelity-first capture and post-processing
CaptiOCR v0.15.4
Enhance release workflow with sensitive content scan and improved commit message handling
CaptiOCR v0.14.9
Merge feature/safer-post-processing: Release v0.14.8 through v0.14.9
CaptiOCR v0.14.0
Merge feature/security-and-quality-fixes: Release v0.14.0
CaptiOCR v0.13.4
Release v0.13.4: Improved sensitivity configuration dialog
Enhancements:
- Increased dialog height from 450px to 650px for better control visibility
- Added Reset to Defaults button for easy parameter restoration
- Improved button layout with visual separation between Reset and Save/Cancel
- Added settings persistence validation with error feedback
- Fixed spinbox value capture issue with update_idletasks()
- Added sensitivity parameters to capture file header for traceability
Bug Fixes:
- Fixed settings not being saved to disk correctly
- Fixed dialog truncation hiding controls
- Fixed spinbox values not updating when quickly clicking Save
All four sensitivity parameters are now properly saved to JSON preferences
and included in capture file headers for full traceability.
Version: 0.13.4
CaptiOCR v0.13.3
Remove obsolete capture files from the repository