Skip to content

Releases: carlosacchi/captiocr

CaptiOCR v0.17.5

25 Apr 09:48

Choose a tag to compare

v0.17.5: ROVER + TF-IDF post-processing pipeline (consolidated 0.17.x release)

The headline change of the 0.17 line is a new post-processing pipeline
that replaces the old binary consensus / hysteresis deduplicator with a
continuous ROVER + TF-IDF novelty-scoring algorithm. The pipeline
landed in 0.17.0 and was refined through 0.17.4. This release ships
the documentation and release-publishing fixes that finalize the cycle
so the new behaviour is correctly described and the supporting policy
files are actually shipped to the public repository.

Documentation

  • Rewrote README around the new pipeline: an end-to-end walkthrough of
    the capture loop (region screenshot -> local Tesseract -> raw frame
    buffer -> post-processing -> save) and a dedicated "How CaptiOCR
    Captures" section detailing ROVER frame-presence weighting,
    sliding-window TF-IDF novelty, dual novel-word tracking (suffix
    difflib alignment for output, set-difference scoring for mid-sentence
    insertions), warmup scaling, end-of-stream flush using the most
    recent frame, the v0.17.4 speaker-label cap and anchored
    novel-word search, and gibberish-detection tuning.
  • Documented the new dialog knobs (Emit Score Threshold, Frequency
    Window, Frame Voting Window) and the scripts/reprocess_capture.py
    utility for re-running the pipeline against raw capture files.
  • Added a "Why CaptiOCR? (No paid AI add-on required)" positioning
    section and a "Security Hardening" section summarising the 0.17.2
    protections.
  • Project Structure section updated to reflect scripts/,
    requirements.lock, SECURITY.md, PRIVACY.md, and the per-user data
    folder layout.

Release publishing

  • publish-release.yml now copies SECURITY.md, PRIVACY.md, and
    requirements.lock to the public repository, so the README links and
    the documented hashed-install flow no longer 404 on github.com.
  • version.txt bumped to 0.17.5 (25/04/2026).

CaptiOCR v0.17.4

24 Apr 20:36

Choose a tag to compare

v0.17.4: Post-processing fidelity fixes

  • Cap speaker-label qualifier regex to <=30 chars and reject candidates
    with sentence punctuation; prevents OCR-mangled frames (e.g. missing
    closing paren) from being treated as a giant "full name" and replacing
    every occurrence of the real speaker label with corrupted text.
  • Anchor _find_novel_words to a match block ending near the end of the
    previous text instead of the last match anywhere; preserves middle
    utterances bracketed by repeating speaker labels (e.g. "Pause", "OK",
    "enabled Workspace") that were silently dropped before.
  • Add scripts/reprocess_capture.py utility to re-run the post-processing
    pipeline against any raw capture file.

CaptiOCR v0.16.3

16 Mar 08:04

Choose a tag to compare

Bump version to 0.16.3 and update release date

CaptiOCR v0.16.1

03 Mar 14:18

Choose a tag to compare

v0.16.1: End-of-stream flush for trailing captured frames

Add end-of-stream flush logic to the fidelity pipeline so that
trailing frames still in the buffer at the end of capture are
emitted rather than silently dropped. The flush candidate goes
through the same no-downgrade, dedup, overlap, and sentence-split
steps (4-7) as regular consensus blocks, ensuring consistent
output quality.

CaptiOCR v0.16.0

25 Feb 15:07

Choose a tag to compare

Merge feature/v0.16-fidelity-fixes: Fidelity-first capture and post-processing

CaptiOCR v0.15.4

24 Feb 14:57

Choose a tag to compare

Enhance release workflow with sensitive content scan and improved commit message handling

CaptiOCR v0.14.9

20 Feb 15:46

Choose a tag to compare

Merge feature/safer-post-processing: Release v0.14.8 through v0.14.9

CaptiOCR v0.14.0

17 Feb 20:40

Choose a tag to compare

Merge feature/security-and-quality-fixes: Release v0.14.0

CaptiOCR v0.13.4

11 Feb 09:07

Choose a tag to compare

Release v0.13.4: Improved sensitivity configuration dialog

Enhancements:

  • Increased dialog height from 450px to 650px for better control visibility
  • Added Reset to Defaults button for easy parameter restoration
  • Improved button layout with visual separation between Reset and Save/Cancel
  • Added settings persistence validation with error feedback
  • Fixed spinbox value capture issue with update_idletasks()
  • Added sensitivity parameters to capture file header for traceability

Bug Fixes:

  • Fixed settings not being saved to disk correctly
  • Fixed dialog truncation hiding controls
  • Fixed spinbox values not updating when quickly clicking Save

All four sensitivity parameters are now properly saved to JSON preferences
and included in capture file headers for full traceability.

Version: 0.13.4

CaptiOCR v0.13.3

10 Feb 16:29

Choose a tag to compare

Remove obsolete capture files from the repository