Text / OCR Cleaner

Fix common OCR issues: invisible and Unicode spaces, line breaks and paragraph reflow, punctuation, hyphenation, repeated lines, blank lines, optional line regex, and ordered find/replace. Everything runs in your browser.

Input

Paste raw OCR text or upload a .txt / .md file.

Options
Remove lines (regex)

Runs after the options above. Each line is tested against the pattern; matching lines are removed. Invalid patterns are ignored until fixed.

Presets:
Find and replace

Literal text only (not regex). Multiline strings are allowed. Rules run top to bottom; each step sees the result of the previous one.

  • Rule 1

Cleaned output

Updates as you type when options are enabled.

15 lines (213 characters) → 10 lines (184 characters)

Checkbox options mostly keep line boundaries unless you use reflow or line removal. Find/replace can change line count if your text includes newlines.

Free image & PDF OCR →