Loading...
Loading...

Document scanning and OCR web app that detects a page inside a noisy phone photo, corrects perspective, generates a clean scan-style result, and extracts text in French, English, or both.
Timeline
2026-04 — 2026-04
Technologies
Overview
ClearSheet is a full-stack document digitization product built to take a raw phone photo of a page, detect the document automatically, flatten it, enhance it into a scan-style image, and extract text with OCR. The project combines a modular OpenCV pipeline, a FastAPI backend, and a polished Next.js frontend for upload, review, and export.
4-stage
Pipeline
Preprocess -> detect -> warp -> OCR
FR + EN
OCR
French, English, or bilingual extraction
3
Scan modes
clean, balanced, ocr-optimized
PNG + PDF
Export
PNG and searchable PDF

Processing Pipeline
The backend is intentionally split into small stages so each part of the workflow is testable and easy to evolve. Preprocessing prepares edges and a document mask, contour detection searches for a stable four-point page boundary, the transformer warps and auto-rotates the sheet, and post-processing builds the final scan before OCR ranking chooses the strongest text result.
minAreaRect when the page edges are imperfect.clean, balanced, ocr-optimized) tune CLAHE, sharpening, thresholding, and text expansion differently.
User Experience
The frontend wraps the backend with a practical workspace: upload a source image, choose OCR languages and scan mode, inspect optional debug snapshots, review the transcript, and export the result without leaving the page.
fra, eng, or fra+eng plus the scan profile that best fits readability or OCR.Searchable PDF export
The PDF exporter does more than place an image on a page: it overlays invisible OCR tokens on top of the scan so copied text stays selectable and searchable.
Delivery & QA
docker compose up --build starts the frontend and backend together for a reproducible full-stack local environment.More like this
ai
Research platform for automatic 3D object segmentation and reconstruction from HoloLens 2 data. Combines YOLO, Mask R-CNN, and SAM achieving 80% precision for mixed reality applications.
ai
AI-powered web application for detecting pulmonary diseases from chest X-rays using deep learning. YOLO-based detection with 80% accuracy on the NIH dataset.