Drop a script here, or click to browse
Supports PDF, DOCX, PNG, JPG, WebP
Advanced: high-precision transcription (optional)
When on, SayCaps sends the image or scanned PDF through our hosted transcription service (powered by Google Gemini) for a character-accurate result. Uncertain words are flagged as [unclear] for review. Requires sign-in. The default Tesseract path stays free and fully local — this is an opt-in upgrade.
When high-precision is enabled, image data leaves your browser and is sent to the SayCaps transcription service (which forwards it to Google). Sign in with your SayCaps account to enable.
How it works
- PDFs: If the PDF has a text layer, we extract it directly. Scanned PDFs go through OCR.
- DOCX: Read directly from the file structure.
- Images: Preprocessed before OCR for best results.
- Paste: Skip file handling entirely and paste script text directly.
- Flagging: Low-confidence OCR words are highlighted in yellow.
- Script parsing: Detects character names and strips noise like page numbers.
- Export: Drag the text box in the preview to position it, then export. The preview matches the export exactly.