Question 1

Is OCRtoMD free?

Accepted Answer

Yes. Converting documents with a text layer (PDF, Word, PowerPoint, Excel, HTML, CSV) is free with no signup. Heavy OCR of scanned pages will have a generous free allowance with a paid tier for large volumes.

Question 2

What file types are supported?

Accepted Answer

PDF, DOCX, PPTX, XLSX/XLS, HTML, CSV, and common image formats (PNG, JPG, GIF, BMP, TIFF, WEBP). Files up to 50 MB.

Question 3

Why convert documents to Markdown for LLMs?

Accepted Answer

Markdown keeps structure — headings, lists, and tables — in a compact, plain-text form that language models parse reliably. It improves RAG retrieval quality and uses fewer tokens than HTML or raw PDF text.

Question 4

Are my uploaded files stored?

Accepted Answer

No long-term storage. Uploads are encrypted, never public, and auto-deleted within about a day; converted output is removed within a week.

Question 5

Does it handle scanned PDFs and images?

Accepted Answer

Born-digital pages convert today. OCR for scanned pages and images (powered by our own document OCR model) is rolling out — those pages currently return a placeholder.

Convert any document into clean, LLM-ready Markdown

How it works

Upload

Parse & OCR

Get Markdown

Supported formats

Why Markdown for LLMs?

Structure survives

Fewer tokens

Better retrieval

Frequently asked questions