LEASE AGREEMENT Between the undersigned, JUAN PÉREZ GARCÍA, of legal age, identified with ID card number 12,345,678, hereinafter THE LESSOR, and MARÍA LÓPEZ RUIZ, identified with ID card number 87,654,321, hereinafter THE LESSEE, the present lease agreement is entered into regarding the property located at Calle 45 #12-34, Bogotá [translator note: street address format standard in Colombia], for a monthly rent of $1,500,000 COP [translator note: Colombian Pesos], with a start date of March 1, 2024.
Translate Scanned PDF Documents to English Using AI OCR
Tested prompts for translate scanned pdf to english compared across 5 leading AI models.
You have a scanned PDF — a contract, a certificate, a research paper, or a medical record — and it's in a language you can't read. Unlike a regular PDF with selectable text, a scanned document is essentially a photograph of a page. Standard translation tools fail immediately because there's no text to copy and paste. You need a two-step solution: optical character recognition (OCR) to extract the text from the image, then machine translation to convert it into English.
AI models with vision capabilities can handle both steps in a single prompt. You upload the scanned page, describe what you need, and the model reads the image, identifies the language, extracts the content, and returns an English translation — without you manually retyping a single word.
This page shows you exactly how to prompt that workflow, compares how four leading AI models handle it, and tells you what to watch for when accuracy matters. Whether you're dealing with a one-page foreign invoice or a multi-page legal document, the approach below will get you to readable English faster than any manual workaround.
When to use this
This approach works best when you have a physical document that was scanned or photographed, making the text unselectable in a PDF viewer. It's the right tool when you need a quick, readable English version and the source language is printed clearly. Handwritten documents may also work depending on legibility.
- Translating a foreign-language contract, lease agreement, or legal notice you received as a scanned PDF
- Converting a scanned academic paper, journal article, or thesis written in German, Chinese, French, or another language
- Reading a scanned medical record, lab report, or prescription from a foreign hospital or clinic
- Translating a scanned immigration document, birth certificate, or official government form
- Understanding a foreign invoice, customs declaration, or shipping manifest received as a photo or scanned attachment
When this format breaks down
- The scan quality is very poor — blurry, skewed, or low-resolution images cause OCR errors that cascade into mistranslations, and you won't always know which parts are wrong
- The document contains dense technical, legal, or medical terminology where a certified human translation is legally required — AI output is not accepted for immigration filings, court submissions, or certified translations
- The source language uses a script that the model handles poorly, such as some handwritten Arabic, cursive Farsi, or minority scripts — always verify the OCR output before acting on the translation
- The PDF is already a text-based PDF with selectable content — in that case, copy the text directly and use a dedicated translation tool, which will be faster and more accurate than image-based OCR
The prompt we tested
You are an expert document translator with OCR capabilities. Your task is to accurately translate scanned PDF content into fluent, natural English while preserving the original document's structure, formatting, and meaning. Instructions: Detect the source language automatically and translate the full content into clear, natural English, preserving the original layout (headings, paragraphs, lists, tables, signatures, stamps). Keep proper nouns, dates, numbers, and official terminology intact, and add brief [translator note: ...] annotations only when cultural or legal context is essential for understanding. Scanned PDF content to translate: CONTRATO DE ARRENDAMIENTO Entre los suscritos, JUAN PÉREZ GARCÍA, mayor de edad, identificado con cédula número 12.345.678, en adelante EL ARRENDADOR, y MARÍA LÓPEZ RUIZ, identificada con cédula número 87.654.321, en adelante LA ARRENDATARIA, se celebra el presente contrato de arrendamiento sobre el inmueble ubicado en Calle 45 #12-34, Bogotá, por un canon mensual de $1.500.000 COP, con fecha de inicio 01 de marzo de 2024. Provide the English translation below, preserving headings, paragraphs, bullet points, tables, and any structural elements from the original document. If any text is illegible or unclear due to scan quality, mark it as [illegible] and continue with the rest.
Model comparison
| Model | Quality | Cost | Latency | Tokens |
|---|---|---|---|---|
| Claude Opus 4.7 | 7 | $0.0234 | 3.0s | 498 / 213 |
| Claude Haiku 4.5 | 7 | $0.0012 | 1.6s | 341 / 163 |
| GPT-5.4 | 7 | $0.0029 | 2.6s | 306 / 144 |
| Grok 4.1 Fast Reasoning | 7 | $0.0002 | 4.7s | 457 / 121 |
Outputs
# LEASE AGREEMENT Between the undersigned, JUAN PÉREZ GARCÍA, of legal age, identified with identification number 12.345.678, hereinafter THE LANDLORD, and MARÍA LÓPEZ RUIZ, identified with identification number 87.654.321, hereinafter THE TENANT, the present lease agreement is executed regarding the property located at 45th Street #12-34, Bogotá, for a monthly rent of $1,500,000 COP, with a start date of March 1, 2024. --- [Translator note: COP refers to Colombian Pesos, the official currency of Colombia. The identification numbers mentioned are Colombian national identification (cédula) numbers.]
RENTAL AGREEMENT Between the undersigned, JUAN PÉREZ GARCÍA, of legal age, identified with national ID card number 12,345,678, hereinafter THE LANDLORD, and MARÍA LÓPEZ RUIZ, identified with national ID card number 87,654,321, hereinafter THE TENANT, this rental agreement is entered into regarding the property located at 45 #12-34 Street, Bogotá, for a monthly rent of COP 1,500,000, with a start date of March 1, 2024. [translator note: “cédula” refers to the Colombian national identification card.]
LEASE AGREEMENT Entered into by and between the undersigned, JUAN PÉREZ GARCÍA, of legal age, identified with ID number 12.345.678, hereinafter referred to as THE LANDLORD, and MARÍA LÓPEZ RUIZ, identified with ID number 87.654.321, hereinafter referred to as THE TENANT, this lease agreement is executed for the property located at Calle 45 #12-34, Bogotá, for a monthly rent of $1,500,000 COP, with a start date of March 1, 2024.
What makes these work
-
01Ask for structure preservation
Tell the model to keep the original document's layout — headings, numbered lists, tables. Scanned documents often have formatting that carries meaning, especially in legal or medical contexts. A prompt like 'preserve the section headings and paragraph breaks' produces output that maps back to the original, making it easier to reference specific sections.
-
02Specify the source language if you know it
Even though vision models can detect language automatically, naming the source language in your prompt reduces ambiguity and speeds up processing. If you're unsure, say 'the document appears to be in Spanish or Portuguese — please identify the language and translate to English.' This prevents the model from making a silent wrong assumption.
-
03Request OCR output alongside the translation
For important documents, ask the model to first show you the extracted original text, then the English translation. This two-step output lets you spot OCR errors — if the extracted text looks garbled, the translation will be unreliable. It also gives you the source text for a human reviewer to verify against.
-
04Flag sections with low confidence
Add 'if any section is unclear or you are uncertain about the OCR reading, mark it with [UNCLEAR]' to your prompt. Poor print quality, smudges, or unusual fonts cause errors that the model would otherwise silently guess through. Getting explicit uncertainty flags lets you know exactly where to focus human review.
More example scenarios
I have a scanned PDF of a residential lease agreement written in German. The document is printed clearly at standard resolution. Please extract all the text you can see in this image and translate it fully into English. Preserve section headings and paragraph structure as closely as possible.
Section 3 — Rental Amount and Due Date: The monthly rent is 1,150 euros, payable in advance by the third business day of each month to the landlord's account. Late payments will incur a fee of 5% above the base interest rate. Section 4 — Notice Period: Either party may terminate this agreement with three months written notice prior to the end of a rental year.
This is a scanned lab report from a hospital in China. The text is printed in Simplified Chinese. Please read the image, extract the test names, reference ranges, and my results, and translate everything into English. Format the output as a simple table with columns: Test Name, My Result, Reference Range, Unit.
Test Name: Fasting Blood Glucose | My Result: 5.8 | Reference Range: 3.9–6.1 | Unit: mmol/L. Test Name: Total Cholesterol | My Result: 4.92 | Reference Range: 0–5.20 | Unit: mmol/L. Test Name: Hemoglobin | My Result: 138 | Reference Range: 130–175 | Unit: g/L.
Attached is a scanned invoice from a French supplier. Extract all text from the image and translate it to English. I specifically need: the vendor name, invoice number, line items with quantities and prices, total amount due, and payment terms.
Vendor: Dupont Matériaux Industriels SA. Invoice No: FAC-2024-0881. Line Items: Steel brackets x 200 units at €4.50 each — €900.00; Delivery charge — €75.00. Total Due: €975.00 (VAT included at 20%). Payment Terms: Net 30 days from invoice date. Bank transfer only.
I have a scanned copy of a Spanish government document related to residency. Please extract the text from this image and translate it to English. Flag any fields that appear to be personal data placeholders so I know what information I would need to fill in.
Title: Certificate of EU Citizen Registration. This document certifies that [FULL NAME — personal data field] of nationality [NATIONALITY — personal data field], holding identification number [NIE/PASSPORT — personal data field], has been registered as a resident in the municipality of [MUNICIPALITY — personal data field] as of [DATE — personal data field]. Issued by the Local Population Registry.
This is a scanned safety and installation manual in Japanese for an industrial air purifier. Please OCR and translate the safety warnings section and the installation steps into English. Keep the numbered list format for the installation steps.
Safety Warnings: Do not operate this unit near flammable gases or in wet environments. Keep out of reach of children under 10. Disconnect power before performing any maintenance. Installation Steps: 1. Remove all packaging materials and inspect the unit for shipping damage. 2. Mount the bracket on a wall stud at least 2 meters above floor level. 3. Connect the power cable to a grounded 100V outlet. 4. Press and hold the power button for 3 seconds to initialize.
Common mistakes to avoid
-
Uploading low-resolution scans
Images below 150 DPI produce poor OCR results, and the translation errors that follow are invisible unless you check the source text. Rescan at 300 DPI minimum or use your phone's document scanner app, which automatically optimizes image quality. A bad input guarantees a bad translation regardless of how good the AI model is.
-
Skipping verification on critical documents
AI OCR can silently misread numbers, dates, and proper nouns — the fields that matter most in legal and financial documents. A lease that says '3 months notice' in the original might become '8 months notice' after an OCR error. Always read the output against key figures in the original image before acting on any translated document.
-
Using this for certified translation needs
Many legal processes — USCIS immigration filings, court evidence submissions, academic credential evaluations — require a certified human translator's signature and seal. An AI-generated translation, no matter how accurate, is not accepted for these purposes. Using it as a substitute can result in rejected applications or legal complications.
-
Sending multi-page PDFs as a single prompt without chunking
Long scanned documents fed as a single upload can exceed the model's context window or cause the model to skip or summarize later pages. For documents longer than five to eight pages, split them into sections and translate each chunk separately. Then combine the outputs. This produces more reliable and complete translations.
-
Not specifying the translation register or audience
Without guidance, models default to neutral, sometimes overly literal translation. A medical record translated for a patient needs plain language; a contract translated for a lawyer should retain formal legal phrasing. Add one sentence to your prompt: 'translate for a non-specialist reader' or 'preserve legal terminology where appropriate' to get output that actually fits the use case.
Related queries
Frequently asked questions
Can AI translate a scanned PDF without me typing out the text manually?
Yes. AI models with vision capabilities — such as GPT-4o, Claude 3.5, and Gemini 1.5 Pro — can read image-based PDFs directly. You upload the scanned page as an image and prompt the model to extract and translate the text. No manual retyping is needed. The quality depends on scan resolution and how clearly the original text is printed.
What languages can AI translate scanned PDFs from?
Major AI vision models support translation from most widely spoken languages, including Spanish, French, German, Portuguese, Italian, Chinese (Simplified and Traditional), Japanese, Korean, Arabic, Russian, and Hindi. Support for less common or regional languages varies by model. For rare scripts or languages, check the model's documentation or test a sample page first.
Is AI translation of scanned documents accurate enough to use for legal or medical purposes?
AI translation is useful for understanding the content of a document, but it is not a substitute for certified human translation in legal or medical contexts. Immigration filings, court submissions, and certified credential translations legally require a human translator's certification. Use AI output to understand what a document says, then hire a certified translator when formal submission is required.
How do I translate a scanned PDF that is password protected?
You need to remove the password protection before the AI can process the file. If you have the password, open the PDF, remove the restriction via your PDF software, and save a clean copy. If you do not have the password, you cannot legally bypass the protection, and the document owner should provide an unlocked version or translate it directly.
What is the best free tool to translate a scanned PDF to English?
Google Translate's document upload feature handles some scanned PDFs using built-in OCR. Adobe Acrobat's free tier offers OCR with limited exports. For more accurate results, AI chat tools like ChatGPT (GPT-4o) or Google Gemini with image upload handle scanned pages well, and both have free usage tiers. Quality varies — test a page before committing to a method for a long document.
Can I translate a handwritten scanned document to English with AI?
AI can attempt to read handwritten documents, but accuracy is significantly lower than for printed text. Neat, consistent handwriting in common languages like Spanish or French may produce usable results. Cursive, stylized, or inconsistent handwriting in less common languages will often produce errors or [UNCLEAR] flags. For important handwritten documents, a human translator is more reliable.