Build Language Learning Quizzes with AI Prompts

Tested prompts for ai quiz generator for language learning compared across 5 leading AI models.

If you're building a language learning app, tutoring platform, or self-study curriculum, generating quiz questions by hand is the bottleneck that slows everything down. Writing 20 vocabulary questions for Spanish B1 learners, checking them for accuracy, balancing difficulty, and formatting them consistently can take hours. An AI quiz generator cuts that to minutes.

The core problem is specificity. Generic quiz tools give you generic questions. What you actually need is questions calibrated to a target language, a proficiency level, a grammar point, or a vocabulary set. AI models can do that when you prompt them correctly, but the prompt structure matters more than most people expect.

This page shows you the exact prompt format that works, compares how four major AI models handle it, and gives you enough context to adapt the approach for your own language learning use case, whether that's CEFR-aligned vocabulary drills, fill-in-the-blank grammar practice, reading comprehension checks, or audio-script MCQs.

When to use this

This approach fits best when you need structured, repeatable quiz output across a defined language scope. If you know your target language, proficiency band, and question type going in, AI generation is faster and more consistent than manual authoring and scales without adding headcount.

Building a vocabulary quiz bank for a specific CEFR level (A1 through C2) in any language
Generating grammar-focused questions around a single structure, such as subjunctive mood or verb aspect
Creating reading comprehension questions from a passage or script you already have
Producing spaced-repetition flashcard questions for a language learning app at volume
Drafting placement test items across multiple proficiency levels quickly for a tutoring platform

When this format breaks down

When the target language is low-resource or minority and the model has limited training data in that language, output accuracy drops significantly and errors can be hard to catch without an expert reviewer
When you need fully validated, psychometrically calibrated test items for high-stakes assessments like official certifications, AI output requires expert linguistic review before use
When the quiz must reflect a very specific proprietary curriculum, syllabus scope, or lesson sequence the model has no access to, outputs will drift outside your intended content boundaries
When your learners are at beginner level and native-speaker nuance in distractors matters critically, the model may produce plausible-sounding wrong answers that are actually correct in some contexts

The prompt we tested

You are an expert language learning quiz designer. Generate a set of quiz questions based on the learner's input below.

Follow these rules strictly:
Create 10 varied questions (mix of multiple choice, fill-in-the-blank, and translation) targeting the specified language, level, and topic; number each question, provide 4 options for multiple choice (A-D), and include a separate 'Answer Key' section with the correct answer and a one-sentence explanation for each. Keep vocabulary and grammar appropriate to the stated CEFR level, and write all instructions in English while keeping target-language content in the target language.

Learner input:
Target language: Spanish. Learner level: A2 (beginner-intermediate). Topic: past tense (preterite) with common travel vocabulary like airport, hotel, and restaurant. I want to practice recognizing correct verb conjugations and translating short sentences about a recent trip.

Produce a complete, ready-to-use quiz with a clear answer key at the end.

Model comparison

Model	Quality	Cost	Latency	Tokens
Claude Opus 4.7	Not run yet
Claude Haiku 4.5	Not run yet
GPT-5.4	Not run yet
Gemini 2.5 Pro	—	$0.0044	19.8s	211 / 831
Grok 4.1 Fast Reasoning	—	$0.0005	12.3s	353 / 887

Outputs

Gemini 2.5 Pro

Of course! Here is a Spanish quiz for an A2 level learner, focusing on the preterite tense and travel vocabulary.

---

### **Spanish A2 Quiz: My Recent Trip (Preterite Tense)**

**Instructions:** Answer the following 10 questions to test your knowledge of the preterite past tense and common travel vocabulary.

**1. (Multiple Choice)** Choose the correct verb conjugation to complete the sentence.
El avión ________ tarde al aeropuerto de Madrid.
A. llegamos
B. llegaste
C. llegó
D. llegué

**2. (Fill-in-the-Blank)** Complete the sentence with the correct preterite form of the verb in parentheses.
Anoche, mi amigo y yo ________ (comer) tapas en un restaurante excelente.

**3. (Translation)** Translate the following sentence into Spanish.
I booked a room for two nights.

**4. (Multiple Choice)** Choose the correct verb conjugation to complete the sentence.
¿Por qué no ________ (tú) la maleta?
A. abriste
B. abrí
C. abrió
D. abrieron

**5. (Fill-in-the-Blank)** Complete the sentence with the correct preterite form of the verb in parentheses.
El año pasado, nosotros ________ (ir) a Colombia de vacaciones.

**6. (Multiple Choice)** Choose the correct verb to complete the sentence.
Ayer, yo ________ la cuenta con mi tarjeta de crédito en el hotel.
A. comí
B. pagué
C. viví
D. bebí

**7. (Translation)** Translate the following sentence into English.
El botones subió el equipaje a la habitación.

**8. (Fill-in-the-Blank)** Complete the sentence with the correct preterite form of the verb in parentheses.
Ustedes ________ (tener) un problema con el vuelo.

**9. (Multiple Choice)** Choose the correct sentence that describes a past action.
A. Yo hago la reserva para el hotel.
B. Yo hice la reserva para el hotel.
C. Yo hacía la reserva para el hotel.
D. Yo voy a hacer la reserva para el hotel.

**10. (Translation)** Translate the following sentence into Spanish.
They saw the monuments and took many photos.

---

### **Answer Key**

**1. C. llegó**
**Explanation:** The subject is "el avión" (the plane), which uses the third-person singular (él/ella/usted) form of the verb.

**2. comimos**
**Explanation:** The subject is "mi amigo y yo," which is "nosotros" (we), and the correct preterite form for -er verbs is "-imos."

**3. Reservé una habitación para dos noches.**
**Explanation:** This is the correct "yo" (I) form of the regular -ar verb "reservar" in the preterite tense.

**4. A. abriste**
**Explanation:** The subject is "tú" (you, informal), so the correct preterite ending for the -ir verb "abrir" is "-iste."

**5. fuimos**
**Explanation:** The verb "ir" (to go) is irregular in the preterite, and "fuimos" is the correct form for "nosotros" (we).

**6. B. pagué**
**Explanation:** "Pagué" (I paid) is the logical verb for the context of "la cuenta" (the bill), and it is the correct preterite "yo" form of "pagar."

**7. The bellhop took the luggage up to the room.**
**Explanation:** This is a direct translation of the sentence, using the past tense verb "subió" (he/she took up).

**8. tuvieron

Grok 4.1 Fast Reasoning

# Spanish A2 Quiz: Preterite Tense with Travel Vocabulary

Practice recognizing correct preterite conjugations and translating short sentences about a recent trip. Use common travel words like *aeropuerto* (airport), *hotel*, and *restaurante* (restaurant).

### Questions

1. **Multiple Choice**: Choose the correct preterite form of *llegar* to complete the sentence: Yo _____ al aeropuerto ayer.  
   A) llegué  
   B) llegaste  
   C) llegó  
   D) llegamos  

2. **Fill-in-the-Blank**: Conjugate *viajar* in the preterite tense: Nosotros _____(viajar) a España el mes pasado.  

3. **Translation (English to Spanish)**: Translate this sentence about a trip: "I took a taxi to the hotel."  

4. **Multiple Choice**: Which sentence has the correct preterite conjugation of *comer*?  
   A) Yo comí en el restaurante.  
   B) Yo comió en el restaurante.  
   C) Yo come en el restaurante.  
   D) Yo comemos en el restaurante.  

5. **Fill-in-the-Blank**: Conjugate *dormir* in the preterite tense: Ellos _____(dormir) en el hotel.  

6. **Translation (Spanish to English)**: Translate this sentence: "Fuimos al aeropuerto temprano."  

7. **Multiple Choice**: Choose the correct preterite form of *ir* to complete: Mi amigo _____ al restaurante conmigo.  
   A) voy  
   B) fui  
   C) iba  
   D) iré  

8. **Fill-in-the-Blank**: Conjugate *reservar* in the preterite tense: ¿Tú _____(reservar) una habitación en el hotel?  

9. **Multiple Choice**: Which is the correct translation of "She paid the bill at the restaurant"?  
   A) Ella pagó la cuenta en el restaurante.  
   B) Ella paga la cuenta en el restaurante.  
   C) Ella pagaba la cuenta en el restaurante.  
   D) Ella pagará la cuenta en el restaurante.  

10. **Translation (English to Spanish)**: Translate this sentence about a trip: "We arrived at the airport at 10 a.m."  

## Answer Key

1. **A) llegué**  
   *Llegué* is the correct first-person singular preterite form of the regular -ar verb *llegar*.  

2. **viajamos**  
   *Viajamos* is the first-person plural preterite form of the regular -ar verb *viajar*.  

3. **Tomé un taxi al hotel.** (or *Yo tomé un taxi al hotel.*)  
   *Tomé* is the first-person singular preterite form of *tomar*, used for taking a taxi.  

4. **A) Yo comí en el restaurante.**  
   *Comí* is the correct first-person singular preterite form of the regular -er verb *comer*.  

5. **durmieron**  
   *Durmieron* is the third-person plural preterite form of the -ir stem-changing verb *dormir*.  

6. **We went to the airport early.** (or *We went to the airport in the morning.*)  
   *Fuimos* is the first-person plural preterite of the irregular verb *ir*.  

7. **B) fui**  
   *Fui* is the correct third-person singular preterite form of the irregular verb *ir*.  

8. **reservaste**  
   *Reservaste* is the second-person singular preterite form of the regular -ar verb *reservar*.  

9. **A) Ella pagó la cuenta en el restaurante.**  
   *Pagó* is the correct third-person singular preterite form of the regular -ar verb *pagar*.  

10. **Llegamos al aeropuerto a las 10 de la mañana.** (or *Llegamos al aeropuerto a las 10.*)  
   *Llegamos* is the first-person plural preterite form of the regular -ar verb *llegar*.

What makes these work

01

Lock the proficiency level first
Always specify CEFR level (A1-C2) or an equivalent framework like HSK or JLPT before anything else in your prompt. Models default to an ambiguous intermediate difficulty when no level is stated, which makes outputs inconsistent and hard to reuse in a structured curriculum. Named frameworks also help the model calibrate vocabulary range and grammar complexity accurately.
02

Specify question type and format explicitly
MCQ, fill-in-the-blank, true/false, matching, and short-answer all require different output structures. State the format, the number of options if applicable, and whether you need an answer key and explanation. Vague prompts produce vague formats that require reformatting before use, which defeats the time-saving purpose.
03

Constrain the grammar or vocabulary scope
Narrow prompts outperform broad ones. Asking for questions about 'the subjunctive' produces better output than asking for 'grammar questions.' If you have a specific word list or grammar table, paste it into the prompt and instruct the model to draw only from that input. This keeps quiz banks aligned to your actual lesson content.
04

Request distractors that target real learner errors
Ask the model to make wrong answer options reflect common mistakes made by speakers of the learner's native language. A Spanish speaker learning English makes different errors than a Japanese speaker. This instruction dramatically improves quiz diagnostic value and separates AI-generated quizzes from random distractor noise.

More example scenarios

#01 · Spanish B1 vocabulary MCQ bank

Input

Generate 5 multiple-choice vocabulary questions for Spanish B1 learners. Each question should test a single word in context using a sentence with a blank. Provide 4 options (A-D), one correct answer, and a one-line explanation. Topic: daily routines. Format as a numbered list.

Expected output

1. Todas las mananas, Maria ______ a las siete. A) despierta B) duerme C) estudia D) cocina. Answer: A. 'Despertarse' means to wake up, fitting the morning routine context. (Plus 4 more items in the same format covering words like ducharse, desayunar, vestirse, and salir.)

#02 · French grammar drill on passe compose vs imparfait

Input

Create 6 fill-in-the-blank questions for B2 French learners practicing the difference between passe compose and imparfait. Each sentence should require the learner to choose the correct tense and form of the verb given in parentheses. Include the answer key and a brief grammatical reason for each.

Expected output

1. Quand j'etais enfant, je (jouer) au foot tous les weekends. Answer: jouais (imparfait, habitual past action). 2. Hier soir, elle (finir) son rapport a minuit. Answer: a fini (passe compose, completed single event). (Pattern continues for 6 items with varied contexts.)

#03 · Mandarin tonal pronunciation awareness quiz

Input

Write 4 multiple-choice questions testing HSK 2 learners on distinguishing Mandarin tones in minimal pairs. Each question presents a pinyin word with tone mark and asks for the correct English meaning from 4 options. Include words commonly confused by English speakers.

Expected output

1. What does 'mai3' (with falling-rising tone) mean? A) to sell B) to buy C) to come D) to go. Answer: B, to buy. Note: mai4 (falling tone) means to sell, a common confusion point for HSK 2 learners.

#04 · ESL reading comprehension questions from a provided passage

Input

Based on the following A2-level English passage about a visit to a supermarket (150 words, provided below), write 4 comprehension questions: 2 literal recall questions and 2 inference questions. Format with the question, question type label, and correct answer. [Passage inserted here]

Expected output

Q1 (Literal): Where did Tom go after work? Answer: He went to the supermarket. Q2 (Literal): What did Tom forget to buy? Answer: Milk. Q3 (Inference): Why did Tom check his phone before leaving the store? Answer: Likely to review his shopping list, as the passage implies he was worried about forgetting items. Q4 (Inference): How does Tom feel about cooking? Answer: He enjoys it, suggested by his careful ingredient selection.

#05 · Italian placement test items across two levels

Input

Generate 3 grammar questions for A2 Italian learners and 3 for B1 Italian learners. A2 should focus on present tense regular verbs. B1 should focus on the congiuntivo presente. Each question is a sentence completion. Label the level and include an answer key.

Expected output

A2 Q1: Io ______ (parlare) italiano ogni giorno. Answer: parlo. A2 Q2: Lei ______ (scrivere) una lettera. Answer: scrive. B1 Q1: E importante che tu ______ (venire) in orario. Answer: venga. B1 Q2: Voglio che lui ______ (sapere) la verita. Answer: sappia. (Format continues for all 6 items.)

Common mistakes to avoid

Skipping proficiency calibration
Generating questions without specifying a level produces a mix of difficulty that is useless for structured learning. A quiz that randomly jumps from A1 to C1 vocabulary in the same set cannot be placed in a curriculum or used for level-appropriate assessment. Always anchor to a recognized framework.
Trusting output without spot-checking target language
AI models occasionally produce grammatically incorrect sentences in the target language, especially in less-common languages or when generating distractor options. A learner encountering a wrong answer that is actually grammatically valid will learn incorrect patterns. Spot-check every generated set before publishing.
Generating too many questions in a single prompt
Asking for 50 questions at once causes quality to degrade in the second half of the output. Models start recycling structures, reusing vocabulary, or drifting from the original constraints. Generate in batches of 5-10 and concatenate results for better consistency across a large question bank.
Ignoring native language interference in distractor design
Generic distractors do not test for the errors real learners make. If you do not specify the learner's L1 background, the model writes distractors that are obviously wrong to any speaker, making questions too easy and reducing diagnostic usefulness. State the learner's first language in the prompt.
Using the output format directly without templating
If your platform or LMS requires a specific import format like QTI, CSV, or Aiken, ask the model to output in that format from the start. Reformatting 100 questions after generation costs as much time as writing them manually. Define the output schema in the prompt and validate one batch before generating at scale.

Related queries

Frequently asked questions

Which AI model is best for generating language learning quiz questions?

GPT-4o and Claude 3.5 Sonnet both perform well for major European languages and handle CEFR-level calibration reliably when prompted correctly. For Asian languages like Mandarin, Japanese, or Korean, GPT-4o tends to produce more accurate characters and tone notations. For lower-resource languages, verify outputs with a native speaker regardless of which model you use.

Can I use AI to generate quiz questions in a language I don't speak?

Yes, but with a critical caveat: you cannot verify the output yourself. Models do make grammatical and lexical errors in non-English languages. If you are building content for learners in a language you do not know, budget for at least a spot-check review by a qualified speaker before the questions go live. For high-stakes use, full expert review is necessary.

How do I make AI quiz questions align with a specific textbook or curriculum?

Paste the relevant vocabulary list, grammar table, or lesson scope directly into the prompt and instruct the model to draw exclusively from that material. You can also paste example questions from the textbook and ask the model to generate new questions in the same style and difficulty. This grounding significantly improves curriculum alignment.

What question types work best for vocabulary acquisition in AI-generated quizzes?

Context-sentence fill-in-the-blank outperforms isolated word-definition matching for retention because it forces learners to process meaning in use. MCQ with plausible distractors (near-synonyms or false cognates) is effective for recognition-stage vocabulary. For production practice, short-answer prompts where learners must use the target word in a sentence yield the strongest learning signal.

Can AI generate audio-ready scripts for listening comprehension quizzes?

Yes. Prompt the model to write a dialogue or monologue at a specified level and register, then write comprehension questions based on the script. You can pair this with a text-to-speech tool for the audio component. Specify natural spoken register rather than written register in the prompt to avoid stiff, textbook-sounding dialogue that sounds unnatural when read aloud.

How many quiz questions can I realistically generate per hour using this approach?

With a well-structured prompt template you can generate and lightly review 50-100 calibrated questions per hour, compared to 10-20 questions per hour written manually by an experienced content author. The bottleneck shifts from drafting to quality review. Build a review checklist for your target language to make spot-checking systematic rather than ad hoc.