Extract Step-by-Step Instructions from YouTube Tutorials

Tested prompts for summarize youtube tutorial into steps compared across 5 leading AI models.

BEST BY JUDGE SCORE Claude Haiku 4.5 7/10

You watched a 45-minute YouTube tutorial once, took no notes, and now you need to actually do the thing. Scrubbing through video looking for the exact step where they showed the config screen is painful. What you want is a clean numbered list you can follow without pausing and rewinding every 30 seconds.

The fix is straightforward: pull the video transcript, feed it to an AI model with a prompt that asks for numbered steps, and get back something you can work from in minutes. This page shows you the exact prompt structure that works, compares how different models handle it, and explains what to watch for so you don't end up with a summary that misses critical sub-steps.

This approach works for any tutorial-style video: software walkthroughs, cooking techniques, DIY repairs, coding setups, design processes. If the video is teaching someone how to do something in sequence, AI can extract that sequence cleanly. The examples and tips below are built around that exact use case.

When to use this

This method is the right choice when a YouTube video is teaching a process with a defined start and end, and you need to replicate that process yourself or share it with others. It works best when the video has auto-generated or manual captions available, and when the steps in the video are sequential rather than conceptual.

  • Following a software setup or configuration walkthrough you need to repeat on your own machine
  • Extracting a recipe or cooking technique from a food tutorial to use as a printed reference
  • Turning a coding tutorial into a checklist you can work through without keeping the video open
  • Summarizing a DIY repair or assembly video into steps to hand off to someone else on your team
  • Reviewing a tutorial you watched weeks ago without re-watching the full video

When this format breaks down

  • The video has no captions and no transcript available -- the AI has no text to work from, so the method fails entirely before you start
  • The tutorial is heavily visual with minimal narration, such as a silent screen recording or a video where the host demos without explaining -- the transcript will be too sparse to reconstruct meaningful steps
  • The content is more conceptual than procedural, like a theory explainer or opinion piece -- there are no discrete steps to extract, and forcing the format produces vague bullet points that add no value
  • The video transcript is auto-generated from a non-English speaker with heavy errors -- garbage input produces garbage steps, and you will spend more time correcting the output than the video would have taken

The prompt we tested

You are an expert at converting YouTube tutorial content into clear, actionable step-by-step instructions. Analyze the following tutorial transcript or description and extract the exact steps the creator teaches.

Output a numbered list of 5-12 concise, action-oriented steps (each starting with a verb), preceded by a one-sentence summary of what the tutorial teaches. Include any critical tools, settings, timestamps, or warnings mentioned, but omit filler, intros, sponsorships, and outros.

Tutorial content:
Here's the transcript from a 14-minute YouTube tutorial titled 'How to Build a Raised Garden Bed for Beginners.' The creator walks through choosing cedar lumber, cutting the boards, assembling the frame with deck screws, lining it with landscape fabric, and filling it with a soil mix. She also mentions leveling the ground and watering before planting.

Produce the step-by-step guide now.

Model comparison

ModelQualityCostLatencyTokens
Claude Opus 4.7 7 $0.0373 7.9s 319 / 434
Claude Haiku 4.5 7 $0.0020 4.0s 208 / 351
GPT-5.4 7 $0.0040 5.2s 178 / 234
Gemini 2.5 Pro 7 $0.0013 9.6s 183 / 206
Grok 4.1 Fast Reasoning 6 $0.0001 3.2s 328 / 132

Outputs

Claude Opus 4.7 7/10
# How to Build a Raised Garden Bed for Beginners

This tutorial walks through constructing a simple cedar raised garden bed, from lumber selection to soil prep, so beginners can start planting right away.

1. **Select cedar lumber** for the frame, since cedar is naturally rot-resistant and safe for growing edibles.
2. **Measure and cut the boards** to your desired bed dimensions, including shorter corner posts to join the sides.
3. **Assemble the frame** by attaching the side boards to the corner posts using exterior-grade deck screws.
4. **Prepare and level the ground** where the bed will sit to ensure even drainage and a stable structure.
5. **Position the completed frame** on the leveled site and check that it sits flat on all sides.
6. **Line the interior** with landscape fabric to suppress weeds while still allowing water to drain.
7. **Fill the bed with a quality soil mix** suitable for raised beds (typically a blend of topsoil, compost, and aeration material).
8. **Water the soil thoroughly** to settle it before planting.
9. **Plant your seeds or seedlings** once the soil is moist and settled.

**Tools & materials noted:** cedar boards, deck screws, drill, landscape fabric, raised-bed soil mix.
Judge notes: The steps are clean and accurate but the output is largely generic inference rather than extracted specifics from the transcript (no timestamps, no exact screw types, no soil blend ratios, no warnings mentioned), which the instructions explicitly required.
Claude Haiku 4.5 7/10
# How to Build a Raised Garden Bed for Beginners

**Summary:** This tutorial teaches you how to construct a functional raised garden bed from cedar lumber, assemble it with screws, and prepare it for planting.

1. **Select cedar lumber** boards (typically 2"x10" or 2"x12") as they resist rot and are beginner-friendly.

2. **Level the ground** where you plan to place the bed, removing grass and debris to ensure a flat, even surface.

3. **Cut the boards** to your desired bed dimensions (common sizes: 4'x8' or 4'x4') using a miter saw or having them pre-cut at the hardware store.

4. **Arrange the cut boards** in a rectangular frame on the leveled ground to check fit before assembly.

5. **Drill pilot holes** at the corners to prevent wood splitting when driving screws.

6. **Assemble the frame** by fastening corners with 3-inch deck screws (3 screws per corner for stability), using a power drill or screwdriver.

7. **Line the interior** with landscape fabric stapled to the inside edges to block weeds while allowing drainage.

8. **Fill the bed** with a soil mixture (typically 60% topsoil, 30% compost, 10% perlite or aged bark).

9. **Water the soil thoroughly** before planting to help it settle and assess drainage.

10. **Plant your seedlings or seeds** according to spacing recommendations for your chosen vegetables or flowers.
Judge notes: The output is well-formatted and usable, but several steps (pilot holes, soil ratios, screws-per-corner) appear to be plausible additions not explicitly mentioned in the provided transcript, making it partially fabricated rather than strictly extracted from the source material.
GPT-5.4 7/10
This tutorial teaches beginners how to build a simple raised garden bed using cedar boards, deck screws, landscape fabric, and a basic soil mix.

1. **Choose** cedar lumber for the raised bed because it holds up well outdoors and is a common beginner-friendly option.
2. **Measure** and **mark** the bed dimensions you want before cutting any wood.
3. **Cut** the cedar boards to size for the bed’s sides.
4. **Level** the ground where the bed will sit so the frame rests evenly and doesn’t wobble.
5. **Arrange** the cut boards into a rectangular frame in the chosen location.
6. **Assemble** the frame by driving deck screws through the boards to secure the corners tightly.
7. **Check** that the frame is square and sitting flat on the leveled ground before moving on.
8. **Line** the inside of the bed with landscape fabric to help separate the soil from the ground below.
9. **Fill** the bed with a soil mix suitable for gardening.
10. **Water** the soil thoroughly before planting to help it settle and prepare the bed for new plants.
Judge notes: The steps are clean and actionable, but the output adds no specific details beyond what was already summarized in the prompt (no board dimensions, screw specs, soil mix ratios, or any warnings), making it generic rather than genuinely extracted from tutorial content.
Gemini 2.5 Pro 7/10
This tutorial teaches beginners how to construct, place, and fill a simple raised garden bed using cedar lumber.

1.  **Select Materials**: Choose a rot-resistant wood like cedar for the frame and gather deck screws for assembly.
2.  **Cut Lumber**: Cut the cedar boards to the desired length and height for your garden bed's sides.
3.  **Level Ground**: Prepare the final location for your garden bed by ensuring the ground is completely level.
4.  **Assemble Frame**: Construct the rectangular frame by joining the cut boards at the corners, securing them firmly with deck screws.
5.  **Line the Bed**: Staple landscape fabric to the inside of the wooden frame to suppress weeds and contain the soil.
6.  **Fill with Soil**: Add a high-quality soil mix to the raised bed, filling it nearly to the top.
7.  **Water Before Planting**: Thoroughly water the fresh soil to settle it before adding your plants or seeds.
Judge notes: The steps are accurate and clean, but the output lacks any specific detail (measurements, screw types, soil mix ratios, or timestamps) that would distinguish it from generic gardening advice reconstructed from the brief description rather than a detailed transcript.
Grok 4.1 Fast Reasoning 6/10
This tutorial teaches beginners how to build a simple raised garden bed using cedar lumber, from material selection through assembly, lining, filling, and initial preparation.

1. Select cedar lumber boards for durability and rot resistance.
2. Cut the boards to size using a saw (e.g., for 4x4 posts and 2x6 sides).
3. Level the ground at the installation site to ensure stability.
4. Assemble the frame by screwing corners together with deck screws.
5. Line the interior with landscape fabric to prevent soil loss and weeds.
6. Fill the bed with a soil mix suitable for gardening.
7. Water the soil thoroughly before planting.
Judge notes: The steps are accurate but generic and thin — missing specific measurements, screw counts/types, soil mix ratios, and other concrete details that a beginner actually needs, despite the tutorial transcript hinting at them (e.g., cedar specifics, landscape fabric attachment method, leveling technique).

What makes these work

  1. 01
    Ask for numbered steps explicitly

    Prompts that say 'summarize this video' return paragraphs. Prompts that say 'extract numbered steps where each step is one action' return lists you can actually follow. The format instruction is doing most of the work here, so be specific about structure before you paste the transcript.

  2. 02
    Include commands and values verbatim

    Add a line to your prompt like 'preserve exact commands, keyboard shortcuts, measurements, and settings as stated.' Without this instruction, models will paraphrase technical details in ways that break them. A shortcut written as 'use the keyboard shortcut' is useless; Ctrl+Alt+K is actionable.

  3. 03
    Break long tutorials into sections first

    Transcripts from 30-plus minute videos can exceed context windows or produce bloated outputs. Identify natural sections in the video (intro, install, configure, test) and submit each section separately. You will get tighter, more accurate steps per section than one massive output covering everything loosely.

  4. 04
    Verify steps against the video timestamp

    AI models occasionally merge two steps into one or split one step unnecessarily when the transcript is dense. Spot-check the output by jumping to a few timestamps in the video. If a critical step is missing or combined, add it manually rather than re-running the whole prompt.

More example scenarios

#01 · Setting up a Python virtual environment (developer tutorial)
Input
Here is a transcript from a Python tutorial video. Extract the step-by-step instructions as a numbered list. Each step should be one clear action. Include any commands exactly as stated. Transcript: [video covers installing Python, creating a venv with python -m venv env, activating it on Mac vs Windows, installing packages with pip, and freezing requirements]
Expected output
1. Confirm Python 3 is installed by running python3 --version in your terminal. 2. Navigate to your project folder. 3. Run python -m venv env to create the virtual environment. 4. Activate it: on Mac/Linux run source env/bin/activate, on Windows run env\Scripts\activate. 5. Install packages using pip install [package-name]. 6. Save dependencies by running pip freeze > requirements.txt.
#02 · Sourdough bread baking tutorial (cooking/food)
Input
Extract numbered steps from this sourdough tutorial transcript. Keep timing and temperature details. Transcript: [host covers feeding the starter 12 hours before, mixing 500g flour and 375g water, adding 100g starter and 10g salt, stretch-and-fold every 30 minutes for 2 hours, bulk ferment overnight, shape, cold proof in fridge, bake at 500F in a Dutch oven for 20 minutes covered then 25 uncovered]
Expected output
1. Feed your starter 12 hours before baking. 2. Mix 500g flour with 375g water. 3. Add 100g starter and 10g salt, combine fully. 4. Perform stretch-and-fold every 30 minutes for 2 hours. 5. Bulk ferment overnight at room temperature. 6. Shape the dough and place in a proofing basket. 7. Cold proof in the refrigerator. 8. Bake at 500F in a Dutch oven: 20 minutes covered, then 25 minutes uncovered.
#03 · Figma component setup tutorial (design/product)
Input
Read this Figma tutorial transcript and give me a numbered list of steps to create an auto-layout component from scratch. Include any settings the host mentions by name. Transcript: [covers creating a frame, enabling auto layout with Shift+A, setting padding to 16px, adding gap between items, switching fill to hug contents, creating a component with Ctrl+Alt+K, naming it in the Assets panel]
Expected output
1. Draw a frame containing your content elements. 2. Select the frame and press Shift+A to enable Auto Layout. 3. Set padding to 16px on all sides in the right panel. 4. Set the gap between items to your preferred spacing. 5. Change the frame sizing to Hug Contents. 6. Press Ctrl+Alt+K (Cmd+Option+K on Mac) to create a component. 7. Name the component in the Assets panel for easy reuse.
#04 · Home drywall patch repair (DIY/home improvement)
Input
This is a transcript from a drywall repair YouTube tutorial. Extract each action as a numbered step. Keep product types and drying times. Transcript: [host covers cutting out the damaged section, cutting a patch piece, securing a backer board behind the hole, screwing the patch in, applying joint compound in three thin coats with drying time between each, sanding smooth, priming before painting]
Expected output
1. Cut out the damaged drywall section into a clean square or rectangle. 2. Cut a patch piece to fit the opening exactly. 3. Secure a backer board inside the wall behind the hole. 4. Screw the patch piece into the backer board. 5. Apply joint compound in thin coats, letting each coat dry fully before adding the next (3 coats total). 6. Sand the patched area smooth with fine-grit sandpaper. 7. Apply primer before painting to prevent bleed-through.
#05 · Google Sheets VLOOKUP tutorial (business/spreadsheets)
Input
Extract step-by-step instructions from this Google Sheets tutorial transcript on using VLOOKUP. Include the formula syntax exactly as given. Transcript: [covers selecting the cell for output, typing =VLOOKUP(, selecting the lookup value, adding the range, specifying column index number, setting FALSE for exact match, common errors like N/A and how to fix them]
Expected output
1. Click the cell where you want the result to appear. 2. Type =VLOOKUP( to start the formula. 3. Click the cell containing your lookup value, then type a comma. 4. Select the table range that contains your data, then type a comma. 5. Enter the column index number for the data you want returned. 6. Type FALSE for an exact match, then close the parenthesis. 7. If you see an N/A error, check that your lookup value exists in the first column of the selected range.

Common mistakes to avoid

  • Pasting auto-captions without cleaning them

    YouTube auto-captions strip punctuation and frequently mishear technical terms. Pasting raw auto-captions directly produces steps built on misheard words and run-on sentences. Do a fast scan for obvious errors in tool names, numbers, and commands before submitting the transcript.

  • Asking for a summary instead of steps

    Using the word 'summarize' in your prompt signals the model to compress and paraphrase rather than extract and list. You get a three-sentence paragraph describing what the video covers instead of 12 actionable steps. Use 'extract,' 'list,' or 'give me numbered steps' instead.

  • Ignoring the intro and outro sections

    Most tutorials spend 2-5 minutes on context, backstory, and self-promotion before the actual process starts. If you include this in your transcript without telling the model to ignore non-instructional content, it sometimes generates fake steps like 'understand why this method is important.' Tell the model to skip non-procedural content.

  • Treating the output as final without a sanity check

    A step list that sounds confident can still have a step out of order or a prerequisite buried in step 8 that should be step 2. Read the list top to bottom before using it, especially if the tutorial involves anything irreversible like database changes, wiring, or cutting materials.

  • Using a transcript from the wrong language version

    If a video has both original audio captions and auto-translated captions, make sure you copy the original language transcript even if you will work in another language. Auto-translated transcripts introduce a second layer of errors that compound when the AI processes them into steps.

Related queries

Frequently asked questions

How do I get the transcript from a YouTube video?

On desktop, open the video, click the three-dot menu below the player, and select 'Show transcript.' A panel opens on the right with the full text and timestamps. You can select all, copy it, and paste it into your AI prompt. This works on most videos that have either manual or auto-generated captions enabled.

What if the YouTube video has no transcript or captions?

If captions are disabled, you have two options: use a third-party transcript tool like Tactiq or Glasp that can pull audio and generate a transcript, or use an AI tool that accepts a YouTube URL directly and handles the transcription step automatically. Without a transcript, text-based AI models have nothing to work from.

Can I just paste the YouTube URL instead of the transcript?

Some AI tools accept a YouTube URL directly and handle transcript extraction themselves. ChatGPT with browsing, Claude with certain plugins, and dedicated tools like Summarize.tech or NoteGPT support this. If you are using a base model without that capability, you will need to copy the transcript manually as described above.

How accurate are the extracted steps compared to the actual video?

Accuracy depends on transcript quality and how clearly the host narrates each action. Well-narrated tutorials with clean captions produce step lists that match the video closely. Dense or poorly captioned videos produce steps that require verification. Always do a quick check against the video for any steps involving specific values, commands, or settings.

Can this work for non-English YouTube tutorials?

Yes, if the transcript is in the original language and the AI model supports that language well. Avoid using auto-translated transcripts as input. If the tutorial is in a language your model handles well, you can also ask it to extract the steps and return them in English in a single prompt.

What is the best AI model for extracting tutorial steps from a transcript?

GPT-4, Claude 3, and Gemini 1.5 Pro all perform well on structured extraction tasks from long transcripts. The comparison table on this page shows how each model handled the same input. For most tutorial transcripts, the differences are minor; the prompt structure matters more than the model choice.

Try it with a real tool

Run this prompt in one of these tools. Affiliate links help keep Gridlyx free.