Abstract
This project explores how large language models (LLMs) can generate recipes from YouTube cooking videos. In this science project, you will learn how LLMs interpret language and how prompt design can affect results.
Summary
None
Readily available
No issues
Objective
To experiment with LLMs and prompt engineering to create, compare, and evaluate AI-generated recipes from video transcripts.
Introduction
Large language models (LLMs) are powerful AI systems that have been trained on huge amounts of text from the internet, books, and other sources. They can understand and create human-like language, which allows them to answer questions, write stories or essays, summarize information, and even generate recipes. Some popular examples of LLMs are ChatGPT, Gemini, Copilot, and Grok.
As LLMs become more common, a new skill called prompt engineering has become very useful. Prompt engineering means learning how to write clear and specific prompts (or instructions) to get the best results from an AI model. The way you ask a question can make a big difference in how the model responds–small changes in wording can lead to very different answers.
Watch this video to learn more about prompt engineering:
In this project, you will explore how to use LLMs to create text recipes from online cooking videos. Trying to follow along with these videos can be annoying if you have to pause the video at each step (Figure 1), so extracting the recipe in text form can be helpful. By using video transcripts and experimenting with different prompts, you can see how well LLMs can understand cooking steps, organize them into recipes, and even add details like ingredient substitutions or difficulty ratings.

Terms and Concepts
- Large language model (LLM)
- Prompt engineering
Questions
- What are large language models (LLMs), and what are some examples?
- Why might it be helpful to generate a text recipe from a cooking video?
- What does prompt engineering mean?
Bibliography
To learn more about prompt engineering:
- Simplilearn. (2023, Dec 16). What Is Prompt Engineering? | Introduction to Prompt Engineering In 6 Minutes | Simplilearn. YouTube. Retrieved October 22, 2025.
- Su, Jeff. (2023, Aug 1). Master the Perfect ChatGPT Prompt Formula (in just 8 minutes)!. YouTube. Retrieved October 22, 2025.
We are using OpenAI Whisper model for transcription generation:
- OpenAI. (n.d.) whisper-base [Model]. Hugging Face. Retrieved October 22, 2025.
Materials and Equipment
- Computer with internet access
Experimental Procedure

Generating YouTube Transcripts
- On YouTube, find 10 recipe videos that you would like to generate recipes from.
- Make sure the videos include spoken narration, since we will use their transcripts to create the recipe.
- Open the video description and scroll down to the bottom. You will see a button called “Show transcript.” If you click on that button, a new box with the transcript will appear.
- To get rid of the time stamps, click on the three dots on the top right corner of the transcript box and then select “Toggle timestamps.”
- Note: Not all YouTube videos will have transcripts.
- Copy and paste the transcript into a text editor like Google Docs or Notepad. Give each separate transcript its own title or header in the document, or save each one in a different file.
- Repeat step 4 for each of the 10 videos.
Experimenting with Recipe Generation and Prompt Engineering
- Watch one of the videos you selected yourself and write down the recipe.
- Think about the criteria you will use to evaluate the AI-generated recipe. For example:
- Does the recipe have any errors in ingredient quantities? If so, how many errors?
- Does the recipe have any steps out of order? If so, how many?
- Does the recipe have any hallucinations or seemingly fabricated information that cannot be verified?
- Is the recipe's format easy to follow? Is the language easy to read?
- Create a data table that you can use to evaluate the recipes based on your specific criteria.
Swipe left to see moreTable 1. Example data table. You can add more rows to the table for additional videos and new prompts. Video Prompt Criteria #1 Criteria #2 Criteria #3 ... Notes - Generate a recipe from the video transcript using a basic prompt and evaluate the recipe.
- Choose an LLM like ChatGPT.
- Enter a prompt like “Make a recipe from this video transcript.” and then paste in the video transcript.
- Record your notes and observations in the data table you created.
- Repeat step 4 for your remaining videos. Make sure you use the same LLM and prompt each time.
- Now, this is where prompt engineering starts. Try modifying or adding something to your prompt. Here are several ideas to help you start, but you should only change one thing at a time.
- “Add ingredient alternatives to make this recipe vegan."
- “Include a difficulty rating.”
- “Add estimated preparation and cooking times.”
- “Make the recipe more detailed for someone who is inexperienced in cooking.”
- “Make the recipe more concise for someone who is experienced in cooking.”
- “Write the recipe so that the ingredients and quantities are built into the steps instead of having a separate ingredient list before the recipe.”
- Repeat step 4 with your new prompt. If necessary, you should also rewrite the recipes yourself. For example, if you are going to prompt the AI to write a detailed recipe for an inexperienced chef, you should write the recipes in that style yourself so you can compare the results.
- Continue to iterate and experiment with different prompts.
- How accurate is the LLM overall in determining the correct basic recipe based on the video transcript?
- What prompt gives you the best results for the format and style of a text recipe?
Ask an Expert
Variations
Try exploring different ways to generate and compare recipes:
- Compare different language models for recipe generation: Test how various AI tools handle recipe generation, such as ChatGPT, Gemini, Copilot, and Grok.
- For videos without a transcript, you can use the OpenAI Whisper model to generate a transcript. You can download the Python notebook to do so here.
- Compare different transcription models for extracting video text: In the Python notebook, several speech-to-text (Whisper) model sizes are available (some are commented out). You can uncomment the model you want to use (and comment out the others) to switch between them. Try running the tiny, small, medium, and large models to see how the accuracy and detail of the video transcript change.
- Experiment with different languages: Generate recipes in languages other than English, or even translate between languages. For best results, set the model to large. You can find more information about available models here: Hugging Face – OpenAI Whisper Models.
- Ask an AI image model to generate a picture of the finished dish based on the text recipe.
- Give an LLM a picture of a finished dish, and ask it to generate a recipe just based on the picture.
Careers
If you like this project, you might enjoy exploring these related careers:







