Abstract
AI (artificial intelligence)-generated text is a hot topic for many reasons. Computers can now generate convincing paragraphs or even pages of text that look like they were written by a human. How do you know if a news article you are reading was written by a human or an AI? How does a teacher know if a student's essay was written by an AI? How do you know this text was not written by an AI? In this science project, you will conduct an experiment to see if volunteers can correctly identify whether different passages of text were written by a human or AI.
Summary
None
Readily available
No issues
Objective
Find out whether volunteers can identify which paragraphs of text were written by humans and which ones were generated by AI.
Introduction
Chatbots, or computer programs that can have a back-and-forth text conversation mimicking a human conversation, have been around for a long time. They have gotten better and better with the introduction of machine learning, which allows the programs to "learn" from massive amounts of training data, or text written by real humans. Machine learning programs are typically called models. Scientists "train" the model to improve its performance using the training data. ChatGPT is one model that has received a lot of public attention, but many others exist, like Microsoft's Copilot AI and Google's Gemini.
ChatGPT is a computer program that is really good at understanding and answering questions. It was taught by a company called OpenAI using a special technique called "deep learning." This means that the program learned from a huge amount of information about how people talk and write in different ways. Now, when you type a question or a message, ChatGPT uses that information to try and come up with the best answer it can. It's like having a very smart friend who knows a lot about many different things and can talk to you about them!
Did you think that previous paragraph was written by a human? It was actually written by ChatGPT! That was its answer in response to the prompt "Write a paragraph about how ChatGPT works that is appropriate for a 5th grader to understand." Figure 1 shows a screenshot of the conversation.

Prompt: Write a paragraph about how ChatGPT works that is appropriate for a 5th grader to understand. Response: ChatGPT is a computer program that is really good at understanding and answering questions. It was taught by a company called OpenAI using a special technique called 'deep learning.' This means that the program learned from a huge amount of information about how people talk and write in different ways. Now, when you type a question or a message, ChatGPT uses that information to try and come up with the best answer it can. It's like having a very smart friend who knows a lot about many different things and can talk to you about them!
Figure 1. Screenshot of a ChatGPT conversation.
ChatGPT can do many things—for example, offering the same explanation at a different reading level. Here is its response to "Write a paragraph about how ChatGPT works that is appropriate for a 1st grade to understand.":
ChatGPT is a helper that can talk to you. When you ask it a question, it tries its best to find the answer and tell it to you. It's like having a talking dictionary that can answer your questions!
You see how, in both cases, ChatGPT can produce text that makes sense and is free from grammatical or punctuation errors. This can make it difficult for a human reader to distinguish text written by a human from text written by ChatGPT. Can people tell the difference at all? Are some passages of text easier to spot than others? In this science project, you will conduct an experiment to find out. You will collect passages of text from different human-generated sources, and generate text on similar topics using ChatGPT or another AI chatbot of your choice. You will then present each passage of text to volunteers and ask them to mark whether they think the text was written by a human or AI. How good to you think the volunteers will be at this task?
Terms and Concepts
- Chatbot
- Machine learning
- Training data
- Model
- ChatGPT
- Search engine
- Natural language processing
- Virtual assistant
Questions
- How do AI chatbots work?
- Do you think humans can easily identify text written by AI?
- Do you think AI-generated text is a good or a bad thing? Can it be both? What benefits could it have? What problems could it cause?
Bibliography
Read the information on this page to learn more about how ChatGPT works:
- OpenAI (2022, November 30). ChatGPT: Optimizing Language Models for Dialogue. Retrieved February 1, 2023.
Materials and Equipment
- Computer with internet access
- Human volunteers
- Optional: printer and paper, if you plan to print out your passages of text
- Lab notebook
Experimental Procedure

Working with Human Test Subjects
There are special considerations when designing an experiment involving human subjects. Fairs affiliated with Regeneron International Science and Engineering Fair (ISEF) often require an Informed Consent Form (permission sheet) for every participant who is questioned. Consult the rules and regulations of the science fair that you are entering, prior to performing experiments or surveys. Please refer to the Science Buddies documents Projects Involving Human Subjects and Scientific Review Committee for additional important requirements. If you are working with minors, you must get advance permission from the children's parents or guardians (and teachers if you are performing the test while they are in school) to make sure that it is all right for the children to participate in the science fair project. Here are suggested guidelines for obtaining permission for working with minors:
- Write a clear description of your science fair project, what you are studying, and what you hope to learn. Include how the child will be tested. Include a paragraph where you get a parent's or guardian's and/or teacher's signature.
- Print out as many copies as you need for each child you will be surveying.
- Pass out the permission sheet to the children or to the teachers of the children to give to the parents. You must have permission for all the children in order to be able to use them as test subjects.
Important reminder: taking work written by someone else (whether a human or AI), and trying to pass it off as your own work, is plagiarism. In general, you should not copy and paste entire pages of text from a book or website. However, it is OK to provide short quotes as long as you cite your source. The same applies to AI-generated text from programs like ChatGPT.
- Plan out your experiment. You will need to decide:
- How many human volunteers you need. Try to have at least five.
- The general topic for your passages of text, such as "animals." You could choose something slightly more specific, like "dogs."
- How many passages of text you will test. 20 is a good start, 10 each for human-generated and AI-generated.
- What length you want your passages of text to be. A paragraph is a good place to start, although you could test shorter or longer lengths like sentences or essays. For this experiment, you should keep the length fairly constant (passages do not need to be the exact same number of words or characters). Comparing lengths is also an option for a separate experiment (see Variations section).
- How you will have your volunteers read the text (on a computer or printed paper).
- What sources you will use for samples of human-generated text.
- It is a good idea to stick to books and reputable websites with known human authors. Do not just copy and paste content from random websites, since it could have been written by an AI!
- You should try to choose samples at roughly the same level of complexity or reading difficulty. For example, you do not want to have some paragraphs from children's books and some from novels for adults. Different reading levels are another interesting variable you could explore in an experiment (see Variations section).
- What prompts you will use to generate text from an AI chatbot of your choice. Make sure the prompts are related to the general topic you picked. For example, if you choose dogs as your topic, you could say "Tell me about golden retrievers in one paragraph at a 10th grade reading level."
- Create a data table for your experiment. Table 1 shows an example.
| Original passage # | Re-ordered passage # | Human or AI? | Volunteer 1 | Volunteer 2 | ... | % correct for each passage |
|---|---|---|---|---|---|---|
| 1 | Human | |||||
| 2 | Human | |||||
| ... | ... | |||||
| 10 | Human | |||||
| % human correct for each volunteer | ||||||
| 11 | AI | |||||
| 12 | AI | |||||
| ... | ... | |||||
| 20 | AI | |||||
| % AI correct for each volunteer | ||||||
| Total % correct for each volunteer |
Table 1. Example data table. Add columns for as many volunteers as you have.
- Find your 10 passages of human-generated text. Put them all into a single document, label each one with a number 1–10, and that it is human-generated. This document will only be visible to you.
- Choose an AI chatbot like ChatGPT, Gemini, or Copilot. Create an account if needed and experiment with the interface a bit so you understand how it works. Some services like ChatGPT may have their own separate website, and others may be built directly into search engines like Google or Bing.
- Generate 10 passages of text with the AI chatbot. You can specify requirements to make them similar to your passages of human-generated text, for example by specifying a length and reading level. Again, paste these passages of text into your document, and label them as "AI-generated" numbers 11–20.
- Create a separate version of the document for the volunteers to see. Delete the numbers and labels, then randomly move the passages of text around to mix them up. Label them 1–20 in the new order.
- In your data table, write the re-ordered passage number next to the original passage number. For example, if what was originally the first passage of text is now the fifth in the volunteer version of the document, you would write a 5 in the second column of the first row. This will make it easier to keep track of your results later.
- Optional: print the document if you prefer to have your volunteers read from paper copies.
- Have your first volunteer read each passage and tell you whether they think it was written by a human or an AI. Record your results in your data table. Be careful: remember that the volunteer document follows the "re-ordered passage #" column, not the "original passage #" column. Make sure you write each response down in the correct row. Try not to have any reactions or make any facial expressions that would indicate to the volunteer whether they are right or wrong. Make sure you do not give the volunteers any other information (for example, do not tell them that there are 10 of each type).
- Repeat step 9 for each volunteer.
- Now, calculate the following percentages and fill in your data table.
- For each passage of text, calculate the percentage of volunteers who identified it correctly. For example, if passage 1 was written by a human, you have 10 volunteers, and 6 of them said it was written by a human, the percentage is 6/10 = 60%.
- For each volunteer, calculate three percentages:
- The percentage of human-generated passages they correctly identified (out of 10).
- The percentage of AI-generated passages they correctly identified (out of 10).
- The total percentage of passages they correctly identified (out of 20).
- Analyze your data so far.
- Were certain passages of text easier for volunteers to correctly identify than others?
- Were certain volunteers better at correctly identifying passages of text than others? If so, were they better at identifying the human-written text, AI-generated text, or both?
- Calculate an overall average percentage correct for both the human-written and AI-generated passages. Do this by averaging the percentages in the last column for "original passage #" 1–10 (human) and "original passage #" 11–20 (AI).
- Analyze your final data. Overall, are humans good at identifying text that is written by an AI? Do you have any "false positives," where humans incorrectly identify text that was written by a human as written by an AI? Think back to the questions posed in the Background section. Overall, what benefits do you think AI-generated text could have? What problems do you think it could cause?
Extra: the procedure above was written by a human employee at Science Buddies. For the record, we asked ChatGPT to write the procedure for this project with the following prompt: "Write a step by step procedure for a science project where students ask volunteers to say whether they think paragraphs of text were written by a human or by ChatGPT. The students have access to ChatGPT to generate text and they can pull human-written text from other sources (books, the internet, etc.)." This is what it came up with. What are the differences between this procedure and the one above?
- Choose a topic: Choose a topic that you would like to create text on, for example, "Animals in the Rainforest."
- Generate Text using ChatGPT: Use the ChatGPT API to generate a paragraph of text about the chosen topic.
- Collect Human-written text: Gather several paragraphs of human-written text on the same topic from various sources such as books, the internet, etc.
- Create a Text Pool: Combine the ChatGPT-generated text and the human-written text into a single pool of text.
- Choose Volunteers: Ask classmates or friends to volunteer to participate in the experiment. Make sure to have a good mix of people with different levels of reading and writing abilities.
- Randomize the Text Pool: Randomly choose a paragraph from the pool of text and present it to each volunteer.
- Ask for Responses: Ask the volunteer to read the paragraph and say whether they think it was written by a human or ChatGPT.
- Record Responses: Record the volunteer's response for each paragraph in a spreadsheet or on paper.
- Repeat: Repeat steps 6 through 8 for each volunteer and for multiple paragraphs of text.
- Analyze Results: Once you have recorded the responses from all the volunteers, analyze the results to see how accurately they were able to determine whether the text was written by a human or by ChatGPT.
- Report Results: Write a report of your findings, including any trends or patterns you observed and any conclusions you can draw from your experiment.
Ask an Expert
Variations
- Re-run the experiment with passages of text of different lengths (for example, sentences and essays). Does the length of the passage affect whether humans can correctly identify it as human or AI-generated?
- Re-run the experiment with passages of text at different reading levels. Does the reading level affect whether humans can correctly identify text as human or AI-generated?
- There are other websites online that can generate text using AI. Can you run an experiment comparing different websites or programs? Are some better than others at generating text that looks like it was written by a human?
Careers
If you like this project, you might enjoy exploring these related careers:










