Jump to main content

Model How Environment and Genetics Change the Chances of Getting Cancer

1
2
3
4
5
72 reviews

Abstract

What is cancer? How does it develop? Doctors and scientists have asked these questions for hundreds of years to understand cancer and find treatments. In this science project, you can investigate these questions too by building a simple model and exploring how environmental and genetic changes affect the development of cancer.

Summary

Areas of Science
Difficulty
Method
Time Required
Very Short (≤ 1 day)
Prerequisites

Basic knowledge about what DNA mutations are.

Material Availability

Readily available

Cost
Very Low (under $20)
Safety

No issues

Credits
Sandra Slutz, PhD, Science Buddies Alumni
Science Buddies is committed to creating content authored by scientists and educators. Learn more about our process and how we use AI.

Objective

Build a simple cancer model to investigate how the chances of developing cancer change depending on environmental and genetic factors.

Introduction

Cancer has impacted humans for many generations. Scientists and doctors have spent their lives trying to understand what cancer is and find new ways to treat and prevent it. The earliest records of cancer date back to descriptions of tumors from an ancient Egyptian textbook on surgery where it was stated, "There is no treatment." However, new immunotherapy treatments hold promise for cancer treatment. 

Our understanding of cancer and our ability to treat it has evolved greatly since ancient times. Today, we know that cancer is cells' unregulated division and growth due to changes in the deoxyribonucleic acid (DNA). DNA is the instructions for making a cell. Let's break down what that means. Even under normal, healthy conditions, the cells in our bodies are constantly growing and dividing to replace older cells. Each cell division takes place from a parent cell that divides into two identical daughter cells in a process of cell division called mitosis. This process takes place often, in fact, we replace approximately 330 billion cells per day! 

Despite the high number of cell divisions, the process for each division is highly regulated by the cell cycle. The cell cycle provides checks and balances to ensure that cells divide only when they get the right signals to do so and that the DNA in the cell is copied and packaged correctly during division. Mistakes are often noticed during cell division, and the problematic cells are carefully killed off in a process known as apoptosis. If a mis-divided cell escapes apoptosis, it is often noticed by the immune system and destroyed.

Despite all these checks and balances, things occasionally go wrong, and a cell with a mutation (change in its DNA) escapes both apoptosis and the immune system. Everyone has cells in their body with mutations, but with aging, the immune system gets less effective at flagging these cells for apoptosis. Therefore, the older you are, the more mutated cells you are likely to have, which can become cancerous. 

Some mutations do not matter because they do not change gene's function. However, some mutations do become problematic, particularly in genes associated with cancer, such as oncogenes (cancerous genes) and tumor suppressor genes. As more cell divisions occur, more mutations can accumulate, and the chances of a mutation impacting one of these cancerous genes are higher. Additional gene mutations that control how the cell divides and grows can lead to the cells growing uncontrollably. This uncontrolled growth of mutated cells is what we call cancer.

To understand in detail how diseases like cancer develop and explore treatment options, scientists and doctors create mathematical disease models. A disease model aims to represent what is happening in the patient population and can continually be updated based on new research findings. There are four steps to mathematical disease modeling:

  1. Build the model from existing data. That data could be from real-world observations, data from experiments, or a combination of both. The model describes what scientists and doctors think happens with the disease on a biological level.
  2. Validate the model. The model is tested to see if it makes sense. If the model outputs results that match up with what we observe about the disease in the real world, then the model is considered valid (representative of the truth). If not, the model is revised until it matches the real-world observations.
  3. Use the model. The model is used to make predictions or run experiments related to the disease, its spread, and/or how to treat it.
  4. Revise or retire the model. As new observations and data are gathered, the model may need to be revised or retired altogether in favor of a new model.

In this science project, you will build a simple model of how cancer develops using data from published scientific papers and basic arithmetic (addition, subtraction, multiplication, and division). You will use the model to compare how many cell cycles of mutated cells it takes for cancer to develop with and without additional genetic and environmental factors. Then, you will compare the information from your simple model with real-world data to see if your model represents what occurs in everyday life.

Terms and Concepts

Questions

Bibliography

Genes, Environment, and Cancer

Disease Modeling of Cancer

Materials and Equipment

Experimental Procedure

This project follows the Scientific Method. Review the steps before you begin.

Experimental Background: Using Scratch and Scientific Data to Model Cancer 

In this experiment, you will build your cancer disease model using a simple programming language called Scratch. To keep track of your experiment, you can easily create a free Scratch account.

If you have never used Scratch before, here are some resources to help get you started: 

To create a basic cancer disease model, the factors that cause cancer need to be represented. From previous research, we know that when the parent cell divides, the daughter cell can accumulate mutations (Figure 1). In each cell division, more mutations can accumulate in each new daughter cell until there are finally enough mutations that a cancer cell develops. It takes many cell divisions to get a cancer cell. 

A mother cell goes through four cell cycles. Each cell cycle a mutation accumulates and a new cell line is born. After four or more cell cycles enough mutations have accumulated to create a cancer cell which multiplies and gives rise to a tumor.  Image Credit: Sandra Slutz, Science Buddies / Science Buddies
Figure 1. Cancer comes from the accumulation of random mutations over many cell divisions.
 
Another factor contributing to cancer development is the ability of the mutated daughter cell to avoid apoptosis or the immune system (Figure 2). Most daughter cells with mutations will trigger apoptosis and die. If the mutated daughter cell does not trigger apoptosis, it may be detected by the immune system and destroyed. If the mutated daughter cell is able to avoid both of these checks, it will survive. This avoidance of these important checks can be considered as escaping cell death in our model. After escaping these checks, the mutated daughter cell can continue to mutate with each cell division. In our cancer model, we will need to assign a mutation threshold. If the number of mutations in a daughter cell is greater than the threshold, it will be considered a cancer cell.
During cell division, it is possible for a daughter cell to gain a mutation. If so, it is likely that it undergoes apoptosis or is killed by the immune system. If neither of these happens, the mutation is fixed in that cell line. There is a chance that a cell accumulates enough mutations, over many cell divisions, that it becomes cancerous.  Image Credit: Sandra Slutz, Science Buddies / Science Buddies
Figure 2. This diagram shows how cells can avoid cell death mechanisms and accumulate mutations through multiple cell divisions to become cancerous.

Now that we have outlined what our model does and the data we need to plug into it, we can look into the literature to see if that data exists and use that information to choose our variable thresholds. We have provided references for you and shared the references for you to check and find the key data for each threshold in Table 1. We have also added information on how to use the data to make data-informed threshold choices. Additional research is necessary to build a more complex model. The variations offer some suggestions for this.

Swipe left to see more
Swipe left to see more
Variable Definition Reference Making Data-informed Threshold Choice Key Data for Threshold Choice

Mutation Rate

The rate at which daughter cells accumulate mutations every cell division for normal cells (human somatic, diploid cells). 

Lee-Six, H., et al. (2018, September 5). Population dynamics of normal human blood inferred from somatic mutations. Retrieved August 27, 2024.

  • See paragraph 2 under the heading, "Somatic mutations acquired during embryonic development"
Choose the number of mutations in the genome per every cell division. 

Cancer Mutation Threshold

The number of mutations a daughter cell must accumulate to become a cancer cell.

Martincorena I., et al.(2017, November 6). Universal Patterns of Selection in Cancer and Somatic Tissues. Retrieved August 27, 2024.

  • See paragraph 5 under the header, "Number of Driver Mutations per Tumor"

When developing a disease model, thresholds are typically the top or cut-off of a range. This is the case for this variable because the higher the number of mutations, the more likely a cell will become cancerous, so choose the maximum number in this range as the threshold. 

Cell death 

The chance of a mutated daughter cell escapes cell death or apoptosis or evades the immune system.

Zhang, W., et al. (2024, March 7) Pan-cancer evaluation of regulated cell death to predict overall survival and immune checkpoint inhibitor response. Retrieved August 27, 2024. 

  • See the first paragraph of the results. 

Cancer cells can avoid cell death in multiple ways. In this model, assume that mutated cells will escape a very small proportion of the time. Choose the minimum percentage from the range presented for the related cell death (RCD) and round down to the nearest whole number. 

Smoking The mutation rate due to smoking cigarettes.

Klein, A. (2016, November 3). Every 50 cigarettes smoked cause one DNA mutation per lung cell. Retrieved August, 27, 2024.

  • See the second paragraph of the article.
Choose the number of mutations per lung cell per every 50 cigarettes (2.5 packs per day)
BRCA1 or BRCA2 mutation The mutation rate is due to an inherited mutation (pathogenic variant) in the BRCA1 or BRCA2 gene.

Zámborszky, J. et al. (2016, July 25). Loss of BRCA1 or BRCA2 markedly increases the rate of base substitution mutagenesis and has distinct effects on genomic deletions. Retrieved August 27, 2024.

  • See paragraph 4 of the results section.
Choose the number of times the mutation rate in breast tissue is higher when the BRAC1 or BRCA2 gene mutation is present. 

Table 1. Primary research data you will use to build a simplified mathematical disease model of cancer. 

Building a Cancer Disease Model

Now that we've researched thresholds for mutation rates and apoptosis evasion, you are ready to start building your cancer model. The following steps can be used to write the starting model yourself. Alternatively, you can use the premade Science Buddies Scratch Project program and learn about modifying the disease model with additional risk factors below. 

  1. To create the project from the bottom up, open Scratch.
  2. Select "Create" at the top of the screen to create a new project.
  3. Select the variables tab to create new variables.
    1. Select the "Make a Variable" box.
    2. Type the new variable names in the pop-up window that says "New Variable." The variable names are:
      1. Mutation Rate
      2. Cancer Mutation Threshold
      3. Total Number of Mutations: How many mutations are in the DNA of the cells in your model.
      4. Number of Cell Cycles: How many cell cycles have passed before a cancer cell appears.
      5. Cell death
    3. Select ok to add the variable to the checklist, like in Figure 3. 
 Scratch blocks for the five variables: cancer mutation threshold, cell death, mutation rate, number of cell cycles, and total number of mutations. Image Credit: Sandra Slutz, Science Buddies / Science Buddies
Figure 3. Scratch blocks for the five variables needed to build the starting model.
  1. Select the "Events" tab.
    1. Drag and drop the "when the green flag is clicked" icon to the main screen.
  2. Select the "Variables" tab.
    1. Select your variable from the drop-down menu in the set variable function.
    2. Drag and drop each pre-set variable to the program. Place and attach them under the "when the green flag is clicked" icon, like Figure 4. 
    3. Based on the data collected in Table 1, set the starting values for each of the 5 variables to their threshold rates. 
    4. The remaining thresholds of the variables not shown in Table 1 should be set to zero.
       Scratch code blocks telling the computer to, when the green flag is clicked, set the mutation rate to 1.2, set the cancer mutation threshold to 10, set the number of cell cycles to 0, set the total number of mutations to 0 and set cell death to 0.  Image Credit: Sandra Slutz, Science Buddies / Science Buddies
      Figure 4. Scratch code showing the starting values of the variables based on research findings in Table 1.
  3. Create a separate program for what happens during a single-cell division. Figure 5 shows one possible solution.
    1. Select the change function to increase the Number of Cell Cycles by one after each cell division.
      1. Select the Number of Cell Cycles variable in the drop-down menu with the change text out front.
      2. Double-check to make sure the change number is by increments of 1. 
    2. Select the change function to increase the Total Number of Mutations in the daughter cell by the Mutation Rate.
      1. Select the Total Number of Mutations variable in the drop-down menu with the change text out front. Drag and drop it under the previously changed variable. 
      2. Then, drag and drop the Mutation Rate variable over the number box to insert the variable. 
    3. Add a statement to indicate whether the cell survives or if there is Cell Death.
      1. Select and set the Cell Death variable under the previous variables. Drag and drop it under the previously changed variable. 
      2. Go to the operator tab, select the pick random command with two number boxes, and drag and drop it over the number box to create a range.
      3. Change the cell death range to reflect the chance of escaping cell death (0.1%) to the range from 0.000 to 1.000 for the Cell Death variable.
    4. Add an if-then statement to indicate what happens when a daughter cell escapes Cell Death.  
      1. Go to the control tab to select an if, then statement. Drag, drop, and attach it under the previous variable.
      2. Go to the operator's tab and select a greater than statement 
      3. Add the Cell Death variable on the left of the greater than symbol. 
      4. Update the threshold to 0.001. The daughter cells' chance of escaping Cell Death is low (>0.001). Therefore, if the value is greater than 0.001, the daughter cell will die.
      5. Add a change statement with the Total Number of Mutations variable to the insert of the then statement. If the daughter cell dies, this will decrease the Total Number of Mutations variable.
      6. Go to the operator tab and add a multiplication operator to the number box.
      7. Update the number box in the change statement to -1 multiplied by the Mutation Rate since a decrease in this variable indicates that there are fewer of the Total Number of Mutations since the daughter cell died, but the mutated parental cell remains.    
    Scratch blocks telling the computer to change the number of cell cycles by 1, change the total number of mutations by the mutation rate, set cell death to a random number between 0.000 and 1.000, and if cell death is greater than 0.001 then change the total number of mutations by negative one times the mutation rate.  Image Credit: Sandra Slutz, Science Buddies / Science Buddies
    Figure 5. Scratch code for what should happen during a single cell cycle.
  4. Now that we have the code for a single cell division, we need to add code to make the cancer model continuously repeat cell cycles until the conditions for cancer are met. One possible solution is shown in Figure 6.
    1. To create a repeating loop to model what happens to mutated cancer cells over time:
      1. Select the control tab. Drag and drop the repeat until control under your wait 1-second function. 
      2. Go to the operator tab. Drag and drop the greater than operator into the hexagon of the repeat until function.
        1. Add the Total Number of Mutations variable to the left number box.
        2. Drag and drop the Cancer Mutation Threshold variable to the right number box. 
      3. Move the second set of programs representing a single cell cycle from Figure 5 into the repeat until function to represent multiple cell cycles. 
        1. Optional: You can also add graphics to make the program visually appealing and clear to users when the model is finished running. To do this, go to the Looks tab and insert the switch costume to function before and after the program, like in the example model.  
          scratch code Image Credit: Sandra Slutz, Science Buddies / Science Buddies
          Figure 6. The scratch code for a single cell cycle will be repeated until the cancer mutation threshold is met.
  5. Add a wait 1-second control function between the single cancer cell variable thresholds (Figure 4) and the repeating cell cycles step (Figure 6). 
  6. Run your model at least ten times. Each time, record in your lab notebook how many cell cycles (Number of Cell Cycles) it takes for a cancer to appear. You can also run the model integrated into the webpage here:

Modifying the Cancer Disease Model

  1. Once you have your starting cancer disease model built, you are ready to modify it to see what happens to the chances of developing cancer due to aging with additional specific environmental and genetic factors. Keep in mind that your original cancer model still contributes to the risk of cancer because all humans age. However, additional environmental and genetic factors can increase or decrease your risk. 
  2. Environmental Factor:
    1. Smoking cigarettes is an environmental factor that changes a person's chances of getting lung cancer. Use Table 1 for information about how smoking affects the mutation rate of cells. Use this information to modify your cancer disease model to include the impact of smoking a pack of cigarettes a day. 
      1. Example: If the mutation rate for smoking a specific number of cigarettes is 2 per cell for every 50 cigarettes (2.5 packs), and the average number of cigarettes smoked per day is 1 pack, then divide the number of mutations by the number of packs per day, like this: 2/2.5 = 0.8. This will give you the average number of mutations per cigarette pack per day. Then, add that to the previous mutation rate like this: 1.2 + 0.8 = 2. 
    2. Based on your own calculations, update the Mutation Rate variable. 
    3. Run your model at least ten times for people who smoke 1 pack a day. 
    4. Record each trial in your lab notebook, similar to Table 2, to keep track of the Number of Cell Cycles it takes for a cancer cell to appear.
  3. Genetic Factor:
    1. Some families with a history of breast cancer have mutations that change the function of the BRCA1 or BRCA2 genes. Using the information in Table 1, modify your starting cancer disease model to see what happens to the chances of getting breast cancer with a BRCA1 or BRCA2 mutation.
      1. Example: If the mutation rate is five times higher than the normal rate, then you will need to multiply the previous mutation rate from the original model by five, so 1.2 x 5 = 6. 
    2. Based on your own calculations, update the Mutation Rate variable. 
    3. Run your model at least ten times for people with BRCA1 or BRCA2 mutations.
    4. Record each trial in your lab notebook, similar to Table 2, to keep track of the number of cell cycles it takes to create a cancerous cell. 
  4. Compare your results for the three cancer disease models you built (cancer model, cancer model + smoking, and cancer model + genetic mutation in BRCA1 or BRCA2).
    1. What is the range of the Number of Cell Cycles for each model? Remember, the range is the minimum and maximum number.
    2. What is the average Number of Cell Cycles (of the 10 trials) before cancer appears in each model?
    3. Calculate the years it will take for the cell to become cancerous. To do this, use the primary literature to inform your calculation that, on average, the cell cycle takes 1 full day to complete.
      1. Example: If the Number of Cell Cycles = 9426 cycles / 365 cell cycles per year = 26 years
        1. Note: Depending on the cell type, this may take more or less time; we are oversimplifying here to be able to make comparisons. To make more specific and realistic comparisons, check out the variations!
  5. According to the NCI, the average age of cancer diagnosis is 66. However, the increased rate of mutated daughter cells that escape apoptosis and the immune system takes place approximately 25 years before this! Add the years until the cells become cancerous to 40 to get the predicted age of diagnosis for each model.
      1. Example: If the number of years until the cell becomes cancerous is 9.8, add 9.8 years + 40 years to get a predicted age of diagnosis of 49.8 years. 
Swipe left to see more
Swipe left to see more
Model Number of Cell Cycles Range of Number of Cell Cycles Average Number of Cell Cycles Years Until the Cell Becomes Cancerous Predicated Age of Diagnosis (years)
Cancer model

1.

2.

3.

4.

5.

6.

7.

8.

9.

10.

Cancer model + smoking ...
Cancer model + Genetic mutation (BRCA1 or BRCA2)

Table 2. Example data table for comparing the results of three cancer disease models. 

Validating Your Model

  1. Use the references in Table 3 and any additional resources to find the average age of cancer diagnosis due to aging, smoking, and BRAC1 or BRAC2 status from real-world data. Fill in your research findings in Table 3. 
  2. Then, compare your model's results to see if they are valid (make sense) compared to the real-world data. 
    1. Compare your original model to the average age of diagnosis of cancer. 
    2. Compare your models to the average age at which smokers versus non-smokers are diagnosed with lung cancer.
    3. Compare your models to the average age of incidence of people with breast cancer with and without mutations that change the function of BRCA1 or BRCA2
  3. Do your models match the real-world data? Explain why or why not.
    1. Optional: Check out the Variations section to see how you can keep improving your models.
Swipe left to see more
Swipe left to see more
Cancer Risk Factor Age of Diagnosis Reference Average Age of Diagnosis (real-world data)
Age impact on cancer risk 

NCI. (2021, March 5) Age and Cancer Risk. Retrieved August 28, 2024. 

  • Read paragraph 2

Smoking's impact on lung cancer

LoPiccolo, J., et al. (2024, January 9). Lung cancer in patients who have never smoked - an emerging disease. Retrieved August 28, 2024. 

  • Review paragraph two under the Epidemiology heading (LCINS = Lung cancer in individuals who have never smoked)

BRCA1 or BRCA2 impact on breast cancer

NCI Staff. (July 12, 2017). Large Study Verifies Cancer Risk for Women Carrying BRCA1 or BRCA2 Mutations. Retrieved August 27, 2024. 

  • Review paragraph 2 under the Necessary Confirmation heading

Table 3. Primary literature research on the average age of diagnosis due to different cancer risk factors.

Conclusions

  • How does the range of Number of Cell Cycles compare to each other in each model? Do they overlap, and what does this mean?
  • Which specific risk factors (smoking or BRCA1 or BRCA2) increased the rate at which the cells became cancerous compared to the original model? Put the years until the cell becomes cancerous for each model from least to greatest to better compare them. 
  • How accurate do you think each of your models is? Are there other variables that are not accounted for in each model based on reading the research articles, such as family history, viral infections, or cumulative risk over a lifetime?
  • How could you improve each of your models? Remember, no model is perfect! Check out the variations for some ideas. 
icon scientific method

Ask an Expert

Do you have specific questions about your science project? Our team of volunteer scientists can help. Our Experts won't do the work for you, but they will make suggestions, offer guidance, and help you troubleshoot.

Global Goals

The United Nations Sustainable Development Goals (UNSDGs) are a blueprint to achieve a better and more sustainable future for all.

This project explores topics key to Good Health and Well-Being: Ensure healthy lives and promote well-being for all at all ages.

Variations

There are many ways to make this model more sophisticated. Try one or more of these suggestions:

  • How does smoking more or less than 1 pack of cigarettes a day impact the age of diagnosis of cancer? Modify your calculation to account for this change and re-run your model compared to the 1 pack a day results to find out. 
  • What other modifiable risk factors besides smoking are there that impact cancer risk? Check out this article to learn more and find research article data to incorporate these environmental factors into new models. 
  • Program your cancer disease model in a different language, like Python, with more capabilities than Scratch. Consider upgrading your model to use a Monte Carlo Simulation. The Monte Carlo Simulation will replace the need to run your model by hand multiple times and will more accurately help you determine the number of cell cycles for each condition (normal, smoking, or BRAC1 or BRCA2 mutations). To get started, watch this video introduction to the Monte Carlo simulation and consult this python tutorial for Monte Carlo simulations.
  • Create a more specific model with Python. Do some research to find research articles (like this one) to adjust the cell division time of specific cell types affected by the type of cancer and adjust the models accordingly. How does changing the cell cycle time impact the years until the cell becomes cancerous? Does it make it more specific? How does each compare to the real-world data?
  • Different cancers have different gene drivers. Model a single type of cancer and the chances of accumulating mutations in the specific genetic drivers for that cancer. This write-up on genetic drivers of cancer is a good place to start your research.

Careers

If you like this project, you might enjoy exploring these related careers:

Career Profile
Computers are essential tools in the modern world, handling everything from traffic control, car welding, movie animation, shipping, aircraft design, and social networking to book publishing, business management, music mixing, health care, agriculture, and online shopping. Computer programmers are the people who write the instructions that tell computers what to do. Read more
Career Profile
Many aspects of peoples' daily lives can be summarized using data, from what is the most popular new video game to where people like to go for a summer vacation. Data scientists (sometimes called data analysts) are experts at organizing and analyzing large sets of data (often called "big data"). By doing this, data scientists make conclusions that help other people or companies. For example, data scientists could help a video game company make a more profitable video game based on players'… Read more
Career Profile
Mathematicians are part of an ancient tradition of searching for patterns, conjecturing, and figuring out truths based on rigorous deduction. Some mathematicians focus on purely theoretical problems, with no obvious or immediate applications, except to advance our understanding of mathematics, while others focus on applied mathematics, where they try to solve problems in economics, business, science, physics, or engineering. Read more
Career Profile
Physicians work to ease physical and mental suffering due to injury and disease. They diagnose medical conditions and then prescribe or administer appropriate treatments. Physicians also seek to prevent medical problems in their patients by advising preventative care. Ultimately, physicians try to help people live and feel better at every age. Read more
Career Profile
Life is all around you in beauty, abundance, and complexity. Biologists are the scientists who study life in all its forms and try to understand fundamental life processes, and how life relates to its environment. They answer basic questions, like how do fireflies create light? Why do grunion fish lay their eggs based on the moon and tides? What genes control deafness? Why don't cancer cells die? How do plants respond to ultraviolet light? Beyond basic research, biologists might also apply… Read more

News Feed on This Topic

 
, ,

Cite This Page

General citation information is provided here. Be sure to check the formatting, including capitalization, for the method you are using and update your citation, as needed.

MLA Style

Slutz, Sandra, and Laura Ohl. "Model How Environment and Genetics Change the Chances of Getting Cancer." Science Buddies, 3 Sep. 2024, https://www.sciencebuddies.org/science-fair-projects/project-ideas/HumBio_p058/human-biology-health/disease-modeling-cancer. Accessed 16 June 2026.

APA Style

Slutz, S., & Ohl, L. (2024, September 3). Model How Environment and Genetics Change the Chances of Getting Cancer. Retrieved from https://www.sciencebuddies.org/science-fair-projects/project-ideas/HumBio_p058/human-biology-health/disease-modeling-cancer


Last edit date: 2024-09-03
Top
Free science fair projects.