Related Links

  • Science Fair Project Guide

Project Summary

Difficulty  9 
Time required Average (about one week)
Prerequisites High school genetics
Material Availability Readily available
Cost Low ($20 - $50)
Safety No issues

Donate to Science Buddies

Abstract

You have probably seen figures showing how human beings are related to chimpanzees, gorillas, and other primates. In this genomics science fair project, you will use bioinformatics tools to generate your own primate family tree.

Objective

The objective of this genomics science fair project is to build a family tree for the primates, including Homo sapiens, based on the similarity of mitochondrial proteins. The method involves BLASTing a mitochondrial protein against a sequence database and analyzing the percent identity between hits.

Introduction

"Nothing in biology makes sense except in the light of evolution." This quote is from a 1973 essay by the famous geneticist Theodosius Dobzhansky. One of the key issues in biology is how we, as a species, are related to other life forms. Based on anatomy and physiology, biologists have argued for decades that human beings are closely related to chimpanzees, with close, but somewhat more distant, relatedness to other primates, such as gorillas and orangutans.

Despite the best efforts by generations of biologists, there are still many skeptics concerning our place on the primate family tree. In this genomics science fair project, you will make up your own mind by using raw protein sequence data and simple software tools to build a tree depicting Homo sapiens and our nearest non-human relatives.

How can you use a protein sequence to build a family tree? When two populations stop interbreeding, the genes in each population begin to accumulate mutations that are specific to each population. The longer the two populations are separated, the greater the number of genetic changes between the two populations. You can directly measure these genetic changes by comparing the sequences of particular proteins. If a protein mutates at a rate of 1 percent change per million years (say five amino acids change in a 500-amino-acid protein), then that protein in two species that split from a common ancestor 1 million years ago will be roughly 2 percent different. Why 2 percent? Because each species has accumulated 1 percent genetic change, independently of the other, resulting in 2 percent difference overall. Assuming a constant rate of mutation, a tree can be built showing the relationship of different species based on the observed genetic changes.

To start this science fair project, you will go to the website for the National Center for Biotechnology Information (NCBI), a part of the U.S. National Institutes of Health (NIH). At the NCBI website, you can look up the sequence of the protein you will use as a BLAST query. BLAST is a bioinformatics tool that allows the user to search a database of proteins for sequences that are similar to the query sequence. The query sequence is the sequence used to search the database.

You will also use your BLAST results to make a rough estimate of the rate of mutation by looking at the percent difference between a human protein and the same protein in different primates. Estimating the rate at which mutations accumulate is a way of looking at evolution in action.

You will use a protein sequence from the mitochondrial genome as a query. Mitochondrial protein sequences have the advantage that they are available for many more species (including Neanderthal) than chromosomal proteins are.

The BLAST output has three regions: a color diagram showing the alignments between the query and the database hits, a list of database hits, and the actual alignments of the query sequence with the database sequences. The figures below give more information about each of these outputs.

Once you have your BLAST results, you will make a table of the results, including the scientific names and common names of the species, the percent identity between the query sequence and the database hit, and the amount of time since Homo sapiens split from the other species. Based on this table, you can calculate a rough rate of mutation.

The BLAST page also has a tool for creating a tree of the BLAST results. You will use this tree to generate a figure showing how human beings are related to other primates. The numbers you'll see on the tree are estimates of the time since the various lines split, in millions of years ago (mya).

BLAST color key
Figure 1. Color key for the alignments. (NCBI, 2008.)


BLAST List of Seqs
Figure 2. List of BLAST hits. These are the proteins in the databse that are most related to the query protein. A high "Score" value and a low "E" value both indicate a high degree of similarity. (NCBI, 2008.)


Blast alignment
Figure 3. An example of a BLAST alignment. BLAST aligns the query sequence with the database sequence. A blank indicates a mismatch. A plus sign, "+," indicates a conservative substitution (the amino acid is replaced by an amino acid with similar chemical characteristics). (NCBI, 2008.)

Terms, Concepts and Questions to Start Background Research

Questions

Bibliography

Materials and Equipment

Experimental Procedure

  1. Before you start this science fair project, you should be familiar with the terms and concepts listed in the "Terms, Concepts, and Questions" area, above.
  2. Direct your browser to the NCBI website, at http://www.ncbi.nlm.nih.gov/.
    1. The NCBI website has links to all sorts of information, including a database of scientific articles related to medicine and biology (PubMed), and databases of protein and nucleic acid sequences.
  3. In the "search" box, select "Protein," near the top of the drop-down box.
  4. In the "for" box, type or copy and paste NP_536853. This is the accession number for the human mitochondrial protein NADH dehydrogenase subunit 5 (NADH5). An accession number is a unique identifier for a particular sequence.
    1. Other proteins can be used as well.
  5. Click on the link for NP_536853.
    1. The page for NP_536853 contains information about the human NADH5 protein, including the protein sequence.
  6. Return to the NCBI homepage at http://www.ncbi.nlm.nih.gov/.
  7. Click on the "BLAST" link, near the top of the page.
  8. Click on the "protein blast" link, under "Basic BLAST."
  9. In the box "Enter query sequence/ accession number" type or copy and paste NP_536853.
  10. In the box "Choose search set:"
    1. For "Database," select "Reference Proteins."
    2. For "Organisms," type " primates (taxid:9443)." This restricts the database search to primate sequences, which simplifies the output.
    3. For "Entrez query," type or copy and paste "595:605 [slen]." This restricts the output to full-length sequences, which can be compared to each other.
  11. For "Algorithm," select "blastp."
    1. This is for a protein query vs. a protein database.
  12. Click on "Show results in a new window."
  13. Now click on "BLAST."
  14. When the BLAST results page appears, look at the alignments of the proteins.
    1. Each alignment compares the human NADH5 protein against the NADH protein in a primate.
    2. How many full-length primate NADH5 proteins are in the database? This is the number of hits you got back in the BLAST search.
  15. Make a data table of the BLAST results. Include the following: scientific names for the animals, common names, and BLAST percent identity.
    1. In the "Alignment" section, percent identity is given above each alignment; for example, "Identities = 563/564 (99%)."
    2. This is the number of amino acids that are identical in each protein.
  16. Add a column for BLAST "% difference." You can calculate this as 100% identical (for example, 99% identity = 1% difference).
  17. Add a column for time, in millions of years ago (mya), that selected lines split from the human lineage. Use these numbers, from Richard Dawkins' book, The Ancestor's Tale:
    1. Neanderthal: 0.4 mya
    2. Chimpanzee: 6 mya
    3. Gorillas: 7 mya
    4. Orangutans: 14 mya
    5. Gibbons: 18 mya
    6. Colobus monkeys, macaques, etc: 25 mya
    7. Lemurs, tarsiers, etc, 63 mya
  18. Calculate the rate of mutation for NADH5 in the primate line.
    1. Divide the percent difference by millions of years ago for the time in the past at which the lineages split.
    2. Calculate the average mutation rate.
  19. Based on your results using the human NADH5 protein, list the primates from most- to least-related to man.
  20. Next, generate a tree that shows how the different species are related to each other.
  21. Click on the button "Distance tree of results."
    1. This will launch the "Distance tree widget."
    2. Click on the "slanted" tab to make a tree. Explore the other options for making trees.
  22. To copy the tree for your notes, take a screen shot and paste it into a graphics program, like Adobe Photoshop, Microsoft PowerPoint, etc.
    1. For a compelling figure, you might redraw the tree and add pictures, the times that particular lineages split, common animal names, etc.
  23. To simplify the tree, go back to the page with the alignments and select which organisms to include.
    1. First, click on "Deselect all."
    2. Then click on just the organisms that you want to focus on. For example, you could choose one species to represent larger groups, such as Homo sapiens, Homo sapiens neanderthalensis, Pan troglodytes (chimpanzee), Gorilla gorilla gorilla, Hylobates lar (gibbon), Pongo (orangutan), etc.

Variations

Credits

David Whyte, PhD, Science Buddies


Last edit date: 2008-12-04 10:30:00


Career Focus

If you like this project, you might enjoy exploring careers in Genetics & Genomics.

Genetic Counselor
Many decisions regarding a person's health depend on knowing the patient's genetic risk of having a disease. Genetic counselors help assess those risks, explain them to patients, and counsel individuals and families about their options.
 



Join Science Buddies

Become a Science Buddies member! It's free! As a member you will be the first to receive our new and innovative project ideas, news about upcoming science competitions, science fair tips, and information on other science related initiatives.


Support Science Buddies

If this website has helped you, won't you consider a small gift so we may continue developing resources to help teachers and students?

 



 

Science Buddies gratefully acknowledges its Presenting Sponsor
 
It's free! As a member you will be the first to receive our new and innovative project ideas, news about upcoming science competitions, science fair tips, and information on other science related initiatives.


Science Fair Project Home      Our Sponsors      Partners      About Us      Volunteer      Donate      Contact Us      Research Grants & Outreach      Site Map

Science Fair Project Ideas      Science Fair Project Guide      Ask an Expert      Blog      Teacher Resources      Parent Resources      Student Resources      Science Careers      Join Science Buddies     


Privacy Policy Science Buddies

Copyright © 2002-2010 Kenneth Lafferty Hess Family Charitable Foundation. All rights reserved.
Reproduction of material from this website without written permission is strictly prohibited.
Use of this site constitutes acceptance of our Terms and Conditions of Fair Use.