Related Links

  • Science Fair Project Guide

Project Summary

Difficulty  9  –  10 
Time required Average (about one week)
Prerequisites A good knowledge of basic concepts in genetics and good computer database searching skills
Material Availability Readily available
Cost Very Low (under $20)
Safety No issues

Donate to Science Buddies

Abstract

Find out the real explanation for why your parents are so weird! Here is a project that lets you explore "the net" to find out why your "DNA blueprint" is so important to health and disease. In this project you will use methods that bioinformatics and biotech scientists perform on a daily basis to decipher the human genome in their efforts to diagnose and treat genetic diseases.

Objective

In this project, you will use publicly available web-based bioinformatics resources to search for a disease of interest and SNPs, single-nucleotide polymorphisms, associated with that disease.

Introduction

Biomedical Informatics is a broad discipline that encompasses bioinformatics and computational biology. Online bioinformatics resources, such as the database Online Mendelian Inheritance in Man, or OMIM, allow bioscience researchers to search up-to-date information on human genes, genetic traits and disorders. This project will take you through the step-by-step process of researching a specific disease of interest and how a single base change in one's DNA could be associated with that disease. This project should take approximately one week to complete.

Scientists are on a constant quest to improve and lengthen the quality of human life. DNA, the blueprint of life, has hidden clues for this quest. Identifying these clues is analogous to the cliché often heard “finding a needle in a haystack.” The “haystack” for this project is the public bioinformatics databases, such as OMIM, containing a multitude of genetic information and the “needle” is the SNP, (pronounced snip), single-nucleotide polymorphism.

A Single Nucleotide Polymorphism, or SNP, is a small genetic change, or variation, that can occur within a person's DNA sequence. SNPs represent the most frequent type of DNA variation found in the human population. These variations can be used to study and track inheritance in families. Despite the fact that more than 99% of human DNA sequences are the same across the population, small variations in DNA sequence, such as SNPs, can have a major impact on how humans respond to disease, environmental factors, and medicines. Interestingly, SNPs are evolutionarily stable. This means they don't change much from generation to generation. That being said SNPs are of great interest and value for biomedical research. Development pharmaceutical products or medical diagnostics are being influenced by SNP data.

This cartoon depiction of a SNP (Wikipedia contributors, n.d.) shows how DNA strand 1 differs from DNA strand 2 at a single base-pair location (a C/T polymorphism):

Genomics Science Project Idea - SNP
Here you can see a single nucleotide polymorphism, or SNP, that results in a small genetic change between sequence 1 and 2 (Wikipedia contributors, n.d.).

DNA, deoxyribonucleic acid, supplies a set of instructions for each living organism. Every cell in each organism contains an entire copy of DNA. Genes are sets of nucleotide sequences encoded and stored in DNA. Each gene encodes for a certain protein. DNA is transcribed into mRNA, messenger ribonucleic acid, and then translated into protein. Proteins are defined by amino acid sequences. A single amino acid is encoded by three nucleotides called a codon. There are 64 possible codons and only 20 amino acids. Since there are only 20 amino acids, multiple codons encode for the same amino acid. This is known as degeneracy of the genetic code. Because of this degeneracy in the genetic code some SNPs do not result in changes in the protein sequence. This is called a synonymous change. If a SNP results in a change in the protein sequence this is termed a non-synonymous change. Finding single nucleotide changes in the human genome may be like “finding a needle in a haystack,” however, bioinformatics resources make it possible to do just that.

Genomics Science Project Idea - codon table
This codon table shows how the genetic code is converted into a sequence of amino acids that make up a protein (image courtesy of Schering-Plough).

Variations in the DNA sequences of humans can affect how humans develop diseases and respond to medicines. While SNPs do not cause disease, they can help determine the likelihood that someone will develop a particular disease. Computational Biology, the actual process of analyzing and interpreting data, combined with Bioinformatics is used to for the technology called database-mining. With the completion of The Human Genome Project in April of 2003, vast amounts of genomic data have been made available for database-mining, the process of generating hypotheses regarding function or structure of a gene or protein of interest by identifying similar or dissimilar sequences in DNA. The International HapMap Project is designed to provide information to researchers with the HapMap, a catalog of common genetic variants that occur in human beings as well as a description of the variants and where they are located in our DNA. This catalog provided information that researchers need to link genetic variants to the risk for specific illnesses.

How do scientists utilize computers for mining of biological data to study genetics and disease association? In this project, you will utilize the World Wide Web to access free bioinformatics resources to search for a disease of interest, identify SNP(s) associated with that disease, and make a hypothesis regarding the effect of the SNP(s). These public databases provide a vault of information that can be searched in many ways. We have provided one example; however, you may use your own method. With the availability of millions of SNPs, scientists now believe that exciting advances in medicine are in our near future. It is now your turn to mine databases for SNPs and make a hypothesis on the outcome on the human phenotype based on your research.

Terms, Concepts and Questions to Start Background Research

To do this type of experiment you should know what the following terms mean. Have an adult help you search the internet, or take you to your local library to find out more!

Questions

This project is based on research that provides often inconclusive but strongly correlative evidence that associates SNPs to risk of disease. The notion is that, with the availability of information about the complete human genome, we would be able to predict the risk of an individual contracting a disease or identify individuals with specific qualitative traits (‘smart’ genes, ‘criminal’ genes, ‘intuition’ genes etc.). One outcome of such advance would be personalized medicine where it is possible to treat each individual with a custom-made drug or even perform preventive therapy. However, on the flip side, ethical concerns need to be addressed with respect to individual human rights (The Minority Report movie debate).

Here are some questions that you will be thinking about while doing this project:

Bibliography

Materials and Equipment

Experimental Procedure

Searching for your disease

The OMIM database in NCBI is a catalog of human genes and genetic disorders authored and edited by Dr. Victor A. McKusick and his colleagues at Johns Hopkins and elsewhere, and developed for the World Wide Web by NCBI, the National Center for Biotechnology Information. The database contains textual information and references. It also contains copious links to MEDLINE and sequence records in the Entrez system, and links to additional related resources at NCBI and elsewhere.

  1. Search for the disease of your choice in OMIM at http://www.ncbi.nlm.nih.gov/sites/entrez?db=OMIM

    Genomics Science Project  Idea - NCBI

  2. Listed will be the genes associated with your disease. Choose one to further investigate:

    Genomics Project Idea - NCBI

    Genomics Project Idea - NCBI

  3. Click on Links and Choose GeneView in dbSNP:

    Genomics Project Idea - NCBI

  4. This screen lists all of the SNP’s associated with the gene you choose above:

    Genomics Project idea - NCBI
    Genomics Project idea - NCBI
    1. Region- Location on gene where SNP is found (Make sure you pick an Exon since these are the parts that code for protein.)
    2. Contig Position
    3. mRNA Position- Coordinates for where the SNP is found in the mRNA.
    4. dbSNP rs# cluster id- unique identifier in the database.
    5. Heterozygosity- Measure of genetic variation in a population.
    6. Validation-Other sources of information supporting this SNP.
    7. 3D- SNP mapped on a 3D structure
    8. OMIM-Links you back to OMIM page
    9. Function-The effect of the SNP on the protein sequence.
    10. dbSNP allele- What amino acid is effected by the SNP.
    11. Protein residue- Amino acid change
    12. Amino acid position- The coordinates within the protein sequence.

  5. Choose a SNP in an Exon that is missense. (Results in a change in amino acid.)

    Genomics Project idea - NCBI

    NCBI

  6. Click on your SNP of interest. This leads you to a screen with information on the SNP such as sequence, location of the mutation and population diversity.

    NCBI

    NCBI

    NCBI

  7. See if your SNP has an impact on protein structure and function by going to www.snps3d.org

    NCBI

    NCBI

    NCBI

  8. If your SNP is red in the model that SNP is predicted to have a damaging effect on the protein. If your SNP is yellow it is not predicted to be damaging, the SNP would be harmless.

    NCBI3

  9. Now try to establish a sequence-structure-function relationship for your SNP. First, search for the GENE in OMIM:

    NCBI

  10. Select your gene from the results:

    NCBI

    NCBI

  11. Follow the P link (Protein) on the left sidebar under the Entrez Gene Category to get to the protein sequence:

    NCBI

  12. Follow the link for the protein:

    NCBI

  13. Change the display to FASTA under the display dropdown menu:

    NCBI

  14. Copy the protein sequence. Go to the SMART site http://smart.embl-heidelberg.de/smart/set_mode.cgi?NORMAL=1 then paste the protein sequence into the Sequence box and press Sequence SMART:

    NCBI

    NCBI

  15. Rolling your mouse over a domain will show you more information about the protein domain. Find the domain where your mutation is located. In this case, it is the first AAA domain:

    NCBI

  16. Read the description of the domain and the Interpro abstract. Assuming that the SNP results in a mutation in this domain, what could be the biochemical effects of this mutation? How might these effects relate to the disease?
  17. Look at Pubmed for articles on effects of the mutation:

    NCBI

    NCBI

Variations

Credits

Science Buddies would like to thank the following volunteers from Schering-Plough who contributed towards writing this project:

Edited by Sara Agee, Ph.D., Science Buddies


Last edit date: 2008-07-24 20:00:00


Career Focus

If you like this project, you might enjoy exploring careers in Genetics & Genomics.

Genetic Counselor
Many decisions regarding a person's health depend on knowing the patient's genetic risk of having a disease. Genetic counselors help assess those risks, explain them to patients, and counsel individuals and families about their options.
 



Join Science Buddies

Become a Science Buddies member! It's free! As a member you will be the first to receive our new and innovative project ideas, news about upcoming science competitions, science fair tips, and information on other science related initiatives.


Support Science Buddies

If this website has helped you, won't you consider a small gift so we may continue developing resources to help teachers and students?

 



 

Science Buddies gratefully acknowledges its Presenting Sponsor
 
It's free! As a member you will be the first to receive our new and innovative project ideas, news about upcoming science competitions, science fair tips, and information on other science related initiatives.


Science Fair Project Home      Our Sponsors      Partners      About Us      Volunteer      Donate      Contact Us      Research Grants & Outreach      Site Map

Science Fair Project Ideas      Science Fair Project Guide      Ask an Expert      Blog      Teacher Resources      Parent Resources      Student Resources      Science Careers      Join Science Buddies     


Privacy Policy Science Buddies

Copyright © 2002-2010 Kenneth Lafferty Hess Family Charitable Foundation. All rights reserved.
Reproduction of material from this website without written permission is strictly prohibited.
Use of this site constitutes acceptance of our Terms and Conditions of Fair Use.