Hitting the Target: The Importance of Making Sure a Drug's Aim Is True
|Areas of Science||
Genetics & Genomics
Pandemics – COVID-19
|Time Required||Short (2-5 days)|
|Prerequisites||Basic understanding of what viruses, drugs, genes, and proteins are.|
|Material Availability||Readily available|
|Cost||Very Low (under $20)|
AbstractScientists recently found that some small drugs can stop infection by the deadly Ebola virus in its tracks. Lab researchers found that these drugs bind to a protein that the Ebola virus uses to enter our cells, and this is how infection is prevented. However, this also means that the bound protein no longer functions in our cells. How might these drugs accidentally disrupt important biological processes in our bodies? What other proteins might these drugs bind to? In this science project, you will explore how drugs that may someday be used to treat deadly diseases are tested to make sure that they do not unintentionally damage our bodies.
Determine how untested drugs may affect important biological processes that they were not intended to.
Teisha Rowland, PhD, Science Buddies
Cite This PageGeneral citation information is provided here. Be sure to check the formatting, including capitalization, for the method you are using and update your citation, as needed.
Last edit date: 2020-11-20
Recently, a breakthrough was made in our understanding of how the Ebola virus infects people. Many outbreaks of the Ebola virus have occurred in Africa, and infection is often deadly. You can see what the virus looks like in Figure 1. In August 2011, two different groups of researchers reported that in order to enter our cells and infect our bodies, the Ebola virus must bind to a protein called Niemann-Pick C1 ("NPC1" for short). To figure this out, one of the research groups (Carette et al.) first took a large number of cells and randomly disrupted different genes in the different cells. Eventually, the number of genes disrupted, or mutated, in all of the different cells combined was in the millions. The researchers exposed these mutated cells to the Ebola virus and then checked to see if any cells were resistant to infection. They analyzed the resistant cells to see what genes were mutated in these specific cells. They realized that the genes, especially NPC1, that were mutated in the resistant cells probably play key roles in infection. This is how the researchers found that NPC1 is essential for the Ebola virus to infect cells.
Figure 1. This is a picture of the deadly Ebola virus, taken with a very high magnification microscope (a transmission electron microscope). Recent research found that it can infect our cells using a protein called Niemann-Pick C1, which has become the target of anti-viral drugs.
NPC1 is found along the membrane of endosomes, which are small compartments in our cells that transport molecules from the outside of the cell to the inside. Inside the cell, molecules in endosomes are carried to lysosomes, which are compartments that break down molecules and cell debris. Normally, NPC1 is important for transporting cholesterol in cells, but the Ebola virus uses NPC1 to gain entry into the endosomes and causes the endosomes to burst, releasing the virus into the cell.
One of the research groups (Cóté et al) already developed two anti-viral drugs that can block infection by binding the NPC1 protein. To find these successful drugs, the researchers first tested a large number of small molecules on cells exposed to the Ebola virus to see if any of the small molecules could prevent infection. Sure enough, one small molecule, labeled 3.0, stopped infection. The researchers made 50 small molecules similar to this one and found that one of these 50, labeled 3.47, worked even better at preventing Ebola infection. (3.0 is technically a benzylpiperazine adamantine diamide molecule, and 3.47 is just like this molecule but has a methoxycarbonyl benzyl group added to it.)
While the researchers (Cóté et al) found that their drugs bind NPC1, and that this can block Ebola infection, extensive pharmaceutical testing still needs to be done before doctors can use these drugs to fight Ebola infection in people. For example, it needs to be determined whether the drugs bind other proteins that are similar to NPC1. Additionally, because researchers had the goal to study how the drugs prevent infection, using only cells grown in a lab, they did not find out how the drugs affect the overall health and function of the cells. For example, the drugs might interfere with important signaling pathways (biochemical pathways). Or the drugs may even affect the body of an animal as a whole. In summary, currently researchers do not know whether these will be good clinical drugs. In this science project, using bioinformatics tools (computer tools used to explore biological processes), you will explore how these Ebola virus drugs could bind non-target proteins, that is, proteins other than NPC1, and how disrupting the normal function of NPC1 and these non-target proteins could interfere with normal cellular and bodily functions. The questions that you will tackle here are exactly the ones researchers will be addressing, with the only difference being that researchers will be able to do both the bioinformatics work, what is covered in this science project, and testing of the resulting hypothesis in the lab.
Terms and Concepts
- Ebola virus
- Small molecules
- Signaling pathways
- Expression of genes
- How did the researchers determine that NPC1 is necessary for the Ebola virus to infect cells?
- What does NPC1 normally do in the cell?
- How does the Ebola virus take advantage of the normal function of NPC1 to gain entry into the cell?
- How specific do you think newly developed drugs are? Do you think they could bind many non-target proteins?
- Knowing what you do about the normal function of NPC1, what important cellular processes do you think the drugs might disrupt?
- Why is it important to test new drugs in animals before using them in humans?
To do this science project you will need to use these databases:
- Amazonia! (n.d.). Explore the jungle of microarray results. Retrieved September 9, 2011.
- Kyoto Encyclopedia of Genes and Genomes (KEGG). (n.d.). KEGG Pathway Database. Retrieved September 9, 2011.
- NCBI. (n.d.). BLAST: Basic Local Alignment Search Tool. Retrieved September 9, 2011.
- NCBI. (n.d.). Gene. Retrieved September 9, 2011.
These resources are a good place to start gathering information about drugs, the Ebola virus, and NPC1:
- Centers for Disease Control and Prevention. (n.d.). Ebola Hemorrhagic Fever. Retrieved September 9, 2011.
- Genetics Home Reference (n.d.). NPC1. Retrieved September 9, 2011.
- The Naked Scientists. (2011, August 24). Ebola virus blocked. Retrieved September 7, 2011.
- Sullivan, N., Yang, Z., and Nabel, G. (2003, September). Ebola Virus Pathogenesis: Implications for Vaccines and Therapies. Journal of Virology. Vol 77, No 18, 9733-9737. Retrieved September 9, 2011.
These are the two recent studies that discovered that the Ebola virus enters cells using the NPC1 protein and developed small molecule drugs to bind NPC1 and block infection:
- Carette, J. et al. (2011, August 24). Ebola virus entry requires the cholesterol transporter Niemann-Pick C1. Nature. Published online. Retrieved September 7, 2011.
- Cóté, M. et al. (2011, August 24). Small molecule inhibitors reveal Niemann-Pick C1 is essential for Ebola virus infection. Nature. Published online. Retrieved September 7, 2011.
Materials and Equipment
- Computer with an Internet connection
- Lab notebook
Identifying Non-Target Proteins
How accurately does a drug bind to its target protein (the protein it is supposed to bind to)? Are there other proteins the drug might bind to that it should not? In this part of the science project you will be looking at what non-target proteins the Ebola drugs might interact with, based on how similar the target protein is to other proteins.
- Learn as much as you can about the Niemann-Pick CI gene and protein. You can do this using the
NCBI Gene Database.
- Use the Science Buddies NCBI Gene & SNP Tutorial to help you navigate the database. You will use the Niemann-Pick C1 gene name, NPC1, for your searches. Make sure to choose the top result that says "NPC1" and is in humans (Homo sapiens).
- What does the database tell you about the NPC1 gene and protein? What disease do defects in this gene cause? What part(s) of the cell is the NPC1 protein located in?
- Hint: You may also want to search through other databases, such as by using the Genetics Home Reference Tutorial, to find out more about NPC1 and the disease associated with it.
- Retrieve the amino acid sequence of the NPC1 protein.
- From within the NCBI Gene page about NPC1, navigate to the "Links" sidebar on the right-hand side of the page. Click on the "Protein" link.
- Click on the top result for the NPC1 protein, as shown in Figure 2, circled in green.
The search results on the webpage ncbi.nlm.nih.gov returns a list of proteins and variations of those proteins. The Niemann-Pick C1 protein has variations that may cause some to click on the wrong link. Each search result has a unique number under the link so the correct link can be identified. The number for this link is GI: 255652944.
Figure 2. Searching for the amino acid sequence for the Niemann-Pick C1 protein will generate many results. Many of these are variants of the NPC1 protein. Pick the top result that is in humans ([Homo sapiens]), circled in green.
- On this page, scroll down to the bottom, where the amino acid sequence for the protein is given. Copy the entire amino acid sequence (including the numbers at the start of each line), as shown in Figure 3, circled in green. It is a good idea to save this sequence in a word or text file in case you need it later.
Near the bottom of the NPC1 protein page on the NCBI webpage there is an amino acid sequence listed.
Figure 3. The amino acid sequence of the protein will be located at the bottom of the protein page for the NPC1 protein. Scroll down to the bottom and select and copy the amino acid sequence, circled in green, so that you can compare this sequence to other protein sequences in the following steps of the Experimental Procedure.
- Next, use the NCBI Basic Local Alignment Search Tool (BLAST) to identify any other human proteins that have a similar amino acid sequence. If there are any, they are candidates for proteins that the Ebola drugs might also bind to and interfere with.
- Under "BLAST Assembled RefSeq Genomes" click on "Human," as shown in Figure 4, circled in green.
Screenshot of the BLAST tool webpage includes a link for human genomes near the top of the page. It is the first link in the list of links under the heading 'BLAST Assembled RefSeq Genomes'
Figure 4. The NCBI BLAST website allows you to submit an amino acid sequence of a protein or a nucleotide (DNA or RNA) sequence of a gene and see what proteins, or genes, share similar sequences. To search the human genome, click on "Human," circled in green. In this science project we are interested in looking for proteins with similar amino acid sequences to NPC1.
- On the top of the page, click the "blastp" tab, as shown in Figure 5, circled in red.
- Paste the amino acid sequence you copied before in the box under "Enter Query Sequence", as shown in Figure 5, circled in green. Click the blue "BLAST" button on the bottom of the page, shown in Figure 5, circled in orange.
In the blastp tab in the BLAST tool there is a large textbox at the top of left of the page where amino acid sequences can be entered to search for related sequences. In the center of the page a drop down menu allows you to search through different databases.
Figure 5. Using the NCBI BLAST tool, you can search for proteins with similar amino acids by clicking on the "blastp" tab, circled in red. In this tab, enter the amino acid sequence you want to search for in the box circled in green. Hit the "BLAST" button, circled in orange.
- The next page may take several seconds to load. After a short wait, a page that looks like Figure 6 should appear. This page gives you a lot of information on how the amino acid sequence you submitted matches other sequences in the database.
- At the top of this results page, in the blue section, click on the "Distance tree of results" link, as shown in Figure 6, circled in green.
The results page in the BLAST tool on the NCBI webpage shows a graphic summary of protein sequences that match a search term. A number line in the center of the page can be clicked to view similar protein domains or regions. Above the graphic summary are links to additional reports such as a search summary, taxonomy report, distance tree of results, and multiple alignments.
Figure 6. Searching for similar amino acid sequences on the NCBI BLAST website will probably generate many results that match your sequence to varying degrees. Clicking on the "Distance tree of results" link, circled in green, will take you to a helpful visual representation of your results. For a visual representation of different protein domains, or regions, that your protein shares with other proteins, click on the image circled in yellow.
- On the distance tree of results page, hover over each green arrow at the end of the tree branches, and click on the "Expand/Collapse" link, as shown in Figure 7, circled in red, until you completely expand all of the branches, as shown in Figure 8.
The BLAST tool on the NCBI webpage can generate distance trees that map relationships between proteins based on shared sequences. Arrows at the end of branches are color coded to match a key on the right side of the page that distinguish the species the protein originates from. Clicking on the arrows also brings up a menu with additional options such as show subtree, expand/collapse, re-root and show alignment.
Figure 7. The "Distance tree of results" is a visual representation of how different BLAST results are related to each other, based on the sequences they share. The amino acid sequence you submitted is labeled "unnamed protein product" and is highlighted in yellow. Hovering over a green arrow at the end of each branch and clicking the "Expand/Collapse" link, circled in red, will expand the branches to show the names of the proteins in each branch, as shown in Figure 8.
The BLAST tool on the NCBI webpage can generate distance trees that map relationship between proteins based on shared sequences. Clicking on an arrow at the end of a branch allows results to be expanded and will list the names of proteins related to a specific amino acid.
Figure 8. By expanding the branches in the "Distance tree of results," originally shown in Figure 7, you will be able to see all of the names of the proteins related to your amino acid sequence, which is labeled "unnamed protein product" and highlighted in yellow.
- The amino acid sequence you submitted is labeled "unnamed protein product" and is highlighted yellow. Trace this branch to the left, toward the trunk of the tree, and look at how it connects to the other branches. The less branch and line distance between two proteins, the more closely related they are. For example, two proteins that are right next to each other on the same vertical branch are more closely related than two proteins that are separated by a horizontal branch, or multiple horizontal branches. The more branches/lines you have to trace to reach a node that two proteins have in common on the left side, or the more branches you have to trace to travel between the two proteins, the more distantly related the proteins are. Besides the Niemann-Pick C1 protein, which proteins are most similar to your sequence? Which proteins are the most unrelated?
- Find the two proteins that are the most closely related to your sequence, besides NPC1. Ignore whether they are "isoforms," and only look at the name before the word "isoform." For example, "protein patched homolog 1 isoform S [Homo sapiens]" is the protein called patched homolog 1, and all the isoforms of this protein are just the same protein for the purposes of this science project.
- Write these two protein names in your lab notebook.
- Just how similar are these two proteins to the NPC1 protein? To find out, go back to the NCBI BLAST results page, shown in Figure 6, and scroll down to the "Descriptions" section, as shown in Figure 9. Locate the names of your two proteins under "Description," circled in green in Figure 9, and see what their "Query coverage" is, which is circled in red in Figure 9.
- To see exactly how the amino acid sequences match each other, click on the "Max score" for any result, which is circled in yellow in Figure 9. In this section, the NPC1 amino acid sequence you searched for is shown as the "Query" and the protein it is being compared to is the "Sbjct." Amino acids that the two sequences have in common are listed by their letter between where these two sequences are aligned, while a "+" between the aligned sequences indicate that the two amino acids are similar. Dashes in either sequence indicate a gap in the match.
The results page in the BLAST tool on the NCBI webpage shows a list of proteins that are similar to the one searched. All proteins listed in the table have information on accession, description, max score, total score, % match of the query searched, F-value and links.
Figure 9. To see how similar proteins are to the amino acid sequence you submitted, scroll down to the "Descriptions" section of your NCBI BLAST results page. The "Description" column, circled in green, lists the names of the different matching proteins. For each matching protein you can see what percentage of amino acids it shares with your submitted sequence under the "Query coverage" column, circled in red. To see the match, click on the link in the "Max score" column, circled in yellow, for each protein.
- At the top of the NCBI BLAST results page, in the "Graphic Summary" click on the image immediately below where it says "Putative conserved domains have been detected, click on the image below for detailed results," as shown in Figure 6, circled in yellow.
- This page shows the amino acid sequence you submitted at the top and below it, the other amino acid sequences that matched it (in grey rectangles). Hover your mouse over the sequence matches to learn more about them.
- Where do the "Patched" proteins match your sequence? (The numbers given correspond to the amino acid sequence of the protein.) Researchers do not know where the drugs bind the NPC1 protein, but where could the drugs bind and probably not also bind the Patched proteins?
- Overall, how likely do you think it is that a drug that binds the NPC1 protein would also bind the Patched proteins?
- Under "BLAST Assembled RefSeq Genomes" click on "Human," as shown in Figure 4, circled in green.
- You have read about the NPC1 protein, but what do the two similar proteins you picked in step 3g do? Go back to the NCBI Gene website, which you explored in step 1, and instead of searching for NPC1, search for each of these two similar proteins (enter their names instead of their abbreviations, for example "patched homolog 1"). Click on the top search result that is a human gene.
- Read about the genes in the "Summary" section. What do they have in common with NPC1, and how are they different? Based on their function, how damaging do you think it might be if a drug blocked the proteins encoded by these genes?
Investigating Complicated Interactions
Next, you will investigate how even if a drug only binds its target protein, it may still disrupt delicate biological processes. To do this we will look at the signaling pathways (biochemical pathways) that NPC1 and related proteins are involved in. These intricate pathways may be disrupted if these proteins cannot function.
- You can learn about the signaling pathways that NPC1 and related proteins are involved in by using the Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway Database.
- In the search box under "Enter keywords," enter NPC1 and click "Go."
- Click on the "Thumbnail Image" for each pathway result.
- Look at the pathways for NPC1. NPC1 should be in red on the pathway diagrams, but if you cannot locate a particular protein on a pathway diagram, search for its name in the search box at the top of the page.
- Based on the pathway diagrams, how important does NPC1 seem? What cellular processes is it involved in, and is it involved in many or just a few? What downstream events (events that NPC1 leads to) would be affected if NPC1 could not function?
- To learn more about a protein in the pathway, click on its name (inside a box).
- Repeat step 1 using the two proteins you identified in the section titled "Identifying Non-Target Proteins," in step 3g.
- To find out more about what proteins NPC1 and potential non-target proteins interact with, go back to the NCBI Gene website for these proteins.
- As a reminder, in the section titled "Identifying Non-Target Proteins," you already looked at the NCBI Gene website for NPC1 (step 1) and for the two non-target proteins (step 4).
- Once on their NCBI Gene pages, for each gene of interest click on the "Interactions" link, in the "Table of contents" sidebar on the right-hand side of the page.
- The column labeled "Other Gene" lists the names of proteins that interact with the protein of interest. Click on an interacting protein's name and read its "Summary."
- After looking at the functions of the interacting proteins, what other biological processes do you think may be disrupted if NPC1 and/or the two non-target proteins are disrupted?
What Parts of the Body Might the Drugs Affect?
You now probably have a good idea of what biological processes may be disrupted by the drugs targeting NPC1, but what areas of the body may be particularly damaged, and what kinds of people should be most careful to avoid drugs like these? To answer this, we will look at gene expression data of NPC1 and other genes that encode for proteins that may be disrupted by the drug.
- You can learn about the expression of NPC1 and other genes in different human tissues by using amazonia!, which is a database of microarray data. A microarray is a tool that allows researchers to look at a certain group of cells or tissues and see what genes are being expressed (turned in to mRNA and thus likely into protein).
- At the bottom of the main amazonia! page, there is a search box next to where it says "QuickStart: Explore the expression profile of any gene." In the search box enter the gene's abbreviated name (such as NPC1) and click "Go!"
- Under the section titled "Result(s) for Gene & Aliases" click on the blue link for "NPC1."
- This will take you to a page with microarray data on the gene. There are two graphs of data on this page for NPC1. Click on a graph, as shown in Figure 10, circled in red.
An information page on the amazonia.transcriptome.eu webpage of the protein NPC1 has a chart on the levels of gene expression for cell types. This chart is near the center of the page and shows cells that express a high amount of NPC1.
Figure 10. Amazonia! is a microarray database that has a variety of expression data for a given gene. When looking at a page with data on a gene, clicking on a graph, circled in red, will enlarge the graph and allow you to see how much of that protein is made by different cells and tissues in the body.
- Click on the graph image to enlarge it. On the x-axis, there are cell types listed. ("HESC" and "HIPSC" are stem cells, and you can ignore them for this science project.) For every cell type, the levels of gene expression are shown on the y-axis.
- Which cell types express the highest amounts of NPC1, and which express the lowest? Are there any cell types that do not express it at all?
- If a person took a drug that binds NPC1, what tissues and organs may be most affected? What kinds of people should most avoid taking the drug?
- Repeat step 1 using the two proteins you identified in the section titled "Identifying Non-Target Proteins," in step 3g.
If you like this project, you might enjoy exploring these related careers:
- Drugs can be made to target specific tissues. Search online to find out what tissues the Ebola virus infects the most. A good place to start would be to read the paper titled "Ebola Virus Pathogenesis: Implications for Vaccines and Therapies" that is referenced in the Bibliography above. If anti-viral drugs could be made to only target the infected tissues, would this still be very damaging to the patient, based on the expression data you found in the Experimental Procedure section titled "What Parts of the Body Might the Drugs Affect?"
- Defects in the NPC1 gene are actually associated with a genetic disease. To read more about it, visit this website: http://ghr.nlm.nih.gov/gene/NPC1. How does this agree, or disagree, with what you have learned about the function of NPC1? Do you think taking Ebola virus drugs could cause similar symptoms? Why or why not? Make sure to tie your reasons to the data you found about the normal and disease functions of NPC1.
- You can investigate the 3D structure of the NPC1 protein using chemistry modeling programs online. Follow the steps below to look up the NPC1 protein and see how it binds cholesterol. Although it is not known where exactly the Ebola drugs bind the NPC1 protein, by looking at the 3D structure of NPC1 and seeing where it binds cholesterol, where do you think a drug could bind the NPC1 protein and still allow the protein to carry out its important role in transporting cholesterol? What areas could the drugs bind that would be the most damaging to the normal function of NPC1?
- First go to the RCSB Protein Data Bank (PDB), located here: http://www.rcsb.org/pdb
- In the search box at the top, next to "e.g., PDB ID, molecule name, author," search for "NPC1."
- Look at the different results for NPC1, specifically looking at what is listed in the "Citations" tab. Many of these results are the 3D structure of NPC1 bound to cholesterol molecules. Click on a result.
- On the right side of this page click on "Simple Viewer." Save the file and open it. You will need Java to run this program.
- Here you will see a 3D model of NPC1, possibly interacting with cholesterol. You can rotate the image using left click and drag. You can zoom by holding shift while using left click and drag.
- If shown, where is cholesterol binding NPC1?
- Hover your mouse over different parts of the protein to see which amino acid that part of the protein is made of. This is displayed in the bottom left corner of the window.
- For example, "Reside: Ser 852 Chain: A Confirmation: Helix" means that you are hovering over a serine amino acid that is at position 852 in the protein. This amino acid is part of an alpha helix in the protein.
- Can you see which amino acids are interacting with cholesterol the most, and can you determine what kind of bonds are being formed based on your knowledge of the structure of NPC1?
Recent Feedback Submissions
|Sort by Date||Sort by User Name|
What was the most important thing you learned?
Scientists recently found that some small drugs can stop infection by the deadly Ebola virus in its tracks. Lab researchers found that these drugs bind to a protein that the Ebola virus uses to enter our cells, and this is how infection is prevented. However, this also means that the bound protein no longer functions in our cells. How might these drugs accidentally disrupt important biological processes in our bodies? What other proteins might these drugs bind to? In this science project, you will explore how drugs that may someday be used to treat deadly diseases are tested to make sure that they do not unintentionally damage our bodies.
What problems did you encounter?
Can you suggest any improvements or ideas?
Science Buddies materials are free for everyone to use, thanks to the support of our sponsors. What would you tell our sponsors about how Science Buddies helped you with your project?
Overall, how would you rate the quality of this project?
What is your enthusiasm for science after doing your project?
Compared to a typical science class, please tell us how much you learned doing this project.
|Do you agree?||Report Inappropriate Comment|
Ask an ExpertThe Ask an Expert Forum is intended to be a place where students can go to find answers to science questions that they have been unable to find using other resources. If you have specific questions about your science fair project or science fair, our team of volunteer scientists can help. Our Experts won't do the work for you, but they will make suggestions, offer guidance, and help you troubleshoot.
Ask an Expert
- Science Fair Project Guide
- Other Ideas Like This
- Medical Biotechnology Project Ideas
- Genetics & Genomics Project Ideas
- Big Data Project Ideas
- Pandemics – COVID-19 Project Ideas
- My Favorites
- Genetics Home Reference Tutorial
- Science Buddies: NCBI Gene & SNP Tutorial
- PharmGKB Pharmacogenomics Knowledge Base Tutorial
Looking for more science fun?
Try one of our science activities for quick, anytime science explorations. The perfect thing to liven up a rainy day, school vacation, or moment of boredom.Find an Activity
Explore Our Science Videos
Physics and Chemistry of an Explosion Science Fair Project Idea
How to Build an ArtBot
Flower Dissection - STEM Activity