Bioinformatics Unit 5 Project

SSU Home | SSU Biology | CourseInfo | Forum | Home

Glossary   |   Tutorials   |   SelfTests   |   Objectives   |   Articles


UNIT 5 PROJECT DESCRIPTION

Path:  Home / Unit 5 Topic Page / Molecular Modeling Project


The problem of predicting how a protein sequence folds into its 3-D structure can be summarized as Levinthal's paradox: Assuming three possible states for every flexible dihedral angle in the backbone of a 100-residue protein, the number of possible backbone configurations is three to the 200th power.

Homology modeling approximates a protein's 3-D structure well enough for making inferences about function, inferences which can be experimentally verified or disproved.

Learning and Project Objectives
Project Background
Steps of Model Building
Model Validation
Project Documentation

Learning and Project Objectives

1. Learning Objectives

a. This unit project is designed to reinforce the underlying basis of the concept that protein structure determines protein function. The focus is on the physical and chemical properties of a protein, ways to discern patterns of secondary structure in a collection of related protein domains, search and retrieval of specific items from the protein structure databases, and the process of modeling a 3-D protein structure from aligned sequences and a template structure.

b. Bioinformatics tools to be evaluated include query software for the protein sequence and structure databases, viewers (such as Swiss-PdbViewer) for PDB format structural data files, sequence alignment packages, motif and profile methods, the SWISS-MODEL server, and the Verify 3D Crystal Structure Verification server.

2. Project Objectives

a. Develop a plan for modeling the 3-D structure of the death domain from the human 75 kDa neurotrophin receptor. The proposed model should be sufficient to address the following question: Would the deletion of a three residue loop, in rodent NTR death domains between helix I and helix II, interfere with the functioning of these rodent death domains?

b. Prepare an appropriate multiple sequence alignment and estimate the secondary structure of the query death domain.

c. Identify an appropriate template or templates and use it or them to develop a model. Validate your model, then propose an answer to the question above based on your model.

top

Project Background

1. Death Domains

a. Death domains come in two flavors One type is the interaction domain of adaptor proteins that form complexes with receptor death domains, eg., death domains in proteins called TRADD, FADD, etc. The other type is the collection of death domains in the cytoplasmic tails of members of the Tumor Necrosis Factor Receptor superfamily ("receptor death domains"). Some members of this superfamily are Fas (CD95), DR3, DR4, DR5, DR6, p75 NTR (NGFR, LANR, and other names), TNFR-I, etc. Many members of this family do not have death domains, for example CD40, LTbetaR, CD30, various "decoy" receptors, etc. You can ignore these death-domainless members.

b. There are two structures in the PDB for receptor death domains: the one for the rat p75 NTR (by Carlos Ibanez' group) and the one for human Fas (by Stephen Fesik's group).

2. Death domains are difficult to align

a. Making a good alignment of divergent sequences is challenging. In this project, helix I will be the most divergent. Warning: Plug and chug with CLUSTAL W will not yield a useful result. The following paper describes a method that is very helpful for collecting protein fragments and building material for a multiple alignment:
Gapped BLAST and PSI-BLAST:
a new generation of protein database search programs
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller and David J. Lipman
Nucleic Acids Research, 1997 25(17) 3389-3402

b. The best source of curated protein sequences is the SwissProtein Database. The NCBI server has a good PSI-BLAST utility.

top

Steps of Model Building

1. The steps in building a homology-based model

a. Listed on page 283 of Gibas and Jambeck, the first steps depend on searching for relatives of the query sequence, building a structurally-informative multiple sequence alignment, and locating an appropriate template structure in the protein structure database.

b. The next step is to achieve the best possible global (full length) alignment of the query sequence with the chosen template sequence or sequences, lining up every secondary structure element with its appropriate counterpart.

c. The next three steps are the actual modeling of the structure. Here the backbone trace is assembled (composed of secondary structure elements without sidechains (ie, amide nitrogen, alpha carbon, carbonyl carbon for each residue in the chain). Note that reconstructive surgery may (will) be required to work around gaps and insertions in the alignment.

d. Once the backbone has been constructed, add the sidechains of the query sequence. At this point, most models blow up. Don't panic, it's normal. Adjust the side chain torsion angles to eliminate major collisions. If there is a problem in one (or more) of the loops grafted on during the plastic surgery, change it and readjust the torsion angles.

e. Optimize the structure using energy minimization. This procedure is based on the idea that strained bonds are high-energy bonds, that repulsive electrostatic forces increase potential energy, and that Van der Waals attractions become energetic repulsions when atoms get too close. Relaxing the model should bring down the potential energy stored in all these unhappy situations. A low energy model is a happy model.

2. Software and services for the project

a. For background on Principles of Protein Structure, Comparative Protein Modelling and Visualisation see Protein modelling by Nicolas Guex and Manuel C. Peitsch, Part II Chapter 6: Comparative protein modelling.

b. For relatively easy to use free modeling software check out SwissModel Optimise Mode . SwissModel project files can be produced with Swiss-PDB Viewer (spdbv). These projects contain the sequence to be modelled and the superimposed template structures. This is the default file format, in which SwissModel will return the results. SPDBV allows you to manually adjust the sequence alignment that will be used to guide the model building. One major reason for failure of "First Aproach" attempts is badly placed gaps or insertions. If you have taken care of this problem already, the modeling should go efficiently.

c. SPDBV project files are the only way to prepare and submit "Optimize Mode" modelling requests to the SwissModel server. Swiss-PDB Viewer can be downloaded for free from Swiss-PdbViewer.

top

Model Validation

1. Background on model verification

a. For a tutorial on model validation see Model quality from Principles of Protein Structure, Comparative Protein Modelling and Visualisation by Nicolas Guex and Manuel C. Peitsch: Part III: Chapter 8: How to evaluate the quality of a model.

b. "The correctness of a model is essentially dictated by the quality of the sequence alignment used to guide the modelling process. If the sequence alignment is wrong in some regions, then the spatial arrangement of the residues in this portion of the model will be incorrect."

2. Server for access to validation software

a. You can run tests on your model at Verify 3D Crystal Structure Verification Server provided by the UCLA Bioinformatics Servers and Databases.

b. Good luck.

top

Project Documentation

1. Prepare a presentation [15 minutes maximum] of your findings, including a brief summary of methods, results, and conclusions. Include recommendations for further extensions of the research.

a. For visual aids, you may include overheads of results or use a computer presentation, such as PowerPoint.

b. Be prepared to field questions from other groups and from the presenter.

2. Submit the following as a typed report. As a guideline, a finished report on the search should be about three pages of text [12 pt standard font] and no more than four pages of appended graphs, tables, images, etc. [Final page length is to be determined by what the group identifies as appropriate*]:

a. Name of the project and names of the members in your group.

b. A brief report in scientific format, including abstract [100 word limit], introduction, methods, results, discussion/conclusions, and citations. Include figures as appropriate.

3. Grading will be on content, organization, spelling, & grammar.

top




Glossary   |   Tutorials   |   SelfTests   |   Objectives   |   Software

Updated on 11/7/01 by bchapman@classroomtools.com.