Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Search for study opportunitiesNEW

Connect with the world's best universities and choose your course of study

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

BCB 444/544 Exam 2: HMMs, Gene & Protein Structure, RNA Secondary Structure - Prof. Drena , Exams of Bioinformatics

Iowa State University (ISU)Bioinformatics

Prof. Drena Leigh Dobbs

Information from exam 2 in the bcb 444/545 course, covering topics such as hidden markov models (hmms) for cpg island identification, gene prediction in prokaryotic and eukaryotic organisms, protein structure prediction using homology modeling, threading, and ab initio methods, and rna secondary structure prediction using ab initio, comparative, and combined computational and experimental methods. Examples and calculations for each topic.

Typology: Exams

Pre 2010

Uploaded on 09/02/2009

koofers-user-6rf 🇺🇸

4.3

(3)

10 documents

1 / 7

Related documents

Protein Secondary and Tertiary Structure Prediction in BCB 444/544 Fall 07 - Prof. Drena L

Protein Structure: Classification, Databases, and Visualization for BCB 444/544X - Prof. D

Protein Structure Basics & Classification: Lecture 20 in BCB 444/544 Fall 07 - Prof. Drena

Protein Secondary Structure Prediction - Prof. Drena Leigh Dobbs

Profiles and Hidden Markov Models (HMMs) in BCB 444/544 Fall 07 - Prof. Drena Leigh Dobbs

Protein Structure & Function: Lecture Notes for BCB 444/544X at Iowa State University - Pr

Protein Structure Prediction: A Review by Ginalski et al. - Prof. Drena Leigh Dobbs

BCB 444/544 Exam 2: Protein Sequence Analysis & Structure Prediction

Gene Prediction: Lecture 26 in BCB 444/544 Fall 07 at ISU by Dobbs - Prof. Drena Leigh Dob

Biological Databases in BCB 444/544: Understanding Genomic Information - Prof. Drena Leigh

RNA Secondary Structure Prediction: A Lab Exercise for BCB 444/544X

RNA Structure Prediction: Understanding the Role of RNA in Genetics - Prof. Drena Leigh Do

Lecture Slides on Protein Structure Prediction | BCB 444

RNA and Protein Structure Prediction: A Focus on Rev Protein and RNA Secondary Structure -

Protein Tertiary Structure Prediction - Lecture Slides | BCB 444

Predicting Secondary Structure and Identifying Disordered Regions in Proteins - Prof. Chri

Lab 6 Answer Key for Microarray Analysis in BCB 444/544X

Threading in Protein Structure Recognition: Ho Algorithm and Rev Proteins - Prof. Drena Le

Protein Structure Databases: Prediction and Modeling - Prof. Drena Leigh Dobbs

Promoter Prediction: BCB 444/544X Lecture Notes - Prof. Drena Leigh Dobbs

Protein 2' Structure Prediction using Neural Networks and SVMs in Bioinformatics - Prof. D

Understanding Protein Structure: Secondary, Tertiary, and Quaternary Structures - Prof. Yo

Protein Structure: Principles, Secondary Structure, and Interactions - Prof. Thomas Sims

Multiple Sequence Alignment in BCB 444/544 Fall 07 - Lecture 11 - Prof. Drena Leigh Dobbs

Bioinformatics: Secondary Protein Structure Prediction and Characterization - Prof. Iosif

Protein Structure: Covalent Bonds, Polypeptides, and Secondary/Tertiary Structures

Protein Structure & Function: Overview of Amino Acids, Structures & Enzymes - Prof. Clint

BCB 444/544 Fall 07 Lecture 10: BLAST Details and Gene Jargon - Prof. Drena Leigh Dobbs

Protein Structure & Function: Turns, Loops, Fibrous Proteins, Tertiary Structure - Prof. M

(1)

BLAST Statistics & Scoring Matrices in BCB 444/544 Fall 07 - Prof. Drena Leigh Dobbs

Partial preview of the text

Download BCB 444/544 Exam 2: HMMs, Gene & Protein Structure, RNA Secondary Structure - Prof. Drena and more Exams Bioinformatics in PDF only on Docsity! BCB 444/544 Fall 08 Oct 31 Exam 2 p 1 of 7 BCB 444/544 Exam 2 (100 pts) Name_____________________________________ 1. HMM (20 pts TOTAL) Consider the simplified CpG island HMM example discussed in class. The system has 3 states: B denotes the start state In denotes the state when the sequence is in a CpG island Out denotes the state when a the sequence is out of a CpG island The transition probabilities between these states are shown in the diagram. 0.2B Out In0.5 0.5 0.8 0.6 0.4 The emission probabilities are: for state Out, eOut(A) = eOut(C) = eOut(G) = eOut(T) = 0.25 for state In: eIn(A) = eIn(T) = 0.1 eIn(C) = eIn(G) = 0.4 1. What is the most probable sequence of states, starting from state B, to produce the sequence of nucleotides CG? For full credit, you must show your work and fill in the table below. C G B 1 0 0 In 0 = 0.5 * 0.4 = 0.2 = 0.4 * max { 0.125 * 0.6 0.2 * 0.8 } = 0.25 * 0.2 * 0.8 = 0.064 Out 0 = 0.5 * 0.25 = 0.125 = 0.25 * max { 0.125 * 0.4 0.2 * 0.4 } = 0.25 * 0.2 * 0.4 = 0.0125 The most probable sequence of states is: B -> In -> In BCB 444/544 Fall 08 Oct 31 Exam 2 p 2 of 7 What is the total probability of the sequence CG? Show your work and fill in the table below. C G B 1 0 0 In 0 = 0.5 * 0.4 = 0.2 = 0.4 * sum { 0.2 * 0.4 0.125 * 0.2 } = 0.4 * 0.105 = 0.042 Out 0 = 0.5 * 0.25 = 0.125 = 0.25 * sum { 0.2 * 0.6 0.125 * 0.8 } = 0.25 * 0.3 = 0.075 The total probability of the sequence CG is: 0.042 + 0.075 = 0.117 BCB 444/544 Fall 08 Oct 31 Exam 2 p 5 of 7 4. RNA Secondary Structure Prediction (20 points total) Describe 3 general methods for predicting the secondary structure of RNA 1. Ab initio – find the structure of the RNA with the lowest free energy. The free energy calculations essentially search for the conformation that allows for maximal base pairing because base pairing lowers the free energy of the structure, mainly due to stacking interactions between adjacent base pairs but also due to the hydrogen bonds in the base pairs. 2. Comparative – use two or more related RNA sequences to find a common secondary structure. There are two ways to use the related RNAs. First by looking for covariation in the sequences – i.e., the secondary structure is likely to be more highly conserved than the sequence, so if one nucleotide involved in a base pair mutates, it is likely that a compensating mutation in the partner nucleotide will be selected for, preserving the ability to base pair. The second way to use multiple RNAs is to predict a structure for each, then identify the secondary structures in common between the two RNAs. 3. Combined computational and experimental – RNA secondary structures can be determined experimentally using various chemicals and enzymes that selectively act on either single stranded or double stranded nucleotides. Combined approaches for secondary structure prediction allow you to use experimentally determined constraints and only search for predicted secondary structures that fit the available experimental data. BCB 444/544 Fall 08 Oct 31 Exam 2 p 6 of 7 5. Short Answer (10 points total) (2 pts) What is the difference between a profile and a PSSM? The main difference is that profiles allow for gaps and PSSMs do not. (2 pts) What is the difference between a protein motif and a protein domain? Domains are longer and form independent structural or functional regions of a protein. (2 pts) Why is a HMM a more accurate representation of a motif or domain than a regular expression? HMMs are a full probabilistic model for each position in the motif or domain whereas regular expressions condense the information into a string. For example, a regular expression contains things like X for an unknown residue or [S,T] when the residue can be either an S or a T, but an HMM would contain probabilities for all 20 amino acids for the X and the probability of S and the probability of T instead of just S,T. (2 pts) What is meant by covariation in RNA secondary structure prediction? Covariation is when two positions in a set of related RNA sequences vary together to preserve a base pairing relationship. When we see covariation in a multiple sequence alignment of RNA sequences, we can be more confident of predicting that these two nucleotides are base paired to each other. (2 pts) Match the following terms: __B__ Local structural elements, such as an α-helix or β-sheet __D__ Multiple subunits (polypeptide chains) assembled into a single functional unit __C__ The fully “folded” 3-dimensional structure of a single polypeptide chain __A__ The sequence of amino acids in a protein a) Primary (1°) structure b) Secondary (2°) structure c) Tertiary (3°) structure d) Quaternary (4 °) structure BCB 444/544 Fall 08 Oct 31 Exam 2 p 7 of 7 6. Molecular Biology & Bioinformatics Terms (10 pts total) (1pt each) Fill in the box beside each definition with one term or acronym that corresponds to the definition provided. (Some have more than one correct answer). Term Definition 1. Mfold A program for predicting RNA secondary structure 2. Prof A program for predicting protein secondary structure 3. Prosite A database of protein domains and motifs 4. PDB A protein structure database 5. Pymol A program for visualizing protein structures 6. Motif A nucleotide or amino-acid sequence pattern that is often conserved and has, or is conjectured to have, functional significance 7. Pfam A database of protein families 8. GeneSeqer A program for predicting genes in eukaryotes 9. Domain Independent structural or functional unit of a protein 10 X-ray crystallography Experimental method for determining the 3-D structure of a macromolecule Most of these had more than one possible answer.

Documents

questions