Введение в эволюционную и медицинскую геномику, часть II ФББ МГУ, весна 2008 Лекция 4.

Презентация:



Advertisements
Похожие презентации
Полиморфизм генома человека Алма-Ата, Василий Раменский, Институт молекулярной биологии им. Энгельгардта РАН, Москва.
Advertisements

Сравнительная геномика Полиморфизм генома человека ФББ, 4 курс Василий Евгеньевич Раменский, Институт молекулярной биологии РАН.
Genetics Genetics (from Ancient Greek γενετικός genetikos, "genitive" and that from γένεσις genesis, "origin"),[1][2][3] a discipline of biology, is the.
© 2009 Avaya Inc. All rights reserved.1 Chapter Two, Voic Pro Components Module Two – Actions, Variables & Conditions.
© 2006 Cisco Systems, Inc. All rights reserved. HIPS v Administering Events and Generating Reports Managing Events.
1 Cutaneous Melanoma. 2 Equivalent Terms, Definitions and Illustrations Skin only C440-C449 Definitions identify reportable tumors –Evolving melanoma.
The reconstruction of coding scheme through errors distributions Lyakhovetskii V.A., Karpinskaya V.Ju*, Bobrova E.V. Pavlov Institute of Physiology of.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v Route Selection Using Policy Controls Using Multihomed BGP Networks.
Designing Network Management Services © 2004 Cisco Systems, Inc. All rights reserved. Designing the Network Management Architecture ARCH v
Workshop 11 Imprint - Assembly Meshing Wizard. WS11-2 Assembly Meshing Wizard Design goals One comprehensive user interface Intuitive approach for solid.
© 2006 Cisco Systems, Inc. All rights reserved. CVOICE v Configuring Voice Networks Configuring Dial Peers.
Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chap 1-1 Chapter 1 Why Study Statistics? Statistics for Business and Economics.
PAT312, Section 39, December 2006 S39-1 Copyright 2007 MSC.Software Corporation SECTION 39 LBCS USING FIELDS AND MICROFUNCTIONS.
© 2006 Cisco Systems, Inc. All rights reserved. BSCI v Configuring OSPF Introducing the OSPF Protocol.
Time-Series Analysis and Forecasting Lecture on the 5 th of October.
WS9-1 PAT328, Workshop 9, May 2005 Copyright 2005 MSC.Software Corporation WORKSHOP 9 PARAMETERIZED GEOMETRY SHAPES.
SIR model The SIR model Standard convention labels these three compartments S (for susceptible), I (for infectious) and R (for recovered). Therefore, this.
© 2006 Cisco Systems, Inc. All rights reserved. HIPS v Using CSA Analysis Generating Application Deployment Reports.
ХИГГС-БОЗОН В ЭКСПЕРИМЕНТАХ ATLAS и CMS НА БАК В.А.Щегельский Семинар ОФВЭ и ОТФ 30 мая 2013.
© 2006 Cisco Systems, Inc. All rights reserved. MPLS v Complex MPLS VPNs Introducing Central Services VPNs.
Транксрипт:

Введение в эволюционную и медицинскую геномику, часть II ФББ МГУ, весна 2008 Лекция 4

Предсказание эффекта замен аминокислот с помощью вычислительных методов Важная задача: Именно несинонимичные замены в кодирующих частях генов – основная причина существования различных фенотипов Значительная часть такой вариации – нейтральная задача отбора предположительно функциональных вариантов Осмысленная с точки зрения биоинформатики задача: Хорошо формализуема (данные «на входе») Большое количество контрольных данных (проверка результатов «на выходе») Большой выбор параметров, описывающих эффект замены («внутри» метода)

Botstein & Risch Nat Genet (2003) 33:228

1: Andersen MC, Engström PG, Lithwick S, Arenillas D, Eriksson P, Lenhard B, Wasserman WW, Odeberg J.Andersen MC, Engström PG, Lithwick S, Arenillas D, Eriksson P, Lenhard B, Wasserman WW, Odeberg J. In silico detection of sequence variations modifying transcriptional regulation. PLoS Comput Biol Jan;4(1):e5. Epub 2007 Nov 27. 2: Barnes MR, Deharo S, Grocock RJ, Brown JR, Sanseau P. Barnes MR, Deharo S, Grocock RJ, Brown JR, Sanseau P. The micro RNA target paradigm: a fundamental and polymorphic control layer of cellular expression. Expert Opin Biol Ther Sep;7(9): Review. 3: Chen K, Rajewsky N. Chen K, Rajewsky N. Natural selection on human microRNA binding sites inferred from SNP data. Nat Genet Dec;38(12): Epub 2006 Oct 29. 4: Chen JM, Férec C, Cooper DN. Chen JM, Férec C, Cooper DN. A systematic analysis of disease-associated variants in the 3' regulatory regions of human protein-coding genes II: the importance of mRNA secondary structure in assessing the functionality of 3' UTR variants. Hum Genet Oct;120(3): Epub 2006 Jun 29. Review. 5: Královicová J, Lei H, Vorechovský I. Královicová J, Lei H, Vorechovský I. Phenotypic consequences of branch point substitutions. Hum Mutat Aug;27(8): PubMed: prediction regulatory SNPs

Генетика сложных заболеваний: поиск факторов // Выбор функционального варианта из двух, которые сцеплены друг с другом и оба ассоциированы с фенотипом; различение редких полиморфизмов и этиологических мутаций Генетическое консультирование: анализ врожденных дефектов (birth defects) и выбор стратегии лечения Генетика модельных организмов: определение с помощью мутагенеза генов, участвующих в биологических процессах // Поиск кандидатов для мутагенеза Геномика и эволюционная генетика: оценка давления отбора // Определение доли функциональных вариантов в популяции Замены аминокислот: примеры приложений

J.Cohen et al., Science (2004) 305 : Генетика сложных заболеваний: пример

How genetic testing can help to diagnose an inherited disorder with certainty in patients X-linked lymphoproliferative disease (XLP) is a rare, life-threatening inherited disorder. In patients with XLP, infection with Epstein-Barr virus (EBV) - the virus that causes mononucleosis (known as "mono") - can lead to an out-of-control immune response, usually resulting in death within days. XLP can be cured by bone marrow transplantation. However, XLP can be difficult to diagnose with certainty and bone marrow transplantation is too risky a procedure to be performed without a definite diagnosis of XLP. Genetic testing can provide such a definitive diagnosis of XLP. How genetic testing can help to differentiate between different forms of an inherited disorder in patients Congenital hyperinsulinism (CH) is due to a malfunction of the parts of the pancreas that are involved in producing the hormone insulin. CH leads to dangerously low blood sugar levels in newborns and infants. This inherited disorder has been associated with genetic variations in any one of at least five different genes. Treatment depends on which gene is affected and on the nature of the genetic variation and can vary from (1) drug therapy to (2) partial or (3) complete removal of the pancreas. Генетическое консультироваие: пример

Оценка качества метода предсказания: Чувствительность (false negatives) Специфичность (false positives) Покрытие (coverage) Контрольные наборы данных: disease mutations mutagenesis data (??) neutral nsSNPs divergence substitutions Базовые типы информации: Тип замены (Val Ile хорошо, Asp Arg плохо) Позиция замены (Ядро/поверхность, и т.д.)

PolyPhen : prediction of amino acid substitution effect on protein function Prediction: benign (neutral), damaging (deleterious)

I. Sequence annotation Hereditary hemochromatosis protein precursor (HLA-H, Q30201) Features checked: * bond: DISULFID, THIOLEST, THIOETH * site: BINDING, ACT_SITE, LIPID, METAL, SITE, MOD_RES, SE_CYS * region: TRANSMEM, SIGNAL, PROPEP

II. PSIC: profile analysis of homologous sequences 1.Align with homologous proteins with seq. ide %

II. PSIC: profile analysis of homologous sequences 2. Calculate the profile matrix with PSIC algorithm Profile matrix: S a,j = ln[ p a,j / q a ], a = {1,..20}, j = {1,..N}, N = alignment length S Asn,4 S Cys,4

II. PSIC: profile analysis of homologous sequences 3. Analyse difference between profile scores for two a.a. variants: S Asn,4 S Cys,4 Asn Cys: = | S Asn,4 – S Cys,4 | = 1.591

III. 3D structure analysis 1. Residues that are in spatial contact with a ligand or other critical residues Zen 999 residues in 5Å contact with Zen 999 Bos Taurus trypsin [PDB ID :1ql7]

III. 3D structure analysis 2. Residues that form the hydrophobic core of the protein (buried residues) Bos Taurus trypsin [PDB ID :1ql7] Surface residues Buried residues

Structural parameters and contacts Secondary structure Phi-psi dihedral angles Solvent accessible surface area, normed s.a.s.a Change in accessible surface propensity Change in residue side chain volume Contacts with heteroatoms Interchain contacts Contacts with functional sites (BINDING, ACT_SITE, LIPID, and METAL) Region of the phi-psi map (Ramachandran map) Normalised B-factor (temperature factor)

RULES (connected with logical AND) PREDICTION PSIC score difference : Substitution site properties:Substitution type properties: arbitrary annotated as a functional* or bond formation** site arbitrary probably damaging not considered in a region annotated or predicted as transmembrane PHAT matrix difference resulting from substitution is negative possibly damaging 0.5 arbitrary benign >1.0 atoms are closer than 3.0Å to atoms of a ligand or residue annotated as BINDING, ACT_SITE, LIPID, METAL arbitraryprobably damaging 0.5< 1.5 normed accessibility ACC 15% absolute change of accessible surface propensity is 0.75 or absolute change of side chain volume is 60 possibly damaging normed accessibility ACC 5% absolute change of accessible surface propensity is 1.0 or absolute change of side chain volume is 80 probably damaging 1.5< 2.0 arbitrary possibly damaging >2.0 arbitrary probably damaging

PolyPhen, контрольные наборы данных: variants mutagens between PPH new alignment pipeline variants mutagens between PPH 1.14 dam_rate, %totalunknownbenigndamagingFile

PolyPhen predictions for dbSNP b.121 All: 9,502unknown 27,991benign % 7,905possibly damaging % 5,521probably damaging % 50,919total (44,005 unique rss) With structure: 42unknown 2,142benign % 531possibly damaging % 1,076probably damaging % 3,791total (,167 uniqe rss) [ Ivan Adzhubei, 2004 ]

PolyPhen predictions for dbSNP b.121 All: Filtered: 5 seq. in multiple alignment 16,813benign % 5,195possibly damaging % 4,168probably damaging % 26,176total (21,677 unique rss) With structure: Filtered: 5 seq. in multiple alignment 2,021benign % 499possibly damaging % 1,050probably damaging % 3,570total (2,983 unique rss) [ Ivan Adzhubei, 2004 ]

Hydrophobic core stability parameters are the best predictors Ramensky et al., Nucleic Acids Res. (2002) 30:

Validation: case studies APEX1 protein: 24 out of 26 substitutions predicted correctly (Xi et al.) Plasminogen activator inhibitor-2: 18 out of 20 (Di Guisto et al.) 3 HapMap populations and 10 primate species: analysis of ~27,000 nsSNPs with frequencies (Victoria Carlton, AFFYMETRIX, private communication)

Validation: allele frequency

Validation: nsSNPs vs. human-mouse interspecies variation

PolyPhen 2.0: New alignment pipeline: only PSIC score, FP=10%, TP: 71.6% (new ali) vs. 65.1% (old ali) IdPMax: proxy for var2, max i { S ( Mut i )*ident( i )} Nucleotide context: conserved CpG SVM (Support Vector Machines) vs. ADT (Alternating Decision Trees)

PolyPhen 2.x: что еще хорошо было бы сделать Интеграция с базами данных мутаций: проверять, нет ли уже такой замены в базах SwissProt, HGMD. PolyPhen + SDPPred: замены в позиции, определяющей специфичность Компенсаторные замены: принимать во внимание не только позицию замены

PolyPhen PolyPhen input : Protein identifier OR sequence Substitution position Substitution type

PolyPhen

PolyPhen: nsSNPs data collection

DAMAGING nsSNPs Transphyretin (PDB: 1tyr, SNP ) Thr118 Asn occurs at the ligand (REA) binding site Thr 118 REA 130

DAMAGING nsSNPs Trypsin (PDB: 1trn, SNP ) Ser142 Phe results in the strong side chain volume change at a buried position Ser 142

PolyPhen : дитя семи нянек ЦИКЛОП ПОЛИФЕМ ПРЕДСТАВЛЯЛ СОБОЙ УНИКАЛЬНЫЙ ПОДВИД КАРЛИКОВЫХ СЛОНОВ Известия-Наука, 18 ноября 2003 Вонзая заостренное бревно в единственный глаз свирепого циклопа Полифема, легендарный Одиссей истреблял уникальный вид карликовых слонов, обитавших на острове Сицилия. Древний миф об одноглазых человекообразных исполинах развеяли итальянские палеонтологи на научной экспозиции "Полифем в Модене". На выставке представлены черепа, обнаруженные исследователями на Сицилии, у которых одна фронтальная глазница. С первого взгляда она очень напоминает глаз во лбу. Найденные рядом с черепами кости действительно принадлежат немаленькому млекопитающему, которое имело габариты крупного медведя. Обладатель этих останков был не циклопом, а карликовым слоном. "Глаз" во лбу - отверстие для дыхательных путей, то есть для хобота.

Polyphenism : the ability of a single genome to produce two or more alternative morphologies within a single population in response to an environmental cue (such as temperature, photoperiod, or nutrition). [ Dr. Ehab Abouheif, McGill University, Montréal Québec ] The seasonal morphs of the buckeye butterfly, Precis coenia ( Nymphalidae ). The ventral surfaces are shown. The Summer morph ("linea") is on the left; the Fall morph ("rosa") is on the right. [ Scott F.Gilbert, A Companion to Developmental Biology. Chapter 22, Seasonal Polyphenism in Butterfly Wings ]