Escolar Documentos
Profissional Documentos
Cultura Documentos
Disclosures
Founder & Consultant, Personalis Inc (genome
sequencing for clinical applications). NextBio, Novartis.
Consultant current or recently: 23andme, Funding support: NIH, NSF, Microsoft, Oracle,
Lightspeed Ventures, PARSA Foundation. clinical pharmacology.
Goals
Provide an overview of the scientic trends Create a snapshot of what seems to be Marvel at the progress made and the
opportunities ahead. and publications in translational bioinformatics important in March, 2013 for the amusement of future generations.
Process
1. Follow literature through the year 2. Solicit nominations from colleagues 3. Search key journals and key topics on PubMed 4. Stress out a bit. 5. Select papers to highlight in ~2-3 slides
Translational bioinformatics = informatics Considered last ~14 months (to this week) Focused on human biology and clinical NOTE: Amazing biological papers with
Caveats
methods that link biological entities (genes, proteins, small molecules) to clinical entities (diseases, symptoms, drugs)--or vice versa.
implications: molecules, clinical data, informatics. straightforward informatics generally not included. link clinical to molecular generally not included.
Final list
350 Quarter nalists, 242 Semi nalists, 98 nalists 27 Presented here (briey) + 10 shout outs Apologies to those I misjudged. Mistakes are mine. This talk and bibliography will be made available on
the conference website and my blog on rbaltman.wordpress.com drugs, delivery.
Thanks!
Darrell Abernethy Andrea Califano Josh Denny Joel Dudley Mark Gerstein George Hripcsak Konrad
Karczewski
Isaac Kohane Lang Li Yong Li Tianyun Liu Yves Lussier Dan Masys Hua FanMinogue Alex Morgan Sandy Napel
Lucila OhnoMachado Raul Rabadan Dan Roden Nigam Shah David States Nick Tatonetti Jessie
Tenenbaum
Omics Medicine
The predictive capacity of personal genome sequencing. (Roberts et al, Science TM)
Method: Estimate clinical risk based on identical twin Result: For 23/24 most individuals negative, but for
19 diseases still signicant risk. 90% of individuals alerted to at least one increased risk. individuals.
10
Comparison of family history and SNPs for predicting risk of complex disease. (Do et al, PLOS Genetics)
Method: Compare risk assessment using FHx and Result: Family history most useful for common
commonly assumed in terms of clinical utility. disease and roughly equivalent to SNPs. SNPs more useful for rare disease (<4%).
Diverse types of genetic variation converge on functional gene networks involved in schizophrenia. (Gilman et al, Nature Neuro)
genetic data (CNVs, SNVs, GWAS associations). expressed in brain, especially prenatally. Pathways related, but mutations different from those seen in autism.
Result: Several cohesive networks identied. Genes Conclusion: Schizophrenia may begin to yield...
13
14
Tracking a hospital outbreak of carbapenemresistant Klebsiella pneumoniae with whole-genome sequencing. (Snitkin et al, Science TM)
Method: Integrate genomics & epidemiology to Result: Index patient transmitted to 3 others & was
discharged 3 weeks before next case!
16
16
Plasma HDL cholesterol and risk of myocardial infarction: a mendelian randomisation study. (Voight et al, Lancet)
Result: LDL is causal. HDL...not so much. Conclusion: Genetics provides a window for not
18
Shout outs
Goal: Explain why genetic associations seem to leave so much heritability unexplained. The mystery of missing heritability: Genetic interactions create phantom heritability. (Zuk et al, PNAS) Estimating genetic effects and quantifying missing heritability explained by identied rare-variant associations. (Liu & Leal, Amer J Hum Gen)
19
21
22
Cool methods
23
Bayesian ontology querying for accurate and noisetolerant semantic searches. (Bauer et al, Bioinformatics)
phenotypes tolerant to noise in data & input networks to infer diseases from input phenotypes
Method: Combine ontological analysis and Bayesian Result: Improved search performance (ROC) Conclusion: Bayesian reasoning on ontologies can
smooth them and make inference more tolerant to noise in input and in annotations.
24
25
Ultrafast genome-wide scan for SNP-SNP interactions in common complex disease. (Prabhu & Peer, Genome Research)
Result: On bipolar GWAS data set, nd signicant Conclusion: There is hope for nding SNPs that
26
27
CACNA2D4 + others
28
Utility of gene-specic algorithms for predicting pathogenicity of uncertain gene variants. (Crockett et al, JAMIA)
Result: Gene-specic often outperform generic Conclusion: Detailed biology matters, and it is
probably overly optimistic to expect variants to be triaged with general purpose tool.
29
30
Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory. (Li et al, JAMIA)
Method: Compute phenotype similarity based on Result: 177 disease traits connected, similarity
correlates with shortest protein interaction distance.
32
33
34
A vector space model approach to identify genetically related diseases. (Sarkar, JAMIA)
Result: A constellation of associated diseases which Conclusion: There is a continuing hunger to reconceptualize our taxonomy of disease.
35
36
37
A whole-cell computational model predicts phenotype from genotype (Karr et al, Cell)
Goal: Build the rst whole-cell model of a living cell. Method: 27 interacting subsystem simulations using
several simulation techniques.
bacterial are here, and eucaryotes are the next big challenge.
38
39
40
41
Cancer
42
An integrated approach to identify causal network modules of complex diseases with application to colorectal cancer. (Wen et al, JAMIA)
Result: DNA methylation of TFs may be causal. Conclusion: Regulation of expression is an emerging
method for understanding disease etiology.
43
44
45
Systematic identication of genomic markers of drug sensitivity in cancer cells. (Garnett et al, Nature)
Method: Screen cancer lines with 130 drugs, Result: Unexpected sensitivities, e.g. EWS
histology in determining best treatments.
associate drug sensitivity with genetic changes. translocation to poly(ADP-ribose) polymerase (PARP) inhibitors.
Top associations
black: expression red: mutation blue: copy # green: tissue
Whole-genome analysis informs breast cancer response to aromatase inhibition. (Ellis et al, Nature)
Goal: Correlate clinical response to aromatase Method: Sequence tumor/normal and assess
mutations, map to pathways.
Result: 18 genes identied, MAP3K1=low grade, Conclusion: Individualized cancer therapy will
become the norm.
TP53 = high grade. GATA3 = aromatase response. Distinct phenotypes associated with distinct somatic mutation patterns.
49
50
51
52
Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic proling. (Yuan et al, Science TM)
Result: Combined predictor outperforms individual Conclusion: Traditional pathology needs to integrate
genomic measurements into diagnosis and prognosis measures.
53
54
55
Conicting biomedical assumptions for mathematical modeling: the case of cancer metastasis. (Divoli et al, PLOS Comp Bio)
Result: Biggest disagreement: when cancer enters/ Conclusion: Expert opinion for modeling exercises is
divergent/incompatible. Modelers beware.
56
57
58
59
Drugs
60
Systematic identication of pharmacogenomics information from clinical trials. (Li & Lu, J Biomed Inf)
Method: NLP approach for identifying d-g-dz in Result: 74% accuracy by human review. Several
associations not in PharmGKB.
62
63
Use of genome-wide association studies for drug repositioning (Sanseau et al, Letter to Nature Biotech)
65
66
Analysis of functional and pathway association of differential co-expressed genes: a case study in drug addiction. (Li et al, J Biomed Inf)
synaptic transmission, cell migration, insulin, energy, dopamine, NGF signalling, locomotor behavior.
68
69
Shout outs...
Automatic ltering and substantiation of drug safety signals. (Bauer-Mehren et al, PLOS Comp Bio)
Literature based drug interaction prediction with clinical assessment using electronic medical records: novel myopathy associated drug interactions. (Duke et al, PLOS Comp Bio)
Delivery
71
A clinician-driven automated system for integration of pharmacogenetic interpretations into an electronic medical record. (Hicks et al, Clin Pharm & Ther) Incorporating personalized gene sequence variants, molecular genetics knowledge, and health knowledge into an EHR prototype based on the Continuity of Care Record standard. (Jing et al, J. Biomed. Inf.) Operational implementation of prospective genotyping for personalized medicine: the design of the Vanderbilt PREDICT project. (Pulley et al, Clin. Pharm & Ther)
72
73
74
75
Goal: Develop methods to reidentify study subjects. Method: Take advantage of coinheritance of Ychromosome & surname, combine with other public data sources. individuals who participate in public sequencing projects. similar in 2005. We need social mechanisms to disallow this.
76
Result: Demonstrated ability to identify specic Conclusion: 15 year old in U.K. did something
77
A novel, privacy-preserving cryptographic approach for sharing sequencing data. (Cassa et al, JAMIA)
Goal: Securely transmit genome sequence data. Method: Use subset of sequence as a shared secret
key to entire sequence.
79
Disclosing pathogenic genetic variants to research participants: quantifying an emerging ethical responsibility. (Cassa et al, Genome Research)
Result: 4000-18000 variants qualify for disclosure. Conclusion: The incidentalome is here, and it could
80
81
82
An Altered Treatment Plan Based on Direct to Consumer (DTC) Genetic Testing: Personalized Medicine from the Patient/Pin-cushion Perspective (Tenenbaum et al, J Pers Med)
Goal:
Can DTC information be used to predict and prevent disease? woman to predict high risk of clotting.
Method: 23andme DTC data used for pregnant Result: Anticoagulants offered to patient. No clots.
Emergency C-section for unrelated reasons. their providers to use it.
84
Pharmacogenomics in the pocket of every patient? A prototype based on quick response codes. (Samwald & Adlassnig, JAMIA)
Method: Create Medicine safety barcode (QR). Result: Can encode genotypes, and provide local
access to interpretation on web. No large scale infrastructure required.
86
Thanks.
russ.altman@stanford.edu
90