Escolar Documentos
Profissional Documentos
Cultura Documentos
Jeanette McCarthy,
MPH, PhD
UCSF Medical Genetics
The Lecture
oMODULE
1:
The
structure
of
the
human
genome
and
how
genes
work
oMODULE
2:
Structural
variants
oMODULE
3:
Single
nucleo8de
variants
oMODULE
4:
Consequences
of
single
nucleo8de
variants
in
genes
oMODULE
5:
Architecture
of
human
gene8c
varia8on
4 bp
ENCODE project
Genes
Promoter
exon
intron
exon
intron
exon
Protein Synthesis
DNA
Promoter
exon"
intron"
exon"
mRNA
exon"
Transcrip8on
exon" exon" exon"
mRNA
message
is
read
and
translated
into
amino
acid
sequence
(protein)
Protein
intron"
Transla8on
2" intron"
Alterna,ve
transcrip,onal
start
site
mRNA
Transcript
1
1" 2" 3" 4"
Transcript
2
2" 3" 4"
3" intron"
4"
Exon skipping
Transcript
3
1" 2" 4"
Protein 1
Protein 2
Protein 3
Insulin
Pigment
Re8na
Insulin
Pigment
Protein-DNA
Epigene8c
Question
What
allows
for
dierent
cell
types
to
express
dierent
func8onal
proteins?
A. Each
cell
type
contains
dierent
genes
that
encode
the
necessary
proteins
B. Each
cell
type
contains
the
same
genes,
but
dierent
genes
are
expressed
Answer
B.
Each
cell
type
contains
the
same
genes,
but
dierent
genes
are
expressed
For
the
most
part,
every
cell
in
your
body
has
the
same
genes.
What
diers
is
the
expression
of
those
genes,
with
some
being
turned
on
and
others
being
turned
o,
dierent
splice
forms
expressed,
etc.
Large
CNVs
14kb
dele8on
Many
small
CNVs
214 kb
7.2 kb
o Dele8on
of
7.2
kb
covers
part
of
2
genes
Large CNVs
o Less polymorphic (monosomy/trisomy instead of tandem)"
Normal
o
o
o
o
Dele4on
Monosomy
Single
duplica4on
Trisomy
hEp://dgv.tcag.ca/dgv/app/sta,s,cs?ref=NCBI36/hg18
Number of genes
60
50
40
Typical
person
carries
1000
CNVs
(down
to
500bp)
30
20
10
0
0
How
many
individuals
carry
one
of
these?
100
200
300
700
800
900
1000
65-80%
5-10%
1%
Question
The
pathogenicity
of
CNVs
is
a
func8on
of
which
of
the
following:
A. Size
B. Loca8on
C. Both
size
and
loca8on
Answer
C.
Both
size
and
loca8on
Size
large
CNVs
are
usually
more
pathogenic
than
small
CNVs
Loca8on
intergenic
CNVs
are
usually
not
pathogenic,
while
those
that
encompass
genes
are
C
C
C
A l l e l e f r e q u e n c i e s o f s i n gl e nu c l e o t i d e
va r i a n t s ( S N V s / S N P s ) i n t h e p o p u l a t i o n
Gene
ow
Natural
selec8on
Gene8c
drip
Polymorphisms
are
muta8ons
with
allele
freq
>1%
Muta8on
allele
frequency
Classica4on:
Minor
allele
freq.:
Rare variants
Common variants
<0.5%
0.5-5%
>5%
Rare
variants
1/50
bp
0.5-5%
Minor
allele
frequency
>5% (common)
SNPs
Common
variants
1/1000
bp
H ow m a ny g e n e t i c va r i a n t s i n t h e ave r a g e
person?
3700000
357000
100000
10000
1000
1000
100
10
1
SNVs
SNPs
InDels
InDels
CCNVs
NVs
R a c i a l / e t h n i c va r i a t i o n i n a l l e l e
frequencies
Asian"
African"
Native American"
European"
SNP 4"
SNP 3"
SNP 2"
SNP 1"
0"
0.1"
0.2"
0.3"
0.4"
0.5"
0.6"
0.7"
Minor allele frequency!
0.8"
0.9"
1"
Question
TRUE
or
FALSE
Common
variants
are
called
so
because
they
are
the
most
prevalent
type
of
varia8on
in
the
human
genome.
Answer
FALSE
Common
variants
are
called
so
because
the
minor
allele
frequency
is
high
in
the
popula8on.
Common
variants
are
actually
not
as
prevalent
as
rare
variants
are
in
the
popula8on.
intron
exon
intron
exon
Non-synonymous
(Missense)
Premature
stop
(Nonsense)
Protein
Fr a m e s h i f t m u t a t i o n s d u e t o I n D e l s
Inser8ons/dele8ons
of
one
or
more
nucleo8des
in
coding
region
of
gene
can
result
in
a
ship
in
the
reading
frame
that
can
drama8cally
alter
the
sequence
of
amino
acids
in
the
protein
Dele8on of T
Frameship
Poten8ally
func8onal
Amino
acid
change
~10K
non-
synonymous
~350
loss
of
func8on
Probably
func8onal
Frameship
Splice
site
Premature
stop
Question
TRUE
or
FALSE
Muta8ons
in
exons
are
always
deleterious.
Answer
FALSE
Muta8ons
in
exons
can
be
silent
(i.e.
not
change
the
amino
acid
sequence)
or
missense
(change
amino
acid)
with
minimum
impact.
H ow h a p l o t y p e s ( c o - i n h e r i t e d a l l e l e s ) a r i s e
Three
haplotypes
observed
in
the
popula8on
G
C
G
C
G
C
G
C
G
C
G
C
G
C
G
C
G
C
G
C
G
C
T
C
G
C
T
C
G
C
T
C
T
C
G
C
T
C
G
C
T
C
G
C
A
G
C
A
G
C
A
G
C
G
C
A
G
T
C
A
T
T
G
G
C
C
G
1st
muta8on
T
C
2nd
muta8on
Aper
recombina8on
3 haplotypes
4 haplotypes
Chromosomal
recombina,on
G
C
G
C
G
Linkage disequilibrium
Two
SNPs
in
strong
LD
G
Two
SNPs
with
no
LD
G
o Alleles
inherited
in
a
haplotype
are
said
to
be
in
linkage
disequilibrium
(LD)
o LD
is
stronger
when
distance
between
variants
is
short
o LD
is
shaped
by
recombina8on
Variants
with
no
LD
M a c r o - l e ve l h a p l o t y p e s t r u c t u r e i n t h e
human genome
ATTGCCGATACGGGACTTAACGACTAACCAACACTAGGCAGATCGACCAGATCGACGTAGCCAGCTTA
block1
84kb
Block 2
1 2 3 4 5 6 7 8 9 10
CDCA7L
SP4
DNAH11
Chr
7p21
Block 1
Block 2 Block 3
Block 4
Key points
o Haplotype
blocks
are
regions
within
which
there
is
strong
LD,
or
correla8on
between
variants
o One
variant
can
capture
informa8on
about
another
variant,
either
known
or
unknown
o Implica8ons
for
GWAS
(Lecture
4)
?
?