Você está na página 1de 11

BENEMÉRITA UNIVERSIDAD AUTÓNOMA DE PUEBLA

LIC. Químico Farmacobiologo


Genética Microbiana NRC:24513
Cecilia Berenice Méndez Navarro

Tarea 5
Búsqueda de ORF
Las siguientes secuencias fueron tomadas de la base de datos del NCBI
Escherichia coli str. K-12 substr. MG1655, complete genome
NCBI Reference Sequence: NC_000913.3

En cada una de las siguientes secuencias busca:


a) EL ORF
b) Tamaño del gen
c) Secuencia del ARNm
d) Tamaño del ARNm
e) Reverso complementario
f) Secuencia aminoacídica
g) Tamaño de la proteína

Secuencia 1

108061 ggatacgctc agcgcgctgc tgacccagga aggcacgccg tctgaaaagg gttatcgcat


108121 tgattatgcg cattttaccc cacaagcaaa attcagcacg cccgtctgga taagccaggc
108181 gcaaggcatc cgtgctggcc ctcaacgcct cacctaacaa caataaacct ttacttcatt
108241 ttattaactc cgcaacgcgg ggcgtttgag attttattat gctaatcaaa ttgttaacta
108301 aagttttcgg tagtcgtaac gatcgcaccc tgcgccggat gcgcaaagtg gtcaacatca
108361 tcaatgccat ggaaccggag atggaaaaac tctccgacga agaactgaaa gggaaaaccg
108421 cagagtttcg tgcacgtctg gaaaaaggcg aagtgctgga aaatctgatc ccggaagctt
108481 tcgccgtggt acgtgaggca agtaagcgcg tctttggtat gcgtcacttc gacgttcagt
108541 tactcggcgg tatggttctt aacgaacgct gcatcgccga aatgcgtacc ggtgaaggaa
108601 aaaccctgac cgcaacgctg cctgcttacc tgaacgcact aaccggtaaa ggcgtgcacg
108661 tagttaccgt caacgactac ctggcgcaac gtgacgccga aaacaaccgt ccgctgtttg
108721 aattccttgg cctgactgtc ggtatcaacc tgccgggcat gccagcaccg gcaaagcgcg
108781 aagcttacgc agctgacatc acttacggta cgaacaacga atacggcttt gactacctgc
108841 gcgacaacat ggcgttcagc cctgaagaac gtgtacagcg taaactgcac tatgcgctgg
108901 tggacgaagt ggactccatc ctgatcgatg aagcgcgtac accgctgatc atttccggcc
108961 cggcagaaga cagctcggaa atgtataaac gcgtgaataa aattattccg cacctgatcc
109021 gtcaggaaaa agaagactcc gaaaccttcc agggcgaagg ccacttctcg gtggacgaaa
109081 aatctcgcca ggtgaacctg accgaacgtg gtctggtgct gattgaagaa ctgctggtga
109141 aagagggcat catggatgaa ggggagtctc tgtactctcc ggccaacatc atgctgatgc
109201 accacgtaac ggcggcgctg cgcgctcatg cgctgtttac ccgtgacgtc gactacatcg
109261 ttaaagatgg tgaagttatc atcgttgacg aacacaccgg tcgtaccatg cagggccgtc
109321 gctggtccga tggtctgcac caggctgtgg aagcgaaaga aggtgtgcag atccagaacg
109381 aaaaccaaac gctggcttcg atcaccttcc agaactactt ccgtctgtat gaaaaactgg
109441 cggggatgac cggtactgct gataccgaag ctttcgaatt tagctcaatc tacaagctgg
109501 ataccgtcgt tgttccgacc aaccgtccaa tgattcgtaa agatctgccg gacctggtct
109561 acatgactga agcggaaaaa attcaggcga tcattgaaga tatcaaagaa cgtactgcga
109621 aaggccagcc ggtgctggtg ggtactatct ccatcgaaaa atcggagctg gtgtcaaacg
109681 aactgaccaa agccggtatt aagcacaacg tcctgaacgc caaattccac gccaacgaag
109741 cggcgattgt tgctcaggca ggttatccgg ctgcggtgac tatcgcgacc aatatggcgg
109801 gtcgtggtac agatattgtg ctcggtggta gctggcaggc agaagttgcc gcgctggaaa
109861 atccgaccgc agagcaaatt gaaaaaatta aagccgactg gcaggtacgt cacgatgcgg
109921 tactggaagc aggtggcctg catatcatcg gtaccgagcg tcacgaatcc cgtcgtatcg
109981 ataaccagtt gcgcggtcgt tctggtcgtc agggggatgc tggttcttcc cgtttctacc
110041 tgtcgatgga agatgcgctg atgcgtattt ttgcttccga ccgagtatcc ggcatgatgc
110101 gtaaactggg tatgaagcca ggcgaagcca ttgaacaccc gtgggtgact aaagcgattg
110161 ccaacgccca gcgtaaagtt gaaagccgta acttcgacat tcgtaagcaa ctgctggaat
GGATACGCTCAGCGCGCTGCTGACCCAGGAAGGCACGCCGTCTGAAAAGGGTTATCGCATTGATTATGCGCA
TTTTACCCCACAAGCAAAATTCAGCACGCCCGTCTGGATAAGCCAGGCGCAAGGCATCCGTGCTGGCCCTCA
ACGCCTCACCTAACAACAATAAACCTTTACTTCATTTTATTAACTCCGCAACGCGGGGCGTTTGAGATTTTA
TTATGCTAATCAAATTGTTAACTAAAGTTTTCGGTAGTCGTAACGATCGCACCCTGCGCCGGATGCGCAAAG
TGGTCAACATCATCAATGCCATGGAACCGGAGATGGAAAAACTCTCCGACGAAGAACTGAAAGGGAAAACCG
CAGAGTTTCGTGCACGTCTGGAAAAAGGCGAAGTGCTGGAAAATCTGATCCCGGAAGCTTTCGCCGTGGTAC
GTGAGGCAAGTAAGCGCGTCTTTGGTATGCGTCACTTCGACGTTCAGTTACTCGGCGGTATGGTTCTTAACG
AACGCTGCATCGCCGAAATGCGTACCGGTGAAGGAAAAACCCTGACCGCAACGCTGCCTGCTTACCTGAACG
CACTAACCGGTAAAGGCGTGCACGTAGTTACCGTCAACGACTACCTGGCGCAACGTGACGCCGAAAACAACC
GTCCGCTGTTTGAATTCCTTGGCCTGACTGTCGGTATCAACCTGCCGGGCATGCCAGCACCGGCAAAGCGCG
AAGCTTACGCAGCTGACATCACTTACGGTACGAACAACGAATACGGCTTTGACTACCTGCGCGACAACATGG
CGTTCAGCCCTGAAGAACGTGTACAGCGTAAACTGCACTATGCGCTGGTGGACGAAGTGGACTCCATCCTGA
TCGATGAAGCGCGTACACCGCTGATCATTTCCGGCCCGGCAGAAGACAGCTCGGAAATGTATAAACGCGTGA
ATAAAATTATTCCGCACCTGATCCGTCAGGAAAAAGAAGACTCCGAAACCTTCCAGGGCGAAGGCCACTTCT
CGGTGGACGAAAAATCTCGCCAGGTGAACCTGACCGAACGTGGTCTGGTGCTGATTGAAGAACTGCTGGTGA
AAGAGGGCATCATGGATGAAGGGGAGTCTCTGTACTCTCCGGCCAACATCATGCTGATGCACCACGTAACGG
CGGCGCTGCGCGCTCATGCGCTGTTTACCCGTGACGTCGACTACATCGTTAAAGATGGTGAAGTTATCATCG
TTGACGAACACACCGGTCGTACCATGCAGGGCCGTCGCTGGTCCGATGGTCTGCACCAGGCTGTGGAAGCGA
AAGAAGGTGTGCAGATCCAGAACGAAAACCAAACGCTGGCTTCGATCACCTTCCAGAACTACTTCCGTCTGT
ATGAAAAACTGGCGGGGATGACCGGTACTGCTGATACCGAAGCTTTCGAATTTAGCTCAATCTACAAGCTGG
ATACCGTCGTTGTTCCGACCAACCGTCCAATGATTCGTAAAGATCTGCCGGACCTGGTCTACATGACTGAAG
CGGAAAAAATTCAGGCGATCATTGAAGATATCAAAGAACGTACTGCGAAAGGCCAGCCGGTGCTGGTGGGTA
CTATCTCCATCGAAAAATCGGAGCTGGTGTCAAACGAACTGACCAAAGCCGGTATTAAGCACAACGTCCTGA
ACGCCAAATTCCACGCCAACGAAGCGGCGATTGTTGCTCAGGCAGGTTATCCGGCTGCGGTGACTATCGCGA
CCAATATGGCGGGTCGTGGTACAGATATTGTGCTCGGTGGTAGCTGGCAGGCAGAAGTTGCCGCGCTGGAAA
ATCCGACCGCAGAGCAAATTGAAAAAATTAAAGCCGACTGGCAGGTACGTCACGATGCGGTACTGGAAGCAG
GTGGCCTGCATATCATCGGTACCGAGCGTCACGAATCCCGTCGTATCGATAACCAGTTGCGCGGTCGTTCTG
GTCGTCAGGGGGATGCTGGTTCTTCCCGTTTCTACCTGTCGATGGAAGATGCGCTGATGCGTATTTTTGCTT
CCGACCGAGTATCCGGCATGATGCGTAAACTGGGTATGAAGCCAGGCGAAGCCATTGAACACCCGTGGGTGA
CTAAAGCGATTGCCAACGCCCAGCGTAAAGTTGAAAGCCGTAACTTCGACATTCGTAAGCAACTGCTGGAAT
a)ORF

b) Tamaño del gen : 1941 nucleótidos

c) Secuencia del ARNm:

>lcl|ORF3 CDS
AUGCUAAUCAAAUUGUUAACUAAAGUUUUCGGUAGUCGUAACGAUCGCAC
CCUGCGCCGGAUGCGCAAAGUGGUCAACAUCAUCAAUGCCAUGGAACCGG
AGAUGGAAAAACUCUCCGACGAAGAACUGAAAGGGAAAACCGCAGAGUUU
CGUGCACGUCUGGAAAAAGGCGAAGUGCUGGAAAAUCUGAUCCCGGAAGC
UUUCGCCGUGGUACGUGAGGCAAGUAAGCGCGUCUUUGGUAUGCGUCACU
UCGACGUUCAGUUACUCGGCGGUAUGGUUCUUAACGAACGCUGCAUCGCC
GAAAUGCGUACCGGUGAAGGAAAAACCCUGACCGCAACGCUGCCUGCUUA
CCUGAACGCACUAACCGGUAAAGGCGUGCACGUAGUUACCGUCAACGACU
ACCUGGCGCAACGUGACGCCGAAAACAACCGUCCGCUGUUUGAAUUCCUU
GGCCUGACUGUCGGUAUCAACCUGCCGGGCAUGCCAGCACCGGCAAAGCG
CGAAGCUUACGCAGCUGACAUCACUUACGGUACGAACAACGAAUACGGCU
UUGACUACCUGCGCGACAACAUGGCGUUCAGCCCUGAAGAACGUGUACAG
CGUAAACUGCACUAUGCGCUGGUGGACGAAGUGGACUCCAUCCUGAUCGA
UGAAGCGCGUACACCGCUGAUCAUUUCCGGCCCGGCAGAAGACAGCUCGG
AAAUGUAUAAACGCGUGAAUAAAAUUAUUCCGCACCUGAUCCGUCAGGAA
AAAGAAGACUCCGAAACCUUCCAGGGCGAAGGCCACUUCUCGGUGGACGA
AAAAUCUCGCCAGGUGAACCUGACCGAACGUGGUCUGGUGCUGAUUGAAG
AACUGCUGGUGAAAGAGGGCAUCAUGGAUGAAGGGGAGUCUCUGUACUCU
CCGGCCAACAUCAUGCUGAUGCACCACGUAACGGCGGCGCUGCGCGCUCA
UGCGCUGUUUACCCGUGACGUCGACUACAUCGUUAAAGAUGGUGAAGUUA
UCAUCGUUGACGAACACACCGGUCGUACCAUGCAGGGCCGUCGCUGGUCC
GAUGGUCUGCACCAGGCUGUGGAAGCGAAAGAAGGUGUGCAGAUCCAGAA
CGAAAACCAAACGCUGGCUUCGAUCACCUUCCAGAACUACUUCCGUCUGU
AUGAAAAACUGGCGGGGAUGACCGGUACUGCUGAUACCGAAGCUUUCGAA
UUUAGCUCAAUCUACAAGCUGGAUACCGUCGUUGUUCCGACCAACCGUCC
AAUGAUUCGUAAAGAUCUGCCGGACCUGGUCUACAUGACUGAAGCGGAAA
AAAUUCAGGCGAUCAUUGAAGAUAUCAAAGAACGUACUGCGAAAGGCCAG
CCGGUGCUGGUGGGUACUAUCUCCAUCGAAAAAUCGGAGCUGGUGUCAAA
CGAACUGACCAAAGCCGGUAUUAAGCACAACGUCCUGAACGCCAAAUUCC
ACGCCAACGAAGCGGCGAUUGUUGCUCAGGCAGGUUAUCCGGCUGCGGUG
ACUAUCGCGACCAAUAUGGCGGGUCGUGGUACAGAUAUUGUGCUCGGUGG
UAGCUGGCAGGCAGAAGUUGCCGCGCUGGAAAAUCCGACCGCAGAGCAAA
UUGAAAAAAUUAAAGCCGACUGGCAGGUACGUCACGAUGCGGUACUGGAA
GCAGGUGGCCUGCAUAUCAUCGGUACCGAGCGUCACGAAUCCCGUCGUAU
CGAUAACCAGUUGCGCGGUCGUUCUGGUCGUCAGGGGGAUGCUGGUUCUU
CCCGUUUCUACCUGUCGAUGGAAGAUGCGCUGAUGCGUAUUUUUGCUUCC
GACCGAGUAUCCGGCAUGAUGCGUAAACUGGGUAUGAAGCCAGGCGAAGC
CAUUGAACACCCGUGGGUGACUAAAGCGAUUGCCAACGCCCAGCGUAAAG
UUGAAAGCCGUAACUUCGACAUUCGUAAGCAACUGCUGGAA

d) Tamaño del ARNm : 1941 nucleótidos


e) Reverso complementario

XXGXXXYXX GHS
TACGATTAGTTTAACAATTGATTTCAAAAGCCATCAGCATTGCTAGCGTG
GGACGCGGCCTACGCGTTTCACCAGTTGTAGTAGTTACGGTACCTTGGCC
TCTACCTTTTTGAGAGGCTGCTTCTTGACTTTCCCTTTTGGCGTCTCAAA
GCACGTGCAGACCTTTTTCCGCTTCACGACCTTTTAGACTAGGGCCTTCG
AAAGCGGCACCATGCACTCCGTTCATTCGCGCAGAAACCATACGCAGTGA
AGCTGCAAGTCAATGAGCCGCCATACCAAGAATTGCTTGCGACGTAGCGG
CTTTACGCATGGCCACTTCCTTTTTGGGACTGGCGTTGCGACGGACGAAT
GGACTTGCGTGATTGGCCATTTCCGCACGTGCATCAATGGCAGTTGCTGA
TGGACCGCGTTGCACTGCGGCTTTTGTTGGCAGGCGACAAACTTAAGGAA
CCGGACTGACAGCCATAGTTGGACGGCCCGTACGGTCGTGGCCGTTTCGC
GCTTCGAATGCGTCGACTGTAGTGAATGCCATGCTTGTTGCTTATGCCGA
AACTGATGGACGCGCTGTTGTACCGCAAGTCGGGACTTCTTGCACATGTC
GCATTTGACGTGATACGCGACCACCTGCTTCACCTGAGGTAGGACTAGCT
ACTTCGCGCATGTGGCGACTAGTAAAGGCCGGGCCGTCTTCTGTCGAGCC
TTTACATATTTGCGCACTTATTTTAATAAGGCGTGGACTAGGCAGTCCTT
TTTCTTCTGAGGCTTTGGAAGGTCCCGCTTCCGGTGAAGAGCCACCTGCT
TTTTAGAGCGGTCCACTTGGACTGGCTTGCACCAGACCACGACTAACTTC
TTGACGACCACTTTCTCCCGTAGTACCTACTTCCCCTCAGAGACATGAGA
GGCCGGTTGTAGTACGACTACGTGGTGCATTGCCGCCGCGACGCGCGAGT
ACGCGACAAATGGGCACTGCAGCTGATGTAGCAATTTCTACCACTTCAAT
AGTAGCAACTGCTTGTGTGGCCAGCATGGTACGTCCCGGCAGCGACCAGG
CTACCAGACGTGGTCCGACACCTTCGCTTTCTTCCACACGTCTAGGTCTT
GCTTTTGGTTTGCGACCGAAGCTAGTGGAAGGTCTTGATGAAGGCAGACA
TACTTTTTGACCGCCCCTACTGGCCATGACGACTATGGCTTCGAAAGCTT
AAATCGAGTTAGATGTTCGACCTATGGCAGCAACAAGGCTGGTTGGCAGG
TTACTAAGCATTTCTAGACGGCCTGGACCAGATGTACTGACTTCGCCTTT
TTTAAGTCCGCTAGTAACTTCTATAGTTTCTTGCATGACGCTTTCCGGTC
GGCCACGACCACCCATGATAGAGGTAGCTTTTTAGCCTCGACCACAGTTT
GCTTGACTGGTTTCGGCCATAATTCGTGTTGCAGGACTTGCGGTTTAAGG
TGCGGTTGCTTCGCCGCTAACAACGAGTCCGTCCAATAGGCCGACGCCAC
TGATAGCGCTGGTTATACCGCCCAGCACCATGTCTATAACACGAGCCACC
ATCGACCGTCCGTCTTCAACGGCGCGACCTTTTAGGCTGGCGTCTCGTTT
AACTTTTTTAATTTCGGCTGACCGTCCATGCAGTGCTACGCCATGACCTT
CGTCCACCGGACGTATAGTAGCCATGGCTCGCAGTGCTTAGGGCAGCATA
GCTATTGGTCAACGCGCCAGCAAGACCAGCAGTCCCCCTACGACCAAGAA
GGGCAAAGATGGACAGCTACCTTCTACGCGACTACGCATAAAAACGAAGG
CTGGCTCATAGGCCGTACTACGCATTTGACCCATACTTCGGTCCGCTTCG
GTAACTTGTGGGCACCCACTGATTTCGCTAACGGTTGCGGGTCGCATTTC
AACTTTCGGCATTGAAGCTGTAAGCATTCGTTGACGACCTT
f) Secuencia aminoacídica
5'3' Frame 1
MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGE
VLENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPA
YLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADI
TYGTNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDS
SEMYKRVNKIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEG
IMDEGESLYSPANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRR
WSDGLHQAVEAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYK
LDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSEL
VSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAE
VAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGD
AGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRN
FDIRKQLLE

g) Tamaño de la proteína : 646

Secuencia 2

2336641 tcattgccac attccttgtg tatagccagc cattttttac gggcacagcc aaactttacc


2336701 gtgccctaat acgacaaaag cccagacttt gcagcctgga cttttcaatt caaacaaggg
2336761 agatagctcc cttttggcat gaagaagtaa aattattctt cttctggctc gtcgtcaacg
2336821 tccacttccg gagcgatttc atcgtcccct tccgcggcac tgccgtcgat ggtatccaga
2336881 tcttcctcgt caaccggttc agcaacacgt tgcagaccca ctacgttttc atcttccgca
2336941 gtacggatga ggatcacgcc ctgggtgtta cggcccacga tgctgatttc cgaaacgcga
2337001 gtacgtacca gcgtaccggc atcggtgatc atcatgatct ggtcgcagtc atctacctgt
2337061 accgcgccaa caactaaacc gttacgttcg gtaaccttga tggagataac ccctttcgtc
2337121 gcacgcgact tggttgggta ttccgccact gcggtacgtt taccgtaacc gttttgcgtt
2337181 gcggtgagga ttgcgccatc gccacgaggc acgatcagag agacgacttt atcgccttca
2337241 cctaagcgaa taccgcgaac accggtggtg ttgcagccca tcgcacggac agaagactct
2337301 ttaaagcgca ccactttacc ttcagcggag aacagcatta cttcgtcttc gccgctggtc
2337361 aggtcaacgc cgatcagctc atcgccgtca accagtttga tcgccacttt accggcggta
2337421 cgcagacggt tgaactcggt gaggacagtt ttcttcacgg taccgttagc ggtcgccatg
2337481 aagactttca cgccttcttc aaactcggtc actggcagga tcgcagtgat acgttcgtcc
2337541 tgctccagcg gcagcaggtt gacgatcgga cgaccgcgcg cgccacgagt ggcttccggc
2337601 aactgataaa ctttcatcga atagacgcga ccacggctgg agaagcacag aatatggtcg
2337661 tgagtgttcg ccaccagcag tcggtcgata aagtcttctt ctttaatacg tgcggcagat
2337721 ttacctttcc cgccacgacg ctgcgcttcg tattcagaaa gcggctgata cttaacgtag
2337781 ccctggtgag agagcgtcac gaccacatct tcctgggtga tcagatcttc caggttgatg
2337841 tctgcgctgt tggcggtgat ttcagtacga cgtttgtcac cgaactgttc acgaaccagc
2337901 tccagctctt cacggatcac ttccatcaga cgatcggcgc taccaagaat acgcaacagt
2337961 tccgcgatct gatccagcag ctctttgtat tcgtcgagca gtttttcgtg ctcaagaccg
2338021 gtcagtttct gcaaacgcag atccagaatc gcctgagctt gctgttcggt caggtagtac
2338081 agaccatcac gcacgccgaa ctctggctcc agccattccg gacgcgcagc atcgtcgcca
2338141 gcacgttcga gcatcgcggc aacgttgccc agctgccacg gattagcaac cagcgcagtt
2338201 ttcgcttctg caggcgtcgg cgcatgacgg atcagttcga tgatcgggtc gatgttcgcc
2338261 agcgccacgg ctaatgcttc aaggatatga gcacgatcgc gagctttacg cagttcgaaa
2338321 atagtacgac gggtcaccac ttcacggcgg tgacgaacaa acgccgcgat gatgtctttc
2338381 aggttcatga tcttcggctg accatggtgc aatgccacca tgttgatacc gaaagaaacc
2338441 tgcaactggg tctgggagta gaggttgttg agcacaactt caccgaccgc atcgcgtttc
2338501 acttcaatca cgatgcgcat accgtctttg tcagactcgt cacgcagcgc gctgatgcct
2338561 tccacgcgtt tttcttttac cagttccgca atcttctcga tcaggcgcgc tttgtttacc
2338621 tgatacggaa tttcgtggac gataatggtt tcacgaccgg ttttggcgtc aacttccact
2338681 tctgcgcgag cgcggatata caccttgccg cgaccggtac ggtaagcttc ttcaataccg
2338741 cgacgaccgt taatgattgc cgccgtcggg aagtccggcc ccgggatgtg ttccatcagc
2338801 ccttcaatgc tgatgtcttc atcatcaata tacgccagac aaccgttgat gacttccgtc
2338861 aggttgtgcg gcgggatgtt ggttgccata cctacggcga taccggaaga accgttcacc
2338921 agcaggttag gaattttggt tggcatgacg tccggaattt tttccgtgcc gtcatagtta
2338981 tcaacgaaat cgaccgtctc tttttcgaga tcggccatca gttcatgggc aattttcgcc
2339041 agacggattt ccgtataacg cattgccgcc gcagagtcgc cgtcgataga accgaagtta
2339101 ccctgaccgt ctaccagcat ataacgcagc gagaatggct gcgccatgcg gacgatcgtg
2339161 tcatagaccg ccgagtcacc atggggatgg tatttaccga ttacgtcacc aacgacacgg
2339221 gcagattttt tataggcttt gttccagtca ttgcctagta cgttcatggc gtaaagtacg
2339281 cgacggtgta ccggcttcag gccatctcgg acatctggca gcgcacggcc aacaatgacc
2339341 gacatcgcat aatccagata ggagctcttc agctcttcct caatgttgac cggtgtaatt
2339401 tctctcgcaa ggtcgctcat ctaaccgcta tccctctact gtatcccgga ttcaaaggtc
2339461 gcaaattata acacagccgc gcagtttgag gtaaacctat acgctttatt cacatccaat
2339521 gcctgatata ctcgtttgtc ttgccaatta cggagtagaa gtgccaatga atgccgaaaa

SECUENCIA FASTA

TCA TTG CCA CAT TCC TTG TGT ATA GCC AGC CAT TTT TTA CGG GCA CAG CCA AAC TTT
ACC GTG CCC TAA TAC GAC AAA AGC CCA GAC TTT GCA GCC TGG ACT TTT CAA TTC
AAA CAA GGG AGA TAG CTC CCT TTT GGC ATG AAG AAG TAA AAT TAT TCT TCT TCT
GGC TCG TCG TCA ACG TCC ACT TCC GGA GCG ATT TCA TCG TCC CCT TCC GCG GCA
CTG CCG TCG ATG GTA TCC AGA TCT TCC TCG TCA ACC GGT TCA GCA ACA CGT TGC
AGA CCC ACT ACG TTT TCA TCT TCC GCA GTA CGG ATG AGG ATC ACG CCC TGG GTG
TTA CGG CCC ACG ATG CTG ATT TCC GAA ACG CGA GTA CGT ACC AGC GTA CCG GCA
TCG GTG ATC ATC ATG ATC TGG TCG CAG TCA TCT ACC TGT ACC GCG CCA ACA ACT
AAA CCG TTA CGT TCG GTA ACC TTG ATG GAG ATA ACC CCT TTC GTC GCA CGC GAC
TTG GTT GGG TAT TCC GCC ACT GCG GTA CGT TTA CCG TAA CCG TTT TGC GTT GCG
GTG AGG ATT GCG CCA TCG CCA CGA GGC ACG ATC AGA GAG ACG ACT TTA TCG CCT
TCA CCT AAG CGA ATA CCG CGA ACA CCG GTG GTG TTG CAG CCC ATC GCA CGG ACA
GAA GAC TCT TTA AAG CGC ACC ACT TTA CCT TCA GCG GAG AAC AGC ATT ACT TCG
TCT TCG CCG CTG GTC AGG TCA ACG CCG ATC AGC TCA TCG CCG TCA ACC AGT TTG
ATC GCC ACT TTA CCG GCG GTA CGC AGA CGG TTG AAC TCG GTG AGG ACA GTT TTC
TTC ACG GTA CCG TTA GCG GTC GCC ATG AAG ACT TTC ACG CCT TCT TCA AAC TCG
GTC ACT GGC AGG ATC GCA GTG ATA CGT TCG TCC TGC TCC AGC GGC AGC AGG TTG
ACG ATC GGA CGA CCG CGC GCG CCA CGA GTG GCT TCC GGC AAC TGA TAA ACT TTC
ATC GAA TAG ACG CGA CCA CGG CTG GAG AAG CAC AGA ATA TGG TCG TGA GTG TTC
GCC ACC AGC AGT CGG TCG ATA AAG TCT TCT TCT TTA ATA CGT GCG GCA GAT TTA
CCT TTC CCG CCA CGA CGC TGC GCT TCG TAT TCA GAA AGC GGC TGA TAC TTA ACG
TAG CCC TGG TGA GAG AGC GTC ACG ACC ACA TCT TCC TGG GTG ATC AGA TCT TCC
AGG TTG ATG TCT GCG CTG TTG GCG GTG ATT TCA GTA CGA CGT TTG TCA CCG AAC
TGT TCA CGA ACC AGC TCC AGC TCT TCA CGG ATC ACT TCC ATC AGA CGA TCG GCG
CTA CCA AGA ATA CGC AAC AGT TCC GCG ATC TGA TCC AGC AGC TCT TTG TAT TCG
TCG AGC AGT TTT TCG TGC TCA AGA CCG GTC AGT TTC TGC AAA CGC AGA TCC AGA
ATC GCC TGA GCT TGC TGT TCG GTC AGG TAG TAC AGA CCA TCA CGC ACG CCG AAC
TCT GGC TCC AGC CAT TCC GGA CGC GCA GCA TCG TCG CCA GCA CGT TCG AGC ATC
GCG GCA ACG TTG CCC AGC TGC CAC GGA TTA GCA ACC AGC GCA GTT TTC GCT TCT
GCA GGC GTC GGC GCA TGA CGG ATC AGT TCG ATG ATC GGG TCG ATG TTC GCC AGC
GCC ACG GCT AAT GCT TCA AGG ATA TGA GCA CGA TCG CGA GCT TTA CGC AGT TCG
AAA ATA GTA CGA CGG GTC ACC ACT TCA CGG CGG TGA CGA ACA AAC GCC GCG ATG
ATG TCT TTC AGG TTC ATG ATC TTC GGC TGA CCA TGG TGC AAT GCC ACC ATG TTG
ATA CCG AAA GAA ACC TGC AAC TGG GTC TGG GAG TAG AGG TTG TTG AGC ACA ACT
TCA CCG ACC GCA TCG CGT TTC ACT TCA ATC ACG ATG CGC ATA CCG TCT TTG TCA
GAC TCG TCA CGC AGC GCG CTG ATG CCT TCC ACG CGT TTT TCT TTT ACC AGT TCC
GCA ATC TTC TCG ATC AGG CGC GCT TTG TTT ACC TGA TAC GGA ATT TCG TGG ACG
ATA ATG GTT TCA CGA CCG GTT TTG GCG TCA ACT TCC ACT TCT GCG CGA GCG CGG
ATA TAC ACC TTG CCG CGA CCG GTA CGG TAA GCT TCT TCA ATA CCG CGA CGA CCG
TTA ATG ATT GCC GCC GTC GGG AAG TCC GGC CCC GGG ATG TGT TCC ATC AGC CCT
TCA ATG CTG ATG TCT TCA TCA TCA ATA TAC GCC AGA CAA CCG TTG ATG ACT TCC
GTC AGG TTG TGC GGC GGG ATG TTG GTT GCC ATA CCT ACG GCG ATA CCG GAA GAA
CCG TTC ACC AGC AGG TTA GGA ATT TTG GTT GGC ATG ACG TCC GGA ATT TTT TCC
GTG CCG TCA TAG TTA TCA ACG AAA TCG ACC GTC TCT TTT TCG AGA TCG GCC ATC
AGT TCA TGG GCA ATT TTC GCC AGA CGG ATT TCC GTA TAA CGC ATT GCC GCC GCA
GAG TCG CCG TCG ATA GAA CCG AAG TTA CCC TGA CCG TCT ACC AGC ATA TAA CGC
AGC GAG AAT GGC TGC GCC ATG CGG ACG ATC GTG TCA TAG ACC GCC GAG TCA CCA
TGG GGA TGG TAT TTA CCG ATT ACG TCA CCA ACG ACA CGG GCA GAT TTT TTA TAG
GCT TTG TTC CAG TCA TTG CCT AGT ACG TTC ATG GCG TAA AGT ACG CGA CGG TGT
ACC GGC TTC AGG CCA TCT CGG ACA TCT GGC AGC GCA CGG CCA ACA ATG ACC GAC
ATC GCA TAA TCC AGA TAG GAG CTC TTC AGC TCT TCC TCA ATG TTG ACC GGT GTA
ATT TCT CTC GCA AGG TCG CTC ATC TAA CCG CTA TCC CTC TAC TGT ATC CCG GAT
TCA AAG GTC GCA AAT TAT AAC ACA GCC GCG CAG TTT GAG GTA AAC CTA TAC GCT
TTA TTC ACA TCC AAT GCC TGA TAT ACT CGT TTG TCT TGC CAA TTA CGG AGT AGA
AGT GCC AAT GAA TGC CGA AAA

A) ORF

b) Tamaño del gen : 2628 aa


c)Secuencia del ARNm:
>lcl|ORF10 CDS
AUGAGCGACCUUGCGAGAGAAAUUACACCGGUCAACAUUGAGGAAGAGCU
GAAGAGCUCCUAUCUGGAUUAUGCGAUGUCGGUCAUUGUUGGCCGUGCGC
UGCCAGAUGUCCGAGAUGGCCUGAAGCCGGUACACCGUCGCGUACUUUAC
GCCAUGAACGUACUAGGCAAUGACUGGAACAAAGCCUAUAAAAAAUCUGC
CCGUGUCGUUGGUGACGUAAUCGGUAAAUACCAUCCCCAUGGUGACUCGG
CGGUCUAUGACACGAUCGUCCGCAUGGCGCAGCCAUUCUCGCUGCGUUAU
AUGCUGGUAGACGGUCAGGGUAACUUCGGUUCUAUCGACGGCGACUCUGC
GGCGGCAAUGCGUUAUACGGAAAUCCGUCUGGCGAAAAUUGCCCAUGAAC
UGAUGGCCGAUCUCGAAAAAGAGACGGUCGAUUUCGUUGAUAACUAUGAC
GGCACGGAAAAAAUUCCGGACGUCAUGCCAACCAAAAUUCCUAACCUGCU
GGUGAACGGUUCUUCCGGUAUCGCCGUAGGUAUGGCAACCAACAUCCCGC
CGCACAACCUGACGGAAGUCAUCAACGGUUGUCUGGCGUAUAUUGAUGAU
GAAGACAUCAGCAUUGAAGGGCUGAUGGAACACAUCCCGGGGCCGGACUU
CCCGACGGCGGCAAUCAUUAACGGUCGUCGCGGUAUUGAAGAAGCUUACC
GUACCGGUCGCGGCAAGGUGUAUAUCCGCGCUCGCGCAGAAGUGGAAGUU
GACGCCAAAACCGGUCGUGAAACCAUUAUCGUCCACGAAAUUCCGUAUCA
GGUAAACAAAGCGCGCCUGAUCGAGAAGAUUGCGGAACUGGUAAAAGAAA
AACGCGUGGAAGGCAUCAGCGCGCUGCGUGACGAGUCUGACAAAGACGGU
AUGCGCAUCGUGAUUGAAGUGAAACGCGAUGCGGUCGGUGAAGUUGUGCU
CAACAACCUCUACUCCCAGACCCAGUUGCAGGUUUCUUUCGGUAUCAACA
UGGUGGCAUUGCACCAUGGUCAGCCGAAGAUCAUGAACCUGAAAGACAUC
AUCGCGGCGUUUGUUCGUCACCGCCGUGAAGUGGUGACCCGUCGUACUAU
UUUCGAACUGCGUAAAGCUCGCGAUCGUGCUCAUAUCCUUGAAGCAUUAG
CCGUGGCGCUGGCGAACAUCGACCCGAUCAUCGAACUGAUCCGUCAUGCG
CCGACGCCUGCAGAAGCGAAAACUGCGCUGGUUGCUAAUCCGUGGCAGCU
GGGCAACGUUGCCGCGAUGCUCGAACGUGCUGGCGACGAUGCUGCGCGUC
CGGAAUGGCUGGAGCCAGAGUUCGGCGUGCGUGAUGGUCUGUACUACCUG
ACCGAACAGCAAGCUCAGGCGAUUCUGGAUCUGCGUUUGCAGAAACUGAC
CGGUCUUGAGCACGAAAAACUGCUCGACGAAUACAAAGAGCUGCUGGAUC
AGAUCGCGGAACUGUUGCGUAUUCUUGGUAGCGCCGAUCGUCUGAUGGAA
GUGAUCCGUGAAGAGCUGGAGCUGGUUCGUGAACAGUUCGGUGACAAACG
UCGUACUGAAAUCACCGCCAACAGCGCAGACAUCAACCUGGAAGAUCUGA
UCACCCAGGAAGAUGUGGUCGUGACGCUCUCUCACCAGGGCUACGUUAAG
UAUCAGCCGCUUUCUGAAUACGAAGCGCAGCGUCGUGGCGGGAAAGGUAA
AUCUGCCGCACGUAUUAAAGAAGAAGACUUUAUCGACCGACUGCUGGUGG
CGAACACUCACGACCAUAUUCUGUGCUUCUCCAGCCGUGGUCGCGUCUAU
UCGAUGAAAGUUUAUCAGUUGCCGGAAGCCACUCGUGGCGCGCGCGGUCG
UCCGAUCGUCAACCUGCUGCCGCUGGAGCAGGACGAACGUAUCACUGCGA
UCCUGCCAGUGACCGAGUUUGAAGAAGGCGUGAAAGUCUUCAUGGCGACC
GCUAACGGUACCGUGAAGAAAACUGUCCUCACCGAGUUCAACCGUCUGCG
UACCGCCGGUAAAGUGGCGAUCAAACUGGUUGACGGCGAUGAGCUGAUCG
GCGUUGACCUGACCAGCGGCGAAGACGAAGUAAUGCUGUUCUCCGCUGAA
GGUAAAGUGGUGCGCUUUAAAGAGUCUUCUGUCCGUGCGAUGGGCUGCAA
CACCACCGGUGUUCGCGGUAUUCGCUUAGGUGAAGGCGAUAAAGUCGUCU
CUCUGAUCGUGCCUCGUGGCGAUGGCGCAAUCCUCACCGCAACGCAAAAC
GGUUACGGUAAACGUACCGCAGUGGCGGAAUACCCAACCAAGUCGCGUGC
GACGAAAGGGGUUAUCUCCAUCAAGGUUACCGAACGUAACGGUUUAGUUG
UUGGCGCGGUACAGGUAGAUGACUGCGACCAGAUCAUGAUGAUCACCGAU
GCCGGUACGCUGGUACGUACUCGCGUUUCGGAAAUCAGCAUCGUGGGCCG
UAACACCCAGGGCGUGAUCCUCAUCCGUACUGCGGAAGAUGAAAACGUAG
UGGGUCUGCAACGUGUUGCUGAACCGGUUGACGAGGAAGAUCUGGAUACC
AUCGACGGCAGUGCCGCGGAAGGGGACGAUGAAAUCGCUCCGGAAGUGGA
CGUUGACGACGAGCCAGAAGAAGAAUAA
d)Tamaño del ARNm :

e)Reverso complementario

TTC ATT GGC ACT TCT ACT CCG TAA TTG GCA AGA CAA ACG AGT ATA TCA GGC ATT
GGA TGT GAA TAA AGC GTA TAG GTT TAC CTC AAA CTG CGC GGC TGT GTT ATA ATT
TGC GAC CTT TGA ATC CGG GAT ACA GTA GAG GGA TAG CGG TTA GAT GAG CGA CCT
TGC GAG AGA AAT TAC ACC GGT CAA CAT TGA GGA AGA GCT GAA GAG CTC CTA TCT
GGA TTA TGC GAT GTC GGT CAT TGT TGG CCG TGC GCT GCC AGA TGT CCG AGA TGG
CCT GAA GCC GGT ACA CCG TCG CGT ACT TTA CGC CAT GAA CGT ACT AGG CAA TGA
CTG GAA CAA AGC CTA TAA AAA ATC TGC CCG TGT CGT TGG TGA CGT AAT CGG TAA
ATA CCA TCC CCA TGG TGA CTC GGC GGT CTA TGA CAC GAT CGT CCG CAT GGC GCA
GCC ATT CTC GCT GCG TTA TAT GCT GGT AGA CGG TCA GGG TAA CTT CGG TTC TAT
CGA CGG CGA CTC TGC GGC GGC AAT GCG TTA TAC GGA AAT CCG TCT GGC GAA AAT
TGC CCA TGA ACT GAT GGC CGA TCT CGA AAA AGA GAC GGT CGA TTT CGT TGA TAA
CTA TGA CGG CAC GGA AAA AAT TCC GGA CGT CAT GCC AAC CAA AAT TCC TAA CCT
GCT GGT GAA CGG TTC TTC CGG TAT CGC CGT AGG TAT GGC AAC CAA CAT CCC GCC
GCA CAA CCT GAC GGA AGT CAT CAA CGG TTG TCT GGC GTA TAT TGA TGA TGA AGA
CAT CAG CAT TGA AGG GCT GAT GGA ACA CAT CCC GGG GCC GGA CTT CCC GAC GGC
GGC AAT CAT TAA CGG TCG TCG CGG TAT TGA AGA AGC TTA CCG TAC CGG TCG CGG
CAA GGT GTA TAT CCG CGC TCG CGC AGA AGT GGA AGT TGA CGC CAA AAC CGG TCG
TGA AAC CAT TAT CGT CCA CGA AAT TCC GTA TCA GGT AAA CAA AGC GCG CCT GAT
CGA GAA GAT TGC GGA ACT GGT AAA AGA AAA ACG CGT GGA AGG CAT CAG CGC GCT
GCG TGA CGA GTC TGA CAA AGA CGG TAT GCG CAT CGT GAT TGA AGT GAA ACG CGA
TGC GGT CGG TGA AGT TGT GCT CAA CAA CCT CTA CTC CCA GAC CCA GTT GCA GGT
TTC TTT CGG TAT CAA CAT GGT GGC ATT GCA CCA TGG TCA GCC GAA GAT CAT GAA
CCT GAA AGA CAT CAT CGC GGC GTT TGT TCG TCA CCG CCG TGA AGT GGT GAC CCG
TCG TAC TAT TTT CGA ACT GCG TAA AGC TCG CGA TCG TGC TCA TAT CCT TGA AGC
ATT AGC CGT GGC GCT GGC GAA CAT CGA CCC GAT CAT CGA ACT GAT CCG TCA TGC
GCC GAC GCC TGC AGA AGC GAA AAC TGC GCT GGT TGC TAA TCC GTG GCA GCT GGG
CAA CGT TGC CGC GAT GCT CGA ACG TGC TGG CGA CGA TGC TGC GCG TCC GGA ATG
GCT GGA GCC AGA GTT CGG CGT GCG TGA TGG TCT GTA CTA CCT GAC CGA ACA GCA
AGC TCA GGC GAT TCT GGA TCT GCG TTT GCA GAA ACT GAC CGG TCT TGA GCA CGA
AAA ACT GCT CGA CGA ATA CAA AGA GCT GCT GGA TCA GAT CGC GGA ACT GTT GCG
TAT TCT TGG TAG CGC CGA TCG TCT GAT GGA AGT GAT CCG TGA AGA GCT GGA GCT
GGT TCG TGA ACA GTT CGG TGA CAA ACG TCG TAC TGA AAT CAC CGC CAA CAG CGC
AGA CAT CAA CCT GGA AGA TCT GAT CAC CCA GGA AGA TGT GGT CGT GAC GCT CTC
TCA CCA GGG CTA CGT TAA GTA TCA GCC GCT TTC TGA ATA CGA AGC GCA GCG TCG
TGG CGG GAA AGG TAA ATC TGC CGC ACG TAT TAA AGA AGA AGA CTT TAT CGA CCG
ACT GCT GGT GGC GAA CAC TCA CGA CCA TAT TCT GTG CTT CTC CAG CCG TGG TCG
CGT CTA TTC GAT GAA AGT TTA TCA GTT GCC GGA AGC CAC TCG TGG CGC GCG CGG
TCG TCC GAT CGT CAA CCT GCT GCC GCT GGA GCA GGA CGA ACG TAT CAC TGC GAT
CCT GCC AGT GAC CGA GTT TGA AGA AGG CGT GAA AGT CTT CAT GGC GAC CGC TAA
CGG TAC CGT GAA GAA AAC TGT CCT CAC CGA GTT CAA CCG TCT GCG TAC CGC CGG
TAA AGT GGC GAT CAA ACT GGT TGA CGG CGA TGA GCT GAT CGG CGT TGA CCT GAC
CAG CGG CGA AGA CGA AGT AAT GCT GTT CTC CGC TGA AGG TAA AGT GGT GCG CTT
TAA AGA GTC TTC TGT CCG TGC GAT GGG CTG CAA CAC CAC CGG TGT TCG CGG TAT
TCG CTT AGG TGA AGG CGA TAA AGT CGT CTC TCT GAT CGT GCC TCG TGG CGA TGG
CGC AAT CCT CAC CGC AAC GCA AAA CGG TTA CGG TAA ACG TAC CGC AGT GGC GGA
ATA CCC AAC CAA GTC GCG TGC GAC GAA AGG GGT TAT CTC CAT CAA GGT TAC CGA
ACG TAA CGG TTT AGT TGT TGG CGC GGT ACA GGT AGA TGA CTG CGA CCA GAT CAT
GAT GAT CAC CGA TGC CGG TAC GCT GGT ACG TAC TCG CGT TTC GGA AAT CAG CAT
CGT GGG CCG TAA CAC CCA GGG CGT GAT CCT CAT CCG TAC TGC GGA AGA TGA AAA
CGT AGT GGG TCT GCA ACG TGT TGC TGA ACC GGT TGA CGA GGA AGA TCT GGA TAC
CAT CGA CGG CAG TGC CGC GGA AGG GGA CGA TGA AAT CGC TCC GGA AGT GGA CGT
TGA CGA CGA GCC AGA AGA AGA ATA ATT TTA CTT CTT CAT
h) Secuencia aminoacídica

5'3' Frame 1
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDW
NKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAA
AMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVG
MATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTG
RGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDE
SDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAF
VRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVAN
PWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHE
KLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLE
DLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHI
LCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMA
TANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKE
SSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRA
TKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIR
TAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE-