|
RettBASE: IRSA MECP2 Variation Database |
|
Since the publication of these papers, MECP2 has been sequenced in other organisms. This page is intended as a convenient reference to these sequences. I have not sequenced any of these organisms myself. Please note:
Primate MeCP2 proteins were not included as there were virtually no differences between these sequences and human MeCP2. Chimpanzee MeCP2 protein is identical to human MeCP2 protein for both the chimpanzee MeCP_e1 and MeCP2_e2. A MeCP2_e2 sequence found in the crab-eating macaque (NCBI GI 15419705) has only one amino acid different to the human MeCP2_e2 sequence, which means the two sequences are 99.8% similar.
A putative Fugu rubripes sequence homologous to human MECP2 has been described. The sequence was predicted by a computer program. The putative sequence has more exons than the human MECP2 sequence, and the predicted sequence did not include the equivalent of human exon 1 or exon 2. A DNA sequence that translates to "PQDLSTSRP" (an amino acid sequence found in zebrafish protein) was found in F. rubripes sequence regarded by ensembl as intronic. In addition, when the putative Fugu sequence was compared with a putative Tetraodon nigroviridis MECP2 sequence (ENSEMBL Gene ID GSTENG00009035001), the predicted exons and splice sites were different.
10 20 30 40 50
| | | | |
humanMeCP2_e2prot 1 MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKDKKEEKEGKHEPVQP------SAHHSA 54
cattleMeCP2_e2prot MVAGMLGLREEKSEEQDLQGLKDKPLKFKKVKKDKKEDKEGKHEPLQP------AAHHSA
dogMeCP2_e2prot MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKEKKEDKEGKHEPLQP------PAHHSA
mouseMeCP2_e2prot MVAGMLGLREEKSEDQDLQGLRDKPLKFKKAKKDKKEDKEGKHEPLQP------SAHHSA
ratMeCP2_e2prot MVAGMLGLREEKSEDQDLQGLKEKPLKFKKVKKDKKEDKEGKHEPLQP------SAHHSA
possumMeCP2_e2prot MVAGMLGLREEQSEDQDLQGLRDKPLKFRKLKRDKKEEKEGKHEFPQP------SSHQSA
XtropMeCP2_e1prot ---------EEKSEDQDLQGQKDKPPKLRKVKRDKKDEEE-KQETFHP------SEHQSG
XlaevisMeCP2_e1prot ---------EEKSEDQDLQGQKDKPPKLRKVKKDKKDEEE-KQEPFHS------SEHQPG
zebrafishMeCP2_e1prot -------RGEDKNEDQ--EGSKDKTQKHKKSKKERHDVEKLETTVSVPPPPSLFTQRDVG
*::.*:* :* ::*. * :* *::::: :: : . . :. .
60 70 80 90 100 110
| | | | | |
humanMeCP2_e2prot 55 EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS 113
cattleMeCP2_e2prot EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
dogMeCP2_e2prot EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
mouseMeCP2_e2prot EPAEAGKAETSE-SSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
ratMeCP2_e2prot EPAEAGKAETSE-SSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
possumMeCP2_e2prot EPAEAGKAETSE-EAGSAPAAPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
XtropMeCP2_e1prot EPADEGKADISE-SAEESLAVPEASASPKQRRSVIRDRGPMYEDPTLPEGWTRKLKQRKS
XlaevisMeCP2_e1prot EPADEGKADMSE-SAEENLAVPESSASPKQRRSVIRDRGPMYEDPTLPEGWTRKLKQRKS
zebrafishMeCP2_e1prot QQAEAGKSEPIDPEVGAALSAPESSASAKQRRSVIRDRGPMYEDPSLPQGWTRKLKQRKS
: *: **:: : :.**:***.*****:********:**:**:***********
120 130 140 150 160 170
| | | | | |
humanMeCP2_e2prot 114 GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP 173
cattleMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
dogMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
mouseMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
ratMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
possumMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
XtropMeCP2_e1prot GRSAGKFDVYLINPNGKAFRSKVELIAYFQKVGDTSLDPNDFDFTVTGRGSPSRREQKQP
XlaevisMeCP2_e1prot GRSAGKFDVYLINPNGKAFRSKVELIAYFQKVGDTSLDPNDFDFTVTGRGSPSRREQKQP
zebrafishMeCP2_e1prot GRSAGKFDVYLINPEGKAFRSKVELMAYFQKVGDTITDPNDFDFTVTGRGSPSRREKRPP
******:*******:**********:***:***** *******************:: *
180 190 200 210 220 230
| | | | | |
humanMeCP2_e2prot 174 KKPKSPKAPGTGRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPGKLLVKMPFQTSPGGK 233
cattleMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTTRPKAAASEGVQVKRVLEKSPGKLLVKMPFQAAPGSK
dogMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTARPKAATSEGVQVKRVLEKSPGKLLVKMPFQASPGSK
mouseMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTGRPKAAASEGVQVKRVLEKSPGKLVVKMPFQASPGGK
ratMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTGRPKAAASEGVQVKRVLEKSPGKLLVKMPFQASPGGK
possumMeCP2_e2prot KKSKSPKAPGTGRGRGRPKGSGTVKPRVTASEGVQVKRVIEKSPGKLLVKMPFQPSPGGK
XtropMeCP2_e1prot KKSKAPKSSGTGRGRGRPKGSVK-VKSPVKSEGVQVKRVIEKSPGKLLVKMPFS---GSK
XlaevisMeCP2_e1prot KKPKAPKSSVSGRGRGRPKGSIKKVKPPVKSEGVQVKRVIEKSPGKLLVKMPYS---GTK
zebrafishMeCP2_e1prot KKPKMVKP--SGRGRGRPKGSGKVR---QATEGVAVKRVIEKSPGKLLVKMPFVAP---K
**.* *. :********** . :*** ****:*******:****: *
240 250 260 270
| | | |
humanMeCP2_e2prot 234 AEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA 278
cattleMeCP2_e2prot AEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
dogMeCP2_e2prot AEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
mouseMeCP2_e2prot GEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
ratMeCP2_e2prot GEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
possumMeCP2_e2prot AEGGGATTSTQVMVIKRPGRKRKVETEPQVIPKKRGRKPG---------------SIVAA
XtropMeCP2_e1prot -EESDATTSEQVLVIKRPGRKRKSDTDPSAAPKKRGRKPGSV-------------SLAAA
XlaevisMeCP2_e1prot -EASDATTSQQVLVIKRGGRKRKSETDPSAAPKKRGRKPSNV-------------SLAAA
zebrafishMeCP2_e1prot TEPGAPLGQAPVAKARR-GRKRKSEQDPPSTPKKRGRKPATVSQSTVGTGSAAAYAAAAI
* . . . * :* ***** : :* ********. : .*
280 290 300 310 320
| | | | |
humanMeCP2_e2prot 279 AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL 328
cattleMeCP2_e2prot ATAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
dogMeCP2_e2prot AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
mouseMeCP2_e2prot AAAEAKKKAVKESSIRSVHETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
ratMeCP2_e2prot AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
possumMeCP2_e2prot AAVEAKKKAIKESSIRSIHETVLPIKKRKTREAVS------IEVKEVVKPLL----VSTV
XtropMeCP2_e1prot AAEAAKKKAIKESSIKPLLETVLPIKKRKTRETIS------VDVKDTVKPEP----LTPV
XlaevisMeCP2_e1prot AAEAAKKKAIKESSIKPLLETVLPIKKRKTRETIS------VDVKDTIKPEP----LTPV
zebrafishMeCP2_e1prot LTAEAKKKALKESSAKPVQERALPIKKRKTRETLEELEASTTSATETFEKRLTASTVTPT
: *****:**** :.: * .**********::. ...:..: ::.
330 340 350 360 370
| | | | |
humanMeCP2_e2prot 329 GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSES 375
cattleMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSGSASS---PPKKE----------HHHHHHHVEP
dogMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSEP
mouseMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSES
ratMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHAES
possumMeCP2_e2prot GEKSTKGLKPGKSPGRKSKESSPKGRSASTSSS--PPKKEQQQ-------QQQYHHHHYY
XtropMeCP2_e1prot IEKSIKGQKPAKSPESRSTEGSPKIKTG-------LPKKELQQ-------HHHHHHHHHH
XlaevisMeCP2_e1prot IEKVMKGQNPAKSPESRSTEGSPKIKTG-------LPKKELQQ-------HHHHHHHHHH
zebrafishMeCP2_e1prot GEEAETGQKPHKHPSRKHKEADPGSSSSGTTASGVAPKSHKKRDQRGQHFKHHHHHHHHH
*: .* :. * * : .*..* :. **.. ::::***
380 390 400 410 420
| | | | |
humanMeCP2_e2prot 376 PKAPVPLLPPLPPPPPEPESSEDPTSP-------PEPQDLSSSVCKEEKMPRGGSLESDG 428
cattleMeCP2_e2prot PKAPAPLLLPPPPPPPEPQSSEDPASP-------PEPQDLSSSVCKEEKMPRAGSLESDG
dogMeCP2_e2prot PKAPAPLLPPPPPPPPEPQSSEDPASP-------PEPQDLSSGVCKEEKMARGGSLESDG
mouseMeCP2_e2prot TKAPMPLLP--SPPPPEPESSEDPISP-------PEPQDLSSSICKEEKMPRGGSLESDG
ratMeCP2_e2prot PKAPMPLLP--PPPPPEPQSSEDPISP-------PEPQDLSSSICKEEKMPRAGSLESDG
possumMeCP2_e2prot PSSESPKAP--PPPHPEPEGSKDSKSP-------PEPQDLSSKVCKEEKMPRGAPPESDG
XtropMeCP2_e1prot HHHSESKAS---ATSPEPETSKDSIGA-------PEPQDLSVKIYKEEKLP-----ESDG
XlaevisMeCP2_e1prot HHHSESKAS---ATSPEPETSKDNIGV-------QEPQDLSVKMCKEEKLP-----ESDG
zebrafishMeCP2_e1prot HQHQHLQAS--TPSTYTPQAHQLSLGHSTHGGLENEPQDLSTSRPKAEHVACR--EEART
.. *: : . ****** * *::. *:
430 440 450 460 470
| | | | |
humanMeCP2_e2prot 429 CPKEPAKTQPAVA------------TAATAAEKYKHRGEGERKDIVS-SSMPRPNREEPV 475
cattleMeCP2_e2prot CPKEPAKTQPALA------------TAAPATEKYKHRGEGERKDIVS-SSMPRPNREEPV
dogMeCP2_e2prot CPKEPAKTQPTVA------------TAATAADKYKHRGEGERKDIVS-SSMPRPNREEPV
mouseMeCP2_e2prot CPKEPAKTQPMVA------------TTTTVAEKYKHRGEGERKDIVS-SSMPRPNREEPV
ratMeCP2_e2prot CPKEPAKTQPMVAAA----ATTTTTTTTTVAEKYKHRGEGERKDIVS-SSMPRPNREEPV
possumMeCP2_e2prot CTKELAKTQPTAAAASAAATAATATTATTAAEKFKHRAEGDRKDIVS-SSMPRPNREDPV
XtropMeCP2_e1prot CAQEPAKTQP--------------------ADKCRNRAEGERKDIVS-S-VPRPTREEPV
XlaevisMeCP2_e1prot CAQEPAKTQP--------------------ADKCRNRAEGERKDIVS-S-VPRPTREEPV
zebrafishMeCP2_e1prot GSSSSRDSQN----------------ASKMASMTVTGESKELRDIVPPSAVPRPSREETV
... .:* :. . : :***. * :***.**:.*
480
|
humanMeCP2_e2prot 476 DSRTPVTERVS 486
cattleMeCP2_e2prot DSRTPVTERVS
dogMeCP2_e2prot DSRTPVTERVS
mouseMeCP2_e2prot DSRTPVTERVS
ratMeCP2_e2prot DSRTPVTERVS
possumMeCP2_e2prot DSRTPVTERVS
XtropMeCP2_e1prot DTRTTVTERVS
XlaevisMeCP2_e1prot DTRTTVTERVS
zebrafishMeCP2_e1prot ESRTPVSEPVS
::**.*:* **
10 20 30 40
| | | |
humanMeCP2_e1prot 1 ----------------MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVK 44
cattleMeCP2_e1prot ----------------MAAAAAAAPSGGGGGGEEERLEEKSEEQDLQGLKDKPLKFKKVK
dogMeCP2_e1prot ----------------MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVK
mouseMeCP2_e1prot -----------MAAAAATAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLRDKPLKFKKAK
ratMeCP2_e1prot MAAAAAAAAAAAAAAAAAAAAAAAAPSGGGGGEEERLEEKSEDQDLQGLKEKPLKFKKVK
possumMeCP2_e1prot ----------------MAAAAALS---GGGGGEEDRLEEQSEDQDLQGLRDKPLKFRKLK
XtropMeCP2_e1prot ------------------MAAAPS-------GEE-RLEEKSEDQDLQGQKDKPPKLRKVK
XlaevisMeCP2_e1prot ------------------MAAAPS-------GEE-RLEEKSEDQDLQGQKDKPPKLRKVK
zebrafishMeCP2_e1prot ------------------MAAAES-------GEE-RLRGEDKNEDQEGSKDKTQKHKKSK
*** : *** **. :.:::* :* ::*. * :* *
50 60 70 80 90
| | | | |
humanMeCP2_e1prot 45 KDKKEEKEGKHEPVQP------SAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR 97
cattleMeCP2_e1prot KDKKEDKEGKHEPLQP------AAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR
dogMeCP2_e1prot KEKKEDKEGKHEPLQP------PAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR
mouseMeCP2_e1prot KDKKEDKEGKHEPLQP------SAHHSAEPAEAGKAETSE-SSGSAPAVPEASASPKQRR
ratMeCP2_e1prot KDKKEDKEGKHEPLQP------SAHHSAEPAEAGKAETSE-SSGSAPAVPEASASPKQRR
possumMeCP2_e1prot RDKKEEKEGKHEFPQP------SSHQSAEPAEAGKAETSE-EAGSAPAAPEASASPKQRR
XtropMeCP2_e1prot RDKKDEEE-KQETFHP------SEHQSGEPADEGKADISE-SAEESLAVPEASASPKQRR
XlaevisMeCP2_e1prot KDKKDEEE-KQEPFHS------SEHQPGEPADEGKADMSE-SAEENLAVPESSASPKQRR
zebrafishMeCP2_e1prot KERHDVEKLETTVSVPPPPSLFTQRDVGQQAEAGKSEPIDPEVGAALSAPESSASAKQRR
::::: :: : . . :. .: *: **:: : :.**:***.****
100 110 120 130 140 150
| | | | | |
humanMeCP2_e1prot 98 SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV 157
cattleMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
dogMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
mouseMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
ratMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
possumMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
XtropMeCP2_e1prot SVIRDRGPMYEDPTLPEGWTRKLKQRKSGRSAGKFDVYLINPNGKAFRSKVELIAYFQKV
XlaevisMeCP2_e1prot SVIRDRGPMYEDPTLPEGWTRKLKQRKSGRSAGKFDVYLINPNGKAFRSKVELIAYFQKV
zebrafishMeCP2_e1prot SVIRDRGPMYEDPSLPQGWTRKLKQRKSGRSAGKFDVYLINPEGKAFRSKVELMAYFQKV
*:********:**:**:*****************:*******:**********:***:**
160 170 180 190 200 210
| | | | | |
humanMeCP2_e1prot 158 GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAATSE 217
cattleMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAAASE
dogMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTARPKAATSE
mouseMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTGRPKAAASE
ratMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTGRPKAAASE
possumMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKSKSPKAPGTGRGRGRPKGSGTVKPRVTASE
XtropMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKQPKKSKAPKSSGTGRGRGRPKGSVK-VKSPVKSE
XlaevisMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKQPKKPKAPKSSVSGRGRGRPKGSIKKVKPPVKSE
zebrafishMeCP2_e1prot GDTITDPNDFDFTVTGRGSPSRREKRPPKKPKMVKP--SGRGRGRPKGSGKVR---QATE
*** *******************:: ***.* *. :********** . :*
220 230 240 250 260 270
| | | | | |
humanMeCP2_e1prot 218 GVQVKRVLEKSPGKLLVKMPFQTSPGGKAEGGGATTSTQVMVIKRPGRKRKAEADPQAIP 277
cattleMeCP2_e1prot GVQVKRVLEKSPGKLLVKMPFQAAPGSKAEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
dogMeCP2_e1prot GVQVKRVLEKSPGKLLVKMPFQASPGSKAEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
mouseMeCP2_e1prot GVQVKRVLEKSPGKLVVKMPFQASPGGKGEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
ratMeCP2_e1prot GVQVKRVLEKSPGKLLVKMPFQASPGGKGEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
possumMeCP2_e1prot GVQVKRVIEKSPGKLLVKMPFQPSPGGKAEGGGATTSTQVMVIKRPGRKRKVETEPQVIP
XtropMeCP2_e1prot GVQVKRVIEKSPGKLLVKMPFS---GSK-EESDATTSEQVLVIKRPGRKRKSDTDPSAAP
XlaevisMeCP2_e1prot GVQVKRVIEKSPGKLLVKMPYS---GTK-EASDATTSQQVLVIKRGGRKRKSETDPSAAP
zebrafishMeCP2_e1prot GVAVKRVIEKSPGKLLVKMPFVAP---KTEPGAPLGQAPVAKARR-GRKRKSEQDPPSTP
** ****:*******:****: * * . . . * :* ***** : :* *
280 290 300 310 320
| | | | |
humanMeCP2_e1prot 278 KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE 322
cattleMeCP2_e1prot KKRGRKPG---------------SVVAAATAEAKKKAVKESSIRSVQETVLPIKKRKTRE
dogMeCP2_e1prot KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE
mouseMeCP2_e1prot KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVHETVLPIKKRKTRE
ratMeCP2_e1prot KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE
possumMeCP2_e1prot KKRGRKPG---------------SIVAAAAVEAKKKAIKESSIRSIHETVLPIKKRKTRE
XtropMeCP2_e1prot KKRGRKPGSV-------------SLAAAAAEAAKKKAIKESSIKPLLETVLPIKKRKTRE
XlaevisMeCP2_e1prot KKRGRKPSNV-------------SLAAAAAEAAKKKAIKESSIKPLLETVLPIKKRKTRE
zebrafishMeCP2_e1prot KKRGRKPATVSQSTVGTGSAAAYAAAAILTAEAKKKALKESSAKPVQERALPIKKRKTRE
*******. : .* : *****:**** :.: * .**********
330 340 350 360 370
| | | | |
humanMeCP2_e1prot 323 TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS 371
cattleMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSGS-AS
dogMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS
mouseMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS
ratMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS
possumMeCP2_e1prot AVS------IEVKEVVKPLL----VSTVGEKSTKGLKPGKSPGRKSKESSPKGRSASTSS
XtropMeCP2_e1prot TIS------VDVKDTVKPEP----LTPVIEKSIKGQKPAKSPESRSTEGSPKIKTG--L-
XlaevisMeCP2_e1prot TIS------VDVKDTIKPEP----LTPVIEKVMKGQNPAKSPESRSTEGSPKIKTG--L-
zebrafishMeCP2_e1prot TLEELEASTTSATETFEKRLTASTVTPTGEEAETGQKPHKHPSRKHKEADPGSSSSGTTA
::. ...:..: ::. *: .* :. * * : .*..* :.
380 390 400 410
| | | |
humanMeCP2_e1prot 372 S---PP--KKE-------HHHHHHHSESPKAPVPLLPPLPPPPPEPESSEDPTSP----- 414
cattleMeCP2_e1prot S---PP--KKE-------HHHHHHHVEPPKAPAPLLLPPPPPPPEPQSSEDPASP-----
dogMeCP2_e1prot S---PP--KKE-------HHHHHHHSEPPKAPAPLLPPPPPPPPEPQSSEDPASP-----
mouseMeCP2_e1prot S---PP--KKE-------HHHHHHHSESTKAPMPLLP--SPPPPEPESSEDPISP-----
ratMeCP2_e1prot S---PP--KKE-------HHHHHHHAESPKAPMPLLP--PPPPPEPQSSEDPISP-----
possumMeCP2_e1prot S---PP--KKEQQQ----QQQYHHHHYYPSSESPKAP--PPPHPEPEGSKDSKSP-----
XtropMeCP2_e1prot ----P---KKELQQ----HHHHHHHHHHHHHSESKAS---ATSPEPETSKDSIGA-----
XlaevisMeCP2_e1prot ----P---KKELQQ----HHHHHHHHHHHHHSESKAS---ATSPEPETSKDNIGV-----
zebrafishMeCP2_e1prot SGVAPKSHKKRDQRGQHFKHHHHHHHHHHQHQHLQAS--TPSTYTPQAHQLSLGHSTHGG
* **. ::::*** .. *: : .
420 430 440 450 460
| | | | |
humanMeCP2_e1prot 415 --PEPQDLSSSVCKEEKMPRGGSLESDGCPKEPAKTQPAVA------------TAATAAE 460
cattleMeCP2_e1prot --PEPQDLSSSVCKEEKMPRAGSLESDGCPKEPAKTQPALA------------TAAPATE
dogMeCP2_e1prot --PEPQDLSSGVCKEEKMARGGSLESDGCPKEPAKTQPTVA------------TAATAAD
mouseMeCP2_e1prot --PEPQDLSSSICKEEKMPRGGSLESDGCPKEPAKTQPMVA------------TTTTVAE
ratMeCP2_e1prot --PEPQDLSSSICKEEKMPRAGSLESDGCPKEPAKTQPMVAAA----ATTTTTTTTTVAE
possumMeCP2_e1prot --PEPQDLSSKVCKEEKMPRGAPPESDGCTKELAKTQPTAAAASAAATAATATTATTAAE
XtropMeCP2_e1prot --PEPQDLSVKIYKEEKLP-----ESDGCAQEPAKTQP--------------------AD
XlaevisMeCP2_e1prot --QEPQDLSVKMCKEEKLP-----ESDGCAQEPAKTQP--------------------AD
zebrafishMeCP2_e1prot LENEPQDLSTSRPKAEHVACR--EEARTGSSSSRDSQN----------------ASKMAS
****** * *::. *: ... .:* :.
470 480 490
| | |
humanMeCP2_e1prot 461 KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS 498
cattleMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
dogMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
mouseMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
ratMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
possumMeCP2_e1prot KFKHRAEGDRKDIVS-SSMPRPNREDPVDSRTPVTERVS
XtropMeCP2_e1prot KCRNRAEGERKDIVS-S-VPRPTREEPVDTRTTVTERVS
XlaevisMeCP2_e1prot KCRNRAEGERKDIVS-S-VPRPTREEPVDTRTTVTERVS
zebrafishMeCP2_e1prot MTVTGESKELRDIVPPSAVPRPSREETVESRTPVSEPVS
. : :***. * :***.**:.*::**.*:* **
>humanMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaagaaaag tcagaagacc aggacctcca gggcctcaag gacaaacccc tcaagtttaa aaaggtgaag aaagataaga aagaagagaa agagggcaag catgagcccg tgcagccatc agcccaccac tctgctgagc ccgcagaggc aggcaaagca gagacatcag aagggtcagg ctccgccccg gctgtgccgg aagcttctgc ctcccccaaa cagcggcgct ccatcatccg tgaccgggga cccatgtatg atgaccccac cctgcctgaa ggctggacac ggaagcttaa gcaaaggaaa tctggccgct ctgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacatcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggccgggga cgccccaaag ggagcggcac cacgagaccc aaggcggcca cgtcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcctgggaag ctccttgtca agatgccttt tcaaacttcg ccagggggca aggctgaggg gggtggggcc accacatcca cccaggtcat ggtgatcaaa cgccccggca ggaagcgaaa agctgaggcc gaccctcagg ccattcccaa gaaacggggc cgaaagccgg ggagtgtggt ggcagccgct gccgccgagg ccaaaaagaa agccgtgaag gagtcttcta tccgatctgt gcaggagacc gtactcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc caccctcggt gagaagagcg ggaaaggact gaagacctgt aagagccctg ggcggaaaag caaggagagc agccccaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagtcccc aaaggccccc gtgccactgc tcccacccct gcccccacct ccacctgagc ccgagagctc cgaggacccc accagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccc agaggaggct cactggagag cgacggctgc cccaaggagc cagctaagac tcagcccgcg gttgccaccg ccgccacggc cgcagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>humanMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKDKPLKFKK VKKDKKEEKE GKHEPVQPSA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTTRPK AATSEGVQVK RVLEKSPGKL LVKMPFQTSP GGKAEGGGAT TSTQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSESPKAPV PLLPPLPPPP PEPESSEDPT SPPEPQDLSS SVCKEEKMPR GGSLESDGCP KEPAKTQPAV ATAATAAEKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS
Derived from: NCBI GI 15079579 bases 85 to 1545.
Reliability: The coding sequence portions of NCBI GI 15079579 match exactly the human genome sequence.
>humanMECP2_e1dna
atggccgccg ccgccgccgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tcagaagacc aggacctcca gggcctcaag gacaaacccc tcaagtttaa aaaggtgaag aaagataaga aagaagagaa agagggcaag catgagcccg tgcagccatc agcccaccac tctgctgagc ccgcagaggc aggcaaagca gagacatcag aagggtcagg ctccgccccg gctgtgccgg aagcttctgc ctcccccaaa cagcggcgct ccatcatccg tgaccgggga cccatgtatg atgaccccac cctgcctgaa ggctggacac ggaagcttaa gcaaaggaaa tctggccgct ctgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacatcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggccgggga cgccccaaag ggagcggcac cacgagaccc aaggcggcca cgtcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcctgggaag ctccttgtca agatgccttt tcaaacttcg ccagggggca aggctgaggg gggtggggcc accacatcca cccaggtcat ggtgatcaaa cgccccggca ggaagcgaaa agctgaggcc gaccctcagg ccattcccaa gaaacggggc cgaaagccgg ggagtgtggt ggcagccgct gccgccgagg ccaaaaagaa agccgtgaag gagtcttcta tccgatctgt gcaggagacc gtactcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc caccctcggt gagaagagcg ggaaaggact gaagacctgt aagagccctg ggcggaaaag caaggagagc agccccaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagtcccc aaaggccccc gtgccactgc tcccacccct gcccccacct ccacctgagc ccgagagctc cgaggacccc accagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccc agaggaggct cactggagag cgacggctgc cccaaggagc cagctaagac tcagcccgcg gttgccaccg ccgccacggc cgcagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>humanMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEDQDL QGLKDKPLKF KKVKKDKKEE KEGKHEPVQP SAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTTR PKAATSEGVQ VKRVLEKSPG KLLVKMPFQT SPGGKAEGGG ATTSTQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA AAAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSSSA SSPPKKEHHH HHHHSESPKA PVPLLPPLPP PPPEPESSED PTSPPEPQDL SSSVCKEEKM PRGGSLESDG CPKEPAKTQP AVATAATAAE KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS
Derived from: NCBI GI 6959307 bases 8 to 69 and 194 to 1628.
Reliability: The coding sequence portions of NCBI GI 6959307 match exactly the human genome sequence.
>cattleMECP2_e2dna
atggtagctg gaatgttagg gctcag
ggaagaaaag tccgaagagc aggatctcca gggcctgaag gacaaacctt tgaagttcaa aaaggtgaag aaggataaga aagaagacaa agagggcaag catgagcccc tgcagccagc agcccaccac tctgccgagc cagcagaggc cggcaaagca gagacctcag aagggtcagg ctcggcccca gccgtgccag aagcttctgc atcccccaag cagcggcgct ccatcattcg tgatcggggc cccatgtacg atgaccccac tctgccggaa ggttggaccc gaaagcttaa gcaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tacttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta accgggagag ggagcccctc ccggcgagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagtggcac cacgagaccc aaggcagctg cgtcagaggg tgtgcaagtg aaaagggttc tggagaaaag tcctggaaag ctactcgtca agatgccttt ccaagctgcg ccgggcagca aggcagaagg gggtggggcc accacctcag cccaggtcat ggtcatcaag cgccccggcc ggaagcgaaa agcggaggcc gacccccagg ccattcccaa gaaacgaggc cgaaagccgg gcagtgtggt tgctgccgcc actgccgagg ccaaaaagaa agccgtgaag gagtcatcta tccggtccgt tcaggagacc gtgctcccca tcaagaagcg caagacccgg gagacggtga gcattgaggt gaaggaggta gtgaagcccc tgctggtgtc cacgctcggc gagaagagcg ggaagggact gaagacctgc aagagcccag ggcggaaaag caaggagagc agtcccaagg ggcgcagcgg cagcgcctcc tcgcccccca agaaggagca ccaccaccac caccaccacg tggagccccc gaaggccccc gcgccgctgc tcctgccccc gcccccaccc ccgcccgagc cccagagctc cgaggaccct gccagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccg agagcaggct cgctggagag cgatggctgc cccaaggagc ctgctaagac tcagcccgcg ctcgccaccg cggccccggc cacagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgtct catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>cattleMECP2_e2prot
MVAGMLGLRE EKSEEQDLQG LKDKPLKFKK VKKDKKEDKE GKHEPLQPAA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTTRPK AAASEGVQVK RVLEKSPGKL LVKMPFQAAP GSKAEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAT AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSGSASS PPKKEHHHHH HHVEPPKAPA PLLLPPPPPP PEPQSSEDPA SPPEPQDLSS SVCKEEKMPR AGSLESDGCP KEPAKTQPAL ATAAPATEKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS
Derived from: ENSEMBL data, except for the section highlighted in red, which is derived from the trace archive.
Reliability: Fairly good agreement with sequences from the trace archive.
>cattleMECP2_e1dna
atggccgccg ccgccgctgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tccgaagagc aggatctcca gggcctgaag gacaaacctt tgaagttcaa aaaggtgaag aaggataaga aagaagacaa agagggcaag catgagcccc tgcagccagc agcccaccac tctgccgagc cagcagaggc cggcaaagca gagacctcag aagggtcagg ctcggcccca gccgtgccag aagcttctgc atcccccaag cagcggcgct ccatcattcg tgatcggggc cccatgtacg atgaccccac tctgccggaa ggttggaccc gaaagcttaa gcaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tacttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta accgggagag ggagcccctc ccggcgagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagtggcac cacgagaccc aaggcagctg cgtcagaggg tgtgcaagtg aaaagggttc tggagaaaag tcctggaaag ctactcgtca agatgccttt ccaagctgcg ccgggcagca aggcagaagg gggtggggcc accacctcag cccaggtcat ggtcatcaag cgccccggcc ggaagcgaaa agcggaggcc gacccccagg ccattcccaa gaaacgaggc cgaaagccgg gcagtgtggt tgctgccgcc actgccgagg ccaaaaagaa agccgtgaag gagtcatcta tccggtccgt tcaggagacc gtgctcccca tcaagaagcg caagacccgg gagacggtga gcattgaggt gaaggaggta gtgaagcccc tgctggtgtc cacgctcggc gagaagagcg ggaagggact gaagacctgc aagagcccag ggcggaaaag caaggagagc agtcccaagg ggcgcagcgg cagcgcctcc tcgcccccca agaaggagca ccaccaccac caccaccacg tggagccccc gaaggccccc gcgccgctgc tcctgccccc gcccccaccc ccgcccgagc cccagagctc cgaggaccct gccagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccg agagcaggct cgctggagag cgatggctgc cccaaggagc ctgctaagac tcagcccgcg ctcgccaccg cggccccggc cacagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgtct catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>cattleMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEEQDL QGLKDKPLKF KKVKKDKKED KEGKHEPLQP AAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTTR PKAAASEGVQ VKRVLEKSPG KLLVKMPFQA APGSKAEGGG ATTSAQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA ATAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSGSA SSPPKKEHHH HHHHVEPPKA PAPLLLPPPP PPPEPQSSED PASPPEPQDL SSSVCKEEKM PRAGSLESDG CPKEPAKTQP ALATAAPATE KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS
Derived from: ENSEMBL data, except for the section highlighted in red, which is derived from the trace archive.
Reliability: Fairly good agreement with sequences from the trace archive.
>dogMECP2_e2dna
atggtagctg gaatgttagg gctcag
ggaagaaaag tcagaagacc aggatctcca gggcctcaag gacaaacccc tgaaatttaa aaaggtgaag aaagagaaga aagaagacaa agagggcaag catgagcccc tgcagccacc ggctcaccac tctgctgaac cagcagaggc aggcaaagcg gagacctcag aagggtcagg ctcagcccca gctgtcccgg aagcttctgc ctcccccaaa cagcgacgct ctatcattcg tgaccgggga cccatgtatg acgaccccac tctgcctgaa ggttggaccc gaaagcttaa acaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagcggcac tgcgagaccc aaggcagcaa catcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcccgggaag ctgctcgtca agatgccttt tcaagcttcg cccgggagca aggctgaagg gggcggggcc accacgtcag cccaggtcat ggttatcaaa cgcccaggcc ggaagcgaaa agccgaggct gacccccagg ccattcccaa gaagcggggc cgaaagccag gcagtgtggt ggcagctgcc gccgcagagg ccaaaaagaa agccgtgaag gagtcttcca tccggtccgt gcaggagact gtgctcccca tcaagaagcg caagactcgg gagacggtca gcattgaggt gaaggaggtg gtgaagcccc tgctggtgtc caccctcggc gagaagagtg gaaagggact gaagacctgc aagagccccg gacggaaaag caaggagagc agcccgaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagccccc gaaggcaccc gcgccgctgc ttccgccccc gccccctccc ccacctgagc cccagagctc cgaggacccc gccagccccc ctgagcccca ggacttgagc agcggcgtct gcaaagagga gaagatggcg agaggaggct cgctggagag cgacggctgc cccaaggagc cagctaagac tcagcccacg gtcgcgaccg ccgccacggc cgcagacaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ct
>dogMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKDKPLKFKK VKKEKKEDKE GKHEPLQPPA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTARPK AATSEGVQVK RVLEKSPGKL LVKMPFQASP GSKAEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSEPPKAPA PLLPPPPPPP PEPQSSEDPA SPPEPQDLSS GVCKEEKMAR GGSLESDGCP KEPAKTQPTV ATAATAADKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Good agreement with other sequences from the trace archive.
>dogMECP2_e1dna
atggccgccg ccgccgctgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tcagaagacc aggatctcca gggcctcaag gacaaacccc tgaaatttaa aaaggtgaag aaagagaaga aagaagacaa agagggcaag catgagcccc tgcagccacc ggctcaccac tctgctgaac cagcagaggc aggcaaagcg gagacctcag aagggtcagg ctcagcccca gctgtcccgg aagcttctgc ctcccccaaa cagcgacgct ctatcattcg tgaccgggga cccatgtatg acgaccccac tctgcctgaa ggttggaccc gaaagcttaa acaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagcggcac tgcgagaccc aaggcagcaa catcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcccgggaag ctgctcgtca agatgccttt tcaagcttcg cccgggagca aggctgaagg gggcggggcc accacgtcag cccaggtcat ggttatcaaa cgcccaggcc ggaagcgaaa agccgaggct gacccccagg ccattcccaa gaagcggggc cgaaagccag gcagtgtggt ggcagctgcc gccgcagagg ccaaaaagaa agccgtgaag gagtcttcca tccggtccgt gcaggagact gtgctcccca tcaagaagcg caagactcgg gagacggtca gcattgaggt gaaggaggtg gtgaagcccc tgctggtgtc caccctcggc gagaagagtg gaaagggact gaagacctgc aagagccccg gacggaaaag caaggagagc agcccgaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagccccc gaaggcaccc gcgccgctgc ttccgccccc gccccctccc ccacctgagc cccagagctc cgaggacccc gccagccccc ctgagcccca ggacttgagc agcggcgtct gcaaagagga gaagatggcg agaggaggct cgctggagag cgacggctgc cccaaggagc cagctaagac tcagcccacg gtcgcgaccg ccgccacggc cgcagacaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ct
>dogMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEDQDL QGLKDKPLKF KKVKKEKKED KEGKHEPLQP PAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTAR PKAATSEGVQ VKRVLEKSPG KLLVKMPFQA SPGSKAEGGG ATTSAQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA AAAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSSSA SSPPKKEHHH HHHHSEPPKA PAPLLPPPPP PPPEPQSSED PASPPEPQDL SSGVCKEEKM ARGGSLESDG CPKEPAKTQP TVATAATAAD KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Exon 1 has fairly good agreement with other sequences from the trace archive, exons 3 and 4 have good agreement with other sequences from the trace archive.
>mouseMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaggaaaag tcagaagacc aggatctcca gggcctcaga gacaagccac tgaagtttaa gaaggcgaag aaagacaaga aggaggacaa agaaggcaag catgagccac tacaaccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gaaacatcag aaagctcagg ctctgcccca gcagtgccag aagcctcggc ttcccccaaa cagcggcgct ccattatccg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacac gaaagcttaa acaaaggaag tctggccgat ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcttttc gctctaaagt agaattgatt gcatactttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcacggta actgggagag ggagcccctc caggagagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgccccaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaggtg aaaagggtcc tggagaagag ccctgggaaa cttgttgtca agatgccttt ccaagcatcg cctgggggta agggtgaggg aggtggggct accacatctg cccaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gcagctgagg ccaaaaagaa agccgtgaag gagtcttcca tacggtctgt gcatgagact gtgctcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc cacccttggt gagaaaagcg ggaagggact gaagacctgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tccccaccta agaaggagca ccatcatcac caccatcact cagagtccac aaaggccccc atgccactgc tcccatcccc acccccacct gagcctgaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aagagaagat gccccgagga ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc tatggtcgcc accactacca cagttgcaga aaagtacaaa caccgagggg agggagagcg caaagacatt gtttcatctt ccatgccaag gccaaacaga gaggagcctg tggacagccg gacgcccgtg accgagagag ttagctga
>mouseMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LRDKPLKFKK AKKDKKEDKE GKHEPLQPSA HHSAEPAEAG KAETSESSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTGRPK AAASEGVQVK RVLEKSPGKL VVKMPFQASP GGKGEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVHETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSESTKAPM PLLPSPPPPE PESSEDPISP PEPQDLSSSI CKEEKMPRGG SLESDGCPKE PAKTQPMVAT TTTVAEKYKH RGEGERKDIV SSSMPRPNRE EPVDSRTPVT ERVS
Derived from: NCBI GI 20072599 bases 202 to 1656.
Reliability: Good agreement with sequences from trace archive.
>mouseMECP2_e1dna
atggccgccg ctgccgccac cgccgccgcc gccgccgcgc cgagcggagg aggaggagga ggcgaggagg agagact
ggaggaaaag tcagaagacc aggatctcca gggcctcaga gacaagccac tgaagtttaa gaaggcgaag aaagacaaga aggaggacaa agaaggcaag catgagccac tacaaccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gaaacatcag aaagctcagg ctctgcccca gcagtgccag aagcctcggc ttcccccaaa cagcggcgct ccattatccg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacac gaaagcttaa acaaaggaag tctggccgat ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcttttc gctctaaagt agaattgatt gcatactttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcacggta actgggagag ggagcccctc caggagagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgccccaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaggtg aaaagggtcc tggagaagag ccctgggaaa cttgttgtca agatgccttt ccaagcatcg cctgggggta agggtgaggg aggtggggct accacatctg cccaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gcagctgagg ccaaaaagaa agccgtgaag gagtcttcca tacggtctgt gcatgagact gtgctcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc cacccttggt gagaaaagcg ggaagggact gaagacctgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tccccaccta agaaggagca ccatcatcac caccatcact cagagtccac aaaggccccc atgccactgc tcccatcccc acccccacct gagcctgaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aagagaagat gccccgagga ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc tatggtcgcc accactacca cagttgcaga aaagtacaaa caccgagggg agggagagcg caaagacatt gtttcatctt ccatgccaag gccaaacaga gaggagcctg tggacagccg gacgcccgtg accgagagag ttagctga
>mouseMECP2_e1prot
MAAAAATAAA AAAPSGGGGG GEEERLEEKS EDQDLQGLRD KPLKFKKAKK DKKEDKEGKH EPLQPSAHHS AEPAEAGKAE TSESSGSAPA VPEASASPKQ RRSIIRDRGP MYDDPTLPEG WTRKLKQRKS GRSAGKYDVY LINPQGKAFR SKVELIAYFE KVGDTSLDPN DFDFTVTGRG SPSRREQKPP KKPKSPKAPG TGRGRGRPKG SGTGRPKAAA SEGVQVKRVL EKSPGKLVVK MPFQASPGGK GEGGGATTSA QVMVIKRPGR KRKAEADPQA IPKKRGRKPG SVVAAAAAEA KKKAVKESSI RSVHETVLPI KKRKTRETVS IEVKEVVKPL LVSTLGEKSG KGLKTCKSPG RKSKESSPKG RSSSASSPPK KEHHHHHHHS ESTKAPMPLL PSPPPPEPES SEDPISPPEP QDLSSSICKE EKMPRGGSLE SDGCPKEPAK TQPMVATTTT VAEKYKHRGE GERKDIVSSS MPRPNREEPV DSRTPVTERV S
Derived from: NCBI GI 20072599 bases 27 to 103 and 228 to 1656.
Reliability: Good agreement with sequences from trace archive.
>ratMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaggaaaag tcagaagacc aggatctcca gggcctcaaa gagaaacccc tgaagtttaa gaaggtgaag aaagacaaga aggaagacaa agagggcaaa catgaaccac tacagccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gagacatcag aaagctcagg ctctgcccca gcagtaccag aagcctctgc ttctcccaaa cagcgacgtt ccatcattcg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacgc gaaagcttaa acagaggaag tctggtcgct ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcctttc gctctaaagt agaattgatt gcatattttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcactgta actgggagag ggagcccttc caggagagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgcccgaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaagtg aaaagggtcc tggagaagag ccctgggaaa cttctcgtca agatgccttt ccaagcatca cctgggggta agggtgaggg aggtggggct accacatctg cgcaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gctgcagagg ccaaaaagaa agctgtgaag gaatcttcta tacggtctgt gcaggagact gtgctcccca tcaagaagcg caagacccgg gaaaccgtca gcattgaggt caaggaggtg gtgaagcccc tgctggtgtc tacacttggt gagaagagtg gaaagggact gaagacatgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tcaccaccta agaaggagca ccatcatcac caccatcacg cagagtcccc aaaggccccc atgccattgc ttccacctcc acccccacct gagcctcaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aggagaagat gccccgagca ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc catggttgct gccgccgcca ccaccaccac caccaccacc accacagttg cagaaaagta caaacaccga ggggagggag agcgcaaaga cattgtttca tcctccatgc cgaggccaaa cagagaggag cctgtggaca gccggacgcc cgtgaccgag agagttagct ga
>ratMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKEKPLKFKK VKKDKKEDKE GKHEPLQPSA HHSAEPAEAG KAETSESSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTGRPK AAASEGVQVK RVLEKSPGKL LVKMPFQASP GGKGEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHAESPKAPM PLLPPPPPPE PQSSEDPISP PEPQDLSSSI CKEEKMPRAG SLESDGCPKE PAKTQPMVAA AATTTTTTTT TVAEKYKHRG EGERKDIVSS SMPRPNREEP VDSRTPVTER VS
Derived from: NCBI GI 115312277 nucleotides 218 to 1696.
Reliability: Exons 2 to 4 match the rat genome exactly.
>ratMECP2_e1dna
atggccgccg ccgctgccgc cgctgccgcc gccgccgccg ccgctgccgc cgccgccgcc gccgccgccg ccgcgccgag cggaggagga ggaggcgagg aggagagact
ggaggaaaag tcagaagacc aggatctcca gggcctcaaa gagaaacccc tgaagtttaa gaaggtgaag aaagacaaga aggaagacaa agagggcaaa catgaaccac tacagccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gagacatcag aaagctcagg ctctgcccca gcagtaccag aagcctctgc ttctcccaaa cagcgacgtt ccatcattcg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacgc gaaagcttaa acagaggaag tctggtcgct ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcctttc gctctaaagt agaattgatt gcatattttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcactgta actgggagag ggagcccttc caggagagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgcccgaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaagtg aaaagggtcc tggagaagag ccctgggaaa cttctcgtca agatgccttt ccaagcatca cctgggggta agggtgaggg aggtggggct accacatctg cgcaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gctgcagagg ccaaaaagaa agctgtgaag gaatcttcta tacggtctgt gcaggagact gtgctcccca tcaagaagcg caagacccgg gaaaccgtca gcattgaggt caaggaggtg gtgaagcccc tgctggtgtc tacacttggt gagaagagtg gaaagggact gaagacatgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tcaccaccta agaaggagca ccatcatcac caccatcacg cagagtcccc aaaggccccc atgccattgc ttccacctcc acccccacct gagcctcaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aggagaagat gccccgagca ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc catggttgct gccgccgcca ccaccaccac caccaccacc accacagttg cagaaaagta caaacaccga ggggagggag agcgcaaaga cattgtttca tcctccatgc cgaggccaaa cagagaggag cctgtggaca gccggacgcc cgtgaccgag agagttagct ga
>ratMECP2_e1prot
MAAAAAAAAA AAAAAAAAAA AAAAAPSGGG GGEEERLEEK SEDQDLQGLK EKPLKFKKVK KDKKEDKEGK HEPLQPSAHH SAEPAEAGKA ETSESSGSAP AVPEASASPK QRRSIIRDRG PMYDDPTLPE GWTRKLKQRK SGRSAGKYDV YLINPQGKAF RSKVELIAYF EKVGDTSLDP NDFDFTVTGR GSPSRREQKP PKKPKSPKAP GTGRGRGRPK GSGTGRPKAA ASEGVQVKRV LEKSPGKLLV KMPFQASPGG KGEGGGATTS AQVMVIKRPG RKRKAEADPQ AIPKKRGRKP GSVVAAAAAE AKKKAVKESS IRSVQETVLP IKKRKTRETV SIEVKEVVKP LLVSTLGEKS GKGLKTCKSP GRKSKESSPK GRSSSASSPP KKEHHHHHHH AESPKAPMPL LPPPPPPEPQ SSEDPISPPE PQDLSSSICK EEKMPRAGSL ESDGCPKEPA KTQPMVAAAA TTTTTTTTTV AEKYKHRGEG ERKDIVSSSM PRPNREEPVD SRTPVTERVS
Derived from: NCBI GI 115312277 nucleotides 10 to 119, and 244 to 1696. Reliability: Exons 3 and 4 match the rat genome exactly. In exon 1, the rat genome has an extra triplet creating an extra alanine in the multi-alanine section.
>possumMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaagaacag tctgaagacc aagacctcca gggcctcaga gataaacccc tgaagttcag aaagttgaag agggataaaa aggaggagaa agaaggaaaa catgaattcc cacagccatc atcacaccag tctgccgaac cagcagaggc aggaaaagca gaaacatcag aagaggctgg gtcagcccct gctgcacctg aagcttcagc ttctcctaaa caacggcgtt ctatcatccg agaccggggg cccatgtatg atgatcccac actaccagag ggctggacaa gaaaactgaa gcagaggaaa tcaggccgtt ctgctgggaa gtacgatgtc tatttgatca a
tccacaggga aaagcttttc gctccaaggt agagttgatt gcatacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagtccctc ccgacgagag cagaaaccac ccaagaagtc caaatccccc aaggctccag ggacaggccg agggagggga cggcccaaag ggagcggcac agtgaaaccc cgggtcacag cctcagaagg ggtccaggtc aaaagggtga ttgagaaaag tcctgggaag ctcctagtca agatgccttt tcagccgtca cctgggggaa aggctgaagg gggtggggcc accacgtcca cccaagtcat ggtgatcaag cgccctggca ggaaacggaa agttgagacc gagccacagg tcatccctaa gaaacggggc cgtaagccgg ggagcatagt ggccgcagct gccgtggaag ccaagaagaa agcaatcaaa gagtcttcca tcaggtccat tcatgagacc gtgctgccca tcaaaaagcg gaagaccagg gaagccgtca gcatcgaggt gaaggaggtg gtgaagcctc tacttgtctc caccgtgggg gagaagagca cgaagggact caagcctgga aagagcccag gtcggaaaag caaagagagc agccccaaag ggcggagtgc cagcacctcc tcttcccccc cgaagaagga gcagcagcag cagcagcagt accaccacca ccactactac ccttcctcag agtcccccaa ggccccaccc ccacctcacc ccgagccaga gggctccaag gacagcaaaa gcccccccga acctcaggac ttaagcagca aagtttgcaa agaagagaag atgccaagag gggctccacc agagagtgat ggctgcacaa aggagctcgc taagactcag cccacagctg ctgccgcctc cgctgctgcc accgccgcca ccgccaccac cgccaccacg gcagcagaaa agttcaaaca ccgagcagag ggagaccgaa aggacattgt ctcgtcctcc atgccgaggc caaaccgaga ggatcctgtg gacagccgga cgcccgtgac agagagagtt agctga
>possumMECP2_e2prot
MVAGMLGLRE EQSEDQDLQG LRDKPLKFRK LKRDKKEEKE GKHEFPQPSS HQSAEPAEAG KAETSEEAGS APAAPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKSKSPK APGTGRGRGR PKGSGTVKPR VTASEGVQVK RVIEKSPGKL LVKMPFQPSP GGKAEGGGAT TSTQVMVIKR PGRKRKVETE PQVIPKKRGR KPGSIVAAAA VEAKKKAIKE SSIRSIHETV LPIKKRKTRE AVSIEVKEVV KPLLVSTVGE KSTKGLKPGK SPGRKSKESS PKGRSASTSS SPPKKEQQQQ QQYHHHHYYP SSESPKAPPP PHPEPEGSKD SKSPPEPQDL SSKVCKEEKM PRGAPPESDG CTKELAKTQP TAAAASAAAT AATATTATTA AEKFKHRAEG DRKDIVSSSM PRPNREDPVD SRTPVTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Good agreement with other sequences from the trace archive.
>possumMECP2_e1dna
atggccgccg ccgccgcgct gagcggagga ggaggaggcg aggaggacag act
ggaagaacag tctgaagacc aagacctcca gggcctcaga gataaacccc tgaagttcag aaagttgaag agggataaaa aggaggagaa agaaggaaaa catgaattcc cacagccatc atcacaccag tctgccgaac cagcagaggc aggaaaagca gaaacatcag aagaggctgg gtcagcccct gctgcacctg aagcttcagc ttctcctaaa caacggcgtt ctatcatccg agaccggggg cccatgtatg atgatcccac actaccagag ggctggacaa gaaaactgaa gcagaggaaa tcaggccgtt ctgctgggaa gtacgatgtc tatttgatca a
tccacaggga aaagcttttc gctccaaggt agagttgatt gcatacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagtccctc ccgacgagag cagaaaccac ccaagaagtc caaatccccc aaggctccag ggacaggccg agggagggga cggcccaaag ggagcggcac agtgaaaccc cgggtcacag cctcagaagg ggtccaggtc aaaagggtga ttgagaaaag tcctgggaag ctcctagtca agatgccttt tcagccgtca cctgggggaa aggctgaagg gggtggggcc accacgtcca cccaagtcat ggtgatcaag cgccctggca ggaaacggaa agttgagacc gagccacagg tcatccctaa gaaacggggc cgtaagccgg ggagcatagt ggccgcagct gccgtggaag ccaagaagaa agcaatcaaa gagtcttcca tcaggtccat tcatgagacc gtgctgccca tcaaaaagcg gaagaccagg gaagccgtca gcatcgaggt gaaggaggtg gtgaagcctc tacttgtctc caccgtgggg gagaagagca cgaagggact caagcctgga aagagcccag gtcggaaaag caaagagagc agccccaaag ggcggagtgc cagcacctcc tcttcccccc cgaagaagga gcagcagcag cagcagcagt accaccacca ccactactac ccttcctcag agtcccccaa ggccccaccc ccacctcacc ccgagccaga gggctccaag gacagcaaaa gcccccccga acctcaggac ttaagcagca aagtttgcaa agaagagaag atgccaagag gggctccacc agagagtgat ggctgcacaa aggagctcgc taagactcag cccacagctg ctgccgcctc cgctgctgcc accgccgcca ccgccaccac cgccaccacg gcagcagaaa agttcaaaca ccgagcagag ggagaccgaa aggacattgt ctcgtcctcc atgccgaggc caaaccgaga ggatcctgtg gacagccgga cgcccgtgac agagagagtt agctga
>possumMECP2_e1prot
MAAAAALSGG GGGEEDRLEE QSEDQDLQGL RDKPLKFRKL KRDKKEEKEG KHEFPQPSSH QSAEPAEAGK AETSEEAGSA PAAPEASASP KQRRSIIRDR GPMYDDPTLP EGWTRKLKQR KSGRSAGKYD VYLINPQGKA FRSKVELIAY FEKVGDTSLD PNDFDFTVTG RGSPSRREQK PPKKSKSPKA PGTGRGRGRP KGSGTVKPRV TASEGVQVKR VIEKSPGKLL VKMPFQPSPG GKAEGGGATT STQVMVIKRP GRKRKVETEP QVIPKKRGRK PGSIVAAAAV EAKKKAIKES SIRSIHETVL PIKKRKTREA VSIEVKEVVK PLLVSTVGEK STKGLKPGKS PGRKSKESSP KGRSASTSSS PPKKEQQQQQ QYHHHHYYPS SESPKAPPPP HPEPEGSKDS KSPPEPQDLS SKVCKEEKMP RGAPPESDGC TKELAKTQPT AAAASAAATA ATATTATTAA EKFKHRAEGD RKDIVSSSMP RPNREDPVDS RTPVTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Good agreement with other sequences from the trace archive.
Frogs and fish only have the MECP2_e1 transcript form.
>XtropMECP2_e1dna
atggccgctg cgccgagcgg agaggagaga ct
ggaagaaaaa tctgaagatc aagatcttca aggacagaaa gataaaccac caaaactcag gaaagtaaaa agagacaaga aggatgagga agaaaagcag gaaacgtttc atccctctga gcaccagtca ggagaacctg cagatgaagg gaaagctgat atatctgaaa gtgctgagga aagccttgct gttcctgaag cctctgcctc tcccaagcag aggcggtctg ttattagaga caggggtccc atgtacgaag accccactct tcctgaaggc tggacacgaa aactaaagca aagaaaatct ggtcgttctg ctggaaagtt tgatgtatat ttaatcaa
ccctaatgga aaagcttttc ggtccaaagt tgaacttata gcatacttcc aaaaggtagg cgacacatcg ctggacccta atgattttga cttcactgta actgggagag ggagtccgtc tcgaagggaa cagaagcaac cgaaaaagtc taaagctcca aaatcttctg gaacagggag aggaagagga agacccaaag gaagtgtaaa agtaaagtca cctgtaaaat ctgaaggagt acaggttaaa agggtgatag agaagagtcc agggaagctt ttggtaaaaa tgcctttttc tggaagtaaa gaggaatccg atgcaacaac ctcagaacag gttttggtaa ttaaaagacc cggtcgtaaa agaaagtcag atacagaccc atcggcagct cctaaaaaac ggggaagaaa gccaggcagt gtgagcttgg ctgctgcagc agcagaagca gcaaagaaaa aagcaatcaa agagtcttcc atcaagcctc ttttagagac tgtgttacca ataaagaaac gcaagaccag ggagactatc agtgtagatg taaaagatac agtaaaaccg gagcctctta cacctgttat agaaaaaagc attaaaggac agaaacctgc aaaaagtcca gaaagcagaa gcacagaggg tagcccaaaa attaaaactg gcttgccgaa aaaggagctg cagcagcacc atcatcatca ccaccatcat catcaccatc atcactccga atccaaggca tctgccacca gtccagagcc agagacttca aaggacagca ttggggcccc agagccccag gacttaagtg tcaaaatata taaagaggag aagctacccg agagtgatgg ctgtgctcag gagccagcca agacgcagcc tgctgataaa tgtagaaacc gagcagaagg tgaaagaaaa gacattgtat catctgtccc tagaccaaca agagaagaac ccgtggacac cagaacaacg gttacggaaa gagttagctg a
>XtropMECP2_e1prot
MAAAPSGEER LEEKSEDQDL QGQKDKPPKL RKVKRDKKDE EEKQETFHPS EHQSGEPADE GKADISESAE ESLAVPEASA SPKQRRSVIR DRGPMYEDPT LPEGWTRKLK QRKSGRSAGK FDVYLINPNG KAFRSKVELI AYFQKVGDTS LDPNDFDFTV TGRGSPSRRE QKQPKKSKAP KSSGTGRGRG RPKGSVKVKS PVKSEGVQVK RVIEKSPGKL LVKMPFSGSK EESDATTSEQ VLVIKRPGRK RKSDTDPSAA PKKRGRKPGS VSLAAAAAEA AKKKAIKESS IKPLLETVLP IKKRKTRETI SVDVKDTVKP EPLTPVIEKS IKGQKPAKSP ESRSTEGSPK IKTGLPKKEL QQHHHHHHHH HHHHHSESKA SATSPEPETS KDSIGAPEPQ DLSVKIYKEE KLPESDGCAQ EPAKTQPADK CRNRAEGERK DIVSSVPRPT REEPVDTRTT VTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Fairly good agreement with other sequences from the trace archive.
>XlaevisMECP2_e1dna
atggccgctg cgccgagcgg agaggagaga ct
ggaagaaaaa tctgaggatc aagatcttca aggacaaaaa gataaaccac caaaactcag gaaagtaaaa aaagacaaga aggatgagga agaaaagcag gaaccatttc attcctctga gcatcagccc ggagaacctg cagatgaagg gaaagctgat atgtctgaaa gtgctgagga aaaccttgct gttcctgaat cttctgcctc tcccaaacag aggcggtctg ttattagaga caggggtccc atgtacgaag accccactct tcctgaaggc tggacacgaa aactcaagca aagaaaatct ggtcgttctg ctggaaaatt tgatgtatat ttaatcaa
ccctaatgga aaagcttttc ggtccaaagt tgagcttata gcatacttcc aaaaggtagg ggacacatct ctagacccta atgattttga cttcactgta actgggagag ggagcccgtc tcgaagggaa cagaagcaac cgaaaaagcc taaagctcca aaatcttctg tatcagggag aggaagagga agacctaaag gaagtataaa aaaagttaag ccacctgtaa aatctgaagg agtacaagtc aaaagggtga tagagaagag tccgggaaaa cttttggtta aaatgcctta ttctggaact aaagaggcat cagatgcaac aacgtcacaa caggttttgg tcattaaaag aggcggtcgt aaaagaaaat cagaaactga tccatctgca gctcctaaaa aaagggggag aaagccaagc aacgtgagct tggctgctgc agcagcagaa gcagcaaaga aaaaagcaat caaagagtct tccatcaagc ctcttttaga gactgtgtta ccaataaaga aacgcaagac cagggagact atcagtgtag atgtaaaaga tacaataaaa ccagagcctc ttacacctgt tatagaaaaa gtcatgaaag gacaaaaccc tgcaaaaagt ccagaaagca gaagcacaga gggtagccca aaaattaaaa ctggcttgcc gaaaaaagag ctgcagcagc accatcatca tcatcaccac caccatcacc atcatcactc cgaatctaag gcatctgcca ccagtccaga gccagagact tcaaaggaca acattggggt tcaggagccc caggacttaa gtgtcaaaat gtgtaaagag gagaagctac cagaaagtga tggctgtgct caggagccag ccaagactca gcctgctgat aaatgtagaa accgagcaga aggtgaaaga aaagacattg tttcatctgt ccctagacca acaagagaag agcccgtgga caccagaaca acggtgacag aaagagttag ctga
>XlaevisMECP2_e1prot
MAAAPSGEER LEEKSEDQDL QGQKDKPPKL RKVKKDKKDE EEKQEPFHSS EHQPGEPADE GKADMSESAE ENLAVPESSA SPKQRRSVIR DRGPMYEDPT LPEGWTRKLK QRKSGRSAGK FDVYLINPNG KAFRSKVELI AYFQKVGDTS LDPNDFDFTV TGRGSPSRRE QKQPKKPKAP KSSVSGRGRG RPKGSIKKVK PPVKSEGVQV KRVIEKSPGK LLVKMPYSGT KEASDATTSQ QVLVIKRGGR KRKSETDPSA APKKRGRKPS NVSLAAAAAE AAKKKAIKES SIKPLLETVL PIKKRKTRET ISVDVKDTIK PEPLTPVIEK VMKGQNPAKS PESRSTEGSP KIKTGLPKKE LQQHHHHHHH HHHHHHSESK ASATSPEPET SKDNIGVQEP QDLSVKMCKE EKLPESDGCA QEPAKTQPAD KCRNRAEGER KDIVSSVPRP TREEPVDTRT TVTERVS
Derived from: NCBI GI 4139225 bases 14 to 1417.
Reliability: Not enough trace archive sequences to compare with this sequence.
>zebrafishMECP2_e1dna
atggccgccg cagagagcgg agaggagaga ct
cagaggtgag gacaagaatg aagaccagga gggctcaaaa gacaagacgc agaagcataa gaaaagcaaa aaggaaaggc atgatgtgga aaaactggag accacagtct ctgttcctcc gcccccgtct ctctttacgc agagggatgt cggacagcag gcagaggcag ggaagtctga acccattgac cctgaagttg gagctgctct cagcgctcca gaatcttccg catcggccaa gcagcggcgg tctgtcattc gggacagagg cccaatgtat gaagatcctt cgctgcctca gggctggaca cgcaagctga aacagcgcaa atcagggcgc tccgctggca aatttgacgt ctaccttatc aa
cccagaaggg aaagccttcc gttccaaggt ggagctcatg gcatacttcc aaaaggttgg cgataccatt acagatccca atgactttga cttcacggtc acgggcaggg gaagcccgtc tcgcagagaa aaaagaccgc caaaaaagcc taaaatggtc aaaccctctg gacgtggaag ggggcggcct aaaggtagcg gcaaggtacg acaggctaca gaaggggtgg cggtgaaacg cgtcatagaa aagagtccag gaaaactctt agtaaagatg ccctttgtgg cccccaaaac tgaaccaggg gctcctttag ggcaagcgcc agttgccaaa gcacgccgag gacgtaagag gaaatcagag caggatccgc caagcacccc taaaaaacgt ggacgcaagc cagcaactgt ttcacagtca acagtgggga cggggtctgc tgctgcatac gccgctgcag ccattctcac cgccgaagcc aagaaaaaag ccctgaagga gtcttccgct aagcctgttc aggagagggc tcttcctatc aaaaaacgca aaacccgaga gactttagag gagctggagg catccaccac ctcagcgaca gagacctttg agaaacgact gactgcatca actgtgaccc ctaccgggga ggaggcagaa acaggacaga agcctcacaa gcatcccagc cggaagcaca aagaggcaga tccgggaagc agcagcagtg ggacgacagc cagcggagtt gcaccgaaga gtcacaagaa gagagatcag cgagggcagc actttaaaca ccaccaccac catcatcatc accaccatca acaccaacac ctgcaggcct ccacaccctc cacctacact ccgcaggctc accagctctc cctgggtcac tccacgcacg gcgggctgga aaacgagccg caggacttga gcacctccag gcccaaagcg gagcacgtgg cctgcaggga ggaggccaga actggcagct cctcgagtag ggactcccag aacgcaagca agatggcttc catgaccgtg acgggggaaa gcaaggagct gagagacatt gttcctccct ccgccgtccc gaggccgagt cgagaggaaa cggtggagtc ccggacacca gtgagcgagc cagtgagctg a
>zebrafishMECP2_e1prot
MAAAESGEER LRGEDKNEDQ EGSKDKTQKH KKSKKERHDV EKLETTVSVP PPPSLFTQRD VGQQAEAGKS EPIDPEVGAA LSAPESSASA KQRRSVIRDR GPMYEDPSLP QGWTRKLKQR KSGRSAGKFD VYLINPEGKA FRSKVELMAY FQKVGDTITD PNDFDFTVTG RGSPSRREKR PPKKPKMVKP SGRGRGRPKG SGKVRQATEG VAVKRVIEKS PGKLLVKMPF VAPKTEPGAP LGQAPVAKAR RGRKRKSEQD PPSTPKKRGR KPATVSQSTV GTGSAAAYAA AAILTAEAKK KALKESSAKP VQERALPIKK RKTRETLEEL EASTTSATET FEKRLTASTV TPTGEEAETG QKPHKHPSRK HKEADPGSSS SGTTASGVAP KSHKKRDQRG QHFKHHHHHH HHHHQHQHLQ ASTPSTYTPQ AHQLSLGHST HGGLENEPQD LSTSRPKAEH VACREEARTG SSSSRDSQNA SKMASMTVTG ESKELRDIVP PSAVPRPSRE ETVESRTPVS EPVS
Derived from: NCBI GI 37574905 base 1 to 1575.
Reliability: Fairly good agreement with sequences from the trace archive.
Timur M. Yusufzai and Alan P. Wolffe. 2000 Functional consequences of Rett syndrome mutations on human MeCP2. Nucleic Acids Research. 28(21): 4172-4179.
J.D. Thompson, D.G. Higgins, and T.J. Gibson. 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research. 22(22): 4673-4680.
| Search Page | Home Page |
MECP2 data is curated by:
Dr. John Christodoulou
formerly with
Andrew Grimm.
The MECP2 database and website are maintained by the
Western Sydney Genetics Program.
©2001 Western Sydney Genetics Program