MeCP2 sequence alignment
Alignments have been used to help determine whether or not a variation at a given amino acid is pathogenic or harmless. Alignments of MeCP2 genes have featured in Amir et al. 1999 and Yusufzai and Wolffe 2000.Since the publication of these papers, MECP2 has been sequenced in other organisms. This page is intended as a convenient reference to these sequences. I have not sequenced any of these organisms myself. Please note:
- Sequences are provided on a "best effort" basis.
- The alignments have been produced using ClustalW (Thompson et al. 1994) without any subsequent hand-based correction.
- The relationship between the conservation of an amino acid and the pathogenicity of a mutation to that amino acid is not fully understood.
- We advise that you use these data with caution when trying to determine if a particular sequence variation might be pathogenic.
Some MeCP2 sequences have not been included on this page.
Chicken MAR-binding protein ARBP (NCBI GI 2388804) was not included because the protein does not align properly due to the presence of trinucleotide amplifications, and it is truncated compared to other MeCP2 proteins.Primate MeCP2 proteins were not included as there were virtually no differences between these sequences and human MeCP2. Chimpanzee MeCP2 protein is identical to human MeCP2 protein for both the chimpanzee MeCP_e1 and MeCP2_e2. A MeCP2_e2 sequence found in the crab-eating macaque (NCBI GI 15419705) has only one amino acid different to the human MeCP2_e2 sequence, which means the two sequences are 99.8% similar.
A putative Fugu rubripes sequence homologous to human MECP2 has been described. The sequence was predicted by a computer program. The putative sequence has more exons than the human MECP2 sequence, and the predicted sequence did not include the equivalent of human exon 1 or exon 2. A DNA sequence that translates to "PQDLSTSRP" (an amino acid sequence found in zebrafish protein) was found in F. rubripes sequence regarded by ensembl as intronic. In addition, when the putative Fugu sequence was compared with a putative Tetraodon nigroviridis MECP2 sequence (ENSEMBL Gene ID GSTENG00009035001), the predicted exons and splice sites were different.
Alignment of MeCP2 homologues
Notes: Xtrop is short for Xenopus tropicalis, the Western clawed frog, and Xlaevis is short for Xenopus laevis, the African clawed frog. MeCP2_e2 refers to the protein based on the coding sequence in exons 2, 3 and 4, whereas MeCP2_e1 refers to the protein based on the coding sequence in exons 1, 3 and 4. The symbols underneath the alignment indicate how strongly conserved the amino acids are. An asterisk ("*") indicates that the amino acid is identical in all species. A colon (":") indicates that the amino acid is strongly conserved in all of the organisms, while a full stop indicates that the amino acid is weakly conserved. A space (" ") indicates the amino acid is not conserved in all of the species. This is a page that explains what counts as strong or weak conservation.MeCP2_e2 transcript
10 20 30 40 50 | | | | | humanMeCP2_e2prot 1 MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKDKKEEKEGKHEPVQP------SAHHSA 54 cattleMeCP2_e2prot MVAGMLGLREEKSEEQDLQGLKDKPLKFKKVKKDKKEDKEGKHEPLQP------AAHHSA dogMeCP2_e2prot MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKEKKEDKEGKHEPLQP------PAHHSA mouseMeCP2_e2prot MVAGMLGLREEKSEDQDLQGLRDKPLKFKKAKKDKKEDKEGKHEPLQP------SAHHSA ratMeCP2_e2prot MVAGMLGLREEKSEDQDLQGLKEKPLKFKKVKKDKKEDKEGKHEPLQP------SAHHSA possumMeCP2_e2prot MVAGMLGLREEQSEDQDLQGLRDKPLKFRKLKRDKKEEKEGKHEFPQP------SSHQSA XtropMeCP2_e1prot ---------EEKSEDQDLQGQKDKPPKLRKVKRDKKDEEE-KQETFHP------SEHQSG XlaevisMeCP2_e1prot ---------EEKSEDQDLQGQKDKPPKLRKVKKDKKDEEE-KQEPFHS------SEHQPG zebrafishMeCP2_e1prot -------RGEDKNEDQ--EGSKDKTQKHKKSKKERHDVEKLETTVSVPPPPSLFTQRDVG *::.*:* :* ::*. * :* *::::: :: : . . :. . 60 70 80 90 100 110 | | | | | | humanMeCP2_e2prot 55 EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS 113 cattleMeCP2_e2prot EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS dogMeCP2_e2prot EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS mouseMeCP2_e2prot EPAEAGKAETSE-SSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS ratMeCP2_e2prot EPAEAGKAETSE-SSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS possumMeCP2_e2prot EPAEAGKAETSE-EAGSAPAAPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS XtropMeCP2_e1prot EPADEGKADISE-SAEESLAVPEASASPKQRRSVIRDRGPMYEDPTLPEGWTRKLKQRKS XlaevisMeCP2_e1prot EPADEGKADMSE-SAEENLAVPESSASPKQRRSVIRDRGPMYEDPTLPEGWTRKLKQRKS zebrafishMeCP2_e1prot QQAEAGKSEPIDPEVGAALSAPESSASAKQRRSVIRDRGPMYEDPSLPQGWTRKLKQRKS : *: **:: : :.**:***.*****:********:**:**:*********** 120 130 140 150 160 170 | | | | | | humanMeCP2_e2prot 114 GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP 173 cattleMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP dogMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP mouseMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP ratMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP possumMeCP2_e2prot GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP XtropMeCP2_e1prot GRSAGKFDVYLINPNGKAFRSKVELIAYFQKVGDTSLDPNDFDFTVTGRGSPSRREQKQP XlaevisMeCP2_e1prot GRSAGKFDVYLINPNGKAFRSKVELIAYFQKVGDTSLDPNDFDFTVTGRGSPSRREQKQP zebrafishMeCP2_e1prot GRSAGKFDVYLINPEGKAFRSKVELMAYFQKVGDTITDPNDFDFTVTGRGSPSRREKRPP ******:*******:**********:***:***** *******************:: * 180 190 200 210 220 230 | | | | | | humanMeCP2_e2prot 174 KKPKSPKAPGTGRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPGKLLVKMPFQTSPGGK 233 cattleMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTTRPKAAASEGVQVKRVLEKSPGKLLVKMPFQAAPGSK dogMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTARPKAATSEGVQVKRVLEKSPGKLLVKMPFQASPGSK mouseMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTGRPKAAASEGVQVKRVLEKSPGKLVVKMPFQASPGGK ratMeCP2_e2prot KKPKSPKAPGTGRGRGRPKGSGTGRPKAAASEGVQVKRVLEKSPGKLLVKMPFQASPGGK possumMeCP2_e2prot KKSKSPKAPGTGRGRGRPKGSGTVKPRVTASEGVQVKRVIEKSPGKLLVKMPFQPSPGGK XtropMeCP2_e1prot KKSKAPKSSGTGRGRGRPKGSVK-VKSPVKSEGVQVKRVIEKSPGKLLVKMPFS---GSK XlaevisMeCP2_e1prot KKPKAPKSSVSGRGRGRPKGSIKKVKPPVKSEGVQVKRVIEKSPGKLLVKMPYS---GTK zebrafishMeCP2_e1prot KKPKMVKP--SGRGRGRPKGSGKVR---QATEGVAVKRVIEKSPGKLLVKMPFVAP---K **.* *. :********** . :*** ****:*******:****: * 240 250 260 270 | | | | humanMeCP2_e2prot 234 AEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA 278 cattleMeCP2_e2prot AEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA dogMeCP2_e2prot AEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA mouseMeCP2_e2prot GEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA ratMeCP2_e2prot GEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA possumMeCP2_e2prot AEGGGATTSTQVMVIKRPGRKRKVETEPQVIPKKRGRKPG---------------SIVAA XtropMeCP2_e1prot -EESDATTSEQVLVIKRPGRKRKSDTDPSAAPKKRGRKPGSV-------------SLAAA XlaevisMeCP2_e1prot -EASDATTSQQVLVIKRGGRKRKSETDPSAAPKKRGRKPSNV-------------SLAAA zebrafishMeCP2_e1prot TEPGAPLGQAPVAKARR-GRKRKSEQDPPSTPKKRGRKPATVSQSTVGTGSAAAYAAAAI * . . . * :* ***** : :* ********. : .* 280 290 300 310 320 | | | | | humanMeCP2_e2prot 279 AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL 328 cattleMeCP2_e2prot ATAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL dogMeCP2_e2prot AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL mouseMeCP2_e2prot AAAEAKKKAVKESSIRSVHETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL ratMeCP2_e2prot AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL possumMeCP2_e2prot AAVEAKKKAIKESSIRSIHETVLPIKKRKTREAVS------IEVKEVVKPLL----VSTV XtropMeCP2_e1prot AAEAAKKKAIKESSIKPLLETVLPIKKRKTRETIS------VDVKDTVKPEP----LTPV XlaevisMeCP2_e1prot AAEAAKKKAIKESSIKPLLETVLPIKKRKTRETIS------VDVKDTIKPEP----LTPV zebrafishMeCP2_e1prot LTAEAKKKALKESSAKPVQERALPIKKRKTRETLEELEASTTSATETFEKRLTASTVTPT : *****:**** :.: * .**********::. ...:..: ::. 330 340 350 360 370 | | | | | humanMeCP2_e2prot 329 GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSES 375 cattleMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSGSASS---PPKKE----------HHHHHHHVEP dogMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSEP mouseMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSES ratMeCP2_e2prot GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHAES possumMeCP2_e2prot GEKSTKGLKPGKSPGRKSKESSPKGRSASTSSS--PPKKEQQQ-------QQQYHHHHYY XtropMeCP2_e1prot IEKSIKGQKPAKSPESRSTEGSPKIKTG-------LPKKELQQ-------HHHHHHHHHH XlaevisMeCP2_e1prot IEKVMKGQNPAKSPESRSTEGSPKIKTG-------LPKKELQQ-------HHHHHHHHHH zebrafishMeCP2_e1prot GEEAETGQKPHKHPSRKHKEADPGSSSSGTTASGVAPKSHKKRDQRGQHFKHHHHHHHHH *: .* :. * * : .*..* :. **.. ::::*** 380 390 400 410 420 | | | | | humanMeCP2_e2prot 376 PKAPVPLLPPLPPPPPEPESSEDPTSP-------PEPQDLSSSVCKEEKMPRGGSLESDG 428 cattleMeCP2_e2prot PKAPAPLLLPPPPPPPEPQSSEDPASP-------PEPQDLSSSVCKEEKMPRAGSLESDG dogMeCP2_e2prot PKAPAPLLPPPPPPPPEPQSSEDPASP-------PEPQDLSSGVCKEEKMARGGSLESDG mouseMeCP2_e2prot TKAPMPLLP--SPPPPEPESSEDPISP-------PEPQDLSSSICKEEKMPRGGSLESDG ratMeCP2_e2prot PKAPMPLLP--PPPPPEPQSSEDPISP-------PEPQDLSSSICKEEKMPRAGSLESDG possumMeCP2_e2prot PSSESPKAP--PPPHPEPEGSKDSKSP-------PEPQDLSSKVCKEEKMPRGAPPESDG XtropMeCP2_e1prot HHHSESKAS---ATSPEPETSKDSIGA-------PEPQDLSVKIYKEEKLP-----ESDG XlaevisMeCP2_e1prot HHHSESKAS---ATSPEPETSKDNIGV-------QEPQDLSVKMCKEEKLP-----ESDG zebrafishMeCP2_e1prot HQHQHLQAS--TPSTYTPQAHQLSLGHSTHGGLENEPQDLSTSRPKAEHVACR--EEART .. *: : . ****** * *::. *: 430 440 450 460 470 | | | | | humanMeCP2_e2prot 429 CPKEPAKTQPAVA------------TAATAAEKYKHRGEGERKDIVS-SSMPRPNREEPV 475 cattleMeCP2_e2prot CPKEPAKTQPALA------------TAAPATEKYKHRGEGERKDIVS-SSMPRPNREEPV dogMeCP2_e2prot CPKEPAKTQPTVA------------TAATAADKYKHRGEGERKDIVS-SSMPRPNREEPV mouseMeCP2_e2prot CPKEPAKTQPMVA------------TTTTVAEKYKHRGEGERKDIVS-SSMPRPNREEPV ratMeCP2_e2prot CPKEPAKTQPMVAAA----ATTTTTTTTTVAEKYKHRGEGERKDIVS-SSMPRPNREEPV possumMeCP2_e2prot CTKELAKTQPTAAAASAAATAATATTATTAAEKFKHRAEGDRKDIVS-SSMPRPNREDPV XtropMeCP2_e1prot CAQEPAKTQP--------------------ADKCRNRAEGERKDIVS-S-VPRPTREEPV XlaevisMeCP2_e1prot CAQEPAKTQP--------------------ADKCRNRAEGERKDIVS-S-VPRPTREEPV zebrafishMeCP2_e1prot GSSSSRDSQN----------------ASKMASMTVTGESKELRDIVPPSAVPRPSREETV ... .:* :. . : :***. * :***.**:.* 480 | humanMeCP2_e2prot 476 DSRTPVTERVS 486 cattleMeCP2_e2prot DSRTPVTERVS dogMeCP2_e2prot DSRTPVTERVS mouseMeCP2_e2prot DSRTPVTERVS ratMeCP2_e2prot DSRTPVTERVS possumMeCP2_e2prot DSRTPVTERVS XtropMeCP2_e1prot DTRTTVTERVS XlaevisMeCP2_e1prot DTRTTVTERVS zebrafishMeCP2_e1prot ESRTPVSEPVS ::**.*:* **
MeCP2_e1 transcript
10 20 30 40 | | | | humanMeCP2_e1prot 1 ----------------MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVK 44 cattleMeCP2_e1prot ----------------MAAAAAAAPSGGGGGGEEERLEEKSEEQDLQGLKDKPLKFKKVK dogMeCP2_e1prot ----------------MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVK mouseMeCP2_e1prot -----------MAAAAATAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLRDKPLKFKKAK ratMeCP2_e1prot MAAAAAAAAAAAAAAAAAAAAAAAAPSGGGGGEEERLEEKSEDQDLQGLKEKPLKFKKVK possumMeCP2_e1prot ----------------MAAAAALS---GGGGGEEDRLEEQSEDQDLQGLRDKPLKFRKLK XtropMeCP2_e1prot ------------------MAAAPS-------GEE-RLEEKSEDQDLQGQKDKPPKLRKVK XlaevisMeCP2_e1prot ------------------MAAAPS-------GEE-RLEEKSEDQDLQGQKDKPPKLRKVK zebrafishMeCP2_e1prot ------------------MAAAES-------GEE-RLRGEDKNEDQEGSKDKTQKHKKSK *** : *** **. :.:::* :* ::*. * :* * 50 60 70 80 90 | | | | | humanMeCP2_e1prot 45 KDKKEEKEGKHEPVQP------SAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR 97 cattleMeCP2_e1prot KDKKEDKEGKHEPLQP------AAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR dogMeCP2_e1prot KEKKEDKEGKHEPLQP------PAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR mouseMeCP2_e1prot KDKKEDKEGKHEPLQP------SAHHSAEPAEAGKAETSE-SSGSAPAVPEASASPKQRR ratMeCP2_e1prot KDKKEDKEGKHEPLQP------SAHHSAEPAEAGKAETSE-SSGSAPAVPEASASPKQRR possumMeCP2_e1prot RDKKEEKEGKHEFPQP------SSHQSAEPAEAGKAETSE-EAGSAPAAPEASASPKQRR XtropMeCP2_e1prot RDKKDEEE-KQETFHP------SEHQSGEPADEGKADISE-SAEESLAVPEASASPKQRR XlaevisMeCP2_e1prot KDKKDEEE-KQEPFHS------SEHQPGEPADEGKADMSE-SAEENLAVPESSASPKQRR zebrafishMeCP2_e1prot KERHDVEKLETTVSVPPPPSLFTQRDVGQQAEAGKSEPIDPEVGAALSAPESSASAKQRR ::::: :: : . . :. .: *: **:: : :.**:***.**** 100 110 120 130 140 150 | | | | | | humanMeCP2_e1prot 98 SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV 157 cattleMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV dogMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV mouseMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV ratMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV possumMeCP2_e1prot SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV XtropMeCP2_e1prot SVIRDRGPMYEDPTLPEGWTRKLKQRKSGRSAGKFDVYLINPNGKAFRSKVELIAYFQKV XlaevisMeCP2_e1prot SVIRDRGPMYEDPTLPEGWTRKLKQRKSGRSAGKFDVYLINPNGKAFRSKVELIAYFQKV zebrafishMeCP2_e1prot SVIRDRGPMYEDPSLPQGWTRKLKQRKSGRSAGKFDVYLINPEGKAFRSKVELMAYFQKV *:********:**:**:*****************:*******:**********:***:** 160 170 180 190 200 210 | | | | | | humanMeCP2_e1prot 158 GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAATSE 217 cattleMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAAASE dogMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTARPKAATSE mouseMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTGRPKAAASE ratMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTGRPKAAASE possumMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKPPKKSKSPKAPGTGRGRGRPKGSGTVKPRVTASE XtropMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKQPKKSKAPKSSGTGRGRGRPKGSVK-VKSPVKSE XlaevisMeCP2_e1prot GDTSLDPNDFDFTVTGRGSPSRREQKQPKKPKAPKSSVSGRGRGRPKGSIKKVKPPVKSE zebrafishMeCP2_e1prot GDTITDPNDFDFTVTGRGSPSRREKRPPKKPKMVKP--SGRGRGRPKGSGKVR---QATE *** *******************:: ***.* *. :********** . :* 220 230 240 250 260 270 | | | | | | humanMeCP2_e1prot 218 GVQVKRVLEKSPGKLLVKMPFQTSPGGKAEGGGATTSTQVMVIKRPGRKRKAEADPQAIP 277 cattleMeCP2_e1prot GVQVKRVLEKSPGKLLVKMPFQAAPGSKAEGGGATTSAQVMVIKRPGRKRKAEADPQAIP dogMeCP2_e1prot GVQVKRVLEKSPGKLLVKMPFQASPGSKAEGGGATTSAQVMVIKRPGRKRKAEADPQAIP mouseMeCP2_e1prot GVQVKRVLEKSPGKLVVKMPFQASPGGKGEGGGATTSAQVMVIKRPGRKRKAEADPQAIP ratMeCP2_e1prot GVQVKRVLEKSPGKLLVKMPFQASPGGKGEGGGATTSAQVMVIKRPGRKRKAEADPQAIP possumMeCP2_e1prot GVQVKRVIEKSPGKLLVKMPFQPSPGGKAEGGGATTSTQVMVIKRPGRKRKVETEPQVIP XtropMeCP2_e1prot GVQVKRVIEKSPGKLLVKMPFS---GSK-EESDATTSEQVLVIKRPGRKRKSDTDPSAAP XlaevisMeCP2_e1prot GVQVKRVIEKSPGKLLVKMPYS---GTK-EASDATTSQQVLVIKRGGRKRKSETDPSAAP zebrafishMeCP2_e1prot GVAVKRVIEKSPGKLLVKMPFVAP---KTEPGAPLGQAPVAKARR-GRKRKSEQDPPSTP ** ****:*******:****: * * . . . * :* ***** : :* * 280 290 300 310 320 | | | | | humanMeCP2_e1prot 278 KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE 322 cattleMeCP2_e1prot KKRGRKPG---------------SVVAAATAEAKKKAVKESSIRSVQETVLPIKKRKTRE dogMeCP2_e1prot KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE mouseMeCP2_e1prot KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVHETVLPIKKRKTRE ratMeCP2_e1prot KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE possumMeCP2_e1prot KKRGRKPG---------------SIVAAAAVEAKKKAIKESSIRSIHETVLPIKKRKTRE XtropMeCP2_e1prot KKRGRKPGSV-------------SLAAAAAEAAKKKAIKESSIKPLLETVLPIKKRKTRE XlaevisMeCP2_e1prot KKRGRKPSNV-------------SLAAAAAEAAKKKAIKESSIKPLLETVLPIKKRKTRE zebrafishMeCP2_e1prot KKRGRKPATVSQSTVGTGSAAAYAAAAILTAEAKKKALKESSAKPVQERALPIKKRKTRE *******. : .* : *****:**** :.: * .********** 330 340 350 360 370 | | | | | humanMeCP2_e1prot 323 TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS 371 cattleMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSGS-AS dogMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS mouseMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS ratMeCP2_e1prot TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS possumMeCP2_e1prot AVS------IEVKEVVKPLL----VSTVGEKSTKGLKPGKSPGRKSKESSPKGRSASTSS XtropMeCP2_e1prot TIS------VDVKDTVKPEP----LTPVIEKSIKGQKPAKSPESRSTEGSPKIKTG--L- XlaevisMeCP2_e1prot TIS------VDVKDTIKPEP----LTPVIEKVMKGQNPAKSPESRSTEGSPKIKTG--L- zebrafishMeCP2_e1prot TLEELEASTTSATETFEKRLTASTVTPTGEEAETGQKPHKHPSRKHKEADPGSSSSGTTA ::. ...:..: ::. *: .* :. * * : .*..* :. 380 390 400 410 | | | | humanMeCP2_e1prot 372 S---PP--KKE-------HHHHHHHSESPKAPVPLLPPLPPPPPEPESSEDPTSP----- 414 cattleMeCP2_e1prot S---PP--KKE-------HHHHHHHVEPPKAPAPLLLPPPPPPPEPQSSEDPASP----- dogMeCP2_e1prot S---PP--KKE-------HHHHHHHSEPPKAPAPLLPPPPPPPPEPQSSEDPASP----- mouseMeCP2_e1prot S---PP--KKE-------HHHHHHHSESTKAPMPLLP--SPPPPEPESSEDPISP----- ratMeCP2_e1prot S---PP--KKE-------HHHHHHHAESPKAPMPLLP--PPPPPEPQSSEDPISP----- possumMeCP2_e1prot S---PP--KKEQQQ----QQQYHHHHYYPSSESPKAP--PPPHPEPEGSKDSKSP----- XtropMeCP2_e1prot ----P---KKELQQ----HHHHHHHHHHHHHSESKAS---ATSPEPETSKDSIGA----- XlaevisMeCP2_e1prot ----P---KKELQQ----HHHHHHHHHHHHHSESKAS---ATSPEPETSKDNIGV----- zebrafishMeCP2_e1prot SGVAPKSHKKRDQRGQHFKHHHHHHHHHHQHQHLQAS--TPSTYTPQAHQLSLGHSTHGG * **. ::::*** .. *: : . 420 430 440 450 460 | | | | | humanMeCP2_e1prot 415 --PEPQDLSSSVCKEEKMPRGGSLESDGCPKEPAKTQPAVA------------TAATAAE 460 cattleMeCP2_e1prot --PEPQDLSSSVCKEEKMPRAGSLESDGCPKEPAKTQPALA------------TAAPATE dogMeCP2_e1prot --PEPQDLSSGVCKEEKMARGGSLESDGCPKEPAKTQPTVA------------TAATAAD mouseMeCP2_e1prot --PEPQDLSSSICKEEKMPRGGSLESDGCPKEPAKTQPMVA------------TTTTVAE ratMeCP2_e1prot --PEPQDLSSSICKEEKMPRAGSLESDGCPKEPAKTQPMVAAA----ATTTTTTTTTVAE possumMeCP2_e1prot --PEPQDLSSKVCKEEKMPRGAPPESDGCTKELAKTQPTAAAASAAATAATATTATTAAE XtropMeCP2_e1prot --PEPQDLSVKIYKEEKLP-----ESDGCAQEPAKTQP--------------------AD XlaevisMeCP2_e1prot --QEPQDLSVKMCKEEKLP-----ESDGCAQEPAKTQP--------------------AD zebrafishMeCP2_e1prot LENEPQDLSTSRPKAEHVACR--EEARTGSSSSRDSQN----------------ASKMAS ****** * *::. *: ... .:* :. 470 480 490 | | | humanMeCP2_e1prot 461 KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS 498 cattleMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS dogMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS mouseMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS ratMeCP2_e1prot KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS possumMeCP2_e1prot KFKHRAEGDRKDIVS-SSMPRPNREDPVDSRTPVTERVS XtropMeCP2_e1prot KCRNRAEGERKDIVS-S-VPRPTREEPVDTRTTVTERVS XlaevisMeCP2_e1prot KCRNRAEGERKDIVS-S-VPRPTREEPVDTRTTVTERVS zebrafishMeCP2_e1prot MTVTGESKELRDIVPPSAVPRPSREETVESRTPVSEPVS . : :***. * :***.**:.*::**.*:* **
Origin and reliability of MECP2 homologue sequences
Human (Homo sapiens):
MECP2_e2 transcript
>humanMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaagaaaag tcagaagacc aggacctcca gggcctcaag gacaaacccc tcaagtttaa aaaggtgaag aaagataaga aagaagagaa agagggcaag catgagcccg tgcagccatc agcccaccac tctgctgagc ccgcagaggc aggcaaagca gagacatcag aagggtcagg ctccgccccg gctgtgccgg aagcttctgc ctcccccaaa cagcggcgct ccatcatccg tgaccgggga cccatgtatg atgaccccac cctgcctgaa ggctggacac ggaagcttaa gcaaaggaaa tctggccgct ctgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacatcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggccgggga cgccccaaag ggagcggcac cacgagaccc aaggcggcca cgtcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcctgggaag ctccttgtca agatgccttt tcaaacttcg ccagggggca aggctgaggg gggtggggcc accacatcca cccaggtcat ggtgatcaaa cgccccggca ggaagcgaaa agctgaggcc gaccctcagg ccattcccaa gaaacggggc cgaaagccgg ggagtgtggt ggcagccgct gccgccgagg ccaaaaagaa agccgtgaag gagtcttcta tccgatctgt gcaggagacc gtactcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc caccctcggt gagaagagcg ggaaaggact gaagacctgt aagagccctg ggcggaaaag caaggagagc agccccaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagtcccc aaaggccccc gtgccactgc tcccacccct gcccccacct ccacctgagc ccgagagctc cgaggacccc accagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccc agaggaggct cactggagag cgacggctgc cccaaggagc cagctaagac tcagcccgcg gttgccaccg ccgccacggc cgcagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>humanMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKDKPLKFKK VKKDKKEEKE GKHEPVQPSA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTTRPK AATSEGVQVK RVLEKSPGKL LVKMPFQTSP GGKAEGGGAT TSTQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSESPKAPV PLLPPLPPPP PEPESSEDPT SPPEPQDLSS SVCKEEKMPR GGSLESDGCP KEPAKTQPAV ATAATAAEKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS
Derived from: NCBI GI 15079579 bases 85 to 1545.
Reliability: The coding sequence portions of NCBI GI 15079579 match exactly the human genome sequence.
MECP2_e1 transcript
>humanMECP2_e1dna
atggccgccg ccgccgccgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tcagaagacc aggacctcca gggcctcaag gacaaacccc tcaagtttaa aaaggtgaag aaagataaga aagaagagaa agagggcaag catgagcccg tgcagccatc agcccaccac tctgctgagc ccgcagaggc aggcaaagca gagacatcag aagggtcagg ctccgccccg gctgtgccgg aagcttctgc ctcccccaaa cagcggcgct ccatcatccg tgaccgggga cccatgtatg atgaccccac cctgcctgaa ggctggacac ggaagcttaa gcaaaggaaa tctggccgct ctgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacatcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggccgggga cgccccaaag ggagcggcac cacgagaccc aaggcggcca cgtcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcctgggaag ctccttgtca agatgccttt tcaaacttcg ccagggggca aggctgaggg gggtggggcc accacatcca cccaggtcat ggtgatcaaa cgccccggca ggaagcgaaa agctgaggcc gaccctcagg ccattcccaa gaaacggggc cgaaagccgg ggagtgtggt ggcagccgct gccgccgagg ccaaaaagaa agccgtgaag gagtcttcta tccgatctgt gcaggagacc gtactcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc caccctcggt gagaagagcg ggaaaggact gaagacctgt aagagccctg ggcggaaaag caaggagagc agccccaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagtcccc aaaggccccc gtgccactgc tcccacccct gcccccacct ccacctgagc ccgagagctc cgaggacccc accagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccc agaggaggct cactggagag cgacggctgc cccaaggagc cagctaagac tcagcccgcg gttgccaccg ccgccacggc cgcagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>humanMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEDQDL QGLKDKPLKF KKVKKDKKEE KEGKHEPVQP SAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTTR PKAATSEGVQ VKRVLEKSPG KLLVKMPFQT SPGGKAEGGG ATTSTQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA AAAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSSSA SSPPKKEHHH HHHHSESPKA PVPLLPPLPP PPPEPESSED PTSPPEPQDL SSSVCKEEKM PRGGSLESDG CPKEPAKTQP AVATAATAAE KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS
Derived from: NCBI GI 6959307 bases 8 to 69 and 194 to 1628.
Reliability: The coding sequence portions of NCBI GI 6959307 match exactly the human genome sequence.
Cattle (Bos taurus):
MECP2_e2 transcript
>cattleMECP2_e2dna
atggtagctg gaatgttagg gctcag
ggaagaaaag tccgaagagc aggatctcca gggcctgaag gacaaacctt tgaagttcaa aaaggtgaag aaggataaga aagaagacaa agagggcaag catgagcccc tgcagccagc agcccaccac tctgccgagc cagcagaggc cggcaaagca gagacctcag aagggtcagg ctcggcccca gccgtgccag aagcttctgc atcccccaag cagcggcgct ccatcattcg tgatcggggc cccatgtacg atgaccccac tctgccggaa ggttggaccc gaaagcttaa gcaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tacttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta accgggagag ggagcccctc ccggcgagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagtggcac cacgagaccc aaggcagctg cgtcagaggg tgtgcaagtg aaaagggttc tggagaaaag tcctggaaag ctactcgtca agatgccttt ccaagctgcg ccgggcagca aggcagaagg gggtggggcc accacctcag cccaggtcat ggtcatcaag cgccccggcc ggaagcgaaa agcggaggcc gacccccagg ccattcccaa gaaacgaggc cgaaagccgg gcagtgtggt tgctgccgcc actgccgagg ccaaaaagaa agccgtgaag gagtcatcta tccggtccgt tcaggagacc gtgctcccca tcaagaagcg caagacccgg gagacggtga gcattgaggt gaaggaggta gtgaagcccc tgctggtgtc cacgctcggc gagaagagcg ggaagggact gaagacctgc aagagcccag ggcggaaaag caaggagagc agtcccaagg ggcgcagcgg cagcgcctcc tcgcccccca agaaggagca ccaccaccac caccaccacg tggagccccc gaaggccccc gcgccgctgc tcctgccccc gcccccaccc ccgcccgagc cccagagctc cgaggaccct gccagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccg agagcaggct cgctggagag cgatggctgc cccaaggagc ctgctaagac tcagcccgcg ctcgccaccg cggccccggc cacagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgtct catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>cattleMECP2_e2prot
MVAGMLGLRE EKSEEQDLQG LKDKPLKFKK VKKDKKEDKE GKHEPLQPAA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTTRPK AAASEGVQVK RVLEKSPGKL LVKMPFQAAP GSKAEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAT AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSGSASS PPKKEHHHHH HHVEPPKAPA PLLLPPPPPP PEPQSSEDPA SPPEPQDLSS SVCKEEKMPR AGSLESDGCP KEPAKTQPAL ATAAPATEKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS
Derived from: ENSEMBL data, except for the section highlighted in red, which is derived from the trace archive.
Reliability: Fairly good agreement with sequences from the trace archive.
MECP2_e1 transcript
>cattleMECP2_e1dna
atggccgccg ccgccgctgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tccgaagagc aggatctcca gggcctgaag gacaaacctt tgaagttcaa aaaggtgaag aaggataaga aagaagacaa agagggcaag catgagcccc tgcagccagc agcccaccac tctgccgagc cagcagaggc cggcaaagca gagacctcag aagggtcagg ctcggcccca gccgtgccag aagcttctgc atcccccaag cagcggcgct ccatcattcg tgatcggggc cccatgtacg atgaccccac tctgccggaa ggttggaccc gaaagcttaa gcaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tacttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta accgggagag ggagcccctc ccggcgagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagtggcac cacgagaccc aaggcagctg cgtcagaggg tgtgcaagtg aaaagggttc tggagaaaag tcctggaaag ctactcgtca agatgccttt ccaagctgcg ccgggcagca aggcagaagg gggtggggcc accacctcag cccaggtcat ggtcatcaag cgccccggcc ggaagcgaaa agcggaggcc gacccccagg ccattcccaa gaaacgaggc cgaaagccgg gcagtgtggt tgctgccgcc actgccgagg ccaaaaagaa agccgtgaag gagtcatcta tccggtccgt tcaggagacc gtgctcccca tcaagaagcg caagacccgg gagacggtga gcattgaggt gaaggaggta gtgaagcccc tgctggtgtc cacgctcggc gagaagagcg ggaagggact gaagacctgc aagagcccag ggcggaaaag caaggagagc agtcccaagg ggcgcagcgg cagcgcctcc tcgcccccca agaaggagca ccaccaccac caccaccacg tggagccccc gaaggccccc gcgccgctgc tcctgccccc gcccccaccc ccgcccgagc cccagagctc cgaggaccct gccagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccg agagcaggct cgctggagag cgatggctgc cccaaggagc ctgctaagac tcagcccgcg ctcgccaccg cggccccggc cacagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgtct catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga
>cattleMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEEQDL QGLKDKPLKF KKVKKDKKED KEGKHEPLQP AAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTTR PKAAASEGVQ VKRVLEKSPG KLLVKMPFQA APGSKAEGGG ATTSAQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA ATAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSGSA SSPPKKEHHH HHHHVEPPKA PAPLLLPPPP PPPEPQSSED PASPPEPQDL SSSVCKEEKM PRAGSLESDG CPKEPAKTQP ALATAAPATE KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS
Derived from: ENSEMBL data, except for the section highlighted in red, which is derived from the trace archive.
Reliability: Fairly good agreement with sequences from the trace archive.
Dog (Canis familiaris):
MECP2_e2 transcript
>dogMECP2_e2dna
atggtagctg gaatgttagg gctcag
ggaagaaaag tcagaagacc aggatctcca gggcctcaag gacaaacccc tgaaatttaa aaaggtgaag aaagagaaga aagaagacaa agagggcaag catgagcccc tgcagccacc ggctcaccac tctgctgaac cagcagaggc aggcaaagcg gagacctcag aagggtcagg ctcagcccca gctgtcccgg aagcttctgc ctcccccaaa cagcgacgct ctatcattcg tgaccgggga cccatgtatg acgaccccac tctgcctgaa ggttggaccc gaaagcttaa acaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagcggcac tgcgagaccc aaggcagcaa catcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcccgggaag ctgctcgtca agatgccttt tcaagcttcg cccgggagca aggctgaagg gggcggggcc accacgtcag cccaggtcat ggttatcaaa cgcccaggcc ggaagcgaaa agccgaggct gacccccagg ccattcccaa gaagcggggc cgaaagccag gcagtgtggt ggcagctgcc gccgcagagg ccaaaaagaa agccgtgaag gagtcttcca tccggtccgt gcaggagact gtgctcccca tcaagaagcg caagactcgg gagacggtca gcattgaggt gaaggaggtg gtgaagcccc tgctggtgtc caccctcggc gagaagagtg gaaagggact gaagacctgc aagagccccg gacggaaaag caaggagagc agcccgaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagccccc gaaggcaccc gcgccgctgc ttccgccccc gccccctccc ccacctgagc cccagagctc cgaggacccc gccagccccc ctgagcccca ggacttgagc agcggcgtct gcaaagagga gaagatggcg agaggaggct cgctggagag cgacggctgc cccaaggagc cagctaagac tcagcccacg gtcgcgaccg ccgccacggc cgcagacaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ct
>dogMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKDKPLKFKK VKKEKKEDKE GKHEPLQPPA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTARPK AATSEGVQVK RVLEKSPGKL LVKMPFQASP GSKAEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSEPPKAPA PLLPPPPPPP PEPQSSEDPA SPPEPQDLSS GVCKEEKMAR GGSLESDGCP KEPAKTQPTV ATAATAADKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Good agreement with other sequences from the trace archive.
MECP2_e1 transcript
>dogMECP2_e1dna
atggccgccg ccgccgctgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tcagaagacc aggatctcca gggcctcaag gacaaacccc tgaaatttaa aaaggtgaag aaagagaaga aagaagacaa agagggcaag catgagcccc tgcagccacc ggctcaccac tctgctgaac cagcagaggc aggcaaagcg gagacctcag aagggtcagg ctcagcccca gctgtcccgg aagcttctgc ctcccccaaa cagcgacgct ctatcattcg tgaccgggga cccatgtatg acgaccccac tctgcctgaa ggttggaccc gaaagcttaa acaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagcggcac tgcgagaccc aaggcagcaa catcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcccgggaag ctgctcgtca agatgccttt tcaagcttcg cccgggagca aggctgaagg gggcggggcc accacgtcag cccaggtcat ggttatcaaa cgcccaggcc ggaagcgaaa agccgaggct gacccccagg ccattcccaa gaagcggggc cgaaagccag gcagtgtggt ggcagctgcc gccgcagagg ccaaaaagaa agccgtgaag gagtcttcca tccggtccgt gcaggagact gtgctcccca tcaagaagcg caagactcgg gagacggtca gcattgaggt gaaggaggtg gtgaagcccc tgctggtgtc caccctcggc gagaagagtg gaaagggact gaagacctgc aagagccccg gacggaaaag caaggagagc agcccgaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagccccc gaaggcaccc gcgccgctgc ttccgccccc gccccctccc ccacctgagc cccagagctc cgaggacccc gccagccccc ctgagcccca ggacttgagc agcggcgtct gcaaagagga gaagatggcg agaggaggct cgctggagag cgacggctgc cccaaggagc cagctaagac tcagcccacg gtcgcgaccg ccgccacggc cgcagacaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ct
>dogMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEDQDL QGLKDKPLKF KKVKKEKKED KEGKHEPLQP PAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTAR PKAATSEGVQ VKRVLEKSPG KLLVKMPFQA SPGSKAEGGG ATTSAQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA AAAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSSSA SSPPKKEHHH HHHHSEPPKA PAPLLPPPPP PPPEPQSSED PASPPEPQDL SSGVCKEEKM ARGGSLESDG CPKEPAKTQP TVATAATAAD KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Exon 1 has fairly good agreement with other sequences from the trace archive, exons 3 and 4 have good agreement with other sequences from the trace archive.
House mouse (Mus musculus):
MECP2_e2 transcript
>mouseMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaggaaaag tcagaagacc aggatctcca gggcctcaga gacaagccac tgaagtttaa gaaggcgaag aaagacaaga aggaggacaa agaaggcaag catgagccac tacaaccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gaaacatcag aaagctcagg ctctgcccca gcagtgccag aagcctcggc ttcccccaaa cagcggcgct ccattatccg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacac gaaagcttaa acaaaggaag tctggccgat ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcttttc gctctaaagt agaattgatt gcatactttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcacggta actgggagag ggagcccctc caggagagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgccccaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaggtg aaaagggtcc tggagaagag ccctgggaaa cttgttgtca agatgccttt ccaagcatcg cctgggggta agggtgaggg aggtggggct accacatctg cccaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gcagctgagg ccaaaaagaa agccgtgaag gagtcttcca tacggtctgt gcatgagact gtgctcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc cacccttggt gagaaaagcg ggaagggact gaagacctgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tccccaccta agaaggagca ccatcatcac caccatcact cagagtccac aaaggccccc atgccactgc tcccatcccc acccccacct gagcctgaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aagagaagat gccccgagga ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc tatggtcgcc accactacca cagttgcaga aaagtacaaa caccgagggg agggagagcg caaagacatt gtttcatctt ccatgccaag gccaaacaga gaggagcctg tggacagccg gacgcccgtg accgagagag ttagctga
>mouseMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LRDKPLKFKK AKKDKKEDKE GKHEPLQPSA HHSAEPAEAG KAETSESSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTGRPK AAASEGVQVK RVLEKSPGKL VVKMPFQASP GGKGEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVHETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSESTKAPM PLLPSPPPPE PESSEDPISP PEPQDLSSSI CKEEKMPRGG SLESDGCPKE PAKTQPMVAT TTTVAEKYKH RGEGERKDIV SSSMPRPNRE EPVDSRTPVT ERVS
Derived from: NCBI GI 20072599 bases 202 to 1656.
Reliability: Good agreement with sequences from trace archive.
MECP2_e1 transcript
>mouseMECP2_e1dna
atggccgccg ctgccgccac cgccgccgcc gccgccgcgc cgagcggagg aggaggagga ggcgaggagg agagact
ggaggaaaag tcagaagacc aggatctcca gggcctcaga gacaagccac tgaagtttaa gaaggcgaag aaagacaaga aggaggacaa agaaggcaag catgagccac tacaaccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gaaacatcag aaagctcagg ctctgcccca gcagtgccag aagcctcggc ttcccccaaa cagcggcgct ccattatccg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacac gaaagcttaa acaaaggaag tctggccgat ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcttttc gctctaaagt agaattgatt gcatactttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcacggta actgggagag ggagcccctc caggagagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgccccaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaggtg aaaagggtcc tggagaagag ccctgggaaa cttgttgtca agatgccttt ccaagcatcg cctgggggta agggtgaggg aggtggggct accacatctg cccaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gcagctgagg ccaaaaagaa agccgtgaag gagtcttcca tacggtctgt gcatgagact gtgctcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc cacccttggt gagaaaagcg ggaagggact gaagacctgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tccccaccta agaaggagca ccatcatcac caccatcact cagagtccac aaaggccccc atgccactgc tcccatcccc acccccacct gagcctgaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aagagaagat gccccgagga ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc tatggtcgcc accactacca cagttgcaga aaagtacaaa caccgagggg agggagagcg caaagacatt gtttcatctt ccatgccaag gccaaacaga gaggagcctg tggacagccg gacgcccgtg accgagagag ttagctga
>mouseMECP2_e1prot
MAAAAATAAA AAAPSGGGGG GEEERLEEKS EDQDLQGLRD KPLKFKKAKK DKKEDKEGKH EPLQPSAHHS AEPAEAGKAE TSESSGSAPA VPEASASPKQ RRSIIRDRGP MYDDPTLPEG WTRKLKQRKS GRSAGKYDVY LINPQGKAFR SKVELIAYFE KVGDTSLDPN DFDFTVTGRG SPSRREQKPP KKPKSPKAPG TGRGRGRPKG SGTGRPKAAA SEGVQVKRVL EKSPGKLVVK MPFQASPGGK GEGGGATTSA QVMVIKRPGR KRKAEADPQA IPKKRGRKPG SVVAAAAAEA KKKAVKESSI RSVHETVLPI KKRKTRETVS IEVKEVVKPL LVSTLGEKSG KGLKTCKSPG RKSKESSPKG RSSSASSPPK KEHHHHHHHS ESTKAPMPLL PSPPPPEPES SEDPISPPEP QDLSSSICKE EKMPRGGSLE SDGCPKEPAK TQPMVATTTT VAEKYKHRGE GERKDIVSSS MPRPNREEPV DSRTPVTERV S
Derived from: NCBI GI 20072599 bases 27 to 103 and 228 to 1656.
Reliability: Good agreement with sequences from trace archive.
Norway rat (Rattus norvegicus):
MECP2_e2 transcript
>ratMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaggaaaag tcagaagacc aggatctcca gggcctcaaa gagaaacccc tgaagtttaa gaaggtgaag aaagacaaga aggaagacaa agagggcaaa catgaaccac tacagccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gagacatcag aaagctcagg ctctgcccca gcagtaccag aagcctctgc ttctcccaaa cagcgacgtt ccatcattcg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacgc gaaagcttaa acagaggaag tctggtcgct ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcctttc gctctaaagt agaattgatt gcatattttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcactgta actgggagag ggagcccttc caggagagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgcccgaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaagtg aaaagggtcc tggagaagag ccctgggaaa cttctcgtca agatgccttt ccaagcatca cctgggggta agggtgaggg aggtggggct accacatctg cgcaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gctgcagagg ccaaaaagaa agctgtgaag gaatcttcta tacggtctgt gcaggagact gtgctcccca tcaagaagcg caagacccgg gaaaccgtca gcattgaggt caaggaggtg gtgaagcccc tgctggtgtc tacacttggt gagaagagtg gaaagggact gaagacatgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tcaccaccta agaaggagca ccatcatcac caccatcacg cagagtcccc aaaggccccc atgccattgc ttccacctcc acccccacct gagcctcaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aggagaagat gccccgagca ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc catggttgct gccgccgcca ccaccaccac caccaccacc accacagttg cagaaaagta caaacaccga ggggagggag agcgcaaaga cattgtttca tcctccatgc cgaggccaaa cagagaggag cctgtggaca gccggacgcc cgtgaccgag agagttagct ga
>ratMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKEKPLKFKK VKKDKKEDKE GKHEPLQPSA HHSAEPAEAG KAETSESSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTGRPK AAASEGVQVK RVLEKSPGKL LVKMPFQASP GGKGEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHAESPKAPM PLLPPPPPPE PQSSEDPISP PEPQDLSSSI CKEEKMPRAG SLESDGCPKE PAKTQPMVAA AATTTTTTTT TVAEKYKHRG EGERKDIVSS SMPRPNREEP VDSRTPVTER VS
Derived from: NCBI GI 115312277 nucleotides 218 to 1696.
Reliability: Exons 2 to 4 match the rat genome exactly.
MECP2_e1 transcript
>ratMECP2_e1dna
atggccgccg ccgctgccgc cgctgccgcc gccgccgccg ccgctgccgc cgccgccgcc gccgccgccg ccgcgccgag cggaggagga ggaggcgagg aggagagact
ggaggaaaag tcagaagacc aggatctcca gggcctcaaa gagaaacccc tgaagtttaa gaaggtgaag aaagacaaga aggaagacaa agagggcaaa catgaaccac tacagccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gagacatcag aaagctcagg ctctgcccca gcagtaccag aagcctctgc ttctcccaaa cagcgacgtt ccatcattcg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacgc gaaagcttaa acagaggaag tctggtcgct ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcctttc gctctaaagt agaattgatt gcatattttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcactgta actgggagag ggagcccttc caggagagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgcccgaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaagtg aaaagggtcc tggagaagag ccctgggaaa cttctcgtca agatgccttt ccaagcatca cctgggggta agggtgaggg aggtggggct accacatctg cgcaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gctgcagagg ccaaaaagaa agctgtgaag gaatcttcta tacggtctgt gcaggagact gtgctcccca tcaagaagcg caagacccgg gaaaccgtca gcattgaggt caaggaggtg gtgaagcccc tgctggtgtc tacacttggt gagaagagtg gaaagggact gaagacatgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tcaccaccta agaaggagca ccatcatcac caccatcacg cagagtcccc aaaggccccc atgccattgc ttccacctcc acccccacct gagcctcaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aggagaagat gccccgagca ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc catggttgct gccgccgcca ccaccaccac caccaccacc accacagttg cagaaaagta caaacaccga ggggagggag agcgcaaaga cattgtttca tcctccatgc cgaggccaaa cagagaggag cctgtggaca gccggacgcc cgtgaccgag agagttagct ga
>ratMECP2_e1prot
MAAAAAAAAA AAAAAAAAAA AAAAAPSGGG GGEEERLEEK SEDQDLQGLK EKPLKFKKVK KDKKEDKEGK HEPLQPSAHH SAEPAEAGKA ETSESSGSAP AVPEASASPK QRRSIIRDRG PMYDDPTLPE GWTRKLKQRK SGRSAGKYDV YLINPQGKAF RSKVELIAYF EKVGDTSLDP NDFDFTVTGR GSPSRREQKP PKKPKSPKAP GTGRGRGRPK GSGTGRPKAA ASEGVQVKRV LEKSPGKLLV KMPFQASPGG KGEGGGATTS AQVMVIKRPG RKRKAEADPQ AIPKKRGRKP GSVVAAAAAE AKKKAVKESS IRSVQETVLP IKKRKTRETV SIEVKEVVKP LLVSTLGEKS GKGLKTCKSP GRKSKESSPK GRSSSASSPP KKEHHHHHHH AESPKAPMPL LPPPPPPEPQ SSEDPISPPE PQDLSSSICK EEKMPRAGSL ESDGCPKEPA KTQPMVAAAA TTTTTTTTTV AEKYKHRGEG ERKDIVSSSM PRPNREEPVD SRTPVTERVS
Derived from: NCBI GI 115312277 nucleotides 10 to 119, and 244 to 1696. Reliability: Exons 3 and 4 match the rat genome exactly. In exon 1, the rat genome has an extra triplet creating an extra alanine in the multi-alanine section.
Gray short-tailed opossum (Monodelphis domestica):
MECP2_e2 transcript
>possumMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaagaacag tctgaagacc aagacctcca gggcctcaga gataaacccc tgaagttcag aaagttgaag agggataaaa aggaggagaa agaaggaaaa catgaattcc cacagccatc atcacaccag tctgccgaac cagcagaggc aggaaaagca gaaacatcag aagaggctgg gtcagcccct gctgcacctg aagcttcagc ttctcctaaa caacggcgtt ctatcatccg agaccggggg cccatgtatg atgatcccac actaccagag ggctggacaa gaaaactgaa gcagaggaaa tcaggccgtt ctgctgggaa gtacgatgtc tatttgatca a
tccacaggga aaagcttttc gctccaaggt agagttgatt gcatacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagtccctc ccgacgagag cagaaaccac ccaagaagtc caaatccccc aaggctccag ggacaggccg agggagggga cggcccaaag ggagcggcac agtgaaaccc cgggtcacag cctcagaagg ggtccaggtc aaaagggtga ttgagaaaag tcctgggaag ctcctagtca agatgccttt tcagccgtca cctgggggaa aggctgaagg gggtggggcc accacgtcca cccaagtcat ggtgatcaag cgccctggca ggaaacggaa agttgagacc gagccacagg tcatccctaa gaaacggggc cgtaagccgg ggagcatagt ggccgcagct gccgtggaag ccaagaagaa agcaatcaaa gagtcttcca tcaggtccat tcatgagacc gtgctgccca tcaaaaagcg gaagaccagg gaagccgtca gcatcgaggt gaaggaggtg gtgaagcctc tacttgtctc caccgtgggg gagaagagca cgaagggact caagcctgga aagagcccag gtcggaaaag caaagagagc agccccaaag ggcggagtgc cagcacctcc tcttcccccc cgaagaagga gcagcagcag cagcagcagt accaccacca ccactactac ccttcctcag agtcccccaa ggccccaccc ccacctcacc ccgagccaga gggctccaag gacagcaaaa gcccccccga acctcaggac ttaagcagca aagtttgcaa agaagagaag atgccaagag gggctccacc agagagtgat ggctgcacaa aggagctcgc taagactcag cccacagctg ctgccgcctc cgctgctgcc accgccgcca ccgccaccac cgccaccacg gcagcagaaa agttcaaaca ccgagcagag ggagaccgaa aggacattgt ctcgtcctcc atgccgaggc caaaccgaga ggatcctgtg gacagccgga cgcccgtgac agagagagtt agctga
>possumMECP2_e2prot
MVAGMLGLRE EQSEDQDLQG LRDKPLKFRK LKRDKKEEKE GKHEFPQPSS HQSAEPAEAG KAETSEEAGS APAAPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKSKSPK APGTGRGRGR PKGSGTVKPR VTASEGVQVK RVIEKSPGKL LVKMPFQPSP GGKAEGGGAT TSTQVMVIKR PGRKRKVETE PQVIPKKRGR KPGSIVAAAA VEAKKKAIKE SSIRSIHETV LPIKKRKTRE AVSIEVKEVV KPLLVSTVGE KSTKGLKPGK SPGRKSKESS PKGRSASTSS SPPKKEQQQQ QQYHHHHYYP SSESPKAPPP PHPEPEGSKD SKSPPEPQDL SSKVCKEEKM PRGAPPESDG CTKELAKTQP TAAAASAAAT AATATTATTA AEKFKHRAEG DRKDIVSSSM PRPNREDPVD SRTPVTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Good agreement with other sequences from the trace archive.
MECP2_e1 transcript
>possumMECP2_e1dna
atggccgccg ccgccgcgct gagcggagga ggaggaggcg aggaggacag act
ggaagaacag tctgaagacc aagacctcca gggcctcaga gataaacccc tgaagttcag aaagttgaag agggataaaa aggaggagaa agaaggaaaa catgaattcc cacagccatc atcacaccag tctgccgaac cagcagaggc aggaaaagca gaaacatcag aagaggctgg gtcagcccct gctgcacctg aagcttcagc ttctcctaaa caacggcgtt ctatcatccg agaccggggg cccatgtatg atgatcccac actaccagag ggctggacaa gaaaactgaa gcagaggaaa tcaggccgtt ctgctgggaa gtacgatgtc tatttgatca a
tccacaggga aaagcttttc gctccaaggt agagttgatt gcatacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagtccctc ccgacgagag cagaaaccac ccaagaagtc caaatccccc aaggctccag ggacaggccg agggagggga cggcccaaag ggagcggcac agtgaaaccc cgggtcacag cctcagaagg ggtccaggtc aaaagggtga ttgagaaaag tcctgggaag ctcctagtca agatgccttt tcagccgtca cctgggggaa aggctgaagg gggtggggcc accacgtcca cccaagtcat ggtgatcaag cgccctggca ggaaacggaa agttgagacc gagccacagg tcatccctaa gaaacggggc cgtaagccgg ggagcatagt ggccgcagct gccgtggaag ccaagaagaa agcaatcaaa gagtcttcca tcaggtccat tcatgagacc gtgctgccca tcaaaaagcg gaagaccagg gaagccgtca gcatcgaggt gaaggaggtg gtgaagcctc tacttgtctc caccgtgggg gagaagagca cgaagggact caagcctgga aagagcccag gtcggaaaag caaagagagc agccccaaag ggcggagtgc cagcacctcc tcttcccccc cgaagaagga gcagcagcag cagcagcagt accaccacca ccactactac ccttcctcag agtcccccaa ggccccaccc ccacctcacc ccgagccaga gggctccaag gacagcaaaa gcccccccga acctcaggac ttaagcagca aagtttgcaa agaagagaag atgccaagag gggctccacc agagagtgat ggctgcacaa aggagctcgc taagactcag cccacagctg ctgccgcctc cgctgctgcc accgccgcca ccgccaccac cgccaccacg gcagcagaaa agttcaaaca ccgagcagag ggagaccgaa aggacattgt ctcgtcctcc atgccgaggc caaaccgaga ggatcctgtg gacagccgga cgcccgtgac agagagagtt agctga
>possumMECP2_e1prot
MAAAAALSGG GGGEEDRLEE QSEDQDLQGL RDKPLKFRKL KRDKKEEKEG KHEFPQPSSH QSAEPAEAGK AETSEEAGSA PAAPEASASP KQRRSIIRDR GPMYDDPTLP EGWTRKLKQR KSGRSAGKYD VYLINPQGKA FRSKVELIAY FEKVGDTSLD PNDFDFTVTG RGSPSRREQK PPKKSKSPKA PGTGRGRGRP KGSGTVKPRV TASEGVQVKR VIEKSPGKLL VKMPFQPSPG GKAEGGGATT STQVMVIKRP GRKRKVETEP QVIPKKRGRK PGSIVAAAAV EAKKKAIKES SIRSIHETVL PIKKRKTREA VSIEVKEVVK PLLVSTVGEK STKGLKPGKS PGRKSKESSP KGRSASTSSS PPKKEQQQQQ QYHHHHYYPS SESPKAPPPP HPEPEGSKDS KSPPEPQDLS SKVCKEEKMP RGAPPESDGC TKELAKTQPT AAAASAAATA ATATTATTAA EKFKHRAEGD RKDIVSSSMP RPNREDPVDS RTPVTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Good agreement with other sequences from the trace archive.
Western clawed frog (Xenopus tropicalis):
Frogs and fish only have the MECP2_e1 transcript form.
>XtropMECP2_e1dna
atggccgctg cgccgagcgg agaggagaga ct
ggaagaaaaa tctgaagatc aagatcttca aggacagaaa gataaaccac caaaactcag gaaagtaaaa agagacaaga aggatgagga agaaaagcag gaaacgtttc atccctctga gcaccagtca ggagaacctg cagatgaagg gaaagctgat atatctgaaa gtgctgagga aagccttgct gttcctgaag cctctgcctc tcccaagcag aggcggtctg ttattagaga caggggtccc atgtacgaag accccactct tcctgaaggc tggacacgaa aactaaagca aagaaaatct ggtcgttctg ctggaaagtt tgatgtatat ttaatcaa
ccctaatgga aaagcttttc ggtccaaagt tgaacttata gcatacttcc aaaaggtagg cgacacatcg ctggacccta atgattttga cttcactgta actgggagag ggagtccgtc tcgaagggaa cagaagcaac cgaaaaagtc taaagctcca aaatcttctg gaacagggag aggaagagga agacccaaag gaagtgtaaa agtaaagtca cctgtaaaat ctgaaggagt acaggttaaa agggtgatag agaagagtcc agggaagctt ttggtaaaaa tgcctttttc tggaagtaaa gaggaatccg atgcaacaac ctcagaacag gttttggtaa ttaaaagacc cggtcgtaaa agaaagtcag atacagaccc atcggcagct cctaaaaaac ggggaagaaa gccaggcagt gtgagcttgg ctgctgcagc agcagaagca gcaaagaaaa aagcaatcaa agagtcttcc atcaagcctc ttttagagac tgtgttacca ataaagaaac gcaagaccag ggagactatc agtgtagatg taaaagatac agtaaaaccg gagcctctta cacctgttat agaaaaaagc attaaaggac agaaacctgc aaaaagtcca gaaagcagaa gcacagaggg tagcccaaaa attaaaactg gcttgccgaa aaaggagctg cagcagcacc atcatcatca ccaccatcat catcaccatc atcactccga atccaaggca tctgccacca gtccagagcc agagacttca aaggacagca ttggggcccc agagccccag gacttaagtg tcaaaatata taaagaggag aagctacccg agagtgatgg ctgtgctcag gagccagcca agacgcagcc tgctgataaa tgtagaaacc gagcagaagg tgaaagaaaa gacattgtat catctgtccc tagaccaaca agagaagaac ccgtggacac cagaacaacg gttacggaaa gagttagctg a
>XtropMECP2_e1prot
MAAAPSGEER LEEKSEDQDL QGQKDKPPKL RKVKRDKKDE EEKQETFHPS EHQSGEPADE GKADISESAE ESLAVPEASA SPKQRRSVIR DRGPMYEDPT LPEGWTRKLK QRKSGRSAGK FDVYLINPNG KAFRSKVELI AYFQKVGDTS LDPNDFDFTV TGRGSPSRRE QKQPKKSKAP KSSGTGRGRG RPKGSVKVKS PVKSEGVQVK RVIEKSPGKL LVKMPFSGSK EESDATTSEQ VLVIKRPGRK RKSDTDPSAA PKKRGRKPGS VSLAAAAAEA AKKKAIKESS IKPLLETVLP IKKRKTRETI SVDVKDTVKP EPLTPVIEKS IKGQKPAKSP ESRSTEGSPK IKTGLPKKEL QQHHHHHHHH HHHHHSESKA SATSPEPETS KDSIGAPEPQ DLSVKIYKEE KLPESDGCAQ EPAKTQPADK CRNRAEGERK DIVSSVPRPT REEPVDTRTT VTERVS
Derived from: Hand-based assembling of sequences from the trace archive.
Reliability: Fairly good agreement with other sequences from the trace archive.
African clawed frog (Xenopus laevis):
>XlaevisMECP2_e1dna
atggccgctg cgccgagcgg agaggagaga ct
ggaagaaaaa tctgaggatc aagatcttca aggacaaaaa gataaaccac caaaactcag gaaagtaaaa aaagacaaga aggatgagga agaaaagcag gaaccatttc attcctctga gcatcagccc ggagaacctg cagatgaagg gaaagctgat atgtctgaaa gtgctgagga aaaccttgct gttcctgaat cttctgcctc tcccaaacag aggcggtctg ttattagaga caggggtccc atgtacgaag accccactct tcctgaaggc tggacacgaa aactcaagca aagaaaatct ggtcgttctg ctggaaaatt tgatgtatat ttaatcaa
ccctaatgga aaagcttttc ggtccaaagt tgagcttata gcatacttcc aaaaggtagg ggacacatct ctagacccta atgattttga cttcactgta actgggagag ggagcccgtc tcgaagggaa cagaagcaac cgaaaaagcc taaagctcca aaatcttctg tatcagggag aggaagagga agacctaaag gaagtataaa aaaagttaag ccacctgtaa aatctgaagg agtacaagtc aaaagggtga tagagaagag tccgggaaaa cttttggtta aaatgcctta ttctggaact aaagaggcat cagatgcaac aacgtcacaa caggttttgg tcattaaaag aggcggtcgt aaaagaaaat cagaaactga tccatctgca gctcctaaaa aaagggggag aaagccaagc aacgtgagct tggctgctgc agcagcagaa gcagcaaaga aaaaagcaat caaagagtct tccatcaagc ctcttttaga gactgtgtta ccaataaaga aacgcaagac cagggagact atcagtgtag atgtaaaaga tacaataaaa ccagagcctc ttacacctgt tatagaaaaa gtcatgaaag gacaaaaccc tgcaaaaagt ccagaaagca gaagcacaga gggtagccca aaaattaaaa ctggcttgcc gaaaaaagag ctgcagcagc accatcatca tcatcaccac caccatcacc atcatcactc cgaatctaag gcatctgcca ccagtccaga gccagagact tcaaaggaca acattggggt tcaggagccc caggacttaa gtgtcaaaat gtgtaaagag gagaagctac cagaaagtga tggctgtgct caggagccag ccaagactca gcctgctgat aaatgtagaa accgagcaga aggtgaaaga aaagacattg tttcatctgt ccctagacca acaagagaag agcccgtgga caccagaaca acggtgacag aaagagttag ctga
>XlaevisMECP2_e1prot
MAAAPSGEER LEEKSEDQDL QGQKDKPPKL RKVKKDKKDE EEKQEPFHSS EHQPGEPADE GKADMSESAE ENLAVPESSA SPKQRRSVIR DRGPMYEDPT LPEGWTRKLK QRKSGRSAGK FDVYLINPNG KAFRSKVELI AYFQKVGDTS LDPNDFDFTV TGRGSPSRRE QKQPKKPKAP KSSVSGRGRG RPKGSIKKVK PPVKSEGVQV KRVIEKSPGK LLVKMPYSGT KEASDATTSQ QVLVIKRGGR KRKSETDPSA APKKRGRKPS NVSLAAAAAE AAKKKAIKES SIKPLLETVL PIKKRKTRET ISVDVKDTIK PEPLTPVIEK VMKGQNPAKS PESRSTEGSP KIKTGLPKKE LQQHHHHHHH HHHHHHSESK ASATSPEPET SKDNIGVQEP QDLSVKMCKE EKLPESDGCA QEPAKTQPAD KCRNRAEGER KDIVSSVPRP TREEPVDTRT TVTERVS
Derived from: NCBI GI 4139225 bases 14 to 1417.
Reliability: Not enough trace archive sequences to compare with this sequence.
Zebrafish (Danio rerio):
>zebrafishMECP2_e1dna
atggccgccg cagagagcgg agaggagaga ct
cagaggtgag gacaagaatg aagaccagga gggctcaaaa gacaagacgc agaagcataa gaaaagcaaa aaggaaaggc atgatgtgga aaaactggag accacagtct ctgttcctcc gcccccgtct ctctttacgc agagggatgt cggacagcag gcagaggcag ggaagtctga acccattgac cctgaagttg gagctgctct cagcgctcca gaatcttccg catcggccaa gcagcggcgg tctgtcattc gggacagagg cccaatgtat gaagatcctt cgctgcctca gggctggaca cgcaagctga aacagcgcaa atcagggcgc tccgctggca aatttgacgt ctaccttatc aa
cccagaaggg aaagccttcc gttccaaggt ggagctcatg gcatacttcc aaaaggttgg cgataccatt acagatccca atgactttga cttcacggtc acgggcaggg gaagcccgtc tcgcagagaa aaaagaccgc caaaaaagcc taaaatggtc aaaccctctg gacgtggaag ggggcggcct aaaggtagcg gcaaggtacg acaggctaca gaaggggtgg cggtgaaacg cgtcatagaa aagagtccag gaaaactctt agtaaagatg ccctttgtgg cccccaaaac tgaaccaggg gctcctttag ggcaagcgcc agttgccaaa gcacgccgag gacgtaagag gaaatcagag caggatccgc caagcacccc taaaaaacgt ggacgcaagc cagcaactgt ttcacagtca acagtgggga cggggtctgc tgctgcatac gccgctgcag ccattctcac cgccgaagcc aagaaaaaag ccctgaagga gtcttccgct aagcctgttc aggagagggc tcttcctatc aaaaaacgca aaacccgaga gactttagag gagctggagg catccaccac ctcagcgaca gagacctttg agaaacgact gactgcatca actgtgaccc ctaccgggga ggaggcagaa acaggacaga agcctcacaa gcatcccagc cggaagcaca aagaggcaga tccgggaagc agcagcagtg ggacgacagc cagcggagtt gcaccgaaga gtcacaagaa gagagatcag cgagggcagc actttaaaca ccaccaccac catcatcatc accaccatca acaccaacac ctgcaggcct ccacaccctc cacctacact ccgcaggctc accagctctc cctgggtcac tccacgcacg gcgggctgga aaacgagccg caggacttga gcacctccag gcccaaagcg gagcacgtgg cctgcaggga ggaggccaga actggcagct cctcgagtag ggactcccag aacgcaagca agatggcttc catgaccgtg acgggggaaa gcaaggagct gagagacatt gttcctccct ccgccgtccc gaggccgagt cgagaggaaa cggtggagtc ccggacacca gtgagcgagc cagtgagctg a
>zebrafishMECP2_e1prot
MAAAESGEER LRGEDKNEDQ EGSKDKTQKH KKSKKERHDV EKLETTVSVP PPPSLFTQRD VGQQAEAGKS EPIDPEVGAA LSAPESSASA KQRRSVIRDR GPMYEDPSLP QGWTRKLKQR KSGRSAGKFD VYLINPEGKA FRSKVELMAY FQKVGDTITD PNDFDFTVTG RGSPSRREKR PPKKPKMVKP SGRGRGRPKG SGKVRQATEG VAVKRVIEKS PGKLLVKMPF VAPKTEPGAP LGQAPVAKAR RGRKRKSEQD PPSTPKKRGR KPATVSQSTV GTGSAAAYAA AAILTAEAKK KALKESSAKP VQERALPIKK RKTRETLEEL EASTTSATET FEKRLTASTV TPTGEEAETG QKPHKHPSRK HKEADPGSSS SGTTASGVAP KSHKKRDQRG QHFKHHHHHH HHHHQHQHLQ ASTPSTYTPQ AHQLSLGHST HGGLENEPQD LSTSRPKAEH VACREEARTG SSSSRDSQNA SKMASMTVTG ESKELRDIVP PSAVPRPSRE ETVESRTPVS EPVS
Derived from: NCBI GI 37574905 base 1 to 1575.
Reliability: Fairly good agreement with sequences from the trace archive.
Citations
Ruthie E. Amir, Ignatia B. Van den Veyver, Mimi Wan, Charles Q. Tran, Uta Francke & Huda Y. Zoghbi 1999 Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2. Nature Genetics. 23(2): 185-8.Timur M. Yusufzai and Alan P. Wolffe. 2000 Functional consequences of Rett syndrome mutations on human MeCP2. Nucleic Acids Research. 28(21): 4172-4179.
J.D. Thompson, D.G. Higgins, and T.J. Gibson. 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research. 22(22): 4673-4680.