MeCP2 homologues

MeCP2 sequence alignment

Alignments have been used to help determine whether or not a variation at a given amino acid is pathogenic or harmless. Alignments of MeCP2 genes have featured in Amir et al. 1999 and Yusufzai and Wolffe 2000.

Since the publication of these papers, MECP2 has been sequenced in other organisms. This page is intended as a convenient reference to these sequences. I have not sequenced any of these organisms myself. Please note:

Some MeCP2 sequences have not been included on this page.

Chicken MAR-binding protein ARBP (NCBI GI 2388804) was not included because the protein does not align properly due to the presence of trinucleotide amplifications, and it is truncated compared to other MeCP2 proteins.

Primate MeCP2 proteins were not included as there were virtually no differences between these sequences and human MeCP2. Chimpanzee MeCP2 protein is identical to human MeCP2 protein for both the chimpanzee MeCP_e1 and MeCP2_e2. A MeCP2_e2 sequence found in the crab-eating macaque (NCBI GI 15419705) has only one amino acid different to the human MeCP2_e2 sequence, which means the two sequences are 99.8% similar.

A putative Fugu rubripes sequence homologous to human MECP2 has been described. The sequence was predicted by a computer program. The putative sequence has more exons than the human MECP2 sequence, and the predicted sequence did not include the equivalent of human exon 1 or exon 2. A DNA sequence that translates to "PQDLSTSRP" (an amino acid sequence found in zebrafish protein) was found in F. rubripes sequence regarded by ensembl as intronic. In addition, when the putative Fugu sequence was compared with a putative Tetraodon nigroviridis MECP2 sequence (ENSEMBL Gene ID GSTENG00009035001), the predicted exons and splice sites were different.

Alignment of MeCP2 homologues

Notes: Xtrop is short for Xenopus tropicalis, the Western clawed frog, and Xlaevis is short for Xenopus laevis, the African clawed frog. MeCP2_e2 refers to the protein based on the coding sequence in exons 2, 3 and 4, whereas MeCP2_e1 refers to the protein based on the coding sequence in exons 1, 3 and 4. The symbols underneath the alignment indicate how strongly conserved the amino acids are. An asterisk ("*") indicates that the amino acid is identical in all species. A colon (":") indicates that the amino acid is strongly conserved in all of the organisms, while a full stop indicates that the amino acid is weakly conserved. A space (" ") indicates the amino acid is not conserved in all of the species. This is a page that explains what counts as strong or weak conservation.

MeCP2_e2 transcript

                                    10        20        30        40              50     
                                    |         |         |         |               |      
humanMeCP2_e2prot      1   MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKDKKEEKEGKHEPVQP------SAHHSA 54
cattleMeCP2_e2prot         MVAGMLGLREEKSEEQDLQGLKDKPLKFKKVKKDKKEDKEGKHEPLQP------AAHHSA
dogMeCP2_e2prot            MVAGMLGLREEKSEDQDLQGLKDKPLKFKKVKKEKKEDKEGKHEPLQP------PAHHSA
mouseMeCP2_e2prot          MVAGMLGLREEKSEDQDLQGLRDKPLKFKKAKKDKKEDKEGKHEPLQP------SAHHSA
ratMeCP2_e2prot            MVAGMLGLREEKSEDQDLQGLKEKPLKFKKVKKDKKEDKEGKHEPLQP------SAHHSA
possumMeCP2_e2prot         MVAGMLGLREEQSEDQDLQGLRDKPLKFRKLKRDKKEEKEGKHEFPQP------SSHQSA
XtropMeCP2_e1prot          ---------EEKSEDQDLQGQKDKPPKLRKVKRDKKDEEE-KQETFHP------SEHQSG
XlaevisMeCP2_e1prot        ---------EEKSEDQDLQGQKDKPPKLRKVKKDKKDEEE-KQEPFHS------SEHQPG
zebrafishMeCP2_e1prot      -------RGEDKNEDQ--EGSKDKTQKHKKSKKERHDVEKLETTVSVPPPPSLFTQRDVG
                                    *::.*:*  :* ::*. * :* *::::: :: :     .      . :. .

                                60         70        80        90        100       110   
                                |          |         |         |         |         |     
humanMeCP2_e2prot     55   EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS 113
cattleMeCP2_e2prot         EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
dogMeCP2_e2prot            EPAEAGKAETSE-GSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
mouseMeCP2_e2prot          EPAEAGKAETSE-SSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
ratMeCP2_e2prot            EPAEAGKAETSE-SSGSAPAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
possumMeCP2_e2prot         EPAEAGKAETSE-EAGSAPAAPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKS
XtropMeCP2_e1prot          EPADEGKADISE-SAEESLAVPEASASPKQRRSVIRDRGPMYEDPTLPEGWTRKLKQRKS
XlaevisMeCP2_e1prot        EPADEGKADMSE-SAEENLAVPESSASPKQRRSVIRDRGPMYEDPTLPEGWTRKLKQRKS
zebrafishMeCP2_e1prot      QQAEAGKSEPIDPEVGAALSAPESSASAKQRRSVIRDRGPMYEDPSLPQGWTRKLKQRKS
                           : *: **::  :       :.**:***.*****:********:**:**:***********

                                 120       130       140       150       160       170   
                                 |         |         |         |         |         |     
humanMeCP2_e2prot    114   GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP 173
cattleMeCP2_e2prot         GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
dogMeCP2_e2prot            GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
mouseMeCP2_e2prot          GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
ratMeCP2_e2prot            GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
possumMeCP2_e2prot         GRSAGKYDVYLINPQGKAFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQKPP
XtropMeCP2_e1prot          GRSAGKFDVYLINPNGKAFRSKVELIAYFQKVGDTSLDPNDFDFTVTGRGSPSRREQKQP
XlaevisMeCP2_e1prot        GRSAGKFDVYLINPNGKAFRSKVELIAYFQKVGDTSLDPNDFDFTVTGRGSPSRREQKQP
zebrafishMeCP2_e1prot      GRSAGKFDVYLINPEGKAFRSKVELMAYFQKVGDTITDPNDFDFTVTGRGSPSRREKRPP
                           ******:*******:**********:***:*****  *******************:: *

                                 180       190       200       210       220       230   
                                 |         |         |         |         |         |     
humanMeCP2_e2prot    174   KKPKSPKAPGTGRGRGRPKGSGTTRPKAATSEGVQVKRVLEKSPGKLLVKMPFQTSPGGK 233
cattleMeCP2_e2prot         KKPKSPKAPGTGRGRGRPKGSGTTRPKAAASEGVQVKRVLEKSPGKLLVKMPFQAAPGSK
dogMeCP2_e2prot            KKPKSPKAPGTGRGRGRPKGSGTARPKAATSEGVQVKRVLEKSPGKLLVKMPFQASPGSK
mouseMeCP2_e2prot          KKPKSPKAPGTGRGRGRPKGSGTGRPKAAASEGVQVKRVLEKSPGKLVVKMPFQASPGGK
ratMeCP2_e2prot            KKPKSPKAPGTGRGRGRPKGSGTGRPKAAASEGVQVKRVLEKSPGKLLVKMPFQASPGGK
possumMeCP2_e2prot         KKSKSPKAPGTGRGRGRPKGSGTVKPRVTASEGVQVKRVIEKSPGKLLVKMPFQPSPGGK
XtropMeCP2_e1prot          KKSKAPKSSGTGRGRGRPKGSVK-VKSPVKSEGVQVKRVIEKSPGKLLVKMPFS---GSK
XlaevisMeCP2_e1prot        KKPKAPKSSVSGRGRGRPKGSIKKVKPPVKSEGVQVKRVIEKSPGKLLVKMPYS---GTK
zebrafishMeCP2_e1prot      KKPKMVKP--SGRGRGRPKGSGKVR---QATEGVAVKRVIEKSPGKLLVKMPFVAP---K
                           **.*  *.  :********** .       :*** ****:*******:****:      *

                                 240       250       260       270                       
                                 |         |         |         |                         
humanMeCP2_e2prot    234   AEGGGATTSTQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA 278
cattleMeCP2_e2prot         AEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
dogMeCP2_e2prot            AEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
mouseMeCP2_e2prot          GEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
ratMeCP2_e2prot            GEGGGATTSAQVMVIKRPGRKRKAEADPQAIPKKRGRKPG---------------SVVAA
possumMeCP2_e2prot         AEGGGATTSTQVMVIKRPGRKRKVETEPQVIPKKRGRKPG---------------SIVAA
XtropMeCP2_e1prot          -EESDATTSEQVLVIKRPGRKRKSDTDPSAAPKKRGRKPGSV-------------SLAAA
XlaevisMeCP2_e1prot        -EASDATTSQQVLVIKRGGRKRKSETDPSAAPKKRGRKPSNV-------------SLAAA
zebrafishMeCP2_e1prot      TEPGAPLGQAPVAKARR-GRKRKSEQDPPSTPKKRGRKPATVSQSTVGTGSAAAYAAAAI
                            * . .  .  *   :* ***** : :*   ********.               : .* 

                            280       290       300       310             320            
                            |         |         |         |               |              
humanMeCP2_e2prot    279   AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL 328
cattleMeCP2_e2prot         ATAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
dogMeCP2_e2prot            AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
mouseMeCP2_e2prot          AAAEAKKKAVKESSIRSVHETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
ratMeCP2_e2prot            AAAEAKKKAVKESSIRSVQETVLPIKKRKTRETVS------IEVKEVVKPLL----VSTL
possumMeCP2_e2prot         AAVEAKKKAIKESSIRSIHETVLPIKKRKTREAVS------IEVKEVVKPLL----VSTV
XtropMeCP2_e1prot          AAEAAKKKAIKESSIKPLLETVLPIKKRKTRETIS------VDVKDTVKPEP----LTPV
XlaevisMeCP2_e1prot        AAEAAKKKAIKESSIKPLLETVLPIKKRKTRETIS------VDVKDTIKPEP----LTPV
zebrafishMeCP2_e1prot      LTAEAKKKALKESSAKPVQERALPIKKRKTRETLEELEASTTSATETFEKRLTASTVTPT
                            :  *****:**** :.: * .**********::.       ...:..:       ::. 

                            330       340       350       360                    370     
                            |         |         |         |                      |       
humanMeCP2_e2prot    329   GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSES 375
cattleMeCP2_e2prot         GEKSGKGLKTCKSPGRKSKESSPKGRSGSASS---PPKKE----------HHHHHHHVEP
dogMeCP2_e2prot            GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSEP
mouseMeCP2_e2prot          GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHSES
ratMeCP2_e2prot            GEKSGKGLKTCKSPGRKSKESSPKGRSSSASS---PPKKE----------HHHHHHHAES
possumMeCP2_e2prot         GEKSTKGLKPGKSPGRKSKESSPKGRSASTSSS--PPKKEQQQ-------QQQYHHHHYY
XtropMeCP2_e1prot          IEKSIKGQKPAKSPESRSTEGSPKIKTG-------LPKKELQQ-------HHHHHHHHHH
XlaevisMeCP2_e1prot        IEKVMKGQNPAKSPESRSTEGSPKIKTG-------LPKKELQQ-------HHHHHHHHHH
zebrafishMeCP2_e1prot      GEEAETGQKPHKHPSRKHKEADPGSSSSGTTASGVAPKSHKKRDQRGQHFKHHHHHHHHH
                            *:  .* :. * *  : .*..*   :.        **..          ::::***   

                               380       390       400              410       420        
                               |         |         |                |         |          
humanMeCP2_e2prot    376   PKAPVPLLPPLPPPPPEPESSEDPTSP-------PEPQDLSSSVCKEEKMPRGGSLESDG 428
cattleMeCP2_e2prot         PKAPAPLLLPPPPPPPEPQSSEDPASP-------PEPQDLSSSVCKEEKMPRAGSLESDG
dogMeCP2_e2prot            PKAPAPLLPPPPPPPPEPQSSEDPASP-------PEPQDLSSGVCKEEKMARGGSLESDG
mouseMeCP2_e2prot          TKAPMPLLP--SPPPPEPESSEDPISP-------PEPQDLSSSICKEEKMPRGGSLESDG
ratMeCP2_e2prot            PKAPMPLLP--PPPPPEPQSSEDPISP-------PEPQDLSSSICKEEKMPRAGSLESDG
possumMeCP2_e2prot         PSSESPKAP--PPPHPEPEGSKDSKSP-------PEPQDLSSKVCKEEKMPRGAPPESDG
XtropMeCP2_e1prot          HHHSESKAS---ATSPEPETSKDSIGA-------PEPQDLSVKIYKEEKLP-----ESDG
XlaevisMeCP2_e1prot        HHHSESKAS---ATSPEPETSKDNIGV-------QEPQDLSVKMCKEEKLP-----ESDG
zebrafishMeCP2_e1prot      HQHQHLQAS--TPSTYTPQAHQLSLGHSTHGGLENEPQDLSTSRPKAEHVACR--EEART
                                       ..   *:  :   .         ******    * *::.     *:  

                            430       440                   450       460        470     
                            |         |                     |         |          |       
humanMeCP2_e2prot    429   CPKEPAKTQPAVA------------TAATAAEKYKHRGEGERKDIVS-SSMPRPNREEPV 475
cattleMeCP2_e2prot         CPKEPAKTQPALA------------TAAPATEKYKHRGEGERKDIVS-SSMPRPNREEPV
dogMeCP2_e2prot            CPKEPAKTQPTVA------------TAATAADKYKHRGEGERKDIVS-SSMPRPNREEPV
mouseMeCP2_e2prot          CPKEPAKTQPMVA------------TTTTVAEKYKHRGEGERKDIVS-SSMPRPNREEPV
ratMeCP2_e2prot            CPKEPAKTQPMVAAA----ATTTTTTTTTVAEKYKHRGEGERKDIVS-SSMPRPNREEPV
possumMeCP2_e2prot         CTKELAKTQPTAAAASAAATAATATTATTAAEKFKHRAEGDRKDIVS-SSMPRPNREDPV
XtropMeCP2_e1prot          CAQEPAKTQP--------------------ADKCRNRAEGERKDIVS-S-VPRPTREEPV
XlaevisMeCP2_e1prot        CAQEPAKTQP--------------------ADKCRNRAEGERKDIVS-S-VPRPTREEPV
zebrafishMeCP2_e1prot      GSSSSRDSQN----------------ASKMASMTVTGESKELRDIVPPSAVPRPSREETV
                            ...  .:*                     :.      . : :***. * :***.**:.*

                               480                                                       
                               |                                                         
humanMeCP2_e2prot    476   DSRTPVTERVS 486
cattleMeCP2_e2prot         DSRTPVTERVS
dogMeCP2_e2prot            DSRTPVTERVS
mouseMeCP2_e2prot          DSRTPVTERVS
ratMeCP2_e2prot            DSRTPVTERVS
possumMeCP2_e2prot         DSRTPVTERVS
XtropMeCP2_e1prot          DTRTTVTERVS
XlaevisMeCP2_e1prot        DTRTTVTERVS
zebrafishMeCP2_e1prot      ESRTPVSEPVS
                           ::**.*:* **

MeCP2_e1 transcript

                                                    10        20        30        40     
                                                    |         |         |         |      
humanMeCP2_e1prot      1   ----------------MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVK 44
cattleMeCP2_e1prot         ----------------MAAAAAAAPSGGGGGGEEERLEEKSEEQDLQGLKDKPLKFKKVK
dogMeCP2_e1prot            ----------------MAAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLKDKPLKFKKVK
mouseMeCP2_e1prot          -----------MAAAAATAAAAAAPSGGGGGGEEERLEEKSEDQDLQGLRDKPLKFKKAK
ratMeCP2_e1prot            MAAAAAAAAAAAAAAAAAAAAAAAAPSGGGGGEEERLEEKSEDQDLQGLKEKPLKFKKVK
possumMeCP2_e1prot         ----------------MAAAAALS---GGGGGEEDRLEEQSEDQDLQGLRDKPLKFRKLK
XtropMeCP2_e1prot          ------------------MAAAPS-------GEE-RLEEKSEDQDLQGQKDKPPKLRKVK
XlaevisMeCP2_e1prot        ------------------MAAAPS-------GEE-RLEEKSEDQDLQGQKDKPPKLRKVK
zebrafishMeCP2_e1prot      ------------------MAAAES-------GEE-RLRGEDKNEDQEGSKDKTQKHKKSK
                                              *** :       *** **. :.:::* :* ::*. * :* *

                                50        60              70         80        90        
                                |         |               |          |         |         
humanMeCP2_e1prot     45   KDKKEEKEGKHEPVQP------SAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR 97
cattleMeCP2_e1prot         KDKKEDKEGKHEPLQP------AAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR
dogMeCP2_e1prot            KEKKEDKEGKHEPLQP------PAHHSAEPAEAGKAETSE-GSGSAPAVPEASASPKQRR
mouseMeCP2_e1prot          KDKKEDKEGKHEPLQP------SAHHSAEPAEAGKAETSE-SSGSAPAVPEASASPKQRR
ratMeCP2_e1prot            KDKKEDKEGKHEPLQP------SAHHSAEPAEAGKAETSE-SSGSAPAVPEASASPKQRR
possumMeCP2_e1prot         RDKKEEKEGKHEFPQP------SSHQSAEPAEAGKAETSE-EAGSAPAAPEASASPKQRR
XtropMeCP2_e1prot          RDKKDEEE-KQETFHP------SEHQSGEPADEGKADISE-SAEESLAVPEASASPKQRR
XlaevisMeCP2_e1prot        KDKKDEEE-KQEPFHS------SEHQPGEPADEGKADMSE-SAEENLAVPESSASPKQRR
zebrafishMeCP2_e1prot      KERHDVEKLETTVSVPPPPSLFTQRDVGQQAEAGKSEPIDPEVGAALSAPESSASAKQRR
                           ::::: :: :     .      . :. .: *: **::  :       :.**:***.****

                             100       110       120       130       140       150       
                             |         |         |         |         |         |         
humanMeCP2_e1prot     98   SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV 157
cattleMeCP2_e1prot         SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
dogMeCP2_e1prot            SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
mouseMeCP2_e1prot          SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
ratMeCP2_e1prot            SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
possumMeCP2_e1prot         SIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGKAFRSKVELIAYFEKV
XtropMeCP2_e1prot          SVIRDRGPMYEDPTLPEGWTRKLKQRKSGRSAGKFDVYLINPNGKAFRSKVELIAYFQKV
XlaevisMeCP2_e1prot        SVIRDRGPMYEDPTLPEGWTRKLKQRKSGRSAGKFDVYLINPNGKAFRSKVELIAYFQKV
zebrafishMeCP2_e1prot      SVIRDRGPMYEDPSLPQGWTRKLKQRKSGRSAGKFDVYLINPEGKAFRSKVELMAYFQKV
                           *:********:**:**:*****************:*******:**********:***:**

                             160       170       180       190       200       210       
                             |         |         |         |         |         |         
humanMeCP2_e1prot    158   GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAATSE 217
cattleMeCP2_e1prot         GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTTRPKAAASE
dogMeCP2_e1prot            GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTARPKAATSE
mouseMeCP2_e1prot          GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTGRPKAAASE
ratMeCP2_e1prot            GDTSLDPNDFDFTVTGRGSPSRREQKPPKKPKSPKAPGTGRGRGRPKGSGTGRPKAAASE
possumMeCP2_e1prot         GDTSLDPNDFDFTVTGRGSPSRREQKPPKKSKSPKAPGTGRGRGRPKGSGTVKPRVTASE
XtropMeCP2_e1prot          GDTSLDPNDFDFTVTGRGSPSRREQKQPKKSKAPKSSGTGRGRGRPKGSVK-VKSPVKSE
XlaevisMeCP2_e1prot        GDTSLDPNDFDFTVTGRGSPSRREQKQPKKPKAPKSSVSGRGRGRPKGSIKKVKPPVKSE
zebrafishMeCP2_e1prot      GDTITDPNDFDFTVTGRGSPSRREKRPPKKPKMVKP--SGRGRGRPKGSGKVR---QATE
                           ***  *******************:: ***.*  *.  :********** .       :*

                             220       230       240       250       260       270       
                             |         |         |         |         |         |         
humanMeCP2_e1prot    218   GVQVKRVLEKSPGKLLVKMPFQTSPGGKAEGGGATTSTQVMVIKRPGRKRKAEADPQAIP 277
cattleMeCP2_e1prot         GVQVKRVLEKSPGKLLVKMPFQAAPGSKAEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
dogMeCP2_e1prot            GVQVKRVLEKSPGKLLVKMPFQASPGSKAEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
mouseMeCP2_e1prot          GVQVKRVLEKSPGKLVVKMPFQASPGGKGEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
ratMeCP2_e1prot            GVQVKRVLEKSPGKLLVKMPFQASPGGKGEGGGATTSAQVMVIKRPGRKRKAEADPQAIP
possumMeCP2_e1prot         GVQVKRVIEKSPGKLLVKMPFQPSPGGKAEGGGATTSTQVMVIKRPGRKRKVETEPQVIP
XtropMeCP2_e1prot          GVQVKRVIEKSPGKLLVKMPFS---GSK-EESDATTSEQVLVIKRPGRKRKSDTDPSAAP
XlaevisMeCP2_e1prot        GVQVKRVIEKSPGKLLVKMPYS---GTK-EASDATTSQQVLVIKRGGRKRKSETDPSAAP
zebrafishMeCP2_e1prot      GVAVKRVIEKSPGKLLVKMPFVAP---KTEPGAPLGQAPVAKARR-GRKRKSEQDPPSTP
                           ** ****:*******:****:      * * . .  .  *   :* ***** : :*   *

                             280                      290       300       310       320  
                             |                        |         |         |         |    
humanMeCP2_e1prot    278   KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE 322
cattleMeCP2_e1prot         KKRGRKPG---------------SVVAAATAEAKKKAVKESSIRSVQETVLPIKKRKTRE
dogMeCP2_e1prot            KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE
mouseMeCP2_e1prot          KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVHETVLPIKKRKTRE
ratMeCP2_e1prot            KKRGRKPG---------------SVVAAAAAEAKKKAVKESSIRSVQETVLPIKKRKTRE
possumMeCP2_e1prot         KKRGRKPG---------------SIVAAAAVEAKKKAIKESSIRSIHETVLPIKKRKTRE
XtropMeCP2_e1prot          KKRGRKPGSV-------------SLAAAAAEAAKKKAIKESSIKPLLETVLPIKKRKTRE
XlaevisMeCP2_e1prot        KKRGRKPSNV-------------SLAAAAAEAAKKKAIKESSIKPLLETVLPIKKRKTRE
zebrafishMeCP2_e1prot      KKRGRKPATVSQSTVGTGSAAAYAAAAILTAEAKKKALKESSAKPVQERALPIKKRKTRE
                           *******.               : .*  :  *****:**** :.: * .**********

                                        330           340       350       360        370 
                                        |             |         |         |          |   
humanMeCP2_e1prot    323   TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS 371
cattleMeCP2_e1prot         TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSGS-AS
dogMeCP2_e1prot            TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS
mouseMeCP2_e1prot          TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS
ratMeCP2_e1prot            TVS------IEVKEVVKPLL----VSTLGEKSGKGLKTCKSPGRKSKESSPKGRSSS-AS
possumMeCP2_e1prot         AVS------IEVKEVVKPLL----VSTVGEKSTKGLKPGKSPGRKSKESSPKGRSASTSS
XtropMeCP2_e1prot          TIS------VDVKDTVKPEP----LTPVIEKSIKGQKPAKSPESRSTEGSPKIKTG--L-
XlaevisMeCP2_e1prot        TIS------VDVKDTIKPEP----LTPVIEKVMKGQNPAKSPESRSTEGSPKIKTG--L-
zebrafishMeCP2_e1prot      TLEELEASTTSATETFEKRLTASTVTPTGEEAETGQKPHKHPSRKHKEADPGSSSSGTTA
                           ::.       ...:..:       ::.  *:  .* :. * *  : .*..*   :.    

                                               380       390       400       410         
                                               |         |         |         |           
humanMeCP2_e1prot    372   S---PP--KKE-------HHHHHHHSESPKAPVPLLPPLPPPPPEPESSEDPTSP----- 414
cattleMeCP2_e1prot         S---PP--KKE-------HHHHHHHVEPPKAPAPLLLPPPPPPPEPQSSEDPASP-----
dogMeCP2_e1prot            S---PP--KKE-------HHHHHHHSEPPKAPAPLLPPPPPPPPEPQSSEDPASP-----
mouseMeCP2_e1prot          S---PP--KKE-------HHHHHHHSESTKAPMPLLP--SPPPPEPESSEDPISP-----
ratMeCP2_e1prot            S---PP--KKE-------HHHHHHHAESPKAPMPLLP--PPPPPEPQSSEDPISP-----
possumMeCP2_e1prot         S---PP--KKEQQQ----QQQYHHHHYYPSSESPKAP--PPPHPEPEGSKDSKSP-----
XtropMeCP2_e1prot          ----P---KKELQQ----HHHHHHHHHHHHHSESKAS---ATSPEPETSKDSIGA-----
XlaevisMeCP2_e1prot        ----P---KKELQQ----HHHHHHHHHHHHHSESKAS---ATSPEPETSKDNIGV-----
zebrafishMeCP2_e1prot      SGVAPKSHKKRDQRGQHFKHHHHHHHHHHQHQHLQAS--TPSTYTPQAHQLSLGHSTHGG
                               *   **.       ::::***               ..   *:  :   .      

                                  420       430       440       450                   460
                                  |         |         |         |                     |  
humanMeCP2_e1prot    415   --PEPQDLSSSVCKEEKMPRGGSLESDGCPKEPAKTQPAVA------------TAATAAE 460
cattleMeCP2_e1prot         --PEPQDLSSSVCKEEKMPRAGSLESDGCPKEPAKTQPALA------------TAAPATE
dogMeCP2_e1prot            --PEPQDLSSGVCKEEKMARGGSLESDGCPKEPAKTQPTVA------------TAATAAD
mouseMeCP2_e1prot          --PEPQDLSSSICKEEKMPRGGSLESDGCPKEPAKTQPMVA------------TTTTVAE
ratMeCP2_e1prot            --PEPQDLSSSICKEEKMPRAGSLESDGCPKEPAKTQPMVAAA----ATTTTTTTTTVAE
possumMeCP2_e1prot         --PEPQDLSSKVCKEEKMPRGAPPESDGCTKELAKTQPTAAAASAAATAATATTATTAAE
XtropMeCP2_e1prot          --PEPQDLSVKIYKEEKLP-----ESDGCAQEPAKTQP--------------------AD
XlaevisMeCP2_e1prot        --QEPQDLSVKMCKEEKLP-----ESDGCAQEPAKTQP--------------------AD
zebrafishMeCP2_e1prot      LENEPQDLSTSRPKAEHVACR--EEARTGSSSSRDSQN----------------ASKMAS
                              ******    * *::.     *:   ...  .:*                     :.

                                    470        480       490                             
                                    |          |         |                               
humanMeCP2_e1prot    461   KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS 498
cattleMeCP2_e1prot         KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
dogMeCP2_e1prot            KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
mouseMeCP2_e1prot          KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
ratMeCP2_e1prot            KYKHRGEGERKDIVS-SSMPRPNREEPVDSRTPVTERVS
possumMeCP2_e1prot         KFKHRAEGDRKDIVS-SSMPRPNREDPVDSRTPVTERVS
XtropMeCP2_e1prot          KCRNRAEGERKDIVS-S-VPRPTREEPVDTRTTVTERVS
XlaevisMeCP2_e1prot        KCRNRAEGERKDIVS-S-VPRPTREEPVDTRTTVTERVS
zebrafishMeCP2_e1prot      MTVTGESKELRDIVPPSAVPRPSREETVESRTPVSEPVS
                                 . : :***. * :***.**:.*::**.*:* **

Origin and reliability of MECP2 homologue sequences

Human (Homo sapiens):

MECP2_e2 transcript

>humanMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaagaaaag tcagaagacc aggacctcca gggcctcaag gacaaacccc tcaagtttaa aaaggtgaag aaagataaga aagaagagaa agagggcaag catgagcccg tgcagccatc agcccaccac tctgctgagc ccgcagaggc aggcaaagca gagacatcag aagggtcagg ctccgccccg gctgtgccgg aagcttctgc ctcccccaaa cagcggcgct ccatcatccg tgaccgggga cccatgtatg atgaccccac cctgcctgaa ggctggacac ggaagcttaa gcaaaggaaa tctggccgct ctgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacatcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggccgggga cgccccaaag ggagcggcac cacgagaccc aaggcggcca cgtcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcctgggaag ctccttgtca agatgccttt tcaaacttcg ccagggggca aggctgaggg gggtggggcc accacatcca cccaggtcat ggtgatcaaa cgccccggca ggaagcgaaa agctgaggcc gaccctcagg ccattcccaa gaaacggggc cgaaagccgg ggagtgtggt ggcagccgct gccgccgagg ccaaaaagaa agccgtgaag gagtcttcta tccgatctgt gcaggagacc gtactcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc caccctcggt gagaagagcg ggaaaggact gaagacctgt aagagccctg ggcggaaaag caaggagagc agccccaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagtcccc aaaggccccc gtgccactgc tcccacccct gcccccacct ccacctgagc ccgagagctc cgaggacccc accagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccc agaggaggct cactggagag cgacggctgc cccaaggagc cagctaagac tcagcccgcg gttgccaccg ccgccacggc cgcagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga

>humanMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKDKPLKFKK VKKDKKEEKE GKHEPVQPSA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTTRPK AATSEGVQVK RVLEKSPGKL LVKMPFQTSP GGKAEGGGAT TSTQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSESPKAPV PLLPPLPPPP PEPESSEDPT SPPEPQDLSS SVCKEEKMPR GGSLESDGCP KEPAKTQPAV ATAATAAEKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS

Derived from: NCBI GI 15079579 bases 85 to 1545.

Reliability: The coding sequence portions of NCBI GI 15079579 match exactly the human genome sequence.

MECP2_e1 transcript

>humanMECP2_e1dna
atggccgccg ccgccgccgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tcagaagacc aggacctcca gggcctcaag gacaaacccc tcaagtttaa aaaggtgaag aaagataaga aagaagagaa agagggcaag catgagcccg tgcagccatc agcccaccac tctgctgagc ccgcagaggc aggcaaagca gagacatcag aagggtcagg ctccgccccg gctgtgccgg aagcttctgc ctcccccaaa cagcggcgct ccatcatccg tgaccgggga cccatgtatg atgaccccac cctgcctgaa ggctggacac ggaagcttaa gcaaaggaaa tctggccgct ctgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacatcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggccgggga cgccccaaag ggagcggcac cacgagaccc aaggcggcca cgtcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcctgggaag ctccttgtca agatgccttt tcaaacttcg ccagggggca aggctgaggg gggtggggcc accacatcca cccaggtcat ggtgatcaaa cgccccggca ggaagcgaaa agctgaggcc gaccctcagg ccattcccaa gaaacggggc cgaaagccgg ggagtgtggt ggcagccgct gccgccgagg ccaaaaagaa agccgtgaag gagtcttcta tccgatctgt gcaggagacc gtactcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc caccctcggt gagaagagcg ggaaaggact gaagacctgt aagagccctg ggcggaaaag caaggagagc agccccaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagtcccc aaaggccccc gtgccactgc tcccacccct gcccccacct ccacctgagc ccgagagctc cgaggacccc accagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccc agaggaggct cactggagag cgacggctgc cccaaggagc cagctaagac tcagcccgcg gttgccaccg ccgccacggc cgcagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga

>humanMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEDQDL QGLKDKPLKF KKVKKDKKEE KEGKHEPVQP SAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTTR PKAATSEGVQ VKRVLEKSPG KLLVKMPFQT SPGGKAEGGG ATTSTQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA AAAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSSSA SSPPKKEHHH HHHHSESPKA PVPLLPPLPP PPPEPESSED PTSPPEPQDL SSSVCKEEKM PRGGSLESDG CPKEPAKTQP AVATAATAAE KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS

Derived from: NCBI GI 6959307 bases 8 to 69 and 194 to 1628.

Reliability: The coding sequence portions of NCBI GI 6959307 match exactly the human genome sequence.

Cattle (Bos taurus):

MECP2_e2 transcript

>cattleMECP2_e2dna
atggtagctg gaatgttagg gctcag
ggaagaaaag tccgaagagc aggatctcca gggcctgaag gacaaacctt tgaagttcaa aaaggtgaag aaggataaga aagaagacaa agagggcaag catgagcccc tgcagccagc agcccaccac tctgccgagc cagcagaggc cggcaaagca gagacctcag aagggtcagg ctcggcccca gccgtgccag aagcttctgc atcccccaag cagcggcgct ccatcattcg tgatcggggc cccatgtacg atgaccccac tctgccggaa ggttggaccc gaaagcttaa gcaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tacttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta accgggagag ggagcccctc ccggcgagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagtggcac cacgagaccc aaggcagctg cgtcagaggg tgtgcaagtg aaaagggttc tggagaaaag tcctggaaag ctactcgtca agatgccttt ccaagctgcg ccgggcagca aggcagaagg gggtggggcc accacctcag cccaggtcat ggtcatcaag cgccccggcc ggaagcgaaa agcggaggcc gacccccagg ccattcccaa gaaacgaggc cgaaagccgg gcagtgtggt tgctgccgcc actgccgagg ccaaaaagaa agccgtgaag gagtcatcta tccggtccgt tcaggagacc gtgctcccca tcaagaagcg caagacccgg gagacggtga gcattgaggt gaaggaggta gtgaagcccc tgctggtgtc cacgctcggc gagaagagcg ggaagggact gaagacctgc aagagcccag ggcggaaaag caaggagagc agtcccaagg ggcgcagcgg cagcgcctcc tcgcccccca agaaggagca ccaccaccac caccaccacg tggagccccc gaaggccccc gcgccgctgc tcctgccccc gcccccaccc ccgcccgagc cccagagctc cgaggaccct gccagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccg agagcaggct cgctggagag cgatggctgc cccaaggagc ctgctaagac tcagcccgcg ctcgccaccg cggccccggc cacagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgtct catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga

>cattleMECP2_e2prot
MVAGMLGLRE EKSEEQDLQG LKDKPLKFKK VKKDKKEDKE GKHEPLQPAA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTTRPK AAASEGVQVK RVLEKSPGKL LVKMPFQAAP GSKAEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAT AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSGSASS PPKKEHHHHH HHVEPPKAPA PLLLPPPPPP PEPQSSEDPA SPPEPQDLSS SVCKEEKMPR AGSLESDGCP KEPAKTQPAL ATAAPATEKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS

Derived from: ENSEMBL data, except for the section highlighted in red, which is derived from the trace archive.

Reliability: Fairly good agreement with sequences from the trace archive.

MECP2_e1 transcript

>cattleMECP2_e1dna
atggccgccg ccgccgctgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tccgaagagc aggatctcca gggcctgaag gacaaacctt tgaagttcaa aaaggtgaag aaggataaga aagaagacaa agagggcaag catgagcccc tgcagccagc agcccaccac tctgccgagc cagcagaggc cggcaaagca gagacctcag aagggtcagg ctcggcccca gccgtgccag aagcttctgc atcccccaag cagcggcgct ccatcattcg tgatcggggc cccatgtacg atgaccccac tctgccggaa ggttggaccc gaaagcttaa gcaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tacttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta accgggagag ggagcccctc ccggcgagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagtggcac cacgagaccc aaggcagctg cgtcagaggg tgtgcaagtg aaaagggttc tggagaaaag tcctggaaag ctactcgtca agatgccttt ccaagctgcg ccgggcagca aggcagaagg gggtggggcc accacctcag cccaggtcat ggtcatcaag cgccccggcc ggaagcgaaa agcggaggcc gacccccagg ccattcccaa gaaacgaggc cgaaagccgg gcagtgtggt tgctgccgcc actgccgagg ccaaaaagaa agccgtgaag gagtcatcta tccggtccgt tcaggagacc gtgctcccca tcaagaagcg caagacccgg gagacggtga gcattgaggt gaaggaggta gtgaagcccc tgctggtgtc cacgctcggc gagaagagcg ggaagggact gaagacctgc aagagcccag ggcggaaaag caaggagagc agtcccaagg ggcgcagcgg cagcgcctcc tcgcccccca agaaggagca ccaccaccac caccaccacg tggagccccc gaaggccccc gcgccgctgc tcctgccccc gcccccaccc ccgcccgagc cccagagctc cgaggaccct gccagccccc ctgagcccca ggacttgagc agcagcgtct gcaaagagga gaagatgccg agagcaggct cgctggagag cgatggctgc cccaaggagc ctgctaagac tcagcccgcg ctcgccaccg cggccccggc cacagaaaag tacaaacacc gaggggaggg agagcgcaaa gacattgtct catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ctga

>cattleMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEEQDL QGLKDKPLKF KKVKKDKKED KEGKHEPLQP AAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTTR PKAAASEGVQ VKRVLEKSPG KLLVKMPFQA APGSKAEGGG ATTSAQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA ATAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSGSA SSPPKKEHHH HHHHVEPPKA PAPLLLPPPP PPPEPQSSED PASPPEPQDL SSSVCKEEKM PRAGSLESDG CPKEPAKTQP ALATAAPATE KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS

Derived from: ENSEMBL data, except for the section highlighted in red, which is derived from the trace archive.

Reliability: Fairly good agreement with sequences from the trace archive.

Dog (Canis familiaris):

MECP2_e2 transcript

>dogMECP2_e2dna
atggtagctg gaatgttagg gctcag
ggaagaaaag tcagaagacc aggatctcca gggcctcaag gacaaacccc tgaaatttaa aaaggtgaag aaagagaaga aagaagacaa agagggcaag catgagcccc tgcagccacc ggctcaccac tctgctgaac cagcagaggc aggcaaagcg gagacctcag aagggtcagg ctcagcccca gctgtcccgg aagcttctgc ctcccccaaa cagcgacgct ctatcattcg tgaccgggga cccatgtatg acgaccccac tctgcctgaa ggttggaccc gaaagcttaa acaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagcggcac tgcgagaccc aaggcagcaa catcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcccgggaag ctgctcgtca agatgccttt tcaagcttcg cccgggagca aggctgaagg gggcggggcc accacgtcag cccaggtcat ggttatcaaa cgcccaggcc ggaagcgaaa agccgaggct gacccccagg ccattcccaa gaagcggggc cgaaagccag gcagtgtggt ggcagctgcc gccgcagagg ccaaaaagaa agccgtgaag gagtcttcca tccggtccgt gcaggagact gtgctcccca tcaagaagcg caagactcgg gagacggtca gcattgaggt gaaggaggtg gtgaagcccc tgctggtgtc caccctcggc gagaagagtg gaaagggact gaagacctgc aagagccccg gacggaaaag caaggagagc agcccgaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagccccc gaaggcaccc gcgccgctgc ttccgccccc gccccctccc ccacctgagc cccagagctc cgaggacccc gccagccccc ctgagcccca ggacttgagc agcggcgtct gcaaagagga gaagatggcg agaggaggct cgctggagag cgacggctgc cccaaggagc cagctaagac tcagcccacg gtcgcgaccg ccgccacggc cgcagacaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ct

>dogMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKDKPLKFKK VKKEKKEDKE GKHEPLQPPA HHSAEPAEAG KAETSEGSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTARPK AATSEGVQVK RVLEKSPGKL LVKMPFQASP GSKAEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSEPPKAPA PLLPPPPPPP PEPQSSEDPA SPPEPQDLSS GVCKEEKMAR GGSLESDGCP KEPAKTQPTV ATAATAADKY KHRGEGERKD IVSSSMPRPN REEPVDSRTP VTERVS

Derived from: Hand-based assembling of sequences from the trace archive.

Reliability: Good agreement with other sequences from the trace archive.

MECP2_e1 transcript

>dogMECP2_e1dna
atggccgccg ccgccgctgc cgcgccgagc ggaggaggag gaggaggcga ggaggagaga ct
ggaagaaaag tcagaagacc aggatctcca gggcctcaag gacaaacccc tgaaatttaa aaaggtgaag aaagagaaga aagaagacaa agagggcaag catgagcccc tgcagccacc ggctcaccac tctgctgaac cagcagaggc aggcaaagcg gagacctcag aagggtcagg ctcagcccca gctgtcccgg aagcttctgc ctcccccaaa cagcgacgct ctatcattcg tgaccgggga cccatgtatg acgaccccac tctgcctgaa ggttggaccc gaaagcttaa acaaaggaaa tctggccgct ccgctgggaa gtatgatgtg tatttgatca a
tccccaggga aaagcctttc gctctaaagt ggagttgatt gcgtacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagcccctc ccggcgagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag aggtcgggga cgccccaaag ggagcggcac tgcgagaccc aaggcagcaa catcagaggg tgtgcaggtg aaaagggtcc tggagaaaag tcccgggaag ctgctcgtca agatgccttt tcaagcttcg cccgggagca aggctgaagg gggcggggcc accacgtcag cccaggtcat ggttatcaaa cgcccaggcc ggaagcgaaa agccgaggct gacccccagg ccattcccaa gaagcggggc cgaaagccag gcagtgtggt ggcagctgcc gccgcagagg ccaaaaagaa agccgtgaag gagtcttcca tccggtccgt gcaggagact gtgctcccca tcaagaagcg caagactcgg gagacggtca gcattgaggt gaaggaggtg gtgaagcccc tgctggtgtc caccctcggc gagaagagtg gaaagggact gaagacctgc aagagccccg gacggaaaag caaggagagc agcccgaagg ggcgcagcag cagcgcctcc tcacccccca agaaggagca ccaccaccat caccaccact cagagccccc gaaggcaccc gcgccgctgc ttccgccccc gccccctccc ccacctgagc cccagagctc cgaggacccc gccagccccc ctgagcccca ggacttgagc agcggcgtct gcaaagagga gaagatggcg agaggaggct cgctggagag cgacggctgc cccaaggagc cagctaagac tcagcccacg gtcgcgaccg ccgccacggc cgcagacaag tacaaacacc gaggggaggg agagcgcaaa gacattgttt catcctccat gccaaggcca aacagagagg agcctgtgga cagccggacg cccgtgaccg agagagttag ct

>dogMECP2_e1prot
MAAAAAAAPS GGGGGGEEER LEEKSEDQDL QGLKDKPLKF KKVKKEKKED KEGKHEPLQP PAHHSAEPAE AGKAETSEGS GSAPAVPEAS ASPKQRRSII RDRGPMYDDP TLPEGWTRKL KQRKSGRSAG KYDVYLINPQ GKAFRSKVEL IAYFEKVGDT SLDPNDFDFT VTGRGSPSRR EQKPPKKPKS PKAPGTGRGR GRPKGSGTAR PKAATSEGVQ VKRVLEKSPG KLLVKMPFQA SPGSKAEGGG ATTSAQVMVI KRPGRKRKAE ADPQAIPKKR GRKPGSVVAA AAAEAKKKAV KESSIRSVQE TVLPIKKRKT RETVSIEVKE VVKPLLVSTL GEKSGKGLKT CKSPGRKSKE SSPKGRSSSA SSPPKKEHHH HHHHSEPPKA PAPLLPPPPP PPPEPQSSED PASPPEPQDL SSGVCKEEKM ARGGSLESDG CPKEPAKTQP TVATAATAAD KYKHRGEGER KDIVSSSMPR PNREEPVDSR TPVTERVS

Derived from: Hand-based assembling of sequences from the trace archive.

Reliability: Exon 1 has fairly good agreement with other sequences from the trace archive, exons 3 and 4 have good agreement with other sequences from the trace archive.

House mouse (Mus musculus):

MECP2_e2 transcript

>mouseMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaggaaaag tcagaagacc aggatctcca gggcctcaga gacaagccac tgaagtttaa gaaggcgaag aaagacaaga aggaggacaa agaaggcaag catgagccac tacaaccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gaaacatcag aaagctcagg ctctgcccca gcagtgccag aagcctcggc ttcccccaaa cagcggcgct ccattatccg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacac gaaagcttaa acaaaggaag tctggccgat ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcttttc gctctaaagt agaattgatt gcatactttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcacggta actgggagag ggagcccctc caggagagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgccccaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaggtg aaaagggtcc tggagaagag ccctgggaaa cttgttgtca agatgccttt ccaagcatcg cctgggggta agggtgaggg aggtggggct accacatctg cccaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gcagctgagg ccaaaaagaa agccgtgaag gagtcttcca tacggtctgt gcatgagact gtgctcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc cacccttggt gagaaaagcg ggaagggact gaagacctgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tccccaccta agaaggagca ccatcatcac caccatcact cagagtccac aaaggccccc atgccactgc tcccatcccc acccccacct gagcctgaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aagagaagat gccccgagga ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc tatggtcgcc accactacca cagttgcaga aaagtacaaa caccgagggg agggagagcg caaagacatt gtttcatctt ccatgccaag gccaaacaga gaggagcctg tggacagccg gacgcccgtg accgagagag ttagctga

>mouseMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LRDKPLKFKK AKKDKKEDKE GKHEPLQPSA HHSAEPAEAG KAETSESSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTGRPK AAASEGVQVK RVLEKSPGKL VVKMPFQASP GGKGEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVHETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHSESTKAPM PLLPSPPPPE PESSEDPISP PEPQDLSSSI CKEEKMPRGG SLESDGCPKE PAKTQPMVAT TTTVAEKYKH RGEGERKDIV SSSMPRPNRE EPVDSRTPVT ERVS

Derived from: NCBI GI 20072599 bases 202 to 1656.

Reliability: Good agreement with sequences from trace archive.

MECP2_e1 transcript

>mouseMECP2_e1dna
atggccgccg ctgccgccac cgccgccgcc gccgccgcgc cgagcggagg aggaggagga ggcgaggagg agagact
ggaggaaaag tcagaagacc aggatctcca gggcctcaga gacaagccac tgaagtttaa gaaggcgaag aaagacaaga aggaggacaa agaaggcaag catgagccac tacaaccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gaaacatcag aaagctcagg ctctgcccca gcagtgccag aagcctcggc ttcccccaaa cagcggcgct ccattatccg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacac gaaagcttaa acaaaggaag tctggccgat ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcttttc gctctaaagt agaattgatt gcatactttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcacggta actgggagag ggagcccctc caggagagag cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgccccaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaggtg aaaagggtcc tggagaagag ccctgggaaa cttgttgtca agatgccttt ccaagcatcg cctgggggta agggtgaggg aggtggggct accacatctg cccaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gcagctgagg ccaaaaagaa agccgtgaag gagtcttcca tacggtctgt gcatgagact gtgctcccca tcaagaagcg caagacccgg gagacggtca gcatcgaggt caaggaagtg gtgaagcccc tgctggtgtc cacccttggt gagaaaagcg ggaagggact gaagacctgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tccccaccta agaaggagca ccatcatcac caccatcact cagagtccac aaaggccccc atgccactgc tcccatcccc acccccacct gagcctgaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aagagaagat gccccgagga ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc tatggtcgcc accactacca cagttgcaga aaagtacaaa caccgagggg agggagagcg caaagacatt gtttcatctt ccatgccaag gccaaacaga gaggagcctg tggacagccg gacgcccgtg accgagagag ttagctga

>mouseMECP2_e1prot
MAAAAATAAA AAAPSGGGGG GEEERLEEKS EDQDLQGLRD KPLKFKKAKK DKKEDKEGKH EPLQPSAHHS AEPAEAGKAE TSESSGSAPA VPEASASPKQ RRSIIRDRGP MYDDPTLPEG WTRKLKQRKS GRSAGKYDVY LINPQGKAFR SKVELIAYFE KVGDTSLDPN DFDFTVTGRG SPSRREQKPP KKPKSPKAPG TGRGRGRPKG SGTGRPKAAA SEGVQVKRVL EKSPGKLVVK MPFQASPGGK GEGGGATTSA QVMVIKRPGR KRKAEADPQA IPKKRGRKPG SVVAAAAAEA KKKAVKESSI RSVHETVLPI KKRKTRETVS IEVKEVVKPL LVSTLGEKSG KGLKTCKSPG RKSKESSPKG RSSSASSPPK KEHHHHHHHS ESTKAPMPLL PSPPPPEPES SEDPISPPEP QDLSSSICKE EKMPRGGSLE SDGCPKEPAK TQPMVATTTT VAEKYKHRGE GERKDIVSSS MPRPNREEPV DSRTPVTERV S

Derived from: NCBI GI 20072599 bases 27 to 103 and 228 to 1656.

Reliability: Good agreement with sequences from trace archive.

Norway rat (Rattus norvegicus):

MECP2_e2 transcript

>ratMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaggaaaag tcagaagacc aggatctcca gggcctcaaa gagaaacccc tgaagtttaa gaaggtgaag aaagacaaga aggaagacaa agagggcaaa catgaaccac tacagccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gagacatcag aaagctcagg ctctgcccca gcagtaccag aagcctctgc ttctcccaaa cagcgacgtt ccatcattcg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacgc gaaagcttaa acagaggaag tctggtcgct ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcctttc gctctaaagt agaattgatt gcatattttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcactgta actgggagag ggagcccttc caggagagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgcccgaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaagtg aaaagggtcc tggagaagag ccctgggaaa cttctcgtca agatgccttt ccaagcatca cctgggggta agggtgaggg aggtggggct accacatctg cgcaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gctgcagagg ccaaaaagaa agctgtgaag gaatcttcta tacggtctgt gcaggagact gtgctcccca tcaagaagcg caagacccgg gaaaccgtca gcattgaggt caaggaggtg gtgaagcccc tgctggtgtc tacacttggt gagaagagtg gaaagggact gaagacatgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tcaccaccta agaaggagca ccatcatcac caccatcacg cagagtcccc aaaggccccc atgccattgc ttccacctcc acccccacct gagcctcaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aggagaagat gccccgagca ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc catggttgct gccgccgcca ccaccaccac caccaccacc accacagttg cagaaaagta caaacaccga ggggagggag agcgcaaaga cattgtttca tcctccatgc cgaggccaaa cagagaggag cctgtggaca gccggacgcc cgtgaccgag agagttagct ga

>ratMECP2_e2prot
MVAGMLGLRE EKSEDQDLQG LKEKPLKFKK VKKDKKEDKE GKHEPLQPSA HHSAEPAEAG KAETSESSGS APAVPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKPKSPK APGTGRGRGR PKGSGTGRPK AAASEGVQVK RVLEKSPGKL LVKMPFQASP GGKGEGGGAT TSAQVMVIKR PGRKRKAEAD PQAIPKKRGR KPGSVVAAAA AEAKKKAVKE SSIRSVQETV LPIKKRKTRE TVSIEVKEVV KPLLVSTLGE KSGKGLKTCK SPGRKSKESS PKGRSSSASS PPKKEHHHHH HHAESPKAPM PLLPPPPPPE PQSSEDPISP PEPQDLSSSI CKEEKMPRAG SLESDGCPKE PAKTQPMVAA AATTTTTTTT TVAEKYKHRG EGERKDIVSS SMPRPNREEP VDSRTPVTER VS

Derived from: NCBI GI 115312277 nucleotides 218 to 1696.

Reliability: Exons 2 to 4 match the rat genome exactly.

MECP2_e1 transcript

>ratMECP2_e1dna
atggccgccg ccgctgccgc cgctgccgcc gccgccgccg ccgctgccgc cgccgccgcc gccgccgccg ccgcgccgag cggaggagga ggaggcgagg aggagagact
ggaggaaaag tcagaagacc aggatctcca gggcctcaaa gagaaacccc tgaagtttaa gaaggtgaag aaagacaaga aggaagacaa agagggcaaa catgaaccac tacagccttc agcccaccat tctgcagagc cagcagaggc aggcaaagca gagacatcag aaagctcagg ctctgcccca gcagtaccag aagcctctgc ttctcccaaa cagcgacgtt ccatcattcg tgaccgggga cctatgtatg atgaccccac cttgcctgaa ggttggacgc gaaagcttaa acagaggaag tctggtcgct ctgctggaaa gtatgatgta tatttgatca a
tccccaggga aaagcctttc gctctaaagt agaattgatt gcatattttg aaaaggtggg agacacctcc ttggacccta atgattttga cttcactgta actgggagag ggagcccttc caggagagaa cagaaaccac ctaagaagcc caaatctccc aaagctccag gaactggcag gggtcgggga cgcccgaaag ggagcggcac tgggagacca aaggcagcag catcagaagg tgttcaagtg aaaagggtcc tggagaagag ccctgggaaa cttctcgtca agatgccttt ccaagcatca cctgggggta agggtgaggg aggtggggct accacatctg cgcaggtcat ggtgatcaaa cgccctggca gaaagcgaaa agctgaagct gacccccagg ccattcctaa gaaacggggt agaaagcctg ggagtgtggt ggcagctgct gctgcagagg ccaaaaagaa agctgtgaag gaatcttcta tacggtctgt gcaggagact gtgctcccca tcaagaagcg caagacccgg gaaaccgtca gcattgaggt caaggaggtg gtgaagcccc tgctggtgtc tacacttggt gagaagagtg gaaagggact gaagacatgc aagagccctg ggcgtaaaag caaggagagc agccccaagg ggcgcagcag cagtgcctcc tcaccaccta agaaggagca ccatcatcac caccatcacg cagagtcccc aaaggccccc atgccattgc ttccacctcc acccccacct gagcctcaga gctctgagga ccccatcagc ccccctgagc ctcaggactt gagcagcagc atctgcaaag aggagaagat gccccgagca ggctcactgg aaagcgatgg ctgccccaag gagccagcta agactcagcc catggttgct gccgccgcca ccaccaccac caccaccacc accacagttg cagaaaagta caaacaccga ggggagggag agcgcaaaga cattgtttca tcctccatgc cgaggccaaa cagagaggag cctgtggaca gccggacgcc cgtgaccgag agagttagct ga

>ratMECP2_e1prot
MAAAAAAAAA AAAAAAAAAA AAAAAPSGGG GGEEERLEEK SEDQDLQGLK EKPLKFKKVK KDKKEDKEGK HEPLQPSAHH SAEPAEAGKA ETSESSGSAP AVPEASASPK QRRSIIRDRG PMYDDPTLPE GWTRKLKQRK SGRSAGKYDV YLINPQGKAF RSKVELIAYF EKVGDTSLDP NDFDFTVTGR GSPSRREQKP PKKPKSPKAP GTGRGRGRPK GSGTGRPKAA ASEGVQVKRV LEKSPGKLLV KMPFQASPGG KGEGGGATTS AQVMVIKRPG RKRKAEADPQ AIPKKRGRKP GSVVAAAAAE AKKKAVKESS IRSVQETVLP IKKRKTRETV SIEVKEVVKP LLVSTLGEKS GKGLKTCKSP GRKSKESSPK GRSSSASSPP KKEHHHHHHH AESPKAPMPL LPPPPPPEPQ SSEDPISPPE PQDLSSSICK EEKMPRAGSL ESDGCPKEPA KTQPMVAAAA TTTTTTTTTV AEKYKHRGEG ERKDIVSSSM PRPNREEPVD SRTPVTERVS

Derived from: NCBI GI 115312277 nucleotides 10 to 119, and 244 to 1696. Reliability: Exons 3 and 4 match the rat genome exactly. In exon 1, the rat genome has an extra triplet creating an extra alanine in the multi-alanine section.

Gray short-tailed opossum (Monodelphis domestica):

MECP2_e2 transcript

>possumMECP2_e2dna
atggtagctg ggatgttagg gctcag
ggaagaacag tctgaagacc aagacctcca gggcctcaga gataaacccc tgaagttcag aaagttgaag agggataaaa aggaggagaa agaaggaaaa catgaattcc cacagccatc atcacaccag tctgccgaac cagcagaggc aggaaaagca gaaacatcag aagaggctgg gtcagcccct gctgcacctg aagcttcagc ttctcctaaa caacggcgtt ctatcatccg agaccggggg cccatgtatg atgatcccac actaccagag ggctggacaa gaaaactgaa gcagaggaaa tcaggccgtt ctgctgggaa gtacgatgtc tatttgatca a
tccacaggga aaagcttttc gctccaaggt agagttgatt gcatacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagtccctc ccgacgagag cagaaaccac ccaagaagtc caaatccccc aaggctccag ggacaggccg agggagggga cggcccaaag ggagcggcac agtgaaaccc cgggtcacag cctcagaagg ggtccaggtc aaaagggtga ttgagaaaag tcctgggaag ctcctagtca agatgccttt tcagccgtca cctgggggaa aggctgaagg gggtggggcc accacgtcca cccaagtcat ggtgatcaag cgccctggca ggaaacggaa agttgagacc gagccacagg tcatccctaa gaaacggggc cgtaagccgg ggagcatagt ggccgcagct gccgtggaag ccaagaagaa agcaatcaaa gagtcttcca tcaggtccat tcatgagacc gtgctgccca tcaaaaagcg gaagaccagg gaagccgtca gcatcgaggt gaaggaggtg gtgaagcctc tacttgtctc caccgtgggg gagaagagca cgaagggact caagcctgga aagagcccag gtcggaaaag caaagagagc agccccaaag ggcggagtgc cagcacctcc tcttcccccc cgaagaagga gcagcagcag cagcagcagt accaccacca ccactactac ccttcctcag agtcccccaa ggccccaccc ccacctcacc ccgagccaga gggctccaag gacagcaaaa gcccccccga acctcaggac ttaagcagca aagtttgcaa agaagagaag atgccaagag gggctccacc agagagtgat ggctgcacaa aggagctcgc taagactcag cccacagctg ctgccgcctc cgctgctgcc accgccgcca ccgccaccac cgccaccacg gcagcagaaa agttcaaaca ccgagcagag ggagaccgaa aggacattgt ctcgtcctcc atgccgaggc caaaccgaga ggatcctgtg gacagccgga cgcccgtgac agagagagtt agctga

>possumMECP2_e2prot
MVAGMLGLRE EQSEDQDLQG LRDKPLKFRK LKRDKKEEKE GKHEFPQPSS HQSAEPAEAG KAETSEEAGS APAAPEASAS PKQRRSIIRD RGPMYDDPTL PEGWTRKLKQ RKSGRSAGKY DVYLINPQGK AFRSKVELIA YFEKVGDTSL DPNDFDFTVT GRGSPSRREQ KPPKKSKSPK APGTGRGRGR PKGSGTVKPR VTASEGVQVK RVIEKSPGKL LVKMPFQPSP GGKAEGGGAT TSTQVMVIKR PGRKRKVETE PQVIPKKRGR KPGSIVAAAA VEAKKKAIKE SSIRSIHETV LPIKKRKTRE AVSIEVKEVV KPLLVSTVGE KSTKGLKPGK SPGRKSKESS PKGRSASTSS SPPKKEQQQQ QQYHHHHYYP SSESPKAPPP PHPEPEGSKD SKSPPEPQDL SSKVCKEEKM PRGAPPESDG CTKELAKTQP TAAAASAAAT AATATTATTA AEKFKHRAEG DRKDIVSSSM PRPNREDPVD SRTPVTERVS

Derived from: Hand-based assembling of sequences from the trace archive.

Reliability: Good agreement with other sequences from the trace archive.

MECP2_e1 transcript

>possumMECP2_e1dna
atggccgccg ccgccgcgct gagcggagga ggaggaggcg aggaggacag act
ggaagaacag tctgaagacc aagacctcca gggcctcaga gataaacccc tgaagttcag aaagttgaag agggataaaa aggaggagaa agaaggaaaa catgaattcc cacagccatc atcacaccag tctgccgaac cagcagaggc aggaaaagca gaaacatcag aagaggctgg gtcagcccct gctgcacctg aagcttcagc ttctcctaaa caacggcgtt ctatcatccg agaccggggg cccatgtatg atgatcccac actaccagag ggctggacaa gaaaactgaa gcagaggaaa tcaggccgtt ctgctgggaa gtacgatgtc tatttgatca a
tccacaggga aaagcttttc gctccaaggt agagttgatt gcatacttcg aaaaggtagg cgacacctcc ctggacccta atgattttga cttcacggta actgggagag ggagtccctc ccgacgagag cagaaaccac ccaagaagtc caaatccccc aaggctccag ggacaggccg agggagggga cggcccaaag ggagcggcac agtgaaaccc cgggtcacag cctcagaagg ggtccaggtc aaaagggtga ttgagaaaag tcctgggaag ctcctagtca agatgccttt tcagccgtca cctgggggaa aggctgaagg gggtggggcc accacgtcca cccaagtcat ggtgatcaag cgccctggca ggaaacggaa agttgagacc gagccacagg tcatccctaa gaaacggggc cgtaagccgg ggagcatagt ggccgcagct gccgtggaag ccaagaagaa agcaatcaaa gagtcttcca tcaggtccat tcatgagacc gtgctgccca tcaaaaagcg gaagaccagg gaagccgtca gcatcgaggt gaaggaggtg gtgaagcctc tacttgtctc caccgtgggg gagaagagca cgaagggact caagcctgga aagagcccag gtcggaaaag caaagagagc agccccaaag ggcggagtgc cagcacctcc tcttcccccc cgaagaagga gcagcagcag cagcagcagt accaccacca ccactactac ccttcctcag agtcccccaa ggccccaccc ccacctcacc ccgagccaga gggctccaag gacagcaaaa gcccccccga acctcaggac ttaagcagca aagtttgcaa agaagagaag atgccaagag gggctccacc agagagtgat ggctgcacaa aggagctcgc taagactcag cccacagctg ctgccgcctc cgctgctgcc accgccgcca ccgccaccac cgccaccacg gcagcagaaa agttcaaaca ccgagcagag ggagaccgaa aggacattgt ctcgtcctcc atgccgaggc caaaccgaga ggatcctgtg gacagccgga cgcccgtgac agagagagtt agctga

>possumMECP2_e1prot
MAAAAALSGG GGGEEDRLEE QSEDQDLQGL RDKPLKFRKL KRDKKEEKEG KHEFPQPSSH QSAEPAEAGK AETSEEAGSA PAAPEASASP KQRRSIIRDR GPMYDDPTLP EGWTRKLKQR KSGRSAGKYD VYLINPQGKA FRSKVELIAY FEKVGDTSLD PNDFDFTVTG RGSPSRREQK PPKKSKSPKA PGTGRGRGRP KGSGTVKPRV TASEGVQVKR VIEKSPGKLL VKMPFQPSPG GKAEGGGATT STQVMVIKRP GRKRKVETEP QVIPKKRGRK PGSIVAAAAV EAKKKAIKES SIRSIHETVL PIKKRKTREA VSIEVKEVVK PLLVSTVGEK STKGLKPGKS PGRKSKESSP KGRSASTSSS PPKKEQQQQQ QYHHHHYYPS SESPKAPPPP HPEPEGSKDS KSPPEPQDLS SKVCKEEKMP RGAPPESDGC TKELAKTQPT AAAASAAATA ATATTATTAA EKFKHRAEGD RKDIVSSSMP RPNREDPVDS RTPVTERVS

Derived from: Hand-based assembling of sequences from the trace archive.

Reliability: Good agreement with other sequences from the trace archive.

Western clawed frog (Xenopus tropicalis):

Frogs and fish only have the MECP2_e1 transcript form.

>XtropMECP2_e1dna
atggccgctg cgccgagcgg agaggagaga ct
ggaagaaaaa tctgaagatc aagatcttca aggacagaaa gataaaccac caaaactcag gaaagtaaaa agagacaaga aggatgagga agaaaagcag gaaacgtttc atccctctga gcaccagtca ggagaacctg cagatgaagg gaaagctgat atatctgaaa gtgctgagga aagccttgct gttcctgaag cctctgcctc tcccaagcag aggcggtctg ttattagaga caggggtccc atgtacgaag accccactct tcctgaaggc tggacacgaa aactaaagca aagaaaatct ggtcgttctg ctggaaagtt tgatgtatat ttaatcaa
ccctaatgga aaagcttttc ggtccaaagt tgaacttata gcatacttcc aaaaggtagg cgacacatcg ctggacccta atgattttga cttcactgta actgggagag ggagtccgtc tcgaagggaa cagaagcaac cgaaaaagtc taaagctcca aaatcttctg gaacagggag aggaagagga agacccaaag gaagtgtaaa agtaaagtca cctgtaaaat ctgaaggagt acaggttaaa agggtgatag agaagagtcc agggaagctt ttggtaaaaa tgcctttttc tggaagtaaa gaggaatccg atgcaacaac ctcagaacag gttttggtaa ttaaaagacc cggtcgtaaa agaaagtcag atacagaccc atcggcagct cctaaaaaac ggggaagaaa gccaggcagt gtgagcttgg ctgctgcagc agcagaagca gcaaagaaaa aagcaatcaa agagtcttcc atcaagcctc ttttagagac tgtgttacca ataaagaaac gcaagaccag ggagactatc agtgtagatg taaaagatac agtaaaaccg gagcctctta cacctgttat agaaaaaagc attaaaggac agaaacctgc aaaaagtcca gaaagcagaa gcacagaggg tagcccaaaa attaaaactg gcttgccgaa aaaggagctg cagcagcacc atcatcatca ccaccatcat catcaccatc atcactccga atccaaggca tctgccacca gtccagagcc agagacttca aaggacagca ttggggcccc agagccccag gacttaagtg tcaaaatata taaagaggag aagctacccg agagtgatgg ctgtgctcag gagccagcca agacgcagcc tgctgataaa tgtagaaacc gagcagaagg tgaaagaaaa gacattgtat catctgtccc tagaccaaca agagaagaac ccgtggacac cagaacaacg gttacggaaa gagttagctg a

>XtropMECP2_e1prot
MAAAPSGEER LEEKSEDQDL QGQKDKPPKL RKVKRDKKDE EEKQETFHPS EHQSGEPADE GKADISESAE ESLAVPEASA SPKQRRSVIR DRGPMYEDPT LPEGWTRKLK QRKSGRSAGK FDVYLINPNG KAFRSKVELI AYFQKVGDTS LDPNDFDFTV TGRGSPSRRE QKQPKKSKAP KSSGTGRGRG RPKGSVKVKS PVKSEGVQVK RVIEKSPGKL LVKMPFSGSK EESDATTSEQ VLVIKRPGRK RKSDTDPSAA PKKRGRKPGS VSLAAAAAEA AKKKAIKESS IKPLLETVLP IKKRKTRETI SVDVKDTVKP EPLTPVIEKS IKGQKPAKSP ESRSTEGSPK IKTGLPKKEL QQHHHHHHHH HHHHHSESKA SATSPEPETS KDSIGAPEPQ DLSVKIYKEE KLPESDGCAQ EPAKTQPADK CRNRAEGERK DIVSSVPRPT REEPVDTRTT VTERVS

Derived from: Hand-based assembling of sequences from the trace archive.

Reliability: Fairly good agreement with other sequences from the trace archive.

African clawed frog (Xenopus laevis):

>XlaevisMECP2_e1dna
atggccgctg cgccgagcgg agaggagaga ct
ggaagaaaaa tctgaggatc aagatcttca aggacaaaaa gataaaccac caaaactcag gaaagtaaaa aaagacaaga aggatgagga agaaaagcag gaaccatttc attcctctga gcatcagccc ggagaacctg cagatgaagg gaaagctgat atgtctgaaa gtgctgagga aaaccttgct gttcctgaat cttctgcctc tcccaaacag aggcggtctg ttattagaga caggggtccc atgtacgaag accccactct tcctgaaggc tggacacgaa aactcaagca aagaaaatct ggtcgttctg ctggaaaatt tgatgtatat ttaatcaa
ccctaatgga aaagcttttc ggtccaaagt tgagcttata gcatacttcc aaaaggtagg ggacacatct ctagacccta atgattttga cttcactgta actgggagag ggagcccgtc tcgaagggaa cagaagcaac cgaaaaagcc taaagctcca aaatcttctg tatcagggag aggaagagga agacctaaag gaagtataaa aaaagttaag ccacctgtaa aatctgaagg agtacaagtc aaaagggtga tagagaagag tccgggaaaa cttttggtta aaatgcctta ttctggaact aaagaggcat cagatgcaac aacgtcacaa caggttttgg tcattaaaag aggcggtcgt aaaagaaaat cagaaactga tccatctgca gctcctaaaa aaagggggag aaagccaagc aacgtgagct tggctgctgc agcagcagaa gcagcaaaga aaaaagcaat caaagagtct tccatcaagc ctcttttaga gactgtgtta ccaataaaga aacgcaagac cagggagact atcagtgtag atgtaaaaga tacaataaaa ccagagcctc ttacacctgt tatagaaaaa gtcatgaaag gacaaaaccc tgcaaaaagt ccagaaagca gaagcacaga gggtagccca aaaattaaaa ctggcttgcc gaaaaaagag ctgcagcagc accatcatca tcatcaccac caccatcacc atcatcactc cgaatctaag gcatctgcca ccagtccaga gccagagact tcaaaggaca acattggggt tcaggagccc caggacttaa gtgtcaaaat gtgtaaagag gagaagctac cagaaagtga tggctgtgct caggagccag ccaagactca gcctgctgat aaatgtagaa accgagcaga aggtgaaaga aaagacattg tttcatctgt ccctagacca acaagagaag agcccgtgga caccagaaca acggtgacag aaagagttag ctga

>XlaevisMECP2_e1prot
MAAAPSGEER LEEKSEDQDL QGQKDKPPKL RKVKKDKKDE EEKQEPFHSS EHQPGEPADE GKADMSESAE ENLAVPESSA SPKQRRSVIR DRGPMYEDPT LPEGWTRKLK QRKSGRSAGK FDVYLINPNG KAFRSKVELI AYFQKVGDTS LDPNDFDFTV TGRGSPSRRE QKQPKKPKAP KSSVSGRGRG RPKGSIKKVK PPVKSEGVQV KRVIEKSPGK LLVKMPYSGT KEASDATTSQ QVLVIKRGGR KRKSETDPSA APKKRGRKPS NVSLAAAAAE AAKKKAIKES SIKPLLETVL PIKKRKTRET ISVDVKDTIK PEPLTPVIEK VMKGQNPAKS PESRSTEGSP KIKTGLPKKE LQQHHHHHHH HHHHHHSESK ASATSPEPET SKDNIGVQEP QDLSVKMCKE EKLPESDGCA QEPAKTQPAD KCRNRAEGER KDIVSSVPRP TREEPVDTRT TVTERVS

Derived from: NCBI GI 4139225 bases 14 to 1417.

Reliability: Not enough trace archive sequences to compare with this sequence.

Zebrafish (Danio rerio):

>zebrafishMECP2_e1dna
atggccgccg cagagagcgg agaggagaga ct
cagaggtgag gacaagaatg aagaccagga gggctcaaaa gacaagacgc agaagcataa gaaaagcaaa aaggaaaggc atgatgtgga aaaactggag accacagtct ctgttcctcc gcccccgtct ctctttacgc agagggatgt cggacagcag gcagaggcag ggaagtctga acccattgac cctgaagttg gagctgctct cagcgctcca gaatcttccg catcggccaa gcagcggcgg tctgtcattc gggacagagg cccaatgtat gaagatcctt cgctgcctca gggctggaca cgcaagctga aacagcgcaa atcagggcgc tccgctggca aatttgacgt ctaccttatc aa
cccagaaggg aaagccttcc gttccaaggt ggagctcatg gcatacttcc aaaaggttgg cgataccatt acagatccca atgactttga cttcacggtc acgggcaggg gaagcccgtc tcgcagagaa aaaagaccgc caaaaaagcc taaaatggtc aaaccctctg gacgtggaag ggggcggcct aaaggtagcg gcaaggtacg acaggctaca gaaggggtgg cggtgaaacg cgtcatagaa aagagtccag gaaaactctt agtaaagatg ccctttgtgg cccccaaaac tgaaccaggg gctcctttag ggcaagcgcc agttgccaaa gcacgccgag gacgtaagag gaaatcagag caggatccgc caagcacccc taaaaaacgt ggacgcaagc cagcaactgt ttcacagtca acagtgggga cggggtctgc tgctgcatac gccgctgcag ccattctcac cgccgaagcc aagaaaaaag ccctgaagga gtcttccgct aagcctgttc aggagagggc tcttcctatc aaaaaacgca aaacccgaga gactttagag gagctggagg catccaccac ctcagcgaca gagacctttg agaaacgact gactgcatca actgtgaccc ctaccgggga ggaggcagaa acaggacaga agcctcacaa gcatcccagc cggaagcaca aagaggcaga tccgggaagc agcagcagtg ggacgacagc cagcggagtt gcaccgaaga gtcacaagaa gagagatcag cgagggcagc actttaaaca ccaccaccac catcatcatc accaccatca acaccaacac ctgcaggcct ccacaccctc cacctacact ccgcaggctc accagctctc cctgggtcac tccacgcacg gcgggctgga aaacgagccg caggacttga gcacctccag gcccaaagcg gagcacgtgg cctgcaggga ggaggccaga actggcagct cctcgagtag ggactcccag aacgcaagca agatggcttc catgaccgtg acgggggaaa gcaaggagct gagagacatt gttcctccct ccgccgtccc gaggccgagt cgagaggaaa cggtggagtc ccggacacca gtgagcgagc cagtgagctg a

>zebrafishMECP2_e1prot
MAAAESGEER LRGEDKNEDQ EGSKDKTQKH KKSKKERHDV EKLETTVSVP PPPSLFTQRD VGQQAEAGKS EPIDPEVGAA LSAPESSASA KQRRSVIRDR GPMYEDPSLP QGWTRKLKQR KSGRSAGKFD VYLINPEGKA FRSKVELMAY FQKVGDTITD PNDFDFTVTG RGSPSRREKR PPKKPKMVKP SGRGRGRPKG SGKVRQATEG VAVKRVIEKS PGKLLVKMPF VAPKTEPGAP LGQAPVAKAR RGRKRKSEQD PPSTPKKRGR KPATVSQSTV GTGSAAAYAA AAILTAEAKK KALKESSAKP VQERALPIKK RKTRETLEEL EASTTSATET FEKRLTASTV TPTGEEAETG QKPHKHPSRK HKEADPGSSS SGTTASGVAP KSHKKRDQRG QHFKHHHHHH HHHHQHQHLQ ASTPSTYTPQ AHQLSLGHST HGGLENEPQD LSTSRPKAEH VACREEARTG SSSSRDSQNA SKMASMTVTG ESKELRDIVP PSAVPRPSRE ETVESRTPVS EPVS

Derived from: NCBI GI 37574905 base 1 to 1575.

Reliability: Fairly good agreement with sequences from the trace archive.

Citations

Ruthie E. Amir, Ignatia B. Van den Veyver, Mimi Wan, Charles Q. Tran, Uta Francke & Huda Y. Zoghbi 1999 Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2. Nature Genetics. 23(2): 185-8.

Timur M. Yusufzai and Alan P. Wolffe. 2000 Functional consequences of Rett syndrome mutations on human MeCP2. Nucleic Acids Research. 28(21): 4172-4179.

J.D. Thompson, D.G. Higgins, and T.J. Gibson. 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research. 22(22): 4673-4680.