Prokaryote - Sulfolobus solfataricus

 

For:

 

 leuC (13815772 - 3-isopropylmalate dehydratase, large subunit (isopropylmalate isomerase) (alpha IPM isomerase) (IPMI) (leuC)) Sulfolobus solfataricus complete genome.

 

VC2492 (9657072 - 3-isopropylmalate dehydratase, large subunit) Vibrio cholerae chromosome I, complete chromosome.

 

NMB1036 (7226276 - 3-isopropylmalate dehydratase, large subunit) Neisseria meningitidis serogroup B strain MC58 complete genome.

 

CC0196 (13421319 - 3-isopropylmalate dehydratase, large subunit) Caulobacter crescentus complete genome.

 

HI0988 (1574017 - 3-isopropylmalate dehydratase, alpha subunit (leuC)) Haemophilus influenzae Rd complete genome.

XF2375 (9107553 - 3-isopropylmalate dehydratase large subunit) Xylella fastidiosa, complete genome.

CLUSTALW

VC2492          ------------------------------------------------------------
XF2375          TATCTTGAGCTCATTGAGTCTCATGATTTCTTTGTGGACTGGCCTACTTGTTGGGAGTCT
leuC            --------------------------------------------GTTTCTTTTTCGGATT
NMB1036         -----------------------------------CTTTCGTCTACCTCTGTGTGTTTTT
HI0988          ---------------------------------------ATACTTGCCTTCCTCCGTTTA
CC0196          -----GGTCAAGCTTGGGTGCAACCACACATAGATTGACAAACCAGCGTTTCGTGAGA-A
                                                                            
 
VC2492          -----------------AGTTTGAGAGTGGGGTTACCCCACTCCTCTTTAGCTCTCTGA-
XF2375          CCAGATGAATCGCACATGGGTTTATGCCGAGGCTAAGTGCCGAGTGTTCGACACTCTAT-
leuC            TAGTACGATGTAAATGTAATTTCTTCGTAAG---AAATGCTTATTTATTAATAATTTTA-
NMB1036         TGACAAGATGCAGATGATAATGCCTTTGAAAGAATGACGACAAGTATTCTTTCGTATCG-
HI0988          CGGGGGAAGGTGCCTGAAGGGCGGAAGGGGGCAATCTCCACCCTTAGTTATTTAGAAAT-
CC0196          CGTGATGCCACCCCCGGCGCTTGTCGCTCGGGCTTCCAGACCGTCTTCTTTCTAGTGGAC
                                                                            
 
VC2492          -GCTCAAGGAAGAAGC--------------------------------------------
XF2375          -ATTTTTGGCGTATGGGATGACATGGTGCGCACGTGG---AGATTCCACACGAAATCATC
leuC            -ACTCCTTA---------------------------------------------------
NMB1036         -AATCATGCTTTA---GATATAACTGCTTACTGAAAT---GGACAATATGTCTAATTTTT
HI0988          -ATCCCCAATAAACAGGGGGGACAAAGTATTTAAAGT---GCGGTTGAAAAACACATCGA
CC0196          GGCCCTCTTCGGGCGGAGCCCATCGGCCCCCGAAGGCTGAAAACGATGCGAGGGCGCGCC
                                                                            
 
VC2492          ------------------------------------------------------------
XF2375          CAATAATTCTCACATTTGAAAATGCTGAATTCCCATGATTGATCTTGATAAGCTCGCTCT
leuC            ------------------------------------------------------------
NMB1036         TTGTTAAAATTGTTTACAAAATAACACCGACTCAAAAATTAGACAAAATCTGTTGCGCGG
HI0988          ATTTTAACCGCACTTCCCTTAACGAATAGAGAAAAATAT---------------------
CC0196          GAAACATGGGCGCGCCCTATGTCGTTCGGGGCTGTTCGGGAGACTGGAAGGCCACAGGGC
                                                                            
 
VC2492          ------------------------------------------------------------
XF2375          AGTATGGTTGCCAACGAAGGGTGATATCAGTCGCCGATTGTTGTAACCACATTAGGAATT
leuC            ------------------------------------------------------------
NMB1036         TATAAAGAATACGTCTAAAATCCGCCCAATCCGCATTACCATTTATCCAAAAGAACAACA
HI0988          ------------------------------------------------------------
CC0196          GCAAGAAATTGTCGCGAGCCTGGGGCGCCACTGGATTTTGGCGCACGATCCGCCCTATAT
                                                                            
 
VC2492          -------
XF2375          GCTGCT-
leuC            -------
NMB1036         TC-----
HI0988          -------
CC0196          CCCCGCC

 

DIALGN produced similar results and only managed to align a small section of T/A.

http://www.genomatix.de/cgi-bin/dialign/dialign.pl?SHOW=user_17_1.seq_48277.html&TASK=dialign

 

 


For:

 

thrS (13815789 - Threonyl-tRNA synthetase (thrS)) Sulfolobus solfataricus complete genome.

 

NMB0720 (7225946 - threonyl-tRNA synthetase) Neisseria meningitidis serogroup B strain MC58 complete genome.

 

HI1367 (1574199 - threonyl-tRNA synthetase (thrS)) Haemophilus influenzae Rd complete genome.

 

TP0837 (3323150 - threonyl-tRNA synthetase (thrS)) Treponema pallidum complete genome.

 

XF0736 (9105624 - threonyl-tRNA synthetase) Xylella fastidiosa, complete genome.

 

Clustal output

 

thrS            ------------------------------------------------------------

NMB0720         ----------------------------TGAATGTCAGTTGGGCGACAGGGGTCGAAATA

TP0837          ------------------------------------------------------------

XF0736          ------------------------------------------------------------

HI1367          TTTTTATTCCTTAACTGGGTATTTAAAAGTGCGGTTATTCTAGCCTTTATTTTTAAGAGA

                                                                           

 

thrS            ------------------------------------------------------------

NMB0720         TATTTTAAAAGACGGCATTATAAATGATTTCCCACGGTTTTTCAGACGACATCCCCAAAT

TP0837          -CGCGCTCACATCCAAAACAGAAAGCATCCTCTACTATACCCTACATACCACGTCCCTTC

XF0736          -AGGCGCCCTCCTGGCGCTTTTTTTGTTGCTTCGCGGGAAGCTCAAGATTGCATTGCTGA

HI1367          TAAAGCCTTTAGTCAGAAAAACATTATTTCCTTTTTGTAAAATATGGGCATTTATTTAAC

                                                                           

 

thrS            ---------------------------C-A-ATACTCTTTGTAACAGCTAAAGATTTAAG

NMB0720         CTTGCCGCAATGTTGCATAAAGAAACGC-ACATACCTCTTGCAAAAATTAAAACGACCCG

TP0837          CTACAGACTGCAGTGACGGCGCAGGCGC-ACTGGCTCAGTGCTTCCTCCAAAACGGC--G

XF0736          ATGTTTGCATTGATGCGTCGGCATTAGC-A-TCATTGCCTGATCCTGCTTAGCAGA---A

HI1367          AATACTTCAAGAAAGTGATAGAATTCGCCACGCATTTTTTGCCTAATTTCGCTCGTTC-A

                                           * *         **                  

 

thrS            TTAAATGACATAGCGATATT----------------------------------------

NMB0720         ATAAAATGCAAAAATTCTTTGAAGGCACGTAGCTCAGTTGGTTAGAGCACCACCTTGACA

TP0837          CCCATTGACAAACCACCCATAAGGTCTCACG-----------------------------

XF0736          TTCAACAACAATGATCCCTAAATACTCAATATATCAATCGCTGACACGCCTTTTATACT-

HI1367          ATGGTAGGAATGAAGCCAAGTAGGGCAAAAGTGCGGTTAATTTTTATAGAATTTTAAATG

                         *                                                 

 

thrS            ------------------------------------------------------------

NMB0720         TGGTGGGGGTCGTTGGTTCGAATCCAATCGTGCCTACCAAATTCCCATAACGGCATTTAT

TP0837          ------------------------------------------------------------

XF0736          ------------------------------------------------------------

HI1367          AACACGTCACCTTTCGTATGGGTGACCACTGTATAAGGAAAAAAC---------------

                                                                            

 

thrS            ------------------------------

NMB0720         GCCGTTATTTTTTAATCTTTCGGAGCGTTT

TP0837          ------------------------------

XF0736          ------------------------------

HI1367          ------------------------------

 

DIALGN produced similar results and only managed to align a small section of T/A.

http://www.genomatix.de/cgi-bin/dialign/dialign.pl?SHOW=user_17_2.seq_78486.html&TASK=dialign

 


For:

alaS (13813483 - Alanyl-tRNA synthetase (alaS)) Sulfolobus solfataricus complete genome.

 

NMB1595 (7226845 - alanyl-tRNA synthetase) Neisseria meningitidis serogroup B strain MC58 complete genome.

 

CC2529 (13424088 - alanyl-tRNA synthetase) Caulobacter crescentus complete genome.

 

HI0814 (1573826 - alanyl-tRNA synthetase (alaS)) Haemophilus influenzae Rd complete genome.

 

XF0124 (9104908 - alanyl-tRNA synthetase) Xylella fastidiosa, complete genome.

 

VC0545 (9654973 - alanyl-tRNA synthetase) Vibrio cholerae chromosome I, complete chromosome.

 

alaS            ---AATTCTTACAAAGGATAATAAAAGTAACACTTT---TTCCGAGAAATAGTCACATTT

HI0814          -----------------TTAACAAGA--AATTGTTT---TTTGAATAATCTGAAATAACT

VC0545          --------------------ATCGCTTTCACCCTCTACATTCCGCTTATTAATCTCGCTC

NMB1595         -----------------TTATCATTCCTTGCATATCGGGTTGGAGAAAGCGGCCATTATA

CC2529          ---------------------------GGGGCTTTTGGCTTGCGGCATGCAGACGCCGCG

XF0124          ATCGTGCATGAGGGAGGATGAGGATTCGGCTATTCCGTGTCGTGTTTGAAAGCGCTGCTG

                                                       *                   

 

alaS            --TAAATTTGTTTTTATAATACAGAATTTAATTTATGGATTAGTCCCTTCCTCATGGAAA

HI0814          --TAAATTT-TAACCGCACTTCTCAAATAGCGTCGTGAAGT-GCGGTTTTATTTTGGAGT

VC0545          --TGGATTT-TATGCATCTTGGTTGAGCTGCCGCTCGCCAC-ACATTCAGTTTGCGGAA-

NMB1595         --GCCGATATTGGCAACAGGGCTTCAGACGGCATTCAAAATCCCGCCACACTCTTCCGA-

CC2529          AGCGAGCATGCGCTGGATGCGCCGATGTCGGCCTTTTCCGCATCCTTGGGCGGCCCCACC

XF0124          --CAAACCGTTGTCTATCCGCGCCACGCTGTGGCGCATGATTGCATGCTTGCTCTCTTTT

                                                                           

 

alaS            TCTTAAATGAAATCTTAATGGCTATTTGTTTTAGAATCTA-AATTACGAGAGGAG---TC

HI0814          TTTGGCAAAAAGCCTTTTTATAAATTT---CAAAAATATGCAAAAATAGGGCGAC---TT

VC0545          ---GAGAAGAAACTAGGCGAACTGTTA--TCCTTGCTTTACAATAGCGCAAAATT---CT

NMB1595         ---AAACCGCCGCTTCCATAGCTAGAAA--CAGGGATTTGCGGTAAGATACCGCCG--TT

CC2529          TTCGACTCGGTTTCCCCTTCGCCGAAAA--CCGCGCTATGCAGCGCCCGACTTCCG--GC

XF0124          CTCTCCTGGGGTTGGTGCTATAAGGAGA---CGAATTGCAACATGCCGTGCATCTGATGT

                                                    *                      

 

alaS            AGTTTTTAAGCTT----TAGATACTT----TTCTTTTTATTGT-----------------

HI0814          AGTTTTTATTTTTATTATGGGTAACAAGGATTCTTTTTA---------------------

VC0545          AACTTGAGTATTT--CAGGAAGAGCTG---------------------------------

NMB1595         CGTTTTCCCTGCTTTTACCATGACAAGACATTTGAGAGACATTGAAAAAAT---------

CC2529          CGCTCGCCGGACCGGTATTCAAGACTTTGAGACGCTC-----------------------

XF0124          GGCCCCCGTTTCCGTCATGACTGAGTGATATCTATAGTGATAAGGGCACACTTTCGTTTG

                                                                           

 

alaS            ------------------------------------------------------------

HI0814          ------------------------------------------------------------

VC0545          ------------------------------------------------------------

NMB1595         ------------------------------------------------------------

CC2529          ------------------------------------------------------------

XF0124          CTTTGTTGCTCCGGTCTGGTTTCTGGATTTCGGCGATTTGCTGCGTTGTTGCCTATAACG

                                                                           

 

alaS            ------

HI0814          ------

VC0545          ------

NMB1595         ------

CC2529          ------

XF0124          CAGAGG

 

DIALGN produced similar results and only managed to align a small section of T/A.

http://www.genomatix.de/cgi-bin/dialign/dialign.pl?SHOW=user_17_3.seq_67215.html&TASK=dialign