Definition Mesorhizobium loti MAFF303099 chromosome, complete genome.
Accession NC_002678
Length 7,036,071

Click here to switch to the map view.

The map label for this gene is 13474327

Identifier: 13474327

GI number: 13474327

Start: 4122839

End: 4125415

Strand: Reverse

Name: 13474327

Synonym: mll5190

Alternate gene names: NA

Gene position: 4125415-4122839 (Counterclockwise)

Preceding gene: 13474328

Following gene: 13474326

Centisome position: 58.63

GC content: 63.33

Gene sequence:

>2577_bases
GTGGCGGAACGGCGCGTGGCGCTGGTCATGGCCGACAACGACTACCGGCTGGTCCGGCCGCTGGCCAATCCCATCCATGA
CGGCGAGGCGATGGAAGCCGCACTTAAGAAGCTCGGCTTCGAGGTCATCCTCGAAACCAACCGGGACCTTCGGCGCATGC
GACGCGCGCTCGACGACTTTCGCGAGGATGCCAAGGGCGCCGATGTCGCGCTGGTTTATTTCTCCGGCCATGGCGTCGAA
ATCTCCGGTGACAACCGGCTGCTGCCAGTCGATGCCGACGCGTCGTCGGTCGATCAACTGGACAAGACCAGCCTGCCGCT
GGAGGAGGTGCGCGATGCCGTCGCCGCGACCGCCAAGGTCGGGCTGATCGTGCTCGATGCCTGCCGCAGCGATCCCTTCT
CGGCTAGCAGCGGCGATGGTCGCGGCGCGACCTCGCTGACCAAGGATGTCGCCGACAAGGTCAAGCCGGGCCTCGGCCGC
GTCGGGCGGGCGGAAAACATCCTGTTCGCCTTTTCGGCCGCCCCTGGCGAGACGGCCGCCGATGGAACCGGAGAGAATTC
GCCCTTCACGACGGCGCTGACCAAATATCTCGCCACGGACGGGCTCGAGATCCGCTCGGTGCTGACCTTGGTGCAGCAGG
AAGTCTATGACCTGTCGCGCGGCAAGCAGTTGCCCTATGTCGAAAGCGGCCTGCCGAAACTGTTCTTTGCCGCTGCCGCC
AAGGAACAGCTGCCAGAGCGCGAACGGCTGCTGCTCGCCATGGCCGACGTGACGCCGGAAATGCGCGGCGAGGTCGAGCA
GATCGCCAGCGATGCCGACATGCCGCTGGCGCCGCTCTATGGCGCGCTGATCAGTTCCGACGCCAGCCATCTCTCCGCCG
ACAGCCTGAACGCCCGGCTGCGCGAGGCCGCCGACGCCTTCGTCAAGGTGCGCAGCGAAATGAAGACGCTCGCCTCCGAC
GATCCGCAGGTGGCGGAATTGCGCCGGCAGGCCGAGGAACAGCTTTCACTAGGCGCCTTCGATGGCGCCCGCGCTCTTCT
GGCCAAGGCCGCCGACATCGACAATGTTTCGCGCCAGGCGCTGAAGGTCAATTTCGCCAGCCGCACCCTGTCCGAGGCCG
CGACGCGTTTTCTATCCGGTGGTGCGGCGCGCGCCGACCTCGACTACACCACGGCCATTGGTGACTTTGAAACGGTGCTG
TCGCTGTATGGTGAGGCTGGGCAGACCTCGCTCAGCCTGGAACAGGCCGACCGTCAGAGCCGCACGCTGGAGGAACTCGG
TATCCTCTATACGACGGTGGGCAATGTCGAGGCGGCCGGCCGCGCCTTTACCGCGCTGTTGACCAATCTCGAGCAGCGGT
CGCGTCAGGAAACAGACCCCAGCGTCAAGCGCGATCTCGCCATCAGCCACATCAAGCTCGCCAACATAAAGATGGTGCAA
GGCGACCTGCCAACCTCCCTAGAGCACTATGAGGCGGCCAGGGACATGCTGCAGGACCTGACCGCAAGCGTGCCGGACGA
GAAAAGCTGGCTCGGTGATCTTGCCATGGCCAATGACAAGATCGGCAACGTGCTGGCGACGCAGGGCGATGTGGGTGCGG
CGGCCAAGGCCTACCAGCAAAGCCTGTCGATCAAGCGGAAATTGGTGGATGCTCAGCCGAACAGCGCCTCCCTGCTGCGC
GACCTGACCATAACCTATGATGAAATCGGCGACCTTGCCCGGACCGCCGGGCAGTTGGACGGCGCCCAGACGGCCTTTGA
AGAGAGCCTGAGAATTCGGCTTGTGTTGGCTGAGAACAAACCCGATCCTGAGCGCCAGCGCGCGGTTTCTGTCAGCCATG
AGAGGATCGGGGATGTACTGCGCGAGCGCGGCGACGCCGCCGGGGCGCTCGTGGCCTACAGCAAGAGCCAGGCGATCGCC
GAGGAACTGGTACGCCATGACCCTAACGACACCGACTTGAAGCGCGACCTGTCGATCAGTTACGCCAAGATCGGCAACGC
GCTCAACGACCAGGAGAACTGGCCGGCCGCGCTGGCTTCCTATCAGCAAGCGCTGGCCGTGGCGCGCGAACTTGCCGCTG
ATGACCCTGGCAACACCGACTGGCAGCGCGACCTGTCGGTCTGTCTGGAAAAGGTTGCCGGTGTGCTTGATGCCCAGGGC
GATATCCGCAGCGCACTGCAAAACTACCAGGACAGTCTTGCCATTGTCGATCGGCTGACCAAGCTCGATCCAGGTAATTC
CGACTGGCAACGCGACCTGTCGATTACGCTTTCGGAAATCGGCATGCTGGAGACCAGACAGCGCCATTTCGAGGGCTCAA
GGAAGGCCTTCGAAGCCAGCCTCGGCATCCGCCAGAAACTCGCCCAGTCCGATCCGAACAATGCCATCTGGCAATTTGAT
CTAGTACAGGCCTACATCAACTATGCCTATGTGGCGAAGGATCCGAAGGCTGTGCTGACCAAGGCGCTGAACCTGACGCT
GGACCTCGACCGCACAGGCCGGCTGGCGCCAAGAGACAAACCCACGATCAAATATTTGCGCGGCCTTCTCGCCAAACTCA
ACGCCGGAAAAAAATAG

Upstream 100 bases:

>100_bases
TAAAGAGGTGGAATGGTCGGTGGGTGACTGAGCCGGGGGCGGTTCTAAATTGCGCTTGCTGTTTGCCTTGCTGCTGACGA
TTGCGACCACCGCCACCGCG

Downstream 100 bases:

>100_bases
CCGGCTGCCGGCCCAGGATTTGGGCCGGAAATGCCCGCAAACCTTGGAAAAACCGCCTTTAGCGCCTATATCTAACGCTA
TATCAGGCGCGCGATTCGGG

Product: hypothetical protein

Products: NA

Alternate protein names: TPR Repeat-Containing Protein; Caspase Domain Protein; Peptidase C; Tetratricopeptide TPR_2 Repeat Protein; TPR Repeat-Containing Serine/Threonin Protein Kinase; Caspase-Like Domain-Containing Protein; TPR Domain-Containing Protein; Peptidase Protein; TPR Repeat Protein; Caspase Domain-Containing Protein; TPR Repeat-Containing Caspace; SEFIR Domain Protein; NB-ARC Domain-Containing Protein; Transcriptional Regulator; Tetratricopeptide Repeat-Containing Protein; GUN4-Like Family; Tetratricopeptide Repeat Protein; Caspase; Serine/Threonine Protein Kinase; Peptidylprolyl Isomerase; Caspase-1 P; ICE-Like Protease

Number of amino acids: Translated: 858; Mature: 857

Protein sequence:

>858_residues
MAERRVALVMADNDYRLVRPLANPIHDGEAMEAALKKLGFEVILETNRDLRRMRRALDDFREDAKGADVALVYFSGHGVE
ISGDNRLLPVDADASSVDQLDKTSLPLEEVRDAVAATAKVGLIVLDACRSDPFSASSGDGRGATSLTKDVADKVKPGLGR
VGRAENILFAFSAAPGETAADGTGENSPFTTALTKYLATDGLEIRSVLTLVQQEVYDLSRGKQLPYVESGLPKLFFAAAA
KEQLPERERLLLAMADVTPEMRGEVEQIASDADMPLAPLYGALISSDASHLSADSLNARLREAADAFVKVRSEMKTLASD
DPQVAELRRQAEEQLSLGAFDGARALLAKAADIDNVSRQALKVNFASRTLSEAATRFLSGGAARADLDYTTAIGDFETVL
SLYGEAGQTSLSLEQADRQSRTLEELGILYTTVGNVEAAGRAFTALLTNLEQRSRQETDPSVKRDLAISHIKLANIKMVQ
GDLPTSLEHYEAARDMLQDLTASVPDEKSWLGDLAMANDKIGNVLATQGDVGAAAKAYQQSLSIKRKLVDAQPNSASLLR
DLTITYDEIGDLARTAGQLDGAQTAFEESLRIRLVLAENKPDPERQRAVSVSHERIGDVLRERGDAAGALVAYSKSQAIA
EELVRHDPNDTDLKRDLSISYAKIGNALNDQENWPAALASYQQALAVARELAADDPGNTDWQRDLSVCLEKVAGVLDAQG
DIRSALQNYQDSLAIVDRLTKLDPGNSDWQRDLSITLSEIGMLETRQRHFEGSRKAFEASLGIRQKLAQSDPNNAIWQFD
LVQAYINYAYVAKDPKAVLTKALNLTLDLDRTGRLAPRDKPTIKYLRGLLAKLNAGKK

Sequences:

>Translated_858_residues
MAERRVALVMADNDYRLVRPLANPIHDGEAMEAALKKLGFEVILETNRDLRRMRRALDDFREDAKGADVALVYFSGHGVE
ISGDNRLLPVDADASSVDQLDKTSLPLEEVRDAVAATAKVGLIVLDACRSDPFSASSGDGRGATSLTKDVADKVKPGLGR
VGRAENILFAFSAAPGETAADGTGENSPFTTALTKYLATDGLEIRSVLTLVQQEVYDLSRGKQLPYVESGLPKLFFAAAA
KEQLPERERLLLAMADVTPEMRGEVEQIASDADMPLAPLYGALISSDASHLSADSLNARLREAADAFVKVRSEMKTLASD
DPQVAELRRQAEEQLSLGAFDGARALLAKAADIDNVSRQALKVNFASRTLSEAATRFLSGGAARADLDYTTAIGDFETVL
SLYGEAGQTSLSLEQADRQSRTLEELGILYTTVGNVEAAGRAFTALLTNLEQRSRQETDPSVKRDLAISHIKLANIKMVQ
GDLPTSLEHYEAARDMLQDLTASVPDEKSWLGDLAMANDKIGNVLATQGDVGAAAKAYQQSLSIKRKLVDAQPNSASLLR
DLTITYDEIGDLARTAGQLDGAQTAFEESLRIRLVLAENKPDPERQRAVSVSHERIGDVLRERGDAAGALVAYSKSQAIA
EELVRHDPNDTDLKRDLSISYAKIGNALNDQENWPAALASYQQALAVARELAADDPGNTDWQRDLSVCLEKVAGVLDAQG
DIRSALQNYQDSLAIVDRLTKLDPGNSDWQRDLSITLSEIGMLETRQRHFEGSRKAFEASLGIRQKLAQSDPNNAIWQFD
LVQAYINYAYVAKDPKAVLTKALNLTLDLDRTGRLAPRDKPTIKYLRGLLAKLNAGKK
>Mature_857_residues
AERRVALVMADNDYRLVRPLANPIHDGEAMEAALKKLGFEVILETNRDLRRMRRALDDFREDAKGADVALVYFSGHGVEI
SGDNRLLPVDADASSVDQLDKTSLPLEEVRDAVAATAKVGLIVLDACRSDPFSASSGDGRGATSLTKDVADKVKPGLGRV
GRAENILFAFSAAPGETAADGTGENSPFTTALTKYLATDGLEIRSVLTLVQQEVYDLSRGKQLPYVESGLPKLFFAAAAK
EQLPERERLLLAMADVTPEMRGEVEQIASDADMPLAPLYGALISSDASHLSADSLNARLREAADAFVKVRSEMKTLASDD
PQVAELRRQAEEQLSLGAFDGARALLAKAADIDNVSRQALKVNFASRTLSEAATRFLSGGAARADLDYTTAIGDFETVLS
LYGEAGQTSLSLEQADRQSRTLEELGILYTTVGNVEAAGRAFTALLTNLEQRSRQETDPSVKRDLAISHIKLANIKMVQG
DLPTSLEHYEAARDMLQDLTASVPDEKSWLGDLAMANDKIGNVLATQGDVGAAAKAYQQSLSIKRKLVDAQPNSASLLRD
LTITYDEIGDLARTAGQLDGAQTAFEESLRIRLVLAENKPDPERQRAVSVSHERIGDVLRERGDAAGALVAYSKSQAIAE
ELVRHDPNDTDLKRDLSISYAKIGNALNDQENWPAALASYQQALAVARELAADDPGNTDWQRDLSVCLEKVAGVLDAQGD
IRSALQNYQDSLAIVDRLTKLDPGNSDWQRDLSITLSEIGMLETRQRHFEGSRKAFEASLGIRQKLAQSDPNNAIWQFDL
VQAYINYAYVAKDPKAVLTKALNLTLDLDRTGRLAPRDKPTIKYLRGLLAKLNAGKK

Specific function: Unknown

COG id: COG4249

COG function: function code R; Uncharacterized protein containing caspase domain

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 93190; Mature: 93059

Theoretical pI: Translated: 4.76; Mature: 4.76

Prosite motif: PS50208 CASPASE_P20

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
1.6 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
1.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAERRVALVMADNDYRLVRPLANPIHDGEAMEAALKKLGFEVILETNRDLRRMRRALDDF
CCCCEEEEEEECCCCHHHHHHHCCCCCCHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHH
REDAKGADVALVYFSGHGVEISGDNRLLPVDADASSVDQLDKTSLPLEEVRDAVAATAKV
HHHCCCCCEEEEEEECCCEEECCCCEEEEECCCCHHHHHHHHCCCCHHHHHHHHHHHHHH
GLIVLDACRSDPFSASSGDGRGATSLTKDVADKVKPGLGRVGRAENILFAFSAAPGETAA
HEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCC
DGTGENSPFTTALTKYLATDGLEIRSVLTLVQQEVYDLSRGKQLPYVESGLPKLFFAAAA
CCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCHHHHHHHHH
KEQLPERERLLLAMADVTPEMRGEVEQIASDADMPLAPLYGALISSDASHLSADSLNARL
HHHCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHH
REAADAFVKVRSEMKTLASDDPQVAELRRQAEEQLSLGAFDGARALLAKAADIDNVSRQA
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCHHHHHHHH
LKVNFASRTLSEAATRFLSGGAARADLDYTTAIGDFETVLSLYGEAGQTSLSLEQADRQS
HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHH
RTLEELGILYTTVGNVEAAGRAFTALLTNLEQRSRQETDPSVKRDLAISHIKLANIKMVQ
HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHEEEEEEEEEC
GDLPTSLEHYEAARDMLQDLTASVPDEKSWLGDLAMANDKIGNVLATQGDVGAAAKAYQQ
CCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCHHCCEEECCCCCHHHHHHHHH
SLSIKRKLVDAQPNSASLLRDLTITYDEIGDLARTAGQLDGAQTAFEESLRIRLVLAENK
HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCEEEEEEECCC
PDPERQRAVSVSHERIGDVLRERGDAAGALVAYSKSQAIAEELVRHDPNDTDLKRDLSIS
CCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHHCCCCCCHHHHHCCHH
YAKIGNALNDQENWPAALASYQQALAVARELAADDPGNTDWQRDLSVCLEKVAGVLDAQG
HHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCH
DIRSALQNYQDSLAIVDRLTKLDPGNSDWQRDLSITLSEIGMLETRQRHFEGSRKAFEAS
HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH
LGIRQKLAQSDPNNAIWQFDLVQAYINYAYVAKDPKAVLTKALNLTLDLDRTGRLAPRDK
HHHHHHHHCCCCCCCEEHHHHHHHHHHHHHCCCCHHHHHHHHHHCEEEECCCCCCCCCCC
PTIKYLRGLLAKLNAGKK
CHHHHHHHHHHHHCCCCC
>Mature Secondary Structure 
AERRVALVMADNDYRLVRPLANPIHDGEAMEAALKKLGFEVILETNRDLRRMRRALDDF
CCCEEEEEEECCCCHHHHHHHCCCCCCHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHH
REDAKGADVALVYFSGHGVEISGDNRLLPVDADASSVDQLDKTSLPLEEVRDAVAATAKV
HHHCCCCCEEEEEEECCCEEECCCCEEEEECCCCHHHHHHHHCCCCHHHHHHHHHHHHHH
GLIVLDACRSDPFSASSGDGRGATSLTKDVADKVKPGLGRVGRAENILFAFSAAPGETAA
HEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCC
DGTGENSPFTTALTKYLATDGLEIRSVLTLVQQEVYDLSRGKQLPYVESGLPKLFFAAAA
CCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCHHHHHHHHH
KEQLPERERLLLAMADVTPEMRGEVEQIASDADMPLAPLYGALISSDASHLSADSLNARL
HHHCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHH
REAADAFVKVRSEMKTLASDDPQVAELRRQAEEQLSLGAFDGARALLAKAADIDNVSRQA
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCHHHHHHHH
LKVNFASRTLSEAATRFLSGGAARADLDYTTAIGDFETVLSLYGEAGQTSLSLEQADRQS
HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHH
RTLEELGILYTTVGNVEAAGRAFTALLTNLEQRSRQETDPSVKRDLAISHIKLANIKMVQ
HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHEEEEEEEEEC
GDLPTSLEHYEAARDMLQDLTASVPDEKSWLGDLAMANDKIGNVLATQGDVGAAAKAYQQ
CCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCHHCCEEECCCCCHHHHHHHHH
SLSIKRKLVDAQPNSASLLRDLTITYDEIGDLARTAGQLDGAQTAFEESLRIRLVLAENK
HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCEEEEEEECCC
PDPERQRAVSVSHERIGDVLRERGDAAGALVAYSKSQAIAEELVRHDPNDTDLKRDLSIS
CCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHHCCCCCCHHHHHCCHH
YAKIGNALNDQENWPAALASYQQALAVARELAADDPGNTDWQRDLSVCLEKVAGVLDAQG
HHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCH
DIRSALQNYQDSLAIVDRLTKLDPGNSDWQRDLSITLSEIGMLETRQRHFEGSRKAFEAS
HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH
LGIRQKLAQSDPNNAIWQFDLVQAYINYAYVAKDPKAVLTKALNLTLDLDRTGRLAPRDK
HHHHHHHHCCCCCCCEEHHHHHHHHHHHHHCCCCHHHHHHHHHHCEEEECCCCCCCCCCC
PTIKYLRGLLAKLNAGKK
CHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA