Definition Mesorhizobium loti MAFF303099 plasmid pMLb, complete sequence.
Accession NC_002682
Length 208,315

Click here to switch to the map view.

The map label for this gene is trbE [H]

Identifier: 13488457

GI number: 13488457

Start: 77842

End: 80292

Strand: Reverse

Name: trbE [H]

Synonym: mll9608

Alternate gene names: 13488457

Gene position: 80292-77842 (Counterclockwise)

Preceding gene: 13488458

Following gene: 13488456

Centisome position: 38.54

GC content: 64.95

Gene sequence:

>2451_bases
ATGATGAGCCTTGCCGAATACCGCCGGACCGCCGCCCGCCTTGCCGACTACCTGCCTTGGGTCGCTCTCGTCGCACCGGG
CGTGGTGCTCAACAAGGACGGAAGTTTTCAGCGCACCGCGCGGTTTCGCGGGCCCGATCTCGACTCCGCCGTGGCGGCGG
AACTTGTGGCGGCGGCATCGCGCATCAACAATGCCTTTCGTCGCCTCGGCTCGGGCTGGAGCATCTTCGTTGAAGCGCAG
CGGCACGAAGCGGCTTCCTATCCCGAGAGCCAGTTCCCCAATGCGGCTTCCGGCCTGCTCGATGCCGAGCGCAAGGCCGA
CTTCGACGAGGCGGGGGTTCATTTCGTATCGAGCTACTTTCTCACCTTCCTGTTCCTGCCGCCGCCCGAAGACGCCACGC
GCGCCGAGGGATGGTTCTACGAGGGGCGCGACCACGCGGGCGCGGATCCGGGCGAGATCGTGCGCGCTTTTACCGATCGG
ACCGGGCGCGTTCTGGCGCTGCTCGACGGACTCATGCCTGAATGCCAGTGGCTCGATGATGAAGACACACTGACCTACCT
GCATTCCACGATCTCGACCAAACGCCATCGGGTGCGCGTGCCGGAAACGCCGATCTATCTCGATGCGCTGCTGGCCGACC
AACCACTTGCCGGCGGGCTCGAGCCCCGGCTTGGCGACAAGCATCTTCGCGTGCTCTCGATTGTCGGCTTCCCGACCGCC
ACGACACCCGGCCTGCTCGATGACCTCAACAGGCTGGCGTTCCCCTACCGCTGGTCGACCCGCGCGATCCTGCTCGACAA
GCCCGACGCTGTCCGCCTGTTGAACAGGATACGGCGGCAATGGTTCGCCAAGCGCAAGAGCATCGCCGCGATCCTGAAGG
AGGTGATGACCAACGAAGCGTCGGCCCTCCTCGACACCGACGCCGCCAACAAGGCGGCCGACGCCGACACGGCGTTGCAG
GAGCTTGGCGCCGACATGGTGGGCATGGCCTACGTCACCGCGAGCGTCGCCGTTTGGGATTCCGACCCGCGTGTGGCCGA
CGAAAAATTGCGGCTCGTCGAAAAGGTCATCCAGGGTCGTGACTTCACGGCAATTGCCGAGACCGTCAACGCCGTCGATG
CCTGGCTCGGTTCCTTGCCCGGGCACGTCTATGCCAATGTCCGTCAGCCTCCCATCTCAACGCTCAATCTCGCCCACATG
ATCCCGCTGTCTGCCGTGTGGGCGGGGCCGGAACGGGACGAGCATCTGGTCGGTCCTCCCCTGCTTTACGGCAAGACCGA
AGGCTCGACCCCGTTCCGGTTGGTCCTGCATGTCGGCGATGTCGGCCACACGCTGGTCGTTGGCCCCACGGGGGCCGGCA
AGTCGGTGCTGCTGGCGCTGATGGCGCTGCAGTTCTGCCGCTACCACAACGCTCAGATCTTCGCCTTCGACTTCGGCGGC
TCCATCCGCGCCGCGGCGTTCTCCATGGGCGGCGACTGGCACGATCTCGGGGGGCATCTCACCGAAGGCATGGACGCCTC
CGTTTCGCTGCAGCCGCTCGCCCGGATTCATGACACGTACGAGCGCGCCTGGGCGGCTGACTGGATCGTCGCGCTCCTGA
TGCGCGAAGGTATCCAGATCACCCCGGACGCAAAGGAGCATATCTGGGCGGCGCTGACGTCGCTCGCCTCGGCTCCTCTC
GAGGAGCGCACCCTCACCGGGCTCAGCGTGCTCTTGCAAGCCAATGACCTGAAACAGGCATTGCGCCCATACTGTTTGGG
AGGGCCTCACGGCCGATTGCTCGACGCCGAGGCCGAACACCTCGGCCAGGCATCGGTGCAGGCCTTCGAGATCGAAGGGC
TGGTGGGCGCGCAAGCCGCGCCGGCTGTCCTGTCCTATCTGTTTCATCGCGTCAGTGACCGGCTCGATGGGCGTCCAACG
CTGCTCATCATCGACGAGGGCTGGCTTGCCCTCGATGACGAGGGGTTTGCGGGGCAGTTGCGCGAGTGGCTGAAGACGCT
GCGCAAGAAGAACGCCAGCGTCGTCTTCGCCACCCAGTCACTTGCCGATATCGACAATTCGGCCATCGCGCCGGCGATCA
TCGAAAGCTGTCCAACCCGGCTTCTGTTGCCTAATGCGCGCGCGATTGAGCCGCAAATTGCCGACATCTATCGACGCTTC
GGCTTGAACGACCGCCAGATCGAAATCCTGGCGCGAGCGACACCCAAGCGCGATTACTACTGCCAGTCCCGTCGTGGCAA
CCGTCTCTTCGAGCTCGGCCTGTCGGAGGTCGGGCTCGCACTTTGCGCCGCGTCCTCGAAGAGCGATCAGGCCCTGATCA
GCGCCATCATCTCCGAGCATGGCCGCGATGGCTTCCTACCGGCCTGGCTGCATGCCCGCAATGTCGGCTGGGCCGCCGAC
CTCATTCCCAATCTCACCGTCCCTGAACCTGGAAAGGACCCCCAGCCATGA

Upstream 100 bases:

>100_bases
GCCACTGCGGTATGGGCCGCCAGGCGCGACCCGCTGTTTTTCGAGGTTGGACGCAGGCACCTGCGTCTGCCCGGGCATCT
GTCGGTTTGAGGGCGCGGTT

Downstream 100 bases:

>100_bases
CTGTTCGTCCAAGCCGCCCGCGCGCCCTTCTTCTGGTCGCCGCGATGCTCGTCGCGCCGATCGCGATTTCGCCGATGATG
ACTGCGCCAGCACATGCGTT

Product: conjugal transfer ATPase TrbE

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 816; Mature: 816

Protein sequence:

>816_residues
MMSLAEYRRTAARLADYLPWVALVAPGVVLNKDGSFQRTARFRGPDLDSAVAAELVAAASRINNAFRRLGSGWSIFVEAQ
RHEAASYPESQFPNAASGLLDAERKADFDEAGVHFVSSYFLTFLFLPPPEDATRAEGWFYEGRDHAGADPGEIVRAFTDR
TGRVLALLDGLMPECQWLDDEDTLTYLHSTISTKRHRVRVPETPIYLDALLADQPLAGGLEPRLGDKHLRVLSIVGFPTA
TTPGLLDDLNRLAFPYRWSTRAILLDKPDAVRLLNRIRRQWFAKRKSIAAILKEVMTNEASALLDTDAANKAADADTALQ
ELGADMVGMAYVTASVAVWDSDPRVADEKLRLVEKVIQGRDFTAIAETVNAVDAWLGSLPGHVYANVRQPPISTLNLAHM
IPLSAVWAGPERDEHLVGPPLLYGKTEGSTPFRLVLHVGDVGHTLVVGPTGAGKSVLLALMALQFCRYHNAQIFAFDFGG
SIRAAAFSMGGDWHDLGGHLTEGMDASVSLQPLARIHDTYERAWAADWIVALLMREGIQITPDAKEHIWAALTSLASAPL
EERTLTGLSVLLQANDLKQALRPYCLGGPHGRLLDAEAEHLGQASVQAFEIEGLVGAQAAPAVLSYLFHRVSDRLDGRPT
LLIIDEGWLALDDEGFAGQLREWLKTLRKKNASVVFATQSLADIDNSAIAPAIIESCPTRLLLPNARAIEPQIADIYRRF
GLNDRQIEILARATPKRDYYCQSRRGNRLFELGLSEVGLALCAASSKSDQALISAIISEHGRDGFLPAWLHARNVGWAAD
LIPNLTVPEPGKDPQP

Sequences:

>Translated_816_residues
MMSLAEYRRTAARLADYLPWVALVAPGVVLNKDGSFQRTARFRGPDLDSAVAAELVAAASRINNAFRRLGSGWSIFVEAQ
RHEAASYPESQFPNAASGLLDAERKADFDEAGVHFVSSYFLTFLFLPPPEDATRAEGWFYEGRDHAGADPGEIVRAFTDR
TGRVLALLDGLMPECQWLDDEDTLTYLHSTISTKRHRVRVPETPIYLDALLADQPLAGGLEPRLGDKHLRVLSIVGFPTA
TTPGLLDDLNRLAFPYRWSTRAILLDKPDAVRLLNRIRRQWFAKRKSIAAILKEVMTNEASALLDTDAANKAADADTALQ
ELGADMVGMAYVTASVAVWDSDPRVADEKLRLVEKVIQGRDFTAIAETVNAVDAWLGSLPGHVYANVRQPPISTLNLAHM
IPLSAVWAGPERDEHLVGPPLLYGKTEGSTPFRLVLHVGDVGHTLVVGPTGAGKSVLLALMALQFCRYHNAQIFAFDFGG
SIRAAAFSMGGDWHDLGGHLTEGMDASVSLQPLARIHDTYERAWAADWIVALLMREGIQITPDAKEHIWAALTSLASAPL
EERTLTGLSVLLQANDLKQALRPYCLGGPHGRLLDAEAEHLGQASVQAFEIEGLVGAQAAPAVLSYLFHRVSDRLDGRPT
LLIIDEGWLALDDEGFAGQLREWLKTLRKKNASVVFATQSLADIDNSAIAPAIIESCPTRLLLPNARAIEPQIADIYRRF
GLNDRQIEILARATPKRDYYCQSRRGNRLFELGLSEVGLALCAASSKSDQALISAIISEHGRDGFLPAWLHARNVGWAAD
LIPNLTVPEPGKDPQP
>Mature_816_residues
MMSLAEYRRTAARLADYLPWVALVAPGVVLNKDGSFQRTARFRGPDLDSAVAAELVAAASRINNAFRRLGSGWSIFVEAQ
RHEAASYPESQFPNAASGLLDAERKADFDEAGVHFVSSYFLTFLFLPPPEDATRAEGWFYEGRDHAGADPGEIVRAFTDR
TGRVLALLDGLMPECQWLDDEDTLTYLHSTISTKRHRVRVPETPIYLDALLADQPLAGGLEPRLGDKHLRVLSIVGFPTA
TTPGLLDDLNRLAFPYRWSTRAILLDKPDAVRLLNRIRRQWFAKRKSIAAILKEVMTNEASALLDTDAANKAADADTALQ
ELGADMVGMAYVTASVAVWDSDPRVADEKLRLVEKVIQGRDFTAIAETVNAVDAWLGSLPGHVYANVRQPPISTLNLAHM
IPLSAVWAGPERDEHLVGPPLLYGKTEGSTPFRLVLHVGDVGHTLVVGPTGAGKSVLLALMALQFCRYHNAQIFAFDFGG
SIRAAAFSMGGDWHDLGGHLTEGMDASVSLQPLARIHDTYERAWAADWIVALLMREGIQITPDAKEHIWAALTSLASAPL
EERTLTGLSVLLQANDLKQALRPYCLGGPHGRLLDAEAEHLGQASVQAFEIEGLVGAQAAPAVLSYLFHRVSDRLDGRPT
LLIIDEGWLALDDEGFAGQLREWLKTLRKKNASVVFATQSLADIDNSAIAPAIIESCPTRLLLPNARAIEPQIADIYRRF
GLNDRQIEILARATPKRDYYCQSRRGNRLFELGLSEVGLALCAASSKSDQALISAIISEHGRDGFLPAWLHARNVGWAAD
LIPNLTVPEPGKDPQP

Specific function: Unknown

COG id: COG3451

COG function: function code U; Type IV secretory pathway, VirB4 components

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the trbE/virB4 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR004346
- InterPro:   IPR018145 [H]

Pfam domain/function: PF03135 CagE_TrbE_VirB [H]

EC number: NA

Molecular weight: Translated: 89247; Mature: 89247

Theoretical pI: Translated: 5.71; Mature: 5.71

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MMSLAEYRRTAARLADYLPWVALVAPGVVLNKDGSFQRTARFRGPDLDSAVAAELVAAAS
CCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCCHHHHHHHHHHHHH
RINNAFRRLGSGWSIFVEAQRHEAASYPESQFPNAASGLLDAERKADFDEAGVHFVSSYF
HHHHHHHHHCCCCEEEEEECHHHCCCCCHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHH
LTFLFLPPPEDATRAEGWFYEGRDHAGADPGEIVRAFTDRTGRVLALLDGLMPECQWLDD
HHEEECCCCCCCCCCCCCEECCCCCCCCCHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCC
EDTLTYLHSTISTKRHRVRVPETPIYLDALLADQPLAGGLEPRLGDKHLRVLSIVGFPTA
CHHHHHHHHHHHHCHHEEECCCCCHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEECCCCC
TTPGLLDDLNRLAFPYRWSTRAILLDKPDAVRLLNRIRRQWFAKRKSIAAILKEVMTNEA
CCCHHHHHHHHHCCCEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
SALLDTDAANKAADADTALQELGADMVGMAYVTASVAVWDSDPRVADEKLRLVEKVIQGR
HHHHCCCCCCCCCCHHHHHHHHCCHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHHHCCC
DFTAIAETVNAVDAWLGSLPGHVYANVRQPPISTLNLAHMIPLSAVWAGPERDEHLVGPP
CHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHCCHHHHCCCCCCCCCCCCCC
LLYGKTEGSTPFRLVLHVGDVGHTLVVGPTGAGKSVLLALMALQFCRYHNAQIFAFDFGG
EEECCCCCCCCEEEEEEECCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCC
SIRAAAFSMGGDWHDLGGHLTEGMDASVSLQPLARIHDTYERAWAADWIVALLMREGIQI
CCEEHHHHCCCCHHHHCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE
TPDAKEHIWAALTSLASAPLEERTLTGLSVLLQANDLKQALRPYCLGGPHGRLLDAEAEH
CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEECCHHHH
LGQASVQAFEIEGLVGAQAAPAVLSYLFHRVSDRLDGRPTLLIIDEGWLALDDEGFAGQL
CCHHHCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEECCCCCHHHH
REWLKTLRKKNASVVFATQSLADIDNSAIAPAIIESCPTRLLLPNARAIEPQIADIYRRF
HHHHHHHHHCCCCEEEEEHHHHHCCCCCHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHH
GLNDRQIEILARATPKRDYYCQSRRGNRLFELGLSEVGLALCAASSKSDQALISAIISEH
CCCCCEEEEEEECCCCCCCEECCCCCCEEEECCHHHHHHHHHHCCCCCHHHHHHHHHHHC
GRDGFLPAWLHARNVGWAADLIPNLTVPEPGKDPQP
CCCCCCHHHHHHCCCCCHHHHCCCCCCCCCCCCCCC
>Mature Secondary Structure
MMSLAEYRRTAARLADYLPWVALVAPGVVLNKDGSFQRTARFRGPDLDSAVAAELVAAAS
CCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCCHHHHHHHHHHHHH
RINNAFRRLGSGWSIFVEAQRHEAASYPESQFPNAASGLLDAERKADFDEAGVHFVSSYF
HHHHHHHHHCCCCEEEEEECHHHCCCCCHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHH
LTFLFLPPPEDATRAEGWFYEGRDHAGADPGEIVRAFTDRTGRVLALLDGLMPECQWLDD
HHEEECCCCCCCCCCCCCEECCCCCCCCCHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCC
EDTLTYLHSTISTKRHRVRVPETPIYLDALLADQPLAGGLEPRLGDKHLRVLSIVGFPTA
CHHHHHHHHHHHHCHHEEECCCCCHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEECCCCC
TTPGLLDDLNRLAFPYRWSTRAILLDKPDAVRLLNRIRRQWFAKRKSIAAILKEVMTNEA
CCCHHHHHHHHHCCCEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
SALLDTDAANKAADADTALQELGADMVGMAYVTASVAVWDSDPRVADEKLRLVEKVIQGR
HHHHCCCCCCCCCCHHHHHHHHCCHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHHHCCC
DFTAIAETVNAVDAWLGSLPGHVYANVRQPPISTLNLAHMIPLSAVWAGPERDEHLVGPP
CHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHCCHHHHCCCCCCCCCCCCCC
LLYGKTEGSTPFRLVLHVGDVGHTLVVGPTGAGKSVLLALMALQFCRYHNAQIFAFDFGG
EEECCCCCCCCEEEEEEECCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCC
SIRAAAFSMGGDWHDLGGHLTEGMDASVSLQPLARIHDTYERAWAADWIVALLMREGIQI
CCEEHHHHCCCCHHHHCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE
TPDAKEHIWAALTSLASAPLEERTLTGLSVLLQANDLKQALRPYCLGGPHGRLLDAEAEH
CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEECCHHHH
LGQASVQAFEIEGLVGAQAAPAVLSYLFHRVSDRLDGRPTLLIIDEGWLALDDEGFAGQL
CCHHHCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEECCCCCHHHH
REWLKTLRKKNASVVFATQSLADIDNSAIAPAIIESCPTRLLLPNARAIEPQIADIYRRF
HHHHHHHHHCCCCEEEEEHHHHHCCCCCHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHH
GLNDRQIEILARATPKRDYYCQSRRGNRLFELGLSEVGLALCAASSKSDQALISAIISEH
CCCCCEEEEEEECCCCCCCEECCCCCCEEEECCHHHHHHHHHHCCCCCHHHHHHHHHHHC
GRDGFLPAWLHARNVGWAADLIPNLTVPEPGKDPQP
CCCCCCHHHHHHCCCCCHHHHCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8763954 [H]