Definition Sinorhizobium medicae WSM419 chromosome, complete genome.
Accession NC_009636
Length 3,781,904

Click here to switch to the map view.

The map label for this gene is exoP [H]

Identifier: 150396077

GI number: 150396077

Start: 912695

End: 914824

Strand: Direct

Name: exoP [H]

Synonym: Smed_0854

Alternate gene names: 150396077

Gene position: 912695-914824 (Clockwise)

Preceding gene: 150396072

Following gene: 150396081

Centisome position: 24.13

GC content: 61.78

Gene sequence:

>2130_bases
ATGTCGGGCGGGCAGCAGGATGTAGATATCGACCTCGGCGGGCTCTTCCGCGCCATCTGGCAACGGCGCGTGCGCGTCCT
GCTCGTCACTGTTGGGGCCGCTGCCGTCAGCTTCGCCGCCGCCAGGATGGTCGCTCCCGACTATCAAGGCGAGACCCGCG
TTTTGATCGAATCGCGTGAGCCCGAGTTCAGCGGCAGCAACCAAGTCTCGCAGAGCGGATCCGACCGAATGTTCGATGAA
TCGGGCATTCTCAGCCAGGTGCAGGTCCTGCGCTCGGCGGACCTGATCAAGCAGGTCGCGCGCAACATGAAACTGCACGA
ACGCGAAGAATTCGATCCCTCCGCCCGGCCTTCCGCCGCCTCCGATCTCCTGGTGATGCTGGGTCTCAAGAAGAATCCCC
TCGATCTTCCGCCCGAGGAACGGGTCCTCAAAGAGTTCAATGAGAAGCTGCAGGTCTACCAGGTCGAAAAGTCGCGTGTG
ATCGCCATCGCCTTCACTTCGAAAGATCCGAAGCTTGCGGCCGCTATTCCGAACGAAATGGCCGATGTCTATCGCTCCCT
GCAGAGCGGCGCCAAGCTCGACTCCAACTCCGATGCAAGCCGGTGGTTGGAACCGGAAATCGCCAACCTGCGCGAGAAGG
TGCGCGAGGCGGAGGCCAAGGTTGCCATTTACCGCGCGGAATCGGGTCTGCTTCCGACCGGAGAGACGCAGAATTTTGCT
ACACGCCAGCTGACCGACATTTCGACGGAGCTCGCGCGGGTGCGCGCGGAGCGTGCCAATGCGGCTGCCCGCGCCGAGGG
TATGCGGACGGCGCTCGCGGACGGTCGTCCGGCCGACACCCTTGCCGACATCGTCGGTTCGCCGATGATACAGCGTCTCA
AGGAAAGCCGCTCCAATGTTCAGTCGCAGATCGCCGATCTGTCGCCGGCGCTGCTCGACGGTCATCCGCGCTTGAAGGGG
CTCAAGTCGCAGCTGGAGGGGATCGAAGCGCAGATCCGGTCGGAGACGCGGAAGATTCTCGCGAGCCTCGAAAACGAAGC
GAAGGTCGGGCAATTGCGCGAACAGCAGCTCGTCCAGCAGCTGAACACGCTCAAGGCGCAATCGGCGCAGGCGGGCGAGG
AGGAAGTGGGACTTCGTGCGCTCGAGCGCGAGGCGGCGGCACAGCGCCAGTTGCTTGAAACTTATCTTGCCCGTTATCGC
GAGGCCACCTCGCGCACTGTGGCGAATGCAACTCCGGCCGATGCACGCGTGATTTCGCGCGCCGTCATCCCCACCAGCCA
GAGCTTCCCCAAGGTGCTGCCGATAACGATCGTCGCAGCCTTTGCCAGCTTCCTCGTCAGTTGCGTCGTTATCATGCTAA
GGGAGCTTTTCAGCGGTCGCGCGCTGAGGCCTGTTTCCATACCCGAAGCGTCGACCGCAGGTCAGCCCGCAGGAGAACCG
CCAGTTCCGGTTCTGGTCCAGCAGATTCCCGCGCCAATCCTCGTTTCGGCGATATCTGCGGATGAGGAAGAAGGCCGGAG
CGAAAACGAAGCACCGAACCATGATTTTTCAATGGAGTCGGTTGCCGAACATATTCGAGCCAAAGGCGTGCGGGTTGCCG
TCTCGGTATCGCCGGGCGGCGACGAAGGATCCACTGCGACCGTAATGCTGGCGCGCTTTCTGGCCGAAGAGGGGCAGAAA
GTCGTTCTGATCGACCTTTCCGGTTCAGCCTGCCCGACGCGGCTCATGGCGCATTCGCAGGATCTTGCCGGAGTCACGAA
TTTGCTCATGGGGGAGGTCGCCTTTTCCGAGGCGATTCACTCAGACAGCTTGTCAGAAGCACATATAATACCCTGTGGCG
ACGCCGACCCGCATGCGGCGATGCGCGGTATCGACCGGCTGCGGATCATCGTCGATGCGCTCTCCAGTGCCTACGACCTC
GTCTTGATTGAATGCGGTTCGGCTGACGCAGACGCGGTCGCCAAGTTACGACACGAGGGTACCGAGATCATCCTGTCGGC
ACCGTCGATCAGCAACGATCAGATTGTCGAGATGTTGATGCGTTTCGGCGAAGCGGGATATCGCGATGTCGTGCTTATGA
CCGGGCAGGGGCAGAAAGGTCCTGATTTTCCCGACCGCCGCGCGGCGTAG

Upstream 100 bases:

>100_bases
CCCTTGTGCGTGACGCGGTGCGGTTCCAGGGAAATTAACGGTTGGGTTACCATGTTCGTTTACCTTCTCGCCAACTTTGA
AAGCTTTGGGGTGCAGGGGT

Downstream 100 bases:

>100_bases
AGGGCTTTCCGGGAACTGTGCTGCGGCTTCCCAGTAGGAGGCATTGGCGCCAGGTCTACGGTTCATCTGCGCTTGCAGCC
GACATCTGCCGTAACCGTCG

Product: exopolysaccharide transport protein family

Products: ADP; protein tyrosine phosphate [C]

Alternate protein names: NA

Number of amino acids: Translated: 709; Mature: 708

Protein sequence:

>709_residues
MSGGQQDVDIDLGGLFRAIWQRRVRVLLVTVGAAAVSFAAARMVAPDYQGETRVLIESREPEFSGSNQVSQSGSDRMFDE
SGILSQVQVLRSADLIKQVARNMKLHEREEFDPSARPSAASDLLVMLGLKKNPLDLPPEERVLKEFNEKLQVYQVEKSRV
IAIAFTSKDPKLAAAIPNEMADVYRSLQSGAKLDSNSDASRWLEPEIANLREKVREAEAKVAIYRAESGLLPTGETQNFA
TRQLTDISTELARVRAERANAAARAEGMRTALADGRPADTLADIVGSPMIQRLKESRSNVQSQIADLSPALLDGHPRLKG
LKSQLEGIEAQIRSETRKILASLENEAKVGQLREQQLVQQLNTLKAQSAQAGEEEVGLRALEREAAAQRQLLETYLARYR
EATSRTVANATPADARVISRAVIPTSQSFPKVLPITIVAAFASFLVSCVVIMLRELFSGRALRPVSIPEASTAGQPAGEP
PVPVLVQQIPAPILVSAISADEEEGRSENEAPNHDFSMESVAEHIRAKGVRVAVSVSPGGDEGSTATVMLARFLAEEGQK
VVLIDLSGSACPTRLMAHSQDLAGVTNLLMGEVAFSEAIHSDSLSEAHIIPCGDADPHAAMRGIDRLRIIVDALSSAYDL
VLIECGSADADAVAKLRHEGTEIILSAPSISNDQIVEMLMRFGEAGYRDVVLMTGQGQKGPDFPDRRAA

Sequences:

>Translated_709_residues
MSGGQQDVDIDLGGLFRAIWQRRVRVLLVTVGAAAVSFAAARMVAPDYQGETRVLIESREPEFSGSNQVSQSGSDRMFDE
SGILSQVQVLRSADLIKQVARNMKLHEREEFDPSARPSAASDLLVMLGLKKNPLDLPPEERVLKEFNEKLQVYQVEKSRV
IAIAFTSKDPKLAAAIPNEMADVYRSLQSGAKLDSNSDASRWLEPEIANLREKVREAEAKVAIYRAESGLLPTGETQNFA
TRQLTDISTELARVRAERANAAARAEGMRTALADGRPADTLADIVGSPMIQRLKESRSNVQSQIADLSPALLDGHPRLKG
LKSQLEGIEAQIRSETRKILASLENEAKVGQLREQQLVQQLNTLKAQSAQAGEEEVGLRALEREAAAQRQLLETYLARYR
EATSRTVANATPADARVISRAVIPTSQSFPKVLPITIVAAFASFLVSCVVIMLRELFSGRALRPVSIPEASTAGQPAGEP
PVPVLVQQIPAPILVSAISADEEEGRSENEAPNHDFSMESVAEHIRAKGVRVAVSVSPGGDEGSTATVMLARFLAEEGQK
VVLIDLSGSACPTRLMAHSQDLAGVTNLLMGEVAFSEAIHSDSLSEAHIIPCGDADPHAAMRGIDRLRIIVDALSSAYDL
VLIECGSADADAVAKLRHEGTEIILSAPSISNDQIVEMLMRFGEAGYRDVVLMTGQGQKGPDFPDRRAA
>Mature_708_residues
SGGQQDVDIDLGGLFRAIWQRRVRVLLVTVGAAAVSFAAARMVAPDYQGETRVLIESREPEFSGSNQVSQSGSDRMFDES
GILSQVQVLRSADLIKQVARNMKLHEREEFDPSARPSAASDLLVMLGLKKNPLDLPPEERVLKEFNEKLQVYQVEKSRVI
AIAFTSKDPKLAAAIPNEMADVYRSLQSGAKLDSNSDASRWLEPEIANLREKVREAEAKVAIYRAESGLLPTGETQNFAT
RQLTDISTELARVRAERANAAARAEGMRTALADGRPADTLADIVGSPMIQRLKESRSNVQSQIADLSPALLDGHPRLKGL
KSQLEGIEAQIRSETRKILASLENEAKVGQLREQQLVQQLNTLKAQSAQAGEEEVGLRALEREAAAQRQLLETYLARYRE
ATSRTVANATPADARVISRAVIPTSQSFPKVLPITIVAAFASFLVSCVVIMLRELFSGRALRPVSIPEASTAGQPAGEPP
VPVLVQQIPAPILVSAISADEEEGRSENEAPNHDFSMESVAEHIRAKGVRVAVSVSPGGDEGSTATVMLARFLAEEGQKV
VLIDLSGSACPTRLMAHSQDLAGVTNLLMGEVAFSEAIHSDSLSEAHIIPCGDADPHAAMRGIDRLRIIVDALSSAYDLV
LIECGSADADAVAKLRHEGTEIILSAPSISNDQIVEMLMRFGEAGYRDVVLMTGQGQKGPDFPDRRAA

Specific function: Required For The Extracellular Polysaccharide Colanic Acid Synthesis. The Autophosphorylated Form Is Inactive. Probably Involved In The Export Of Colanic Acid From The Cell To Medium. [C]

COG id: COG3206

COG function: function code M; Uncharacterized protein involved in exopolysaccharide biosynthesis

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Probable) [H]

Metaboloic importance: Non Essential [C]

Operon status: Not Known

Operon components: None

Similarity: To B.solanacearum epsB [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002586
- InterPro:   IPR005702
- InterPro:   IPR005700
- InterPro:   IPR003856 [H]

Pfam domain/function: PF01656 CbiA; PF02706 Wzz [H]

EC number: 2.7.1.112 [C]

Molecular weight: Translated: 76618; Mature: 76487

Theoretical pI: Translated: 5.01; Mature: 5.01

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSGGQQDVDIDLGGLFRAIWQRRVRVLLVTVGAAAVSFAAARMVAPDYQGETRVLIESRE
CCCCCCCCCCCHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCCCCCEEEEEECCC
PEFSGSNQVSQSGSDRMFDESGILSQVQVLRSADLIKQVARNMKLHEREEFDPSARPSAA
CCCCCCCCHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCHH
SDLLVMLGLKKNPLDLPPEERVLKEFNEKLQVYQVEKSRVIAIAFTSKDPKLAAAIPNEM
HHHHHEECCCCCCCCCCCHHHHHHHHHHHHEEEEECCCEEEEEEEECCCCCEEHHCCHHH
ADVYRSLQSGAKLDSNSDASRWLEPEIANLREKVREAEAKVAIYRAESGLLPTGETQNFA
HHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHEEEEECCCCCCCCCCCCCHH
TRQLTDISTELARVRAERANAAARAEGMRTALADGRPADTLADIVGSPMIQRLKESRSNV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHH
QSQIADLSPALLDGHPRLKGLKSQLEGIEAQIRSETRKILASLENEAKVGQLREQQLVQQ
HHHHHHCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LNTLKAQSAQAGEEEVGLRALEREAAAQRQLLETYLARYREATSRTVANATPADARVISR
HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
AVIPTSQSFPKVLPITIVAAFASFLVSCVVIMLRELFSGRALRPVSIPEASTAGQPAGEP
HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
PVPVLVQQIPAPILVSAISADEEEGRSENEAPNHDFSMESVAEHIRAKGVRVAVSVSPGG
CHHHHHHHCCHHHHHHHHCCCHHHCCCCCCCCCCCCCHHHHHHHHHHCCEEEEEEECCCC
DEGSTATVMLARFLAEEGQKVVLIDLSGSACPTRLMAHSQDLAGVTNLLMGEVAFSEAIH
CCCCHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHH
SDSLSEAHIIPCGDADPHAAMRGIDRLRIIVDALSSAYDLVLIECGSADADAVAKLRHEG
CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHCC
TEIILSAPSISNDQIVEMLMRFGEAGYRDVVLMTGQGQKGPDFPDRRAA
CEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCC
>Mature Secondary Structure 
SGGQQDVDIDLGGLFRAIWQRRVRVLLVTVGAAAVSFAAARMVAPDYQGETRVLIESRE
CCCCCCCCCCHHHHHHHHHHHHHHEEEEEHHHHHHHHHHHHHCCCCCCCCEEEEEECCC
PEFSGSNQVSQSGSDRMFDESGILSQVQVLRSADLIKQVARNMKLHEREEFDPSARPSAA
CCCCCCCCHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCHH
SDLLVMLGLKKNPLDLPPEERVLKEFNEKLQVYQVEKSRVIAIAFTSKDPKLAAAIPNEM
HHHHHEECCCCCCCCCCCHHHHHHHHHHHHEEEEECCCEEEEEEEECCCCCEEHHCCHHH
ADVYRSLQSGAKLDSNSDASRWLEPEIANLREKVREAEAKVAIYRAESGLLPTGETQNFA
HHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHEEEEECCCCCCCCCCCCCHH
TRQLTDISTELARVRAERANAAARAEGMRTALADGRPADTLADIVGSPMIQRLKESRSNV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHH
QSQIADLSPALLDGHPRLKGLKSQLEGIEAQIRSETRKILASLENEAKVGQLREQQLVQQ
HHHHHHCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LNTLKAQSAQAGEEEVGLRALEREAAAQRQLLETYLARYREATSRTVANATPADARVISR
HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
AVIPTSQSFPKVLPITIVAAFASFLVSCVVIMLRELFSGRALRPVSIPEASTAGQPAGEP
HHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
PVPVLVQQIPAPILVSAISADEEEGRSENEAPNHDFSMESVAEHIRAKGVRVAVSVSPGG
CHHHHHHHCCHHHHHHHHCCCHHHCCCCCCCCCCCCCHHHHHHHHHHCCEEEEEEECCCC
DEGSTATVMLARFLAEEGQKVVLIDLSGSACPTRLMAHSQDLAGVTNLLMGEVAFSEAIH
CCCCHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHH
SDSLSEAHIIPCGDADPHAAMRGIDRLRIIVDALSSAYDLVLIECGSADADAVAKLRHEG
CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHCC
TEIILSAPSISNDQIVEMLMRFGEAGYRDVVLMTGQGQKGPDFPDRRAA
CEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: ATP; a protein tyrosine [C]

Specific reaction: ATP + a protein tyrosine = ADP + protein tyrosine phosphate [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8226645; 8226646; 8246891; 11481431 [H]