Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is entS [H]

Identifier: 157160087

GI number: 157160087

Start: 658381

End: 659631

Strand: Direct

Name: entS [H]

Synonym: EcHS_A0642

Alternate gene names: 157160087

Gene position: 658381-659631 (Clockwise)

Preceding gene: 157160083

Following gene: 157160089

Centisome position: 14.18

GC content: 58.03

Gene sequence:

>1251_bases
ATGAATAAACAATCCTGGCTGCTTAACCTCAGCCTGTTGAAAACGCACCCGGCGTTTCGCGCAGTATTCCTCGCTCGTTT
CATCTCAATTGTGTCTCTGGGTTTGCTCGGCGTCGCGGTGCCGGTGCAGATCCAGATGATGACACATTCCACCTGGCAGG
TGGGGCTTTCGGTGACGCTGACCGGCGGCGCGATGTTTGTTGGCCTGATGGTCGGCGGTGTGCTGGCGGATCGCTATGAG
CGCAAAAAAGTGATTTTGCTGGCGCGCGGCACCTGTGGCATTGGCTTCATTGGACTGTGCCTTAATGCACTGCTGCCGGA
GCCGTCATTGCTGGCAATCTATTTACTTGGTTTATGGGATGGTTTTTTCGCATCGCTTGGCGTTACGGCGCTATTGGCGG
CGACACCAGCACTGGTAGGGCGTGAAAACTTAATGCAGGCCGGGGCGATCACCATGTTGACCGTGCGTCTGGGGTCGGTG
ATTTCGCCCATGATTGGCGGTTTATTGCTGGCGACCGGTGGCGTAGCCTGGAACTACGGGCTGGCGGCGGCGGGCACGTT
TATTACCTTGCTACCGTTGTTAAGCCTTCCGGCGTTGCCACCGCCACCGCAGCCGCGTGAGCATCCGTTGAAATCATTAC
TGGCAGGATTTCGTTTTCTGCTTGCCAGCCCGCTGGTGGGCGGGATTGCGCTGCTGGGTGGTTTATTGACGATGGCGAGC
GCGGTGCGGGTACTGTATCCGGCGCTGGCTGACAACTGGCAGATGTCAGCGGCACAGATTGGTTTTCTCTACGCGGCGAT
CCCGCTCGGCGCGGCTATTGGTGCGTTAACCAGCGGGAAGCTGGCACATAGTGCGCGACCAGGGTTATTGATGCTGCTCT
CCACGCTGGGATCGTTCCTCGCCATTGGTCTGTTTGGCCTGATGCCGATGTGGATTTTAGGCGTGGTTTGTCTGGCGCTG
TTCGGCTGGTTGAGTGCGGTCAGCTCGTTGCTGCAATACACAATGCTGCAAACGCAAACCCCGGAAGCGATGTTAGGGCG
GATTAACGGTTTGTGGACGGCGCAGAACGTGACGGGCGATGCCATAGGCGCGGCGCTGCTGGGTGGTTTGGGCGCGATGA
TGACACCGGTTGCTTCCGCAAGCGCGAGCGGTTTTGGTTTGTTGATTATCGGCGTGTTGTTATTGCTGGTGCTGGTGGAG
TTGCGACATTTTCGCCAGACGCCGCCGCAGGTGACAGCGTCCGACAGTTAA

Upstream 100 bases:

>100_bases
ATGATAATGAAATTAATTATCGTTATCGATCTTATTTGGATATGTTAGCATGTGCAGCCTAAGAATAGGTATTTAAAATA
TTTGATGGCAAGGCATTGTA

Downstream 100 bases:

>100_bases
TGCTTAAAACAGCGCCTTAAGCCTATCCAGCACTTGCATGGCGCTGTAGTAATCCAGACGGAACGTCTCGGTTCCCAGCG
CATAAACCTGCTTGTTTTGT

Product: enterobactin exporter EntS

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 416; Mature: 416

Protein sequence:

>416_residues
MNKQSWLLNLSLLKTHPAFRAVFLARFISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFVGLMVGGVLADRYE
RKKVILLARGTCGIGFIGLCLNALLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQAGAITMLTVRLGSV
ISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPPPPQPREHPLKSLLAGFRFLLASPLVGGIALLGGLLTMAS
AVRVLYPALADNWQMSAAQIGFLYAAIPLGAAIGALTSGKLAHSARPGLLMLLSTLGSFLAIGLFGLMPMWILGVVCLAL
FGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGGLGAMMTPVASASASGFGLLIIGVLLLLVLVE
LRHFRQTPPQVTASDS

Sequences:

>Translated_416_residues
MNKQSWLLNLSLLKTHPAFRAVFLARFISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFVGLMVGGVLADRYE
RKKVILLARGTCGIGFIGLCLNALLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQAGAITMLTVRLGSV
ISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPPPPQPREHPLKSLLAGFRFLLASPLVGGIALLGGLLTMAS
AVRVLYPALADNWQMSAAQIGFLYAAIPLGAAIGALTSGKLAHSARPGLLMLLSTLGSFLAIGLFGLMPMWILGVVCLAL
FGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGGLGAMMTPVASASASGFGLLIIGVLLLLVLVE
LRHFRQTPPQVTASDS
>Mature_416_residues
MNKQSWLLNLSLLKTHPAFRAVFLARFISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFVGLMVGGVLADRYE
RKKVILLARGTCGIGFIGLCLNALLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQAGAITMLTVRLGSV
ISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPPPPQPREHPLKSLLAGFRFLLASPLVGGIALLGGLLTMAS
AVRVLYPALADNWQMSAAQIGFLYAAIPLGAAIGALTSGKLAHSARPGLLMLLSTLGSFLAIGLFGLMPMWILGVVCLAL
FGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGGLGAMMTPVASASASGFGLLIIGVLLLLVLVE
LRHFRQTPPQVTASDS

Specific function: Exports the siderophore enterobactin out of the cell [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the major facilitator superfamily. EntS (TC 2.A.1.38) family [H]

Homologues:

Organism=Escherichia coli, GI1786806, Length=416, Percent_Identity=100, Blast_Score=797, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR020846
- InterPro:   IPR010290
- InterPro:   IPR016196 [H]

Pfam domain/function: PF05977 DUF894 [H]

EC number: NA

Molecular weight: Translated: 43283; Mature: 43283

Theoretical pI: Translated: 10.28; Mature: 10.28

Prosite motif: PS50850 MFS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
4.1 %Met     (Translated Protein)
4.8 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
4.1 %Met     (Mature Protein)
4.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKQSWLLNLSLLKTHPAFRAVFLARFISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTL
CCCCHHEEEHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCEEEEEEEEE
TGGAMFVGLMVGGVLADRYERKKVILLARGTCGIGFIGLCLNALLPEPSLLAIYLLGLWD
CCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH
GFFASLGVTALLAATPALVGRENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYG
HHHHHHHHHHHHHHHHHHHCHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCC
LAAAGTFITLLPLLSLPALPPPPQPREHPLKSLLAGFRFLLASPLVGGIALLGGLLTMAS
HHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AVRVLYPALADNWQMSAAQIGFLYAAIPLGAAIGALTSGKLAHSARPGLLMLLSTLGSFL
HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHHHHHHHH
AIGLFGLMPMWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGD
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCEEECCCCHH
AIGAALLGGLGAMMTPVASASASGFGLLIIGVLLLLVLVELRHFRQTPPQVTASDS
HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC
>Mature Secondary Structure
MNKQSWLLNLSLLKTHPAFRAVFLARFISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTL
CCCCHHEEEHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCEEEEEEEEE
TGGAMFVGLMVGGVLADRYERKKVILLARGTCGIGFIGLCLNALLPEPSLLAIYLLGLWD
CCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH
GFFASLGVTALLAATPALVGRENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYG
HHHHHHHHHHHHHHHHHHHCHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCC
LAAAGTFITLLPLLSLPALPPPPQPREHPLKSLLAGFRFLLASPLVGGIALLGGLLTMAS
HHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AVRVLYPALADNWQMSAAQIGFLYAAIPLGAAIGALTSGKLAHSARPGLLMLLSTLGSFL
HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHHHHHHHH
AIGLFGLMPMWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGD
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCEEECCCCHH
AIGAALLGGLGAMMTPVASASASGFGLLIIGVLLLLVLVELRHFRQTPPQVTASDS
HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA