Definition | Escherichia coli IAI39 chromosome, complete genome. |
---|---|
Accession | NC_011750 |
Length | 5,132,068 |
Click here to switch to the map view.
The map label for this gene is atoE
Identifier: 218700692
GI number: 218700692
Start: 2436847
End: 2438169
Strand: Direct
Name: atoE
Synonym: ECIAI39_2361
Alternate gene names: 218700692
Gene position: 2436847-2438169 (Clockwise)
Preceding gene: 218700691
Following gene: 218700693
Centisome position: 47.48
GC content: 53.06
Gene sequence:
>1323_bases ATGATTGGTCGCATATCGCGTTTTATGACGCGTTTTGTCAGCCGGTGGCTTCCCGATCCACTGATCTTTGCCATGTTGCT GACATTGCTAACATTCGTGATCGCGCTTTGGTTAACACCACAAACGCCGATCAGCATGGTGAAAATGTGGGGTGACGGTT TCTGGAACTTGCTGGCGTTTGGTATGCAGATGGCGCTTATCATCGTTACCGGTCATGCCCTTGCCAGCTCTGCTCCGGTA AAAAGTTTGCTGCGTACTGCCGCCTCCGCCGCAAAGACGCCCGTACAGGGCGTCATGCTGGTTACTTTCTTCGGTTCAGT CGCTTGTGTCATCAACTGGGGATTTGGTTTGGTTGTCGGCGCAATGTTTGCCCGTGAAGTCGCCCGCCGAGTACCCGGTT CTGATTATCCGTTGCTCATTGCCTGCGCCTACATTGGTTTTCTCACCTGGGGTGGCGGTTTCTCTGGCTCAATGCCTCTG TTGGCTGCAACACCGGGCAACCCGGTTGAGCATATCGCCGGGCTGATCCCGGTGGGCGATACTCTGTTCAGTGGTTTTAA CATTTTCATCACTGTGGCGTTGATTGTGGTGATGCCATTTATCACCCGCATGATGATGCCAAAACCGTCTGACGTGGTGA GTATCGATCCGAAACTACTCATGGAAGAGGCTGATTTCCAAAAGCAGCTACCGAAAGATGCCCCACCATCCGAGCGACTG GAAGAAAGCCGCATTCTGACGTTGATCATCGGCGCACTCGGTATCGCTTACCTTGCGATGTACTTCAGCGAACATGGCTT CAACATCACCATCAATACCGTCAACCTGATGTTTATGATTGCGGGTCTGCTGCTACATAAAACGCCAATGGCTTATATGC GTGCTATCAGCGCGGCAGCACGCAGTACTGCCGGTATTCTGGTGCAATTCCCCTTCTACGCTGGGATCCAACTGATGATG GAGCATTCCGGTCTGGGCGGACTCATTACCGAATTCTTCATCAATGTTGCGAACAAAGACACCTTCCCGGTAATGACCTT TTTTAGTTCTGCACTGATTAACTTCGCCGTTCCGTCTGGCGGCGGTCACTGGGTTATTCAGGGACCTTTCGTGATACCCG CAGCCCAGGCGCTGGGCGCTGATCTCGGTAAATCGGTAATGGCGATCGCCTACGGCGAGCAATGGATGAACATGGCACAA CCGTTCTGGGCGCTGCCAGCACTGGCAATCGCCGGACTCGGTGTCCGCGACATCATGGGCTATTGCATCACTGCCCTGCT CTTCTCCGGCGTCATTTTCGTCATTGGTTTAACGCTGTTCTGA
Upstream 100 bases:
>100_bases TCACCGAAATTGCCGACGGGTGTGATTTAGCCACCGTGCGTGCCAAAACAGAAGCTCGGTTTGAAGTAGCTGCCGATCTG AATACGCAACGGGGTGATTT
Downstream 100 bases:
>100_bases CGGCAACCCTACAGACAGAAGGAATATAAAATGAAAAATTGTGTCATCGTCAGTGCGGTACGTACTGCTATCGGTAGTTT TAACGGTTCACTCGCTTCCA
Product: short chain fatty acid transporter
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 440; Mature: 440
Protein sequence:
>440_residues MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAFGMQMALIIVTGHALASSAPV KSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVGAMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPL LAATPGNPVEHIAGLIPVGDTLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERL EESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAARSTAGILVQFPFYAGIQLMM EHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSGGGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQ PFWALPALAIAGLGVRDIMGYCITALLFSGVIFVIGLTLF
Sequences:
>Translated_440_residues MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAFGMQMALIIVTGHALASSAPV KSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVGAMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPL LAATPGNPVEHIAGLIPVGDTLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERL EESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAARSTAGILVQFPFYAGIQLMM EHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSGGGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQ PFWALPALAIAGLGVRDIMGYCITALLFSGVIFVIGLTLF >Mature_440_residues MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAFGMQMALIIVTGHALASSAPV KSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVGAMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPL LAATPGNPVEHIAGLIPVGDTLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERL EESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAARSTAGILVQFPFYAGIQLMM EHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSGGGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQ PFWALPALAIAGLGVRDIMGYCITALLFSGVIFVIGLTLF
Specific function: Responsible for the intake of short-chain fatty acids
COG id: COG2031
COG function: function code I; Short chain fatty acids transporter
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Escherichia coli, GI1788553, Length=440, Percent_Identity=100, Blast_Score=880, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): ATOE_ECOLI (P76460)
Other databases:
- EMBL: U00096 - EMBL: AP009048 - PIR: E64992 - RefSeq: AP_002822.1 - RefSeq: NP_416727.1 - ProteinModelPortal: P76460 - STRING: P76460 - EnsemblBacteria: EBESCT00000002160 - EnsemblBacteria: EBESCT00000014270 - GeneID: 946721 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW2217 - KEGG: eco:b2223 - EchoBASE: EB1622 - EcoGene: EG11671 - eggNOG: COG2031 - GeneTree: EBGT00050000012097 - HOGENOM: HBG697931 - OMA: LGFSMQM - ProtClustDB: CLSK2301701 - BioCyc: EcoCyc:EG11671-MONOMER - Genevestigator: P76460 - InterPro: IPR006161 - InterPro: IPR006160 - TIGRFAMs: TIGR00366
Pfam domain/function: PF02667 SCFA_trans
EC number: NA
Molecular weight: Translated: 47528; Mature: 47528
Theoretical pI: Translated: 8.46; Mature: 8.46
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x2228fff8)-; HASH(0x2335e720)-; HASH(0x22fe7200)-; HASH(0x23501498)-; HASH(0x226408f4)-; HASH(0x236ab11c)-; HASH(0x22d48534)-; HASH(0x2350c0b8)-; HASH(0x23332548)-; HASH(0x233f4140)-; HASH(0x22ed4bd8)-; HASH(0x22f6300c)-;
Cys/Met content:
0.7 %Cys (Translated Protein) 6.1 %Met (Translated Protein) 6.8 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 6.1 %Met (Mature Protein) 6.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAF CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCHHHHHHHH GMQMALIIVTGHALASSAPVKSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVG HHHHHHHHEECHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHH AMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPLLAATPGNPVEHIAGLIPVGD HHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCHHHHHHHHCCCCH TLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERL HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCHHHH EESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAA HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHCCHHHHHHHHHHHH RSTAGILVQFPFYAGIQLMMEHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSG HCCCCEEEEECHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCC GGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQPFWALPALAIAGLGVRDIMG CCEEEEECCEECHHHHHHHHHHCHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCHHHHHH YCITALLFSGVIFVIGLTLF HHHHHHHHHHHHHHHHHHCH >Mature Secondary Structure MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAF CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCHHHHHHHH GMQMALIIVTGHALASSAPVKSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVG HHHHHHHHEECHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHH AMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPLLAATPGNPVEHIAGLIPVGD HHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCHHHHHHHHCCCCH TLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERL HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCHHHH EESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAA HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHCCHHHHHHHHHHHH RSTAGILVQFPFYAGIQLMMEHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSG HCCCCEEEEECHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCC GGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQPFWALPALAIAGLGVRDIMG CCEEEEECCEECHHHHHHHHHHCHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCHHHHHH YCITALLFSGVIFVIGLTLF HHHHHHHHHHHHHHHHHHCH
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9097040; 9278503