Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is bhsA
Identifier: 157160639
GI number: 157160639
Start: 1232778
End: 1233035
Strand: Direct
Name: bhsA
Synonym: EcHS_A1235
Alternate gene names: 157160639
Gene position: 1232778-1233035 (Clockwise)
Preceding gene: 157160637
Following gene: 157160643
Centisome position: 26.55
GC content: 47.29
Gene sequence:
>258_bases ATGAAAAACGTAAAAACCCTCATCGCTGCGGCGATTTTAAGCTCCATGTCATTTGCCAGCTTTGCGGCTGTCGAAGTTCA GTCAACGCCAGAAGGCCAACAAAAAGTCGGTACAATCAGTGCTAACGCGGGGACAAATCTGGGATCGCTGGAAGAGCAGC TGGCGCAAAAAGCGGATGAGATGGGCGCAAAATCTTTCCGTATTACTTCTGTAACCGGTCCGAATACCCTCCATGGAACA GCAGTAATTTATAAATAA
Upstream 100 bases:
>100_bases TAAAAAATATCTTGTATGTGATCCAGATCACATCTATCATTTAGTTATCGATCGTTAAGTAATTGCTTGCGACGTCATTC ATCTGCATAAGGCCACTATT
Downstream 100 bases:
>100_bases GCATTAACCCTCATTAATGCCTGCTACTGCTGATTTTTTCCCCGCGACATGCCGTGTCGCGGGGATTTTTTTTATCCAGG ATTTACAGAGTTTGTGGGCT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 85; Mature: 85
Protein sequence:
>85_residues MKNVKTLIAAAILSSMSFASFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADEMGAKSFRITSVTGPNTLHGT AVIYK
Sequences:
>Translated_85_residues MKNVKTLIAAAILSSMSFASFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADEMGAKSFRITSVTGPNTLHGT AVIYK >Mature_85_residues MKNVKTLIAAAILSSMSFASFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADEMGAKSFRITSVTGPNTLHGT AVIYK
Specific function: May be involved in the regulation of biofilm formation
COG id: NA
COG function: NA
Gene ontology:
Cell location: Periplasm (Potential)
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the BhsA/McbA family
Homologues:
Organism=Escherichia coli, GI1787355, Length=85, Percent_Identity=100, Blast_Score=171, Evalue=9e-45, Organism=Escherichia coli, GI87081785, Length=86, Percent_Identity=43.0232558139535, Blast_Score=68, Evalue=1e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): BHSA_ECO57 (P0AB42)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: B90815 - PIR: F85674 - RefSeq: NP_287246.1 - RefSeq: NP_309517.1 - ProteinModelPortal: P0AB42 - SMR: P0AB42 - EnsemblBacteria: EBESCT00000026174 - EnsemblBacteria: EBESCT00000059002 - GeneID: 912348 - GeneID: 959465 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z1751 - KEGG: ecs:ECs1490 - GeneTree: EBGT00050000009057 - HOGENOM: HBG677470 - OMA: GDNQLRG - ProtClustDB: CLSK879955 - BioCyc: ECOL83334:ECS1490-MONOMER - InterPro: IPR010854
Pfam domain/function: PF07338 DUF1471
EC number: NA
Molecular weight: Translated: 8815; Mature: 8815
Theoretical pI: Translated: 9.14; Mature: 9.14
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.5 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.5 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKNVKTLIAAAILSSMSFASFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADE CCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHEEEEECCCCCCCCHHHHHHHHHHHH MGAKSFRITSVTGPNTLHGTAVIYK CCCCCEEEEEECCCCCCCCEEEEEC >Mature Secondary Structure MKNVKTLIAAAILSSMSFASFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADE CCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHEEEEECCCCCCCCHHHHHHHHHHHH MGAKSFRITSVTGPNTLHGTAVIYK CCCCCEEEEEECCCCCCCCEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796