Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is hpaA [C]
Identifier: 157163793
GI number: 157163793
Start: 4586007
End: 4586897
Strand: Reverse
Name: hpaA [C]
Synonym: EcHS_A4577
Alternate gene names: 157163793
Gene position: 4586897-4586007 (Counterclockwise)
Preceding gene: 157163794
Following gene: 157163792
Centisome position: 98.78
GC content: 52.64
Gene sequence:
>891_bases ATGTGTGACCGTCAGATTGCCAATATTGATATCAGCAAAGAGTACGATGAAAGCCTGGGCACGGACGATGTGCATTATCA GTCCTTCGCCCGCATGGCGGCATTTTTTGGCCGCCATATGCTGCCACATCGCCACGAACAGTACTTTCAGATGCATTTCC TCAATAGCGGACAGATTGAGCTACAGCTTGACGATCATCGCTACTCGGTGGAAGCGCCCCTGTTTGTCCTGACGCCGCCG TCAGTACCTCATGCGTTTATTACGGAGTCTGACGCCGACGGTCATGTGTTGACGGTACGGGAAGATCTGATCTGGCCCCT GCTGGAAGTTCTTTATCCAGGCACTCGGGAAACCTTCGGCCTGCCGGGGATTTGCCTGTCACTGGCAGATAAACCCGACG AACTGGCGGCGCTGGAACACTATTGGCAACTGATAGAGCGGGAATCGGTAGAACAACTGCCTGGACGGGAACACACCCTG ACGTTACTGGCACAGGCAGTGTTCACCCTACTGCTGCGTAACGCAAAACTCGACGACCATGCCGCCAGCGGAATGCGCGG AGAATTAAAACTGTTCCAGCGTTTTCATATGCTGATTGAAAGCCATTTTCATCAGCACTGGACAGTACCGGATTACGCTA ACGAACTGCATATCACCGAATCACGCCTCACGGACATCTGCCGCCGCTTTGCCAACCGTCCGCCAAAACGGTTGATTTTC GACAGGCAGCTGCGAGAAGCCAAGCGGCTGCTGCTGTTTTCTGATAACGCCGTGAACAATATTGCCTGGCAACTCGGTTT TAAGGATCCAGCTTATTTTGCGCGCTTTTTTAATCGCTTAGTCGGTTGCTCGCCCAGTGCTTATCGTGCCAAAAAAGTAC CTGTGACGTGA
Upstream 100 bases:
>100_bases ATTGTGGTTTGTTGCCGCGCTGCTGGTGATTGGTGCGGGGATTATCTGGGCAATTCCAATGCAGTCCTCCCGTCCGCGAG CGACCCCGTAAGGAACGACG
Downstream 100 bases:
>100_bases AAATCCCTTAGCCTTAACGGGAAACCAGGCACCACCTGCTATTCCCCTTTCTCCGCCTGGAGCAAGGGGATTTAGTAACG TCAAAAATCGCTAAAAGCGA
Product: 4-hydroxyphenylacetate catabolism regulatory protein HpaA
Products: NA
Alternate protein names: Transcriptional Regulator AraC Family; Transcriptional Regulator; Helix-Turn-Helix Domain-Containing Protein; 4-Hydroxyphenylacetate Catabolism Regulatory Protein HpaA; Transcriptional Regulator PobR; Transcriptional Regulatory Protein; Regulator Of 4HPA-Hydroxylase Operon; AraC-Family Transcriptional Regulator; Helix-Turn-Helix- Domain Containing Protein AraC Type; PobR Regulator; AraC Family Transcription Regulator; Transcriptional Regulator AraC Family Protein; Transcriptional Regulator Arac Family; Transcriptional Regulator Pobr Arac Family Protein; AraC Protein; AraC Family Regulatory Protein; Transcriptional Regulatory Protein AraC-Type; Regulatory Protein; AraC/XylS Family Transcription Factor; AraC-Type Regulatory Protein; Helix-Turn-Helix AraC Type; Regulator Of 4hpa-Hydroxylase Operon; Transcriptional Regulator Protein; AraC Family Transcription Regulator Protein
Number of amino acids: Translated: 296; Mature: 296
Protein sequence:
>296_residues MCDRQIANIDISKEYDESLGTDDVHYQSFARMAAFFGRHMLPHRHEQYFQMHFLNSGQIELQLDDHRYSVEAPLFVLTPP SVPHAFITESDADGHVLTVREDLIWPLLEVLYPGTRETFGLPGICLSLADKPDELAALEHYWQLIERESVEQLPGREHTL TLLAQAVFTLLLRNAKLDDHAASGMRGELKLFQRFHMLIESHFHQHWTVPDYANELHITESRLTDICRRFANRPPKRLIF DRQLREAKRLLLFSDNAVNNIAWQLGFKDPAYFARFFNRLVGCSPSAYRAKKVPVT
Sequences:
>Translated_296_residues MCDRQIANIDISKEYDESLGTDDVHYQSFARMAAFFGRHMLPHRHEQYFQMHFLNSGQIELQLDDHRYSVEAPLFVLTPP SVPHAFITESDADGHVLTVREDLIWPLLEVLYPGTRETFGLPGICLSLADKPDELAALEHYWQLIERESVEQLPGREHTL TLLAQAVFTLLLRNAKLDDHAASGMRGELKLFQRFHMLIESHFHQHWTVPDYANELHITESRLTDICRRFANRPPKRLIF DRQLREAKRLLLFSDNAVNNIAWQLGFKDPAYFARFFNRLVGCSPSAYRAKKVPVT >Mature_296_residues MCDRQIANIDISKEYDESLGTDDVHYQSFARMAAFFGRHMLPHRHEQYFQMHFLNSGQIELQLDDHRYSVEAPLFVLTPP SVPHAFITESDADGHVLTVREDLIWPLLEVLYPGTRETFGLPGICLSLADKPDELAALEHYWQLIERESVEQLPGREHTL TLLAQAVFTLLLRNAKLDDHAASGMRGELKLFQRFHMLIESHFHQHWTVPDYANELHITESRLTDICRRFANRPPKRLIF DRQLREAKRLLLFSDNAVNNIAWQLGFKDPAYFARFFNRLVGCSPSAYRAKKVPVT
Specific function: Unknown
COG id: COG2207
COG function: function code K; AraC-type DNA-binding domain-containing proteins
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 34354; Mature: 34354
Theoretical pI: Translated: 6.61; Mature: 6.61
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MCDRQIANIDISKEYDESLGTDDVHYQSFARMAAFFGRHMLPHRHEQYFQMHFLNSGQIE CCCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEE LQLDDHRYSVEAPLFVLTPPSVPHAFITESDADGHVLTVREDLIWPLLEVLYPGTRETFG EEECCCEEEECCCEEEECCCCCCCCEEECCCCCCCEEEEHHHHHHHHHHHHCCCCCCCCC LPGICLSLADKPDELAALEHYWQLIERESVEQLPGREHTLTLLAQAVFTLLLRNAKLDDH CCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCHH AASGMRGELKLFQRFHMLIESHFHQHWTVPDYANELHITESRLTDICRRFANRPPKRLIF HHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEECHHHHHHHHHHHHCCCHHHHHH DRQLREAKRLLLFSDNAVNNIAWQLGFKDPAYFARFFNRLVGCSPSAYRAKKVPVT HHHHHHHHHEEEECCCCCCHHEEEECCCCHHHHHHHHHHHHCCCCCHHCCCCCCCC >Mature Secondary Structure MCDRQIANIDISKEYDESLGTDDVHYQSFARMAAFFGRHMLPHRHEQYFQMHFLNSGQIE CCCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEE LQLDDHRYSVEAPLFVLTPPSVPHAFITESDADGHVLTVREDLIWPLLEVLYPGTRETFG EEECCCEEEECCCEEEECCCCCCCCEEECCCCCCCEEEEHHHHHHHHHHHHCCCCCCCCC LPGICLSLADKPDELAALEHYWQLIERESVEQLPGREHTLTLLAQAVFTLLLRNAKLDDH CCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCHH AASGMRGELKLFQRFHMLIESHFHQHWTVPDYANELHITESRLTDICRRFANRPPKRLIF HHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEECHHHHHHHHHHHHCCCHHHHHH DRQLREAKRLLLFSDNAVNNIAWQLGFKDPAYFARFFNRLVGCSPSAYRAKKVPVT HHHHHHHHHEEEECCCCCCHHEEEECCCCHHHHHHHHHHHHCCCCCHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA