Definition | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 chromosome, complete genome. |
---|---|
Accession | NC_011094 |
Length | 4,709,075 |
Click here to switch to the map view.
The map label for this gene is hpaA [C]
Identifier: 194737210
GI number: 194737210
Start: 1154095
End: 1154991
Strand: Direct
Name: hpaA [C]
Synonym: SeSA_A1172
Alternate gene names: 194737210
Gene position: 1154095-1154991 (Clockwise)
Preceding gene: 194736359
Following gene: 194736694
Centisome position: 24.51
GC content: 52.17
Gene sequence:
>897_bases ATGTGCCAACGTGCGATCGCCAATATTGATATCAGCAAAGAGTATGACGAAAGCATGGGCAGTAACGATGTGCATTATCA GTCGTTTGCTCGTATGGCGGATTTCTTTGGTCGTGATATGCAGGCGCATCGCCACGACCAGTTTTTTCAAATGCACTTTC TTGATACCGGGCAGATTGAGCTACAGCTCGACGATCATCGCTATTCGGTGCAGGCGCCGCTATTTGTGCTTACGCCGCCC TCGGTGCCGCATGCTTTTATTACCGAATCGGATAGCGATGGCCATGTTCTGACGGTACGCGAAGAGCTGGTTTGGCCGCT GCTGGAAGTGCTTTATCCCGGCACCAGAGAGGCCTTTGGCCTGCCGGGAATCTGCCTGTCGCTGGCGGATAAACCCAACG AGCTGGCGGCGCTCAAACATTACTGGCAGCTAATTGAGCGGGAGTCCACGGAACAACTGGCTGGCTGCGAACATACCCTG GTGCTACTGGCGCAGGCGGTATTTACCTTGCTGTTGCGTAATGCGAAGCTGGACGATCACGCCGCAACCGGGATGCGCGG TGAACTGAAACTTTTTCAGCGCTTTACCCTGTTAATTGACAACCACTTCCATCAGCACTGGACGGTGCCCGATTATGCCT GCGAGCTGCATATTACCGAATCTCGTTTGACCGATATTTGCCGACGTTTTGCTAATCGCCCGCCTAAACGCCTGATTTTT GATCGGCAATTACGCGAGGCGAAACGACTGCTGCTTTTTTCCGACAATGCTGTCAACGAGATCGCCTGGCAATTAGGTTT TAAAGATCCGGCTTATTTCGCCCGTTTCTTTAATCGCCTTGCTGGCTGTTCTCCTTCGCAGTTTCGCCAACGTGAAGTTC CCTCTTTTCTCAACTAA
Upstream 100 bases:
>100_bases ACTCTGGTTTGTCGCTTCTCTGTTAGTCGTCGGCGCTGCCATTATCTGGCTCATTCCCATGAAAGCATCGCGTCCGCGCG CCACCCCTTGAGGAGAAACT
Downstream 100 bases:
>100_bases GAAGAGTAAAAACATGATGAAAAAAAGCGTCGCTATGCTGGCGGTTTGTATGCTGGCGCAAAGCCACCTTGCCATTGCTG CCGGTACTCCTGCGCCTCAA
Product: 4-hydroxyphenylacetate catabolism regulatory protein HpaA
Products: NA
Alternate protein names: AraC Family Transcriptional Regulator; Transcriptional Regulator; Helix-Turn-Helix Domain-Containing Protein; 4-Hydroxyphenylacetate Catabolism Regulatory Protein HpaA; Transcriptional Regulator PobR; Transcriptional Regulatory Protein; Regulator Of 4HPA-Hydroxylase Operon; Helix-Turn-Helix- Domain Containing Protein AraC Type; Transcriptional Regulator AraC Family Protein; PobR Regulator; AraC-Family Transcriptional Regulator; Transcriptional Regulator Arac Family; Transcriptional Regulator Pobr Arac Family Protein; AraC Protein; AraC Family Regulatory Protein; AraC Family Transcriptional Regulatory Protein; Transcriptional Regulatory Protein AraC-Type; AraC Family Transcription Regulator; AraC/XylS Family Transcription Factor; AraC-Type Regulatory Protein; Bacterial Regulatory Helix-Turn-Helix Protein; Helix-Turn-Helix AraC Type; Regulator Of 4hpa-Hydroxylase Operon; Transcriptional Regulator Protein; AraC Family Transcription Regulator Protein
Number of amino acids: Translated: 298; Mature: 298
Protein sequence:
>298_residues MCQRAIANIDISKEYDESMGSNDVHYQSFARMADFFGRDMQAHRHDQFFQMHFLDTGQIELQLDDHRYSVQAPLFVLTPP SVPHAFITESDSDGHVLTVREELVWPLLEVLYPGTREAFGLPGICLSLADKPNELAALKHYWQLIERESTEQLAGCEHTL VLLAQAVFTLLLRNAKLDDHAATGMRGELKLFQRFTLLIDNHFHQHWTVPDYACELHITESRLTDICRRFANRPPKRLIF DRQLREAKRLLLFSDNAVNEIAWQLGFKDPAYFARFFNRLAGCSPSQFRQREVPSFLN
Sequences:
>Translated_298_residues MCQRAIANIDISKEYDESMGSNDVHYQSFARMADFFGRDMQAHRHDQFFQMHFLDTGQIELQLDDHRYSVQAPLFVLTPP SVPHAFITESDSDGHVLTVREELVWPLLEVLYPGTREAFGLPGICLSLADKPNELAALKHYWQLIERESTEQLAGCEHTL VLLAQAVFTLLLRNAKLDDHAATGMRGELKLFQRFTLLIDNHFHQHWTVPDYACELHITESRLTDICRRFANRPPKRLIF DRQLREAKRLLLFSDNAVNEIAWQLGFKDPAYFARFFNRLAGCSPSQFRQREVPSFLN >Mature_298_residues MCQRAIANIDISKEYDESMGSNDVHYQSFARMADFFGRDMQAHRHDQFFQMHFLDTGQIELQLDDHRYSVQAPLFVLTPP SVPHAFITESDSDGHVLTVREELVWPLLEVLYPGTREAFGLPGICLSLADKPNELAALKHYWQLIERESTEQLAGCEHTL VLLAQAVFTLLLRNAKLDDHAATGMRGELKLFQRFTLLIDNHFHQHWTVPDYACELHITESRLTDICRRFANRPPKRLIF DRQLREAKRLLLFSDNAVNEIAWQLGFKDPAYFARFFNRLAGCSPSQFRQREVPSFLN
Specific function: Unknown
COG id: COG2207
COG function: function code K; AraC-type DNA-binding domain-containing proteins
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 34550; Mature: 34550
Theoretical pI: Translated: 6.39; Mature: 6.39
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 4.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MCQRAIANIDISKEYDESMGSNDVHYQSFARMADFFGRDMQAHRHDQFFQMHFLDTGQIE CCCHHHHCCCCCCHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHEEEEEECCCEEE LQLDDHRYSVQAPLFVLTPPSVPHAFITESDSDGHVLTVREELVWPLLEVLYPGTREAFG EEECCCEEEEECCEEEECCCCCCCEEEECCCCCCCEEEEHHHHHHHHHHHHCCCCCHHCC LPGICLSLADKPNELAALKHYWQLIERESTEQLAGCEHTLVLLAQAVFTLLLRNAKLDDH CCHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCH AATGMRGELKLFQRFTLLIDNHFHQHWTVPDYACELHITESRLTDICRRFANRPPKRLIF HHCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHCCCHHHHHH DRQLREAKRLLLFSDNAVNEIAWQLGFKDPAYFARFFNRLAGCSPSQFRQREVPSFLN HHHHHHHHHEEEECCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHCCCCCCC >Mature Secondary Structure MCQRAIANIDISKEYDESMGSNDVHYQSFARMADFFGRDMQAHRHDQFFQMHFLDTGQIE CCCHHHHCCCCCCHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHEEEEEECCCEEE LQLDDHRYSVQAPLFVLTPPSVPHAFITESDSDGHVLTVREELVWPLLEVLYPGTREAFG EEECCCEEEEECCEEEECCCCCCCEEEECCCCCCCEEEEHHHHHHHHHHHHCCCCCHHCC LPGICLSLADKPNELAALKHYWQLIERESTEQLAGCEHTLVLLAQAVFTLLLRNAKLDDH CCHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCH AATGMRGELKLFQRFTLLIDNHFHQHWTVPDYACELHITESRLTDICRRFANRPPKRLIF HHCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHCCCHHHHHH DRQLREAKRLLLFSDNAVNEIAWQLGFKDPAYFARFFNRLAGCSPSQFRQREVPSFLN HHHHHHHHHEEEECCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA