| Definition | Azoarcus sp. BH72 chromosome, complete genome. |
|---|---|
| Accession | NC_008702 |
| Length | 4,376,040 |
Click here to switch to the map view.
The map label for this gene is hoxA [H]
Identifier: 119900094
GI number: 119900094
Start: 4164280
End: 4165737
Strand: Direct
Name: hoxA [H]
Synonym: azo3805
Alternate gene names: 119900094
Gene position: 4164280-4165737 (Clockwise)
Preceding gene: 119900093
Following gene: 119900096
Centisome position: 95.16
GC content: 69.82
Gene sequence:
>1458_bases ATGCCCCCGCCGCTGCCGCCCGAACTGCCGTCCATCCTCGTCGTCGACGACGAGATCCGCTCGCAGGAAGCCCTGCGCCG CACGCTGGAGGAAGACTTCGAAGTCTTCACCGCGTCCGGTGCCGACGACGCGATCGCCATCCTCGAGCGCGAATGGATAC AGATCGTGCTGTGCGACCAGCGCATGCCGGGCAGCTCCGGCGTCGCGCTGCTGCGCCAGGTGCGCGAGCGCTGGCCCGAG GCGGTGCGCATCATCATTTCCGGCTACACCGATTCGGAAGACATCATCGCCGGCATCAACGAGGCCGGCATCTACCAGTA CCTGCTCAAGCCCTGGCAGCCGGAACAGCTGCTGCTGGCGCTGAAATCCGCCGCCGAGATGGCCCGGCTGCACGCCGAGA ACCAGCGCCTGACGCTGGAACTGCGCACCGCCGCGCCGGTGCTCGAAAAACAGGTCAGCCACCGGCGCGCCAGCGTGCGC CAGCAGTTCGCGCTCGACGCCGTGCTGCGCGCGCCCGGCTCGCCGATGAACGCGGTGTGCGCGCTGGTGAAAAAGCTCGC CGCGCTCGACATCCCGGTGCTGCTGACCGGCGAATCCGGCACCGGCAAGGAACTGCTGGCGCGCGCGCTGCACTACGACA GCCCGCGCGCCGGCGAGGCCTTCGTGGTGGAGAACTGCGGCGCGCTGCCCGACCAGCTGCTGGAATCCGAACTCTTCGGC CACAAGCGCGGCGCCTTCACCGGCGCCTTCGAAGACCGCGTCGGCCTTTTCAAGCAGGCCGACGGCGGCACCATGCTGCT CGACGAGATCGGTGAAACCTCGTTCGCCTTCCAGGTGAAGCTGCTGCGCGCGCTGCAGGAAGGCGAGGTGCGGCCGGTGG GAGCGCCGCGGCCGATTCCGGTGGATGCGCGGGTGATTGCCGCCACCAACCGCGACCTCGAAGCCGAGGTGCGCGCTGGC CGCTTCCGCGAGGATCTCTACTACCGGCTGGCGGCACTCACCATCCATGTGCCGCCGCTGCGCGAGCGCACGATGGACAT CCCGCTGATCGCGCAGGCGCTGGTGGATGAAACCCAGGCCGCACTCGGGCGCCGCTTCGAGCCGCTGTCGGCCGAGGTGA TCACCTGCCTGCAGGCCTGGCGCTGGCCGGGCAATGTGCGCGAACTGCGCAACGAGGTGCTGCGCATGATCGCGCTCGCC GACGACGAGCGCCTGTCGGCCGCCCACCTGAGCCCGCGCGTGCTGCGTGCGGGCGACGCGCACGAGGAGCCGGCGCTGTC GATGCTGAGCGGGCTGGACGGCGACCTCAAGACCCGGCTGGAAGCGCTCGAAGCGCGCATCGTCAAGGAAAGCCTGATCC GCCACCGCTGGAACAAGACCCGCGCGGCTAAGGAACTCGGCCTGTCGCGCGTCGGCCTGCGCAGCAAGCTGGCGCGCTAT GGGCTGGAGCGCGACTGA
Upstream 100 bases:
>100_bases CACCACTTCGTGCAGTTGCGCACCGCGCTCGGCGGCATGCGCATGCTCGACTGGCTGTCGGGCGAACCGCTGCCGCGCAT CTGCTGAGCGCCGCCGACCG
Downstream 100 bases:
>100_bases AGCCGCGCGGCCACCTCAGCCGCGCAGGCCGCGGCGCTCATCCACCTCGGCGAGCGCACGCAGGATGAAGGCAGAGGTCG GCCCGACGCCTTCGCTTTCC
Product: hydrogenase transcriptional regulatory protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 485; Mature: 484
Protein sequence:
>485_residues MPPPLPPELPSILVVDDEIRSQEALRRTLEEDFEVFTASGADDAIAILEREWIQIVLCDQRMPGSSGVALLRQVRERWPE AVRIIISGYTDSEDIIAGINEAGIYQYLLKPWQPEQLLLALKSAAEMARLHAENQRLTLELRTAAPVLEKQVSHRRASVR QQFALDAVLRAPGSPMNAVCALVKKLAALDIPVLLTGESGTGKELLARALHYDSPRAGEAFVVENCGALPDQLLESELFG HKRGAFTGAFEDRVGLFKQADGGTMLLDEIGETSFAFQVKLLRALQEGEVRPVGAPRPIPVDARVIAATNRDLEAEVRAG RFREDLYYRLAALTIHVPPLRERTMDIPLIAQALVDETQAALGRRFEPLSAEVITCLQAWRWPGNVRELRNEVLRMIALA DDERLSAAHLSPRVLRAGDAHEEPALSMLSGLDGDLKTRLEALEARIVKESLIRHRWNKTRAAKELGLSRVGLRSKLARY GLERD
Sequences:
>Translated_485_residues MPPPLPPELPSILVVDDEIRSQEALRRTLEEDFEVFTASGADDAIAILEREWIQIVLCDQRMPGSSGVALLRQVRERWPE AVRIIISGYTDSEDIIAGINEAGIYQYLLKPWQPEQLLLALKSAAEMARLHAENQRLTLELRTAAPVLEKQVSHRRASVR QQFALDAVLRAPGSPMNAVCALVKKLAALDIPVLLTGESGTGKELLARALHYDSPRAGEAFVVENCGALPDQLLESELFG HKRGAFTGAFEDRVGLFKQADGGTMLLDEIGETSFAFQVKLLRALQEGEVRPVGAPRPIPVDARVIAATNRDLEAEVRAG RFREDLYYRLAALTIHVPPLRERTMDIPLIAQALVDETQAALGRRFEPLSAEVITCLQAWRWPGNVRELRNEVLRMIALA DDERLSAAHLSPRVLRAGDAHEEPALSMLSGLDGDLKTRLEALEARIVKESLIRHRWNKTRAAKELGLSRVGLRSKLARY GLERD >Mature_484_residues PPPLPPELPSILVVDDEIRSQEALRRTLEEDFEVFTASGADDAIAILEREWIQIVLCDQRMPGSSGVALLRQVRERWPEA VRIIISGYTDSEDIIAGINEAGIYQYLLKPWQPEQLLLALKSAAEMARLHAENQRLTLELRTAAPVLEKQVSHRRASVRQ QFALDAVLRAPGSPMNAVCALVKKLAALDIPVLLTGESGTGKELLARALHYDSPRAGEAFVVENCGALPDQLLESELFGH KRGAFTGAFEDRVGLFKQADGGTMLLDEIGETSFAFQVKLLRALQEGEVRPVGAPRPIPVDARVIAATNRDLEAEVRAGR FREDLYYRLAALTIHVPPLRERTMDIPLIAQALVDETQAALGRRFEPLSAEVITCLQAWRWPGNVRELRNEVLRMIALAD DERLSAAHLSPRVLRAGDAHEEPALSMLSGLDGDLKTRLEALEARIVKESLIRHRWNKTRAAKELGLSRVGLRSKLARYG LERD
Specific function: Probable member of the two-component regulatory system involved in the regulation of the hydrogenase activity. HoxA is probably phosphorylated by a sensory component (which could be hoxX) and then acts in conjunction with sigma-54 as a transcriptional act
COG id: COG2204
COG function: function code T; Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
Gene ontology:
Cell location: Cytoplasmic [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 sigma-54 factor interaction domain [H]
Homologues:
Organism=Escherichia coli, GI1788550, Length=482, Percent_Identity=36.9294605809129, Blast_Score=287, Evalue=9e-79, Organism=Escherichia coli, GI1790437, Length=469, Percent_Identity=37.5266524520256, Blast_Score=261, Evalue=9e-71, Organism=Escherichia coli, GI1788905, Length=479, Percent_Identity=35.9081419624217, Blast_Score=256, Evalue=3e-69, Organism=Escherichia coli, GI1790299, Length=488, Percent_Identity=33.6065573770492, Blast_Score=244, Evalue=7e-66, Organism=Escherichia coli, GI1789233, Length=307, Percent_Identity=40.7166123778502, Blast_Score=224, Evalue=1e-59, Organism=Escherichia coli, GI87082152, Length=320, Percent_Identity=41.875, Blast_Score=223, Evalue=2e-59, Organism=Escherichia coli, GI1789087, Length=310, Percent_Identity=43.2258064516129, Blast_Score=218, Evalue=9e-58, Organism=Escherichia coli, GI87082117, Length=307, Percent_Identity=43.3224755700326, Blast_Score=214, Evalue=7e-57, Organism=Escherichia coli, GI1787583, Length=309, Percent_Identity=39.4822006472492, Blast_Score=202, Evalue=5e-53, Organism=Escherichia coli, GI87081872, Length=306, Percent_Identity=40.5228758169935, Blast_Score=192, Evalue=5e-50, Organism=Escherichia coli, GI1786524, Length=307, Percent_Identity=39.7394136807818, Blast_Score=190, Evalue=1e-49, Organism=Escherichia coli, GI1789828, Length=316, Percent_Identity=36.3924050632911, Blast_Score=137, Evalue=1e-33, Organism=Escherichia coli, GI87081858, Length=300, Percent_Identity=32, Blast_Score=123, Evalue=2e-29,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003593 - InterPro: IPR011006 - InterPro: IPR020441 - InterPro: IPR009057 - InterPro: IPR002197 - InterPro: IPR002078 - InterPro: IPR001789 [H]
Pfam domain/function: PF02954 HTH_8; PF00072 Response_reg; PF00158 Sigma54_activat [H]
EC number: NA
Molecular weight: Translated: 53877; Mature: 53745
Theoretical pI: Translated: 5.78; Mature: 5.78
Prosite motif: PS50110 RESPONSE_REGULATORY ; PS00675 SIGMA54_INTERACT_1 ; PS00676 SIGMA54_INTERACT_2 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPPPLPPELPSILVVDDEIRSQEALRRTLEEDFEVFTASGADDAIAILEREWIQIVLCDQ CCCCCCCCCCEEEEECHHHHHHHHHHHHHHHHHHEEECCCCCHHHHHHHHHHEEEEEECC RMPGSSGVALLRQVRERWPEAVRIIISGYTDSEDIIAGINEAGIYQYLLKPWQPEQLLLA CCCCCCHHHHHHHHHHHHHHHHEEHEECCCCCHHHHHCCCHHHHHHHHHCCCCHHHHHHH LKSAAEMARLHAENQRLTLELRTAAPVLEKQVSHRRASVRQQFALDAVLRAPGSPMNAVC HHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH ALVKKLAALDIPVLLTGESGTGKELLARALHYDSPRAGEAFVVENCGALPDQLLESELFG HHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHHHHHHC HKRGAFTGAFEDRVGLFKQADGGTMLLDEIGETSFAFQVKLLRALQEGEVRPVGAPRPIP CCCCCCCCCHHHHHHHHEECCCCEEEHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCC VDARVIAATNRDLEAEVRAGRFREDLYYRLAALTIHVPPLRERTMDIPLIAQALVDETQA CCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHEEECCCHHHHCCCCHHHHHHHHHHHHH ALGRRFEPLSAEVITCLQAWRWPGNVRELRNEVLRMIALADDERLSAAHLSPRVLRAGDA HHCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHCCCCC HEEPALSMLSGLDGDLKTRLEALEARIVKESLIRHRWNKTRAAKELGLSRVGLRSKLARY CCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHC GLERD CCCCC >Mature Secondary Structure PPPLPPELPSILVVDDEIRSQEALRRTLEEDFEVFTASGADDAIAILEREWIQIVLCDQ CCCCCCCCCEEEEECHHHHHHHHHHHHHHHHHHEEECCCCCHHHHHHHHHHEEEEEECC RMPGSSGVALLRQVRERWPEAVRIIISGYTDSEDIIAGINEAGIYQYLLKPWQPEQLLLA CCCCCCHHHHHHHHHHHHHHHHEEHEECCCCCHHHHHCCCHHHHHHHHHCCCCHHHHHHH LKSAAEMARLHAENQRLTLELRTAAPVLEKQVSHRRASVRQQFALDAVLRAPGSPMNAVC HHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH ALVKKLAALDIPVLLTGESGTGKELLARALHYDSPRAGEAFVVENCGALPDQLLESELFG HHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHHHHHHC HKRGAFTGAFEDRVGLFKQADGGTMLLDEIGETSFAFQVKLLRALQEGEVRPVGAPRPIP CCCCCCCCCHHHHHHHHEECCCCEEEHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCC VDARVIAATNRDLEAEVRAGRFREDLYYRLAALTIHVPPLRERTMDIPLIAQALVDETQA CCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHEEECCCHHHHCCCCHHHHHHHHHHHHH ALGRRFEPLSAEVITCLQAWRWPGNVRELRNEVLRMIALADDERLSAAHLSPRVLRAGDA HHCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHCCCCC HEEPALSMLSGLDGDLKTRLEALEARIVKESLIRHRWNKTRAAKELGLSRVGLRSKLARY CCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHC GLERD CCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2001989; 12948488 [H]