Definition Candidatus Solibacter usitatus Ellin6076 chromosome, complete genome.
Accession NC_008536
Length 9,965,640

Click here to switch to the map view.

The map label for this gene is fhlA [H]

Identifier: 116622082

GI number: 116622082

Start: 3745889

End: 3748006

Strand: Direct

Name: fhlA [H]

Synonym: Acid_2969

Alternate gene names: 116622082

Gene position: 3745889-3748006 (Clockwise)

Preceding gene: 116622069

Following gene: 116622083

Centisome position: 37.59

GC content: 59.21

Gene sequence:

>2118_bases
ATGCCCTCCGCGGCCTCACAGCAACAAGTTACGTGGCTTGTTGAAAACGGTCCTAGAGAACTCGAAGCCCTGTTACGCGC
GATTGTCTATCACCCCGCGGCGCCCATTCTTATCGCCGACAACGATCGGCACTCGGTGCACGCAAGTTCTGGAGCCGGAA
AGCTGCTCGGCGTACCACGTGATGCGATCATCGGCCGGAGTTTAGATGATTTCGCCGACCCGGCGTTTAAACCTCAAATC
TCCGAGCTATGGCAGGCTTTTTTGGAGCGCGGAAAACAGGCGGGCACACTTCGCCTGGTCGGTCCCGATGGAAAGTCGCG
CGAGGTCGAGTACACGGTCAAGGGCAACGTCCTGCCCGTGCGCCACGTTTTGGTGCTGCGCGACAAGGAGGGTGCCGCCG
GCGCCACCCCCGGGGAGATTCCCGCTTGGGTACAGGACTACGCCATCTTTCTGCTGGATGTGGACGGCACGGTGGTGGCT
TGGTATTCGGGCGCCGAGCGAATCTATGGCTACGCAGCGTCCGAGGTAATCGATCGACATATTTCAGTGCTCTACCCGGA
TGACGAAGAACGGCGCTCCTATGTGACTGAGGAGTTGGACCGGTCTGTCGCCGAGGGCCACTTCGGTAGCGAAGGCTGGT
GTATGCGCCAGGACGGAGAGCGTTTCTGGGGAAACGTAGTCACGGTCTCCTTGAAAGACGAAGATGGAAAGCTGCAGGGG
TTCGCGAGAGTGGTCCGCGATTTTAGCGAACGCCGCGAACGTGATGAGAAACTGCGCGGCAAGCGATCCCGTCTCCGCAC
AACATCGACCGATACGTTGATCGCCGGTATCGTATCCGGCGAGTTCGACCGCATCCCGGAAGCCAATGACGCCTTCCTTG
AACTGGTCGGGTATACTCGCGAAGACCTGTTGACCGGGTGCCTGCGATGGCCTGACCTGACCCCACCCGAGTATTTCGCG
CTTGACGAACTGGCACACGAAGAAGGGCTACGATTCGGCGCCTGCGCCCCGTATGAAAAGGAATTGATTCGCAAGGGCGG
GTCGCGAGTTCAAGTTCTGGTGGCAACCGCCGTCTTGAAACTCGCCCCATTTCGCTGGATCAGCTTCGTACAGGAACTGA
GCGCTTTTGATCGCCGACTGCCCGTGGCAGAAACCGGCGAGTTTACCGAGGAAATCGTCGTTCTCAAGAATGAGTTCGTC
GAAATCGTGGGCCGCAGCGCCGCGATGAACCGCGTCCTGGGCCAGATCGAACTGGTTGCTCCCACCAATGCCACCGTGCT
GATCCTGGGCGAAACCGGGACCGGCAAAGAACTGGTTGCGCGCGCCGTACACCAGATGAGCCCTCGCCGCGACCGTCCTT
TCGTGACGTTGAACTGCGCCGCCATTCCTACCGGCCTGCTGGAAAGCGAGTTGTTTGGATATGAACGGGGCGCCTTCACC
GGAGCGTTGTCGCAAAAGATTGGCCGCTTTGAGATGGCGAATCGTGGAACGCTCTTCCTGGACGAGGTCGGTGACATTCC
TCTGGACCTGCAGCCCAAGTTGCTCCGCGCCTTACAGGAGAAGTCCTTCGAGAGGTTGGGAGGAACGAAGACGATTCCGA
TCGACGTCCGGTTGGTGGCTGCTACGAATCGCAACCTGACCCAGATGATGGGGGACAAGCTTTTCAGAAGCGATCTCTAT
TACCGCCTGAAAGTCTTCCCGATTACAACCTCCCCGTTGCGCGATCACCCCGAGGACATCCCGATACTGGCCCGGCACTT
CATGCAAAAGTACGCCCGGGAGATGGGCAAACAGATCGATACGATCCCCCCGGACGCGCTTCAAGCACTGGTCAAATGGC
CATGGCCGGGCAATGTTCGTGAACTCGAAAACTTCATCGAACGCGCGGTCATCCTGACGCAGGGTTCCAGCTTGCGGGCT
CCTTTGGCTGAGATACGGCCCGATGCAGCTGAGCTTCCCGGGGACAGCACGCTGGAGTTTGTCGAGCGAGCACACCTCTT
GAAGATCCTCGGTGAAACCGGTTGGGTCATCAGCAAGGCTGCCGCAAGGCTCGGCATGCCGCGAACCACTCTCAACGCAA
TGATGGGTAAGCTCGGAATCTCCCGCAAGGACGCGTGA

Upstream 100 bases:

>100_bases
TGGACACCGCCATTCTTGGTTTCCAGCAGTTTTGTAGAACACCGCGAGGTGTTTGCAGACGGTAAAGGTTCCCGGATCGA
CGGGTCAGGAGAAAAGCAGC

Downstream 100 bases:

>100_bases
CCACGTGAGGTCACGCGACCGCGGGCGGTCGTGAAATCGATAAGAACCCATTCGTTCCGCTAAAACGGCGGGCAAACCCA
GCGGCCGGCAACGTGCCCTT

Product: Fis family transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 705; Mature: 704

Protein sequence:

>705_residues
MPSAASQQQVTWLVENGPRELEALLRAIVYHPAAPILIADNDRHSVHASSGAGKLLGVPRDAIIGRSLDDFADPAFKPQI
SELWQAFLERGKQAGTLRLVGPDGKSREVEYTVKGNVLPVRHVLVLRDKEGAAGATPGEIPAWVQDYAIFLLDVDGTVVA
WYSGAERIYGYAASEVIDRHISVLYPDDEERRSYVTEELDRSVAEGHFGSEGWCMRQDGERFWGNVVTVSLKDEDGKLQG
FARVVRDFSERRERDEKLRGKRSRLRTTSTDTLIAGIVSGEFDRIPEANDAFLELVGYTREDLLTGCLRWPDLTPPEYFA
LDELAHEEGLRFGACAPYEKELIRKGGSRVQVLVATAVLKLAPFRWISFVQELSAFDRRLPVAETGEFTEEIVVLKNEFV
EIVGRSAAMNRVLGQIELVAPTNATVLILGETGTGKELVARAVHQMSPRRDRPFVTLNCAAIPTGLLESELFGYERGAFT
GALSQKIGRFEMANRGTLFLDEVGDIPLDLQPKLLRALQEKSFERLGGTKTIPIDVRLVAATNRNLTQMMGDKLFRSDLY
YRLKVFPITTSPLRDHPEDIPILARHFMQKYAREMGKQIDTIPPDALQALVKWPWPGNVRELENFIERAVILTQGSSLRA
PLAEIRPDAAELPGDSTLEFVERAHLLKILGETGWVISKAAARLGMPRTTLNAMMGKLGISRKDA

Sequences:

>Translated_705_residues
MPSAASQQQVTWLVENGPRELEALLRAIVYHPAAPILIADNDRHSVHASSGAGKLLGVPRDAIIGRSLDDFADPAFKPQI
SELWQAFLERGKQAGTLRLVGPDGKSREVEYTVKGNVLPVRHVLVLRDKEGAAGATPGEIPAWVQDYAIFLLDVDGTVVA
WYSGAERIYGYAASEVIDRHISVLYPDDEERRSYVTEELDRSVAEGHFGSEGWCMRQDGERFWGNVVTVSLKDEDGKLQG
FARVVRDFSERRERDEKLRGKRSRLRTTSTDTLIAGIVSGEFDRIPEANDAFLELVGYTREDLLTGCLRWPDLTPPEYFA
LDELAHEEGLRFGACAPYEKELIRKGGSRVQVLVATAVLKLAPFRWISFVQELSAFDRRLPVAETGEFTEEIVVLKNEFV
EIVGRSAAMNRVLGQIELVAPTNATVLILGETGTGKELVARAVHQMSPRRDRPFVTLNCAAIPTGLLESELFGYERGAFT
GALSQKIGRFEMANRGTLFLDEVGDIPLDLQPKLLRALQEKSFERLGGTKTIPIDVRLVAATNRNLTQMMGDKLFRSDLY
YRLKVFPITTSPLRDHPEDIPILARHFMQKYAREMGKQIDTIPPDALQALVKWPWPGNVRELENFIERAVILTQGSSLRA
PLAEIRPDAAELPGDSTLEFVERAHLLKILGETGWVISKAAARLGMPRTTLNAMMGKLGISRKDA
>Mature_704_residues
PSAASQQQVTWLVENGPRELEALLRAIVYHPAAPILIADNDRHSVHASSGAGKLLGVPRDAIIGRSLDDFADPAFKPQIS
ELWQAFLERGKQAGTLRLVGPDGKSREVEYTVKGNVLPVRHVLVLRDKEGAAGATPGEIPAWVQDYAIFLLDVDGTVVAW
YSGAERIYGYAASEVIDRHISVLYPDDEERRSYVTEELDRSVAEGHFGSEGWCMRQDGERFWGNVVTVSLKDEDGKLQGF
ARVVRDFSERRERDEKLRGKRSRLRTTSTDTLIAGIVSGEFDRIPEANDAFLELVGYTREDLLTGCLRWPDLTPPEYFAL
DELAHEEGLRFGACAPYEKELIRKGGSRVQVLVATAVLKLAPFRWISFVQELSAFDRRLPVAETGEFTEEIVVLKNEFVE
IVGRSAAMNRVLGQIELVAPTNATVLILGETGTGKELVARAVHQMSPRRDRPFVTLNCAAIPTGLLESELFGYERGAFTG
ALSQKIGRFEMANRGTLFLDEVGDIPLDLQPKLLRALQEKSFERLGGTKTIPIDVRLVAATNRNLTQMMGDKLFRSDLYY
RLKVFPITTSPLRDHPEDIPILARHFMQKYAREMGKQIDTIPPDALQALVKWPWPGNVRELENFIERAVILTQGSSLRAP
LAEIRPDAAELPGDSTLEFVERAHLLKILGETGWVISKAAARLGMPRTTLNAMMGKLGISRKDA

Specific function: Required for induction of expression of the formate dehydrogenase H and hydrogenase-3 structural genes [H]

COG id: COG3604

COG function: function code KT; Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1789087, Length=541, Percent_Identity=42.3290203327172, Blast_Score=366, Evalue=1e-102,
Organism=Escherichia coli, GI87082117, Length=323, Percent_Identity=55.7275541795666, Blast_Score=361, Evalue=1e-100,
Organism=Escherichia coli, GI1790437, Length=301, Percent_Identity=48.1727574750831, Blast_Score=270, Evalue=3e-73,
Organism=Escherichia coli, GI1788550, Length=317, Percent_Identity=44.4794952681388, Blast_Score=264, Evalue=1e-71,
Organism=Escherichia coli, GI1789233, Length=231, Percent_Identity=48.9177489177489, Blast_Score=241, Evalue=2e-64,
Organism=Escherichia coli, GI1790299, Length=246, Percent_Identity=48.780487804878, Blast_Score=239, Evalue=6e-64,
Organism=Escherichia coli, GI1788905, Length=311, Percent_Identity=42.7652733118971, Blast_Score=236, Evalue=5e-63,
Organism=Escherichia coli, GI87082152, Length=236, Percent_Identity=47.4576271186441, Blast_Score=228, Evalue=1e-60,
Organism=Escherichia coli, GI1787583, Length=315, Percent_Identity=38.4126984126984, Blast_Score=201, Evalue=1e-52,
Organism=Escherichia coli, GI1786524, Length=253, Percent_Identity=43.0830039525692, Blast_Score=196, Evalue=5e-51,
Organism=Escherichia coli, GI87081872, Length=232, Percent_Identity=43.9655172413793, Blast_Score=194, Evalue=1e-50,
Organism=Escherichia coli, GI1789828, Length=269, Percent_Identity=34.9442379182156, Blast_Score=135, Evalue=9e-33,
Organism=Escherichia coli, GI87081858, Length=284, Percent_Identity=31.3380281690141, Blast_Score=133, Evalue=3e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR003018
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR002197
- InterPro:   IPR002078 [H]

Pfam domain/function: PF01590 GAF; PF02954 HTH_8; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 78490; Mature: 78359

Theoretical pI: Translated: 5.78; Mature: 5.78

Prosite motif: PS50112 PAS ; PS50113 PAC ; PS00675 SIGMA54_INTERACT_1 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4 ; PS01228 COF_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPSAASQQQVTWLVENGPRELEALLRAIVYHPAAPILIADNDRHSVHASSGAGKLLGVPR
CCCCCCCCEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCCEEECCCCCCCEECCCC
DAIIGRSLDDFADPAFKPQISELWQAFLERGKQAGTLRLVGPDGKSREVEYTVKGNVLPV
HHHHCCCCHHHCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEEECCEEEE
RHVLVLRDKEGAAGATPGEIPAWVQDYAIFLLDVDGTVVAWYSGAERIYGYAASEVIDRH
EEEEEEECCCCCCCCCCCCCCHHHHCEEEEEEECCCCEEEEECCHHHHHHHHHHHHHHHC
ISVLYPDDEERRSYVTEELDRSVAEGHFGSEGWCMRQDGERFWGNVVTVSLKDEDGKLQG
CEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCHHHCCCEEEEEEECCCCHHHH
FARVVRDFSERRERDEKLRGKRSRLRTTSTDTLIAGIVSGEFDRIPEANDAFLELVGYTR
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHCCCCCCHHHHHHHCCCH
EDLLTGCLRWPDLTPPEYFALDELAHEEGLRFGACAPYEKELIRKGGSRVQVLVATAVLK
HHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCEEEHHHHHHHHH
LAPFRWISFVQELSAFDRRLPVAETGEFTEEIVVLKNEFVEIVGRSAAMNRVLGQIELVA
HHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEE
PTNATVLILGETGTGKELVARAVHQMSPRRDRPFVTLNCAAIPTGLLESELFGYERGAFT
CCCCEEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEEEHHHCCCHHHHHHHHCCCCCCHH
GALSQKIGRFEMANRGTLFLDEVGDIPLDLQPKLLRALQEKSFERLGGTKTIPIDVRLVA
HHHHHHHCCCEECCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEEEE
ATNRNLTQMMGDKLFRSDLYYRLKVFPITTSPLRDHPEDIPILARHFMQKYAREMGKQID
ECCCCHHHHHHHHHHHHCCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC
TIPPDALQALVKWPWPGNVRELENFIERAVILTQGSSLRAPLAEIRPDAAELPGDSTLEF
CCCHHHHHHHHCCCCCCCHHHHHHHHHHHEEEECCCCCCCCHHHCCCCHHHCCCCHHHHH
VERAHLLKILGETGWVISKAAARLGMPRTTLNAMMGKLGISRKDA
HHHHHHHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCC
>Mature Secondary Structure 
PSAASQQQVTWLVENGPRELEALLRAIVYHPAAPILIADNDRHSVHASSGAGKLLGVPR
CCCCCCCEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCCEEECCCCCCCEECCCC
DAIIGRSLDDFADPAFKPQISELWQAFLERGKQAGTLRLVGPDGKSREVEYTVKGNVLPV
HHHHCCCCHHHCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEEECCEEEE
RHVLVLRDKEGAAGATPGEIPAWVQDYAIFLLDVDGTVVAWYSGAERIYGYAASEVIDRH
EEEEEEECCCCCCCCCCCCCCHHHHCEEEEEEECCCCEEEEECCHHHHHHHHHHHHHHHC
ISVLYPDDEERRSYVTEELDRSVAEGHFGSEGWCMRQDGERFWGNVVTVSLKDEDGKLQG
CEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCHHHCCCEEEEEEECCCCHHHH
FARVVRDFSERRERDEKLRGKRSRLRTTSTDTLIAGIVSGEFDRIPEANDAFLELVGYTR
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHCCCCCCHHHHHHHCCCH
EDLLTGCLRWPDLTPPEYFALDELAHEEGLRFGACAPYEKELIRKGGSRVQVLVATAVLK
HHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCEEEHHHHHHHHH
LAPFRWISFVQELSAFDRRLPVAETGEFTEEIVVLKNEFVEIVGRSAAMNRVLGQIELVA
HHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEE
PTNATVLILGETGTGKELVARAVHQMSPRRDRPFVTLNCAAIPTGLLESELFGYERGAFT
CCCCEEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEEEHHHCCCHHHHHHHHCCCCCCHH
GALSQKIGRFEMANRGTLFLDEVGDIPLDLQPKLLRALQEKSFERLGGTKTIPIDVRLVA
HHHHHHHCCCEECCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEEEE
ATNRNLTQMMGDKLFRSDLYYRLKVFPITTSPLRDHPEDIPILARHFMQKYAREMGKQID
ECCCCHHHHHHHHHHHHCCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC
TIPPDALQALVKWPWPGNVRELENFIERAVILTQGSSLRAPLAEIRPDAAELPGDSTLEF
CCCHHHHHHHHCCCCCCCHHHHHHHHHHHEEEECCCCCCCCHHHCCCCHHHCCCCHHHHH
VERAHLLKILGETGWVISKAAARLGMPRTTLNAMMGKLGISRKDA
HHHHHHHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2280686; 2118503; 9278503 [H]