Definition Pelobacter propionicus DSM 2379 chromosome, complete genome.
Accession NC_008609
Length 4,008,000

Click here to switch to the map view.

The map label for this gene is fhlA [H]

Identifier: 118578840

GI number: 118578840

Start: 409383

End: 411446

Strand: Direct

Name: fhlA [H]

Synonym: Ppro_0398

Alternate gene names: 118578840

Gene position: 409383-411446 (Clockwise)

Preceding gene: 118578836

Following gene: 118578841

Centisome position: 10.21

GC content: 59.69

Gene sequence:

>2064_bases
ATGCTGGAAAGAGCCCCCAAGAAGACCCTCCACCCGCTGCTTACCTGCGACTCTCCACGATCCGTGGCCGACGACGATCA
GTGGGGCACCACGCATCACTCATGTGAAGAACTGAAAATAAACGAGACCCGGCTCAAGCGTGCCGCGGATGCTTCAGGCA
CCGCCCTCTGGGCGCTGGATCTCGCGACCTCTGCGATCTGGAGCAATCATGTTGCCAGGGATATCTTCTGTCTTCCCCAA
AACGGGCAGATCCGTCTGGAACAGATTCTGGACAGGGTTCATCCCGATGATCTCGATATGGTCAGGGAGGGGCTGGACGC
CATGATGCGGAGGGGTGAGACAACCAGCATCGAGTACCGCCTGCTGCTCCCGGACGGATCAATTCGCTGGATTCACTCGC
GGGGGGGGCCGCACAGCTGTAATGGCGGAAAATCATACTGCGTCATGGGAGCTTCGGCGGATGTAACCCAACGGAAGCAG
AATGAACAGACCCTTCTCCGGCAGCTGAGCTTCGAGTCATTCCTGGCGGAGACCTCGTCCGTCTTTGCCAAGTCCACGCT
GCCGTCTGATCTGGACCATCAGATCGAGCATGCCCTGTGGAGACTCCTGGACCATTTTCATGGCGACCGTTGCGGGCTGA
TCAAGGTGGATCTGGAGGGTGAAAAAACCATCATCACCCATGCCGTGTACCGGGAGGGACTTGAGCGTTTGCCCGGTGAT
GTCGACCTCGTGGCCCTGTTCCCCTGGTCGTTCCGTCAGTTGCGTGAGGGGCGCTGCTACTGCTTCTCCGATCTCGCCGA
ACTGCCGGTCGAGGCGGACAGGGACCGCGCTTCCTGGATGGCCATGGGGATTCGCTCCGGACTGCACGTTCCCCTGAGGG
TTGAAAACAGGGTCCATTACCTGATCGTCGTGGAATCCCATTCCGGCATCCTTGCCGGGACCGAGGATGTCCTTAACCGC
CTGCAGATAATCGGGGATCTGTTCGTGAACGCGCTTCACCGGAAAGCGACGGAGGACGAGCTCAGAAGTTCCTGTGACGA
AATCGCCAGGCTGAAGGACAAGCTGGAGTTGGAAGCAGACTATCTTCGCAGCGAGGTTCGCGCGTCCCGCTTCAACGATC
AGATTGTCGGCCAGAGCGAACCGATCCAAGGGGTACTTGCCATGGTGGAGCAGGTGGCGCCAACCCCCTCCACCGTTCTT
GTGTACGGTGAAACCGGGACAGGCAAGGAGCTGGTGGCCCAGGCGATCCATAACCACAGCCCACGCCGGGACAAATTGAT
GGTGAAGGTGAATTGCGCCTCGCTCCCCTCGTCGCTGGTGGAGAGCGAGCTGTTCGGGCGGGAGCGGGGAGCCTATACCG
GCGCCCTGACGCGCCAGATGGGGCGTTTCGAGCTTGCCGACGGTTCCTCTCTCTTTCTGGACGAGATAGCGGAGTTGTCG
CTGGAGCTGCAGAGCAAGCTGCTGCGGGTGCTGCAGGAGGGGGAGTTCGAGCGCTTGGGATCTCCCCGGACGATCAAGGT
TGACGTGCGCGTGATAGCCGCAACCAACCGCAATCTGCTGGAAGAGGTGCGCAAGGGCAGATTTCGCGAAGACCTGTACT
ACCGCCTGAGCGTTTTTCCCATCGTCGTCCCCCCCCTTCGTGAGCGGCGGGAGGACATCCCCCTGCTGGCGTGGGAGTTC
GTGCGGGATTTCAACGAGAAGATGGGCAAACGGATTCTCAGGATCGCAAAGAGGGACATGCTTGCGCTGCAGTCCTATTC
CTGGCCGGGCAACGTGCGGGAGCTGAGGAACGTCATTGAATACGCGGTGATCGTCAGTCCGGGAGACGAGTTGAAGATAC
GGCTTCCGGAAAATATCGTTAACGCCCCTCCCCAGATGACAACTCTCGAAGAGATGGAGCGCCACTACATCCAGGATGTT
CTGCGCCGGACGGAGTGGCGCATAAAAGGCGAGGGGGGCGCCGCCCATATCCTGGATCTCAATCCGGCCACACTCTATTC
CCGGATGAAAAAACTGGGCATACTCCCTCCCCGGGAGAAAGACGGCATACCATCCTCAGGTTGA

Upstream 100 bases:

>100_bases
CTAAATATCGACCTCGGGATGACATTTCAACTTGCGACGAACAACATCTGCACTATCATTCAGTTAAAACGATTTCAGAT
TCGGCCATGAGGAATCACCT

Downstream 100 bases:

>100_bases
GATTTCGTCTCTCGATCCGGTGCCGAGCCAGCGCGCCCGTCCCGTCCCGGCAGTGTAACTCTCTACATTGTCTCTTGTTT
TAAAACAATATTCAGCTTTC

Product: Fis family transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 687; Mature: 687

Protein sequence:

>687_residues
MLERAPKKTLHPLLTCDSPRSVADDDQWGTTHHSCEELKINETRLKRAADASGTALWALDLATSAIWSNHVARDIFCLPQ
NGQIRLEQILDRVHPDDLDMVREGLDAMMRRGETTSIEYRLLLPDGSIRWIHSRGGPHSCNGGKSYCVMGASADVTQRKQ
NEQTLLRQLSFESFLAETSSVFAKSTLPSDLDHQIEHALWRLLDHFHGDRCGLIKVDLEGEKTIITHAVYREGLERLPGD
VDLVALFPWSFRQLREGRCYCFSDLAELPVEADRDRASWMAMGIRSGLHVPLRVENRVHYLIVVESHSGILAGTEDVLNR
LQIIGDLFVNALHRKATEDELRSSCDEIARLKDKLELEADYLRSEVRASRFNDQIVGQSEPIQGVLAMVEQVAPTPSTVL
VYGETGTGKELVAQAIHNHSPRRDKLMVKVNCASLPSSLVESELFGRERGAYTGALTRQMGRFELADGSSLFLDEIAELS
LELQSKLLRVLQEGEFERLGSPRTIKVDVRVIAATNRNLLEEVRKGRFREDLYYRLSVFPIVVPPLRERREDIPLLAWEF
VRDFNEKMGKRILRIAKRDMLALQSYSWPGNVRELRNVIEYAVIVSPGDELKIRLPENIVNAPPQMTTLEEMERHYIQDV
LRRTEWRIKGEGGAAHILDLNPATLYSRMKKLGILPPREKDGIPSSG

Sequences:

>Translated_687_residues
MLERAPKKTLHPLLTCDSPRSVADDDQWGTTHHSCEELKINETRLKRAADASGTALWALDLATSAIWSNHVARDIFCLPQ
NGQIRLEQILDRVHPDDLDMVREGLDAMMRRGETTSIEYRLLLPDGSIRWIHSRGGPHSCNGGKSYCVMGASADVTQRKQ
NEQTLLRQLSFESFLAETSSVFAKSTLPSDLDHQIEHALWRLLDHFHGDRCGLIKVDLEGEKTIITHAVYREGLERLPGD
VDLVALFPWSFRQLREGRCYCFSDLAELPVEADRDRASWMAMGIRSGLHVPLRVENRVHYLIVVESHSGILAGTEDVLNR
LQIIGDLFVNALHRKATEDELRSSCDEIARLKDKLELEADYLRSEVRASRFNDQIVGQSEPIQGVLAMVEQVAPTPSTVL
VYGETGTGKELVAQAIHNHSPRRDKLMVKVNCASLPSSLVESELFGRERGAYTGALTRQMGRFELADGSSLFLDEIAELS
LELQSKLLRVLQEGEFERLGSPRTIKVDVRVIAATNRNLLEEVRKGRFREDLYYRLSVFPIVVPPLRERREDIPLLAWEF
VRDFNEKMGKRILRIAKRDMLALQSYSWPGNVRELRNVIEYAVIVSPGDELKIRLPENIVNAPPQMTTLEEMERHYIQDV
LRRTEWRIKGEGGAAHILDLNPATLYSRMKKLGILPPREKDGIPSSG
>Mature_687_residues
MLERAPKKTLHPLLTCDSPRSVADDDQWGTTHHSCEELKINETRLKRAADASGTALWALDLATSAIWSNHVARDIFCLPQ
NGQIRLEQILDRVHPDDLDMVREGLDAMMRRGETTSIEYRLLLPDGSIRWIHSRGGPHSCNGGKSYCVMGASADVTQRKQ
NEQTLLRQLSFESFLAETSSVFAKSTLPSDLDHQIEHALWRLLDHFHGDRCGLIKVDLEGEKTIITHAVYREGLERLPGD
VDLVALFPWSFRQLREGRCYCFSDLAELPVEADRDRASWMAMGIRSGLHVPLRVENRVHYLIVVESHSGILAGTEDVLNR
LQIIGDLFVNALHRKATEDELRSSCDEIARLKDKLELEADYLRSEVRASRFNDQIVGQSEPIQGVLAMVEQVAPTPSTVL
VYGETGTGKELVAQAIHNHSPRRDKLMVKVNCASLPSSLVESELFGRERGAYTGALTRQMGRFELADGSSLFLDEIAELS
LELQSKLLRVLQEGEFERLGSPRTIKVDVRVIAATNRNLLEEVRKGRFREDLYYRLSVFPIVVPPLRERREDIPLLAWEF
VRDFNEKMGKRILRIAKRDMLALQSYSWPGNVRELRNVIEYAVIVSPGDELKIRLPENIVNAPPQMTTLEEMERHYIQDV
LRRTEWRIKGEGGAAHILDLNPATLYSRMKKLGILPPREKDGIPSSG

Specific function: Required for induction of expression of the formate dehydrogenase H and hydrogenase-3 structural genes [H]

COG id: COG3604

COG function: function code KT; Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1789087, Length=336, Percent_Identity=52.6785714285714, Blast_Score=315, Evalue=8e-87,
Organism=Escherichia coli, GI87082117, Length=410, Percent_Identity=45.3658536585366, Blast_Score=305, Evalue=4e-84,
Organism=Escherichia coli, GI1790437, Length=313, Percent_Identity=47.2843450479233, Blast_Score=257, Evalue=2e-69,
Organism=Escherichia coli, GI1788550, Length=317, Percent_Identity=44.4794952681388, Blast_Score=248, Evalue=7e-67,
Organism=Escherichia coli, GI87082152, Length=324, Percent_Identity=44.7530864197531, Blast_Score=248, Evalue=1e-66,
Organism=Escherichia coli, GI1789233, Length=325, Percent_Identity=40.6153846153846, Blast_Score=236, Evalue=2e-63,
Organism=Escherichia coli, GI1790299, Length=240, Percent_Identity=46.6666666666667, Blast_Score=232, Evalue=6e-62,
Organism=Escherichia coli, GI1788905, Length=284, Percent_Identity=42.2535211267606, Blast_Score=219, Evalue=6e-58,
Organism=Escherichia coli, GI1786524, Length=256, Percent_Identity=44.53125, Blast_Score=192, Evalue=9e-50,
Organism=Escherichia coli, GI1787583, Length=247, Percent_Identity=44.1295546558704, Blast_Score=191, Evalue=2e-49,
Organism=Escherichia coli, GI87081872, Length=236, Percent_Identity=43.2203389830509, Blast_Score=190, Evalue=2e-49,
Organism=Escherichia coli, GI87081858, Length=295, Percent_Identity=36.271186440678, Blast_Score=154, Evalue=1e-38,
Organism=Escherichia coli, GI1789828, Length=283, Percent_Identity=33.5689045936396, Blast_Score=130, Evalue=3e-31,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR003018
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR002197
- InterPro:   IPR002078 [H]

Pfam domain/function: PF01590 GAF; PF02954 HTH_8; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 77861; Mature: 77861

Theoretical pI: Translated: 6.44; Mature: 6.44

Prosite motif: PS50112 PAS ; PS50113 PAC ; PS00675 SIGMA54_INTERACT_1 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.6 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLERAPKKTLHPLLTCDSPRSVADDDQWGTTHHSCEELKINETRLKRAADASGTALWALD
CCCCCCCHHHCHHEECCCCCCCCCCCCCCCCCCCHHHCCCCHHHHHHHCCCCCCEEEEHH
LATSAIWSNHVARDIFCLPQNGQIRLEQILDRVHPDDLDMVREGLDAMMRRGETTSIEYR
HHHHHHHHHHHHHHEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEE
LLLPDGSIRWIHSRGGPHSCNGGKSYCVMGASADVTQRKQNEQTLLRQLSFESFLAETSS
EEECCCCEEEEECCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
VFAKSTLPSDLDHQIEHALWRLLDHFHGDRCGLIKVDLEGEKTIITHAVYREGLERLPGD
HHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCC
VDLVALFPWSFRQLREGRCYCFSDLAELPVEADRDRASWMAMGIRSGLHVPLRVENRVHY
EEEEEECCHHHHHHHCCCCEEEHHHHHCCCCCCCCHHHHHHHHHHCCCCCEEEECCCEEE
LIVVESHSGILAGTEDVLNRLQIIGDLFVNALHRKATEDELRSSCDEIARLKDKLELEAD
EEEEECCCCCEECHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
YLRSEVRASRFNDQIVGQSEPIQGVLAMVEQVAPTPSTVLVYGETGTGKELVAQAIHNHS
HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHCCC
PRRDKLMVKVNCASLPSSLVESELFGRERGAYTGALTRQMGRFELADGSSLFLDEIAELS
CCCCEEEEEEEHHHCCHHHHHHHHHHCCCCCHHHHHHHHCCCEEECCCCHHHHHHHHHHH
LELQSKLLRVLQEGEFERLGSPRTIKVDVRVIAATNRNLLEEVRKGRFREDLYYRLSVFP
HHHHHHHHHHHHCCCHHHCCCCCEEEEEEEEEEECCHHHHHHHHHCCHHHHHHHEEEEEE
IVVPPLRERREDIPLLAWEFVRDFNEKMGKRILRIAKRDMLALQSYSWPGNVRELRNVIE
EECCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH
YAVIVSPGDELKIRLPENIVNAPPQMTTLEEMERHYIQDVLRRTEWRIKGEGGAAHILDL
EEEEECCCCCEEEECCHHHHCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEEC
NPATLYSRMKKLGILPPREKDGIPSSG
CHHHHHHHHHHHCCCCCCCCCCCCCCC
>Mature Secondary Structure
MLERAPKKTLHPLLTCDSPRSVADDDQWGTTHHSCEELKINETRLKRAADASGTALWALD
CCCCCCCHHHCHHEECCCCCCCCCCCCCCCCCCCHHHCCCCHHHHHHHCCCCCCEEEEHH
LATSAIWSNHVARDIFCLPQNGQIRLEQILDRVHPDDLDMVREGLDAMMRRGETTSIEYR
HHHHHHHHHHHHHHEEEECCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEE
LLLPDGSIRWIHSRGGPHSCNGGKSYCVMGASADVTQRKQNEQTLLRQLSFESFLAETSS
EEECCCCEEEEECCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
VFAKSTLPSDLDHQIEHALWRLLDHFHGDRCGLIKVDLEGEKTIITHAVYREGLERLPGD
HHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCC
VDLVALFPWSFRQLREGRCYCFSDLAELPVEADRDRASWMAMGIRSGLHVPLRVENRVHY
EEEEEECCHHHHHHHCCCCEEEHHHHHCCCCCCCCHHHHHHHHHHCCCCCEEEECCCEEE
LIVVESHSGILAGTEDVLNRLQIIGDLFVNALHRKATEDELRSSCDEIARLKDKLELEAD
EEEEECCCCCEECHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
YLRSEVRASRFNDQIVGQSEPIQGVLAMVEQVAPTPSTVLVYGETGTGKELVAQAIHNHS
HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHCCC
PRRDKLMVKVNCASLPSSLVESELFGRERGAYTGALTRQMGRFELADGSSLFLDEIAELS
CCCCEEEEEEEHHHCCHHHHHHHHHHCCCCCHHHHHHHHCCCEEECCCCHHHHHHHHHHH
LELQSKLLRVLQEGEFERLGSPRTIKVDVRVIAATNRNLLEEVRKGRFREDLYYRLSVFP
HHHHHHHHHHHHCCCHHHCCCCCEEEEEEEEEEECCHHHHHHHHHCCHHHHHHHEEEEEE
IVVPPLRERREDIPLLAWEFVRDFNEKMGKRILRIAKRDMLALQSYSWPGNVRELRNVIE
EECCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH
YAVIVSPGDELKIRLPENIVNAPPQMTTLEEMERHYIQDVLRRTEWRIKGEGGAAHILDL
EEEEECCCCCEEEECCHHHHCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCCCEEEEEEC
NPATLYSRMKKLGILPPREKDGIPSSG
CHHHHHHHHHHHCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2280686; 2118503; 9278503 [H]