Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is hoxH [H]

Identifier: 222524577

GI number: 222524577

Start: 1666707

End: 1668128

Strand: Direct

Name: hoxH [H]

Synonym: Chy400_1301

Alternate gene names: 222524577

Gene position: 1666707-1668128 (Clockwise)

Preceding gene: 222524576

Following gene: 222524578

Centisome position: 31.63

GC content: 56.75

Gene sequence:

>1422_bases
ATGGGTCAACGTATCATCATCGATCCGGTGACACGCATCGAGGGGCATGCCAAAATCAGCATTCATCTCGATGACCATGG
CGACGTTGCCGAAACACGATTTCACGTCACCGAATTTCGTGGCTTCGAGCGCTTTTGCATCGGACGACCCTTCTGGGAGA
TGCCGGGCATTACGGCGCGTATCTGCGGCATCTGCCCGGTGAGCCACTTGCTTGCCTCGGCCCGTGCCGGTGACGGTATC
CTTGCCGTGTTGATTCCACCGGCTGCGGAAAAATTGCGCCGGTTGATGAATCTTGCCCAGATCATTCAGTCACACGCGCT
CAGTTTCTTTCATCTGAGCGGGCCTGATCTGATGCTCGGTTTCGACAGTGACCCGGCGCAGCGGAATATCTTTGGACTGA
TCGCTGCCGACCCGGACGTGGCACGGAACGGGATTCGCCTGCGCCAGTTTGGCCAGGAAGTGATCGAGATGTTGGGTGGG
CGCAAAATCCATCCGGCATGGGCCGTACCGGGAGGTGTGCGGAGCGGTTTGAGTGCGAGCAGTCGTGATCATATTCGGGC
TAAGCTGCCCGAAATGCTGACCATCGCCTGCGATGCCCTTGGGCGGCTGAAACAACTCAGTGCGAAGTATCAGCGTGAAA
TTGAAACGTTCGGCGTGTTTCCCAGCCTCTACCTGGGTATGGTAGGGCCGGGCGGACGATGGATGCACTACGGCGGCAAG
CTGCGCGTCATTGATCACACCGGTAATATCTTGATCGATGAACTCGATCCTATCGACTACCGTGATGTCATCGGCGAAGC
GGTTGAGCCGTGGAGCTACCTGAAGTTTCCGTACATTCGAGCCTTGGGCTATCCGCAGGGCATGTATCGGGTAGGGCCAC
TGGCGCGGATCAACATCTGCACAACCATGGGTGCTGAACGGGCTGATGCTGAGTTACGCGAGTTGCGCCAGGCTGTTGGG
CCGGTTATCCATGGCAGCTTCTACTATCACCACGCCCGCCTGATCGAGATTATCGCCGCGCTAGAGCGGATGGAAGTGCT
GGTTGAAGACGACGATCTGCTCTCTACCGAACTCCGTGCCGATGCCGGTGTGAATCGTCACGAAGCGGTTGGCGTCAGCG
AAGCGCCACGGGGCACGCTCTTCCACCATTATCGCGTCGATGAGAAAGGTTTGATTACCGATGTCAATCTGATCATTGCG
ACCGGTCAGAATAATCTGGCAATCAATCGCACCGTTGGTCAGATTGCACAGAGCTATATTCGGCATGGTCAGTTTGATGA
AGGCATTCTCAATCGGATCGAAGCCGGGGTGCGTGCCTACGATCCGTGTCTGAGCTGCTCAACCCATGCGGTCGGTCAGA
TGGCGATGGAGGTGGATTTGTACAATGCCCAGGGCGAATTGATCAATCGCCTCTCCCGCTAA

Upstream 100 bases:

>100_bases
GGATTCGGGCATTGCTTGAAGCCCTGATTGAGGGCCGTGAACCACCAGCCGATATTCGCTACGGATGAAGGGCGACGATA
TTCTGAGCAAGGAGTACACG

Downstream 100 bases:

>100_bases
GGCAGCAGCATCGGAAGATAGCATCAGGAACATCACAGCACCGACCGCACGCGCCCGGTGACCCGATGGGGTAGTTCCGT
CGTGGTTTGCCGACTCGAAG

Product: nickel-dependent hydrogenase large subunit

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 473; Mature: 472

Protein sequence:

>473_residues
MGQRIIIDPVTRIEGHAKISIHLDDHGDVAETRFHVTEFRGFERFCIGRPFWEMPGITARICGICPVSHLLASARAGDGI
LAVLIPPAAEKLRRLMNLAQIIQSHALSFFHLSGPDLMLGFDSDPAQRNIFGLIAADPDVARNGIRLRQFGQEVIEMLGG
RKIHPAWAVPGGVRSGLSASSRDHIRAKLPEMLTIACDALGRLKQLSAKYQREIETFGVFPSLYLGMVGPGGRWMHYGGK
LRVIDHTGNILIDELDPIDYRDVIGEAVEPWSYLKFPYIRALGYPQGMYRVGPLARINICTTMGAERADAELRELRQAVG
PVIHGSFYYHHARLIEIIAALERMEVLVEDDDLLSTELRADAGVNRHEAVGVSEAPRGTLFHHYRVDEKGLITDVNLIIA
TGQNNLAINRTVGQIAQSYIRHGQFDEGILNRIEAGVRAYDPCLSCSTHAVGQMAMEVDLYNAQGELINRLSR

Sequences:

>Translated_473_residues
MGQRIIIDPVTRIEGHAKISIHLDDHGDVAETRFHVTEFRGFERFCIGRPFWEMPGITARICGICPVSHLLASARAGDGI
LAVLIPPAAEKLRRLMNLAQIIQSHALSFFHLSGPDLMLGFDSDPAQRNIFGLIAADPDVARNGIRLRQFGQEVIEMLGG
RKIHPAWAVPGGVRSGLSASSRDHIRAKLPEMLTIACDALGRLKQLSAKYQREIETFGVFPSLYLGMVGPGGRWMHYGGK
LRVIDHTGNILIDELDPIDYRDVIGEAVEPWSYLKFPYIRALGYPQGMYRVGPLARINICTTMGAERADAELRELRQAVG
PVIHGSFYYHHARLIEIIAALERMEVLVEDDDLLSTELRADAGVNRHEAVGVSEAPRGTLFHHYRVDEKGLITDVNLIIA
TGQNNLAINRTVGQIAQSYIRHGQFDEGILNRIEAGVRAYDPCLSCSTHAVGQMAMEVDLYNAQGELINRLSR
>Mature_472_residues
GQRIIIDPVTRIEGHAKISIHLDDHGDVAETRFHVTEFRGFERFCIGRPFWEMPGITARICGICPVSHLLASARAGDGIL
AVLIPPAAEKLRRLMNLAQIIQSHALSFFHLSGPDLMLGFDSDPAQRNIFGLIAADPDVARNGIRLRQFGQEVIEMLGGR
KIHPAWAVPGGVRSGLSASSRDHIRAKLPEMLTIACDALGRLKQLSAKYQREIETFGVFPSLYLGMVGPGGRWMHYGGKL
RVIDHTGNILIDELDPIDYRDVIGEAVEPWSYLKFPYIRALGYPQGMYRVGPLARINICTTMGAERADAELRELRQAVGP
VIHGSFYYHHARLIEIIAALERMEVLVEDDDLLSTELRADAGVNRHEAVGVSEAPRGTLFHHYRVDEKGLITDVNLIIAT
GQNNLAINRTVGQIAQSYIRHGQFDEGILNRIEAGVRAYDPCLSCSTHAVGQMAMEVDLYNAQGELINRLSR

Specific function: This Is One Of Three E.Coli Hydrogenases Synthesized In Response To Different Physiological Conditions. Hyd2 Is Involved In Hydrogen Uptake. [C]

COG id: COG3259

COG function: function code C; Coenzyme F420-reducing hydrogenase, alpha subunit

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the [NiFe]/[NiFeSe] hydrogenase large subunit family [H]

Homologues:

Organism=Escherichia coli, GI1789368, Length=130, Percent_Identity=34.6153846153846, Blast_Score=82, Evalue=9e-17,
Organism=Escherichia coli, GI1787207, Length=122, Percent_Identity=31.1475409836066, Blast_Score=75, Evalue=1e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001501
- InterPro:   IPR018194 [H]

Pfam domain/function: PF00374 NiFeSe_Hases [H]

EC number: =1.12.1.2 [H]

Molecular weight: Translated: 52459; Mature: 52328

Theoretical pI: Translated: 6.83; Mature: 6.83

Prosite motif: PS00508 NI_HGENASE_L_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGQRIIIDPVTRIEGHAKISIHLDDHGDVAETRFHVTEFRGFERFCIGRPFWEMPGITAR
CCCEEEECCHHHCCCCEEEEEEECCCCCHHHHHHHHHHHCCCHHHHCCCCCHHCCCCHHH
ICGICPVSHLLASARAGDGILAVLIPPAAEKLRRLMNLAQIIQSHALSFFHLSGPDLMLG
HHHHCHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCEEEE
FDSDPAQRNIFGLIAADPDVARNGIRLRQFGQEVIEMLGGRKIHPAWAVPGGVRSGLSAS
CCCCCCCCCEEEEEECCHHHHHCCHHHHHHHHHHHHHHCCCEECCCCCCCCHHHCCCCCC
SRDHIRAKLPEMLTIACDALGRLKQLSAKYQREIETFGVFPSLYLGMVGPGGRWMHYGGK
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCCCEEEECCE
LRVIDHTGNILIDELDPIDYRDVIGEAVEPWSYLKFPYIRALGYPQGMYRVGPLARINIC
EEEEECCCCEEEECCCCCCHHHHHHHHCCCHHHHCCHHHHHCCCCCCCCCCCCCEEEEEE
TTMGAERADAELRELRQAVGPVIHGSFYYHHARLIEIIAALERMEVLVEDDDLLSTELRA
ECCCCCHHHHHHHHHHHHHCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
DAGVNRHEAVGVSEAPRGTLFHHYRVDEKGLITDVNLIIATGQNNLAINRTVGQIAQSYI
CCCCCHHHCCCCCCCCCCCEEEEEEECCCCCEEEEEEEEEECCCCEEEEHHHHHHHHHHH
RHGQFDEGILNRIEAGVRAYDPCLSCSTHAVGQMAMEVDLYNAQGELINRLSR
HCCCCCHHHHHHHHHHHHHHCHHHCCCHHHHHHEEEEEEEECCCHHHHHHHCC
>Mature Secondary Structure 
GQRIIIDPVTRIEGHAKISIHLDDHGDVAETRFHVTEFRGFERFCIGRPFWEMPGITAR
CCEEEECCHHHCCCCEEEEEEECCCCCHHHHHHHHHHHCCCHHHHCCCCCHHCCCCHHH
ICGICPVSHLLASARAGDGILAVLIPPAAEKLRRLMNLAQIIQSHALSFFHLSGPDLMLG
HHHHCHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCEEEE
FDSDPAQRNIFGLIAADPDVARNGIRLRQFGQEVIEMLGGRKIHPAWAVPGGVRSGLSAS
CCCCCCCCCEEEEEECCHHHHHCCHHHHHHHHHHHHHHCCCEECCCCCCCCHHHCCCCCC
SRDHIRAKLPEMLTIACDALGRLKQLSAKYQREIETFGVFPSLYLGMVGPGGRWMHYGGK
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCCCEEEECCE
LRVIDHTGNILIDELDPIDYRDVIGEAVEPWSYLKFPYIRALGYPQGMYRVGPLARINIC
EEEEECCCCEEEECCCCCCHHHHHHHHCCCHHHHCCHHHHHCCCCCCCCCCCCCEEEEEE
TTMGAERADAELRELRQAVGPVIHGSFYYHHARLIEIIAALERMEVLVEDDDLLSTELRA
ECCCCCHHHHHHHHHHHHHCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
DAGVNRHEAVGVSEAPRGTLFHHYRVDEKGLITDVNLIIATGQNNLAINRTVGQIAQSYI
CCCCCHHHCCCCCCCCCCCEEEEEEECCCCCEEEEEEEEEECCCCEEEEHHHHHHHHHHH
RHGQFDEGILNRIEAGVRAYDPCLSCSTHAVGQMAMEVDLYNAQGELINRLSR
HCCCCCHHHHHHHHHHHHHHCHHHCCCHHHHHHEEEEEEEECCCHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2188945; 12948488; 2496982 [H]