| Definition | Chloroflexus sp. Y-400-fl chromosome, complete genome. |
|---|---|
| Accession | NC_012032 |
| Length | 5,268,950 |
Click here to switch to the map view.
The map label for this gene is yhcY [H]
Identifier: 222526714
GI number: 222526714
Start: 4343885
End: 4346080
Strand: Direct
Name: yhcY [H]
Synonym: Chy400_3485
Alternate gene names: 222526714
Gene position: 4343885-4346080 (Clockwise)
Preceding gene: 222526713
Following gene: 222526715
Centisome position: 82.44
GC content: 59.93
Gene sequence:
>2196_bases ATGACTGACTCTGCGCATATGGCGGCTGATTCTACTCATCGCACGCTTCCGTTAGGCGATCTGCTGGCATTTGGCGATGC GTTACGTGCCGACGATACGCCCGAAAGCCTGCTGGCTGAGGTTGTCGAGACCTTACGCCGGATCGTCGGTAGTCCGGCAG TCTATGCCCGCTTACGCGACCTCGACAGTGATACGCTCTACGCAGTAGCCTTTGCCGGTGTTGATGCACCACTGGTCGAA CGTTTACGGGCCACACCGATTGGGCCGGCAATCTACCAGCCACTCCTGCGCCCGGAATACCGACAGAGTAGTTCATACCT GATCCCGGCAGCAGTGCTCCCCCCCGATGTGCCGGATACCGAAGCGATTATTCCGGCCTCGGCACTGTTAACGCCGTTGC GCGGGCGGGGAGACCGCTTGATCGGCGTCATTGTGCTGGCATGGCCCGATCAACCCGACCTGGTCACTGTGCGTATGGTC GAAGCGATTGCCCGTCAGGCGGCGCTGGCAGTCGAAAATGTTCGCCTGGCCGAACGGAGCGCCCGCTTGCTGGCGAAAGA GCAACTGCTGGCTGAACTGGGGCGTGCGGTTGGGGCAACGCTTGATTTGGATACTATTTTGCATCAGACGATTGATCGGC TGGCTGCTGCATTCGGTAGTGGCCTGGTCGCACTGCTCGATAATCAAGAGACATTAATGGTCGTGGCAGCGGCACCGCCG CTCGACACATTGATCGGCAGGCAATTGCCGTTACTCTCTGGATCGCTGGCCTGGGTAGTGCAGAGCGGTCAGCCTTTTGT GGTAGACGATTGTCGGTTGCATGCCCCGGATATGGCTCTGTTCGGCCCCGACATCGCGTCGTGTATCATCGCCCCGTTGC GGAGTGGTGGTCGGGTGATCGGGATGCTGAGTGTAGTTAGCCGGCAGGCCGGCGTGTTCAGCGACGAAGATGTTGATCTG CTGGAGGCAATTGCCGCACAGGTGAGTGGGCCGGTCGTCAGCGCACGCCTCTACGCCGAGTCGCAACGACTGGCAGCGCA GGTGCAGAGGCGTGCCGATCAGCTCGCTGTGCTCAACTCGATTGCCCGCATCGTCACCGCAACGCTCGATCTGCGCGAGT CGCTGCCACTGGCAACAGAGCAGATACAGCGCGGTTTTGGTTATCCGCAGGTTGACCTCTTCCTGCTCGAAGAAGAGGCC AATGAGCTGATCCTGGTCGCATCCGCCGGTCGTTATGCACCAGAGCGAGTTGGCTATCGCCAACACATCAATCTGGGACT GGTCGGACGTGCAGCGCGGAGTGGTCGGATTGTGCGCGCCGAGGATGTCGCGGCAGAAGCAGATTATCTCGGTCTCAGCG AGCGGCTCGATATTCGTTCAGAGCTGTGTGTACCGCTGATCTCCAATGGCAAGACGCTGGGAGTGCTCAATATTGAATCG CCCGAACGAGCCGGCCTGACCGAAGAGGATGCAGCGGTACTGGAGACCGTCGCCGATATGCTGGCCGGTGCCGTAGAGAA TGGGCGCCTCTACCAGCGGGCGCAGCAGGCTGCTGCCCTGGAAGAACGCAATCGCCTCGCCCGTGAATTACACGACAGCG TTAGTCAGCAGCTCTTCAGTATGACCCTGACCGCACAGGCCGCCCGATCACAGTTCGAGCGGAACCCGGCTCGCGTCCCG GCCCTGCTCGACCGGCTACACGAGACGGCAACTGCGGCGCTGGCCGAAATGCGTGCCCTAATCTTTCAGTTACGTCCGCC GGCACTGCGCGACCAGGGACTCGTCGCTGCTATTCAGCAACACGCCCAACATCTGGCCCATCGTGAAGGATTGCGGATAG AATTAAATGTAATCGGTGATGAGCGCCACGCGCGCGGCATCGAGCAACCCCTCTTTCGGATTGTTCAGGAGGCGCTCAAT AACATTGTGAAGCATGCAGCCGCCCGCAATGTGCAAATTATGCTCGAATTCAATGCCGATCAGGTTGCGATTCGGGTCAT TGACGATGGCAAAGGTTTCGATCCGGCGGCTCGCCCCTCTGGCGAAGGCCGACACCTCGGCTTACTCAGCATGCGCGAAC GCGCAGCAGAATTGGGCGGTTCGTTCAACGTCCATTCTCGTCCTGGGGCCGGTGCCGAGGTTGAGGTTGTCGTGCCACTA CGTGGTCGTCATGGCAATCTTGAACGACCTGAGTAA
Upstream 100 bases:
>100_bases GGCGAGGAACGGCACCCATTGTCACCGAACCGGCTGACGAAGATGAGTTGTAATTGCCGGCGGCATATGCGACAATGTGA ATGATCATAACGAGGTAATC
Downstream 100 bases:
>100_bases CATGAGATGCGAGACAGGTAATACCAGTTGCTTGACATAATCAGCGACGTGGGAAGGGTAGCAAGAAGGGGGTAGGTGGA TGGAACAGATTACGGTGCTG
Product: GAF sensor signal transduction histidine kinase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 731; Mature: 730
Protein sequence:
>731_residues MTDSAHMAADSTHRTLPLGDLLAFGDALRADDTPESLLAEVVETLRRIVGSPAVYARLRDLDSDTLYAVAFAGVDAPLVE RLRATPIGPAIYQPLLRPEYRQSSSYLIPAAVLPPDVPDTEAIIPASALLTPLRGRGDRLIGVIVLAWPDQPDLVTVRMV EAIARQAALAVENVRLAERSARLLAKEQLLAELGRAVGATLDLDTILHQTIDRLAAAFGSGLVALLDNQETLMVVAAAPP LDTLIGRQLPLLSGSLAWVVQSGQPFVVDDCRLHAPDMALFGPDIASCIIAPLRSGGRVIGMLSVVSRQAGVFSDEDVDL LEAIAAQVSGPVVSARLYAESQRLAAQVQRRADQLAVLNSIARIVTATLDLRESLPLATEQIQRGFGYPQVDLFLLEEEA NELILVASAGRYAPERVGYRQHINLGLVGRAARSGRIVRAEDVAAEADYLGLSERLDIRSELCVPLISNGKTLGVLNIES PERAGLTEEDAAVLETVADMLAGAVENGRLYQRAQQAAALEERNRLARELHDSVSQQLFSMTLTAQAARSQFERNPARVP ALLDRLHETATAALAEMRALIFQLRPPALRDQGLVAAIQQHAQHLAHREGLRIELNVIGDERHARGIEQPLFRIVQEALN NIVKHAAARNVQIMLEFNADQVAIRVIDDGKGFDPAARPSGEGRHLGLLSMRERAAELGGSFNVHSRPGAGAEVEVVVPL RGRHGNLERPE
Sequences:
>Translated_731_residues MTDSAHMAADSTHRTLPLGDLLAFGDALRADDTPESLLAEVVETLRRIVGSPAVYARLRDLDSDTLYAVAFAGVDAPLVE RLRATPIGPAIYQPLLRPEYRQSSSYLIPAAVLPPDVPDTEAIIPASALLTPLRGRGDRLIGVIVLAWPDQPDLVTVRMV EAIARQAALAVENVRLAERSARLLAKEQLLAELGRAVGATLDLDTILHQTIDRLAAAFGSGLVALLDNQETLMVVAAAPP LDTLIGRQLPLLSGSLAWVVQSGQPFVVDDCRLHAPDMALFGPDIASCIIAPLRSGGRVIGMLSVVSRQAGVFSDEDVDL LEAIAAQVSGPVVSARLYAESQRLAAQVQRRADQLAVLNSIARIVTATLDLRESLPLATEQIQRGFGYPQVDLFLLEEEA NELILVASAGRYAPERVGYRQHINLGLVGRAARSGRIVRAEDVAAEADYLGLSERLDIRSELCVPLISNGKTLGVLNIES PERAGLTEEDAAVLETVADMLAGAVENGRLYQRAQQAAALEERNRLARELHDSVSQQLFSMTLTAQAARSQFERNPARVP ALLDRLHETATAALAEMRALIFQLRPPALRDQGLVAAIQQHAQHLAHREGLRIELNVIGDERHARGIEQPLFRIVQEALN NIVKHAAARNVQIMLEFNADQVAIRVIDDGKGFDPAARPSGEGRHLGLLSMRERAAELGGSFNVHSRPGAGAEVEVVVPL RGRHGNLERPE >Mature_730_residues TDSAHMAADSTHRTLPLGDLLAFGDALRADDTPESLLAEVVETLRRIVGSPAVYARLRDLDSDTLYAVAFAGVDAPLVER LRATPIGPAIYQPLLRPEYRQSSSYLIPAAVLPPDVPDTEAIIPASALLTPLRGRGDRLIGVIVLAWPDQPDLVTVRMVE AIARQAALAVENVRLAERSARLLAKEQLLAELGRAVGATLDLDTILHQTIDRLAAAFGSGLVALLDNQETLMVVAAAPPL DTLIGRQLPLLSGSLAWVVQSGQPFVVDDCRLHAPDMALFGPDIASCIIAPLRSGGRVIGMLSVVSRQAGVFSDEDVDLL EAIAAQVSGPVVSARLYAESQRLAAQVQRRADQLAVLNSIARIVTATLDLRESLPLATEQIQRGFGYPQVDLFLLEEEAN ELILVASAGRYAPERVGYRQHINLGLVGRAARSGRIVRAEDVAAEADYLGLSERLDIRSELCVPLISNGKTLGVLNIESP ERAGLTEEDAAVLETVADMLAGAVENGRLYQRAQQAAALEERNRLARELHDSVSQQLFSMTLTAQAARSQFERNPARVPA LLDRLHETATAALAEMRALIFQLRPPALRDQGLVAAIQQHAQHLAHREGLRIELNVIGDERHARGIEQPLFRIVQEALNN IVKHAAARNVQIMLEFNADQVAIRVIDDGKGFDPAARPSGEGRHLGLLSMRERAAELGGSFNVHSRPGAGAEVEVVVPLR GRHGNLERPE
Specific function: Member of the two-component regulatory system yhcY/yhcZ. Probably activates yhcZ by phosphorylation [H]
COG id: COG4585
COG function: function code T; Signal transduction histidine kinase
Gene ontology:
Cell location: Integral Membrane Protein. Inner Membrane [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 histidine kinase domain [H]
Homologues:
Organism=Escherichia coli, GI1787474, Length=247, Percent_Identity=29.5546558704453, Blast_Score=93, Evalue=8e-20, Organism=Escherichia coli, GI1788812, Length=226, Percent_Identity=25.2212389380531, Blast_Score=63, Evalue=6e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR003018 - InterPro: IPR011712 [H]
Pfam domain/function: PF01590 GAF; PF02518 HATPase_c; PF07730 HisKA_3 [H]
EC number: =2.7.13.3 [H]
Molecular weight: Translated: 78984; Mature: 78853
Theoretical pI: Translated: 5.14; Mature: 5.14
Prosite motif: PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 1.9 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTDSAHMAADSTHRTLPLGDLLAFGDALRADDTPESLLAEVVETLRRIVGSPAVYARLRD CCCCCCCCCCCCCCCCCHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHH LDSDTLYAVAFAGVDAPLVERLRATPIGPAIYQPLLRPEYRQSSSYLIPAAVLPPDVPDT CCCCCEEEEEECCCCHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCEEEEECCCCCCCCCC EAIIPASALLTPLRGRGDRLIGVIVLAWPDQPDLVTVRMVEAIARQAALAVENVRLAERS CCCCCHHHHHCCCCCCCCEEEEEEEEECCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHH ARLLAKEQLLAELGRAVGATLDLDTILHQTIDRLAAAFGSGLVALLDNQETLMVVAAAPP HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCEEEEEEECCC LDTLIGRQLPLLSGSLAWVVQSGQPFVVDDCRLHAPDMALFGPDIASCIIAPLRSGGRVI HHHHHCCCCCCCCCCEEEEEECCCCEEEECCEECCCCCHHCCCHHHHHHHHHHCCCCCEE GMLSVVSRQAGVFSDEDVDLLEAIAAQVSGPVVSARLYAESQRLAAQVQRRADQLAVLNS HHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHH IARIVTATLDLRESLPLATEQIQRGFGYPQVDLFLLEEEANELILVASAGRYAPERVGYR HHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEEEECCCCCEEEEECCCCCCHHHCCHH QHINLGLVGRAARSGRIVRAEDVAAEADYLGLSERLDIRSELCVPLISNGKTLGVLNIES HCCCEEEEECHHCCCCEEEHHHHHHHHHHCCCHHHHCCHHHHCHHHHCCCCEEEEEECCC PERAGLTEEDAAVLETVADMLAGAVENGRLYQRAQQAAALEERNRLARELHDSVSQQLFS CCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH MTLTAQAARSQFERNPARVPALLDRLHETATAALAEMRALIFQLRPPALRDQGLVAAIQQ HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH HAQHLAHREGLRIELNVIGDERHARGIEQPLFRIVQEALNNIVKHAAARNVQIMLEFNAD HHHHHHHHCCCEEEEEEECCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC QVAIRVIDDGKGFDPAARPSGEGRHLGLLSMRERAAELGGSFNVHSRPGAGAEVEVVVPL EEEEEEEECCCCCCCCCCCCCCCCEECHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEE RGRHGNLERPE CCCCCCCCCCC >Mature Secondary Structure TDSAHMAADSTHRTLPLGDLLAFGDALRADDTPESLLAEVVETLRRIVGSPAVYARLRD CCCCCCCCCCCCCCCCHHHHHHHCCHHCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHH LDSDTLYAVAFAGVDAPLVERLRATPIGPAIYQPLLRPEYRQSSSYLIPAAVLPPDVPDT CCCCCEEEEEECCCCHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCEEEEECCCCCCCCCC EAIIPASALLTPLRGRGDRLIGVIVLAWPDQPDLVTVRMVEAIARQAALAVENVRLAERS CCCCCHHHHHCCCCCCCCEEEEEEEEECCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHH ARLLAKEQLLAELGRAVGATLDLDTILHQTIDRLAAAFGSGLVALLDNQETLMVVAAAPP HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCEEEEEEECCC LDTLIGRQLPLLSGSLAWVVQSGQPFVVDDCRLHAPDMALFGPDIASCIIAPLRSGGRVI HHHHHCCCCCCCCCCEEEEEECCCCEEEECCEECCCCCHHCCCHHHHHHHHHHCCCCCEE GMLSVVSRQAGVFSDEDVDLLEAIAAQVSGPVVSARLYAESQRLAAQVQRRADQLAVLNS HHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHH IARIVTATLDLRESLPLATEQIQRGFGYPQVDLFLLEEEANELILVASAGRYAPERVGYR HHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEEEECCCCCEEEEECCCCCCHHHCCHH QHINLGLVGRAARSGRIVRAEDVAAEADYLGLSERLDIRSELCVPLISNGKTLGVLNIES HCCCEEEEECHHCCCCEEEHHHHHHHHHHCCCHHHHCCHHHHCHHHHCCCCEEEEEECCC PERAGLTEEDAAVLETVADMLAGAVENGRLYQRAQQAAALEERNRLARELHDSVSQQLFS CCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH MTLTAQAARSQFERNPARVPALLDRLHETATAALAEMRALIFQLRPPALRDQGLVAAIQQ HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH HAQHLAHREGLRIELNVIGDERHARGIEQPLFRIVQEALNNIVKHAAARNVQIMLEFNAD HHHHHHHHCCCEEEEEEECCHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC QVAIRVIDDGKGFDPAARPSGEGRHLGLLSMRERAAELGGSFNVHSRPGAGAEVEVVVPL EEEEEEEECCCCCCCCCCCCCCCCEECHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEE RGRHGNLERPE CCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9579061; 9384377 [H]