Definition Corynebacterium glutamicum R chromosome, complete genome.
Accession NC_009342
Length 3,314,179

Click here to switch to the map view.

The map label for this gene is yhgF [C]

Identifier: 145296007

GI number: 145296007

Start: 2118388

End: 2120664

Strand: Reverse

Name: yhgF [C]

Synonym: cgR_1931

Alternate gene names: 145296007

Gene position: 2120664-2118388 (Counterclockwise)

Preceding gene: 145296008

Following gene: 145296001

Centisome position: 63.99

GC content: 57.27

Gene sequence:

>2277_bases
ATGTCATCAATTTCTCGAACGATTGCTCTCGAACTCGGCGTCAAAGATGAACAAGTCGAGGCCGCCATCAAGCTCTTGGA
TGAAGGAAACACCGTTCCGTTCATCGCCAGGTACCGCAAGGAAATCACTGGGGGACTCGATGATACCCAACTGCGTGACC
TGGAAGAACGCCTCAGTTACCTCCGTGAGCTGGAGGATCGCAAACAAAGCATCCTCGCCGCGATTGAGGAACAAGGCAAA
CTCACCGATGATTTACGCTCGCTGATTTTGGGCTGCGACACAAAGGCTCGCCTGGAGGATCTGTACCTGCCGTTCAAAAA
ACGGCGCAAGACGAAGGCTGATATCGCTAGGGAGGCGGGCCTGGAGGGGCTCGTCGATAAGCTTATCGACGCCCCGTCCC
TCGACGCCGCAGCGCAGGCAGCTGCATTTACGACCGAAGGCTTTGAGGATTCCAAAAAAGTTTTGGATGGCGCTCGCGCC
ATTTTGATTGACCGCTTCGCGCTTGACGCCGATCTGGTGGGCGAGGTGCGTGAGCAAATGTATCGCGCGGGTTCCATGGC
AGCATCCGTGGTGGCGGGCAAGGAGCAGGAAGGCGCAAAGTTCAAGGACTACTTTGAGTTTTCCGAACCTTTTGACAAGC
TTCCATCTCACCGAATTTTGGCGCTGCTGCGCGGTGAAAACGAAGGTGTGCTGAGCCTCAACCTCGATGCGGGCGACGAC
ACAATCTACGAAGGTTTGATCGCCGACCGATTCTCCCTGGACATCCACACTTCTAGCTGGCTGGCTGAGGCTGTGCGCTG
GGGTTGGCGCACCAAACTGTATGTGTCCTCCGGATTGGATGTGCGCATGCGTCTGAAAGAAAAGGCAGAGGAAGGCGCAC
TCGATGTGTTTGCCACCAACCTCCGCGACGTTCTCCTTGCAGCTCCCGCTGGTCAGCGCTCCACAATTGGCCTTGACCCG
GGATTCCGCAACGGTGTGAAAGTAGCTGTCGTGGATTCCACCGGTAAGGATGTTGCCACCACGATCGTCTACCCACACCA
GCCCCAAAACCGCTGGAAGGAAGCCGTATCCGAACTGGCTAATCTGTGCGCGACTCACGGTGTGGAACTCATGGCGATTG
GCAACGGAACCGCCTCGAGGGAAACGGAAAAACTCGCCGGCGAAGTGGCAGACATGATCAAAGCCGCAGGTGGCAAGCGA
CCAACTCCCGTGGTGGTCTCCGAATCGGGCGCATCTGTGTACTCGGCATCACCGATCGCAGCCGAAGAATTCCCCGACAT
GGACGTCTCCCTGCGCGGTGCAGTTTCTATCGCGAGGCGACTCCAGGATCCACTGGCGGAGCTCGTCAAGATTGAGCCCA
AAGCCATCGGAGTCGGCCAGTACCAACACGATGTCAACCAGGTTGCACTGTCCAAAACCCTAGATGGCGTTGTCGAAGAC
GCAGTGAATGCAGTCGGAGTTAACCTCAACACCGCATCCGCACCACTTCTTACCCGAGTTGCCGGAGTGACCTCCACCTT
GGCAAACAACATCGTGGCCTACCGCAACGAAAACGGTGGATTCTCCTCCCGAAAAGAACTGAACAAAGTTCCTCGCCTGG
GACCCAAAGCCTTTGAACAGTGTGCTGGCTTCCTCCGCATTTCTGGATCCACCGACCCTCTCGACGCCTCCGCTGTTCAC
CCCGAGGCCTACCCAGTTGTCCGCAACATCGCGAAGGCTACCGGATTGGATGTCGCAGGATTGATCGGAAACACTGCGGT
GCTTGCCAAATTGAAGCCCGCTGATTTCGCTGATGAACGATTCGGCATCCCCACCGTCACCGACATCATCGCCGAGCTGG
ATAAACCCGGACGCGACCCCCGCCCAGAATTCAAAACCGCCAGCTTCAAAGAAGGCGTGGAGAAAATCTCCGACCTCACA
CCCGGCATGATCCTGGAAGGCACTGTCACCAACGTTGCGGCGTTCGGCGCATTCGTTGACGTGGGAGTGCACCAAGATGG
CCTCGTTCACGTTTCCGCGATGAGCGACAAATTCATCTCCAACCCCCACGAAGTTGTTCGCTCTGGTGAGGTCGTGAAGG
TAAAGGTCATGGAAGTTGACGTCGACCGCAAACGCATCGGCCTTTCCCTCCGCTTGACCGATGAACCCGGTGCCCCAGCT
CCGCAAAAGCGCGGAAACCGACCAGCCAAACAGCAGCGAGCTCCGCAAAAGCAGTCCGCTAAGCCCGCCACAGGTTCCAT
GGCAGATGCTTTGCGACGCGCAGGCCTCGGTGGCTAA

Upstream 100 bases:

>100_bases
GATCTGATTGAAAATCCTGAACTTCCCACCGGCGATGTGGTTTTGCAGGGGCAGGTGATCCTTCGGGGGTCGAGCACACA
TTCCGGGTAGAATTGCCCAA

Downstream 100 bases:

>100_bases
GGCAACTTTCAAACCAAGCGGGAGTGTTCTCAAGCTCCCCTTGGTCTGAATCTCACATTTCGAGAGCGAATTAATGCAGT
ATTAGGCCAAGAACTAATTC

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 758; Mature: 757

Protein sequence:

>758_residues
MSSISRTIALELGVKDEQVEAAIKLLDEGNTVPFIARYRKEITGGLDDTQLRDLEERLSYLRELEDRKQSILAAIEEQGK
LTDDLRSLILGCDTKARLEDLYLPFKKRRKTKADIAREAGLEGLVDKLIDAPSLDAAAQAAAFTTEGFEDSKKVLDGARA
ILIDRFALDADLVGEVREQMYRAGSMAASVVAGKEQEGAKFKDYFEFSEPFDKLPSHRILALLRGENEGVLSLNLDAGDD
TIYEGLIADRFSLDIHTSSWLAEAVRWGWRTKLYVSSGLDVRMRLKEKAEEGALDVFATNLRDVLLAAPAGQRSTIGLDP
GFRNGVKVAVVDSTGKDVATTIVYPHQPQNRWKEAVSELANLCATHGVELMAIGNGTASRETEKLAGEVADMIKAAGGKR
PTPVVVSESGASVYSASPIAAEEFPDMDVSLRGAVSIARRLQDPLAELVKIEPKAIGVGQYQHDVNQVALSKTLDGVVED
AVNAVGVNLNTASAPLLTRVAGVTSTLANNIVAYRNENGGFSSRKELNKVPRLGPKAFEQCAGFLRISGSTDPLDASAVH
PEAYPVVRNIAKATGLDVAGLIGNTAVLAKLKPADFADERFGIPTVTDIIAELDKPGRDPRPEFKTASFKEGVEKISDLT
PGMILEGTVTNVAAFGAFVDVGVHQDGLVHVSAMSDKFISNPHEVVRSGEVVKVKVMEVDVDRKRIGLSLRLTDEPGAPA
PQKRGNRPAKQQRAPQKQSAKPATGSMADALRRAGLGG

Sequences:

>Translated_758_residues
MSSISRTIALELGVKDEQVEAAIKLLDEGNTVPFIARYRKEITGGLDDTQLRDLEERLSYLRELEDRKQSILAAIEEQGK
LTDDLRSLILGCDTKARLEDLYLPFKKRRKTKADIAREAGLEGLVDKLIDAPSLDAAAQAAAFTTEGFEDSKKVLDGARA
ILIDRFALDADLVGEVREQMYRAGSMAASVVAGKEQEGAKFKDYFEFSEPFDKLPSHRILALLRGENEGVLSLNLDAGDD
TIYEGLIADRFSLDIHTSSWLAEAVRWGWRTKLYVSSGLDVRMRLKEKAEEGALDVFATNLRDVLLAAPAGQRSTIGLDP
GFRNGVKVAVVDSTGKDVATTIVYPHQPQNRWKEAVSELANLCATHGVELMAIGNGTASRETEKLAGEVADMIKAAGGKR
PTPVVVSESGASVYSASPIAAEEFPDMDVSLRGAVSIARRLQDPLAELVKIEPKAIGVGQYQHDVNQVALSKTLDGVVED
AVNAVGVNLNTASAPLLTRVAGVTSTLANNIVAYRNENGGFSSRKELNKVPRLGPKAFEQCAGFLRISGSTDPLDASAVH
PEAYPVVRNIAKATGLDVAGLIGNTAVLAKLKPADFADERFGIPTVTDIIAELDKPGRDPRPEFKTASFKEGVEKISDLT
PGMILEGTVTNVAAFGAFVDVGVHQDGLVHVSAMSDKFISNPHEVVRSGEVVKVKVMEVDVDRKRIGLSLRLTDEPGAPA
PQKRGNRPAKQQRAPQKQSAKPATGSMADALRRAGLGG
>Mature_757_residues
SSISRTIALELGVKDEQVEAAIKLLDEGNTVPFIARYRKEITGGLDDTQLRDLEERLSYLRELEDRKQSILAAIEEQGKL
TDDLRSLILGCDTKARLEDLYLPFKKRRKTKADIAREAGLEGLVDKLIDAPSLDAAAQAAAFTTEGFEDSKKVLDGARAI
LIDRFALDADLVGEVREQMYRAGSMAASVVAGKEQEGAKFKDYFEFSEPFDKLPSHRILALLRGENEGVLSLNLDAGDDT
IYEGLIADRFSLDIHTSSWLAEAVRWGWRTKLYVSSGLDVRMRLKEKAEEGALDVFATNLRDVLLAAPAGQRSTIGLDPG
FRNGVKVAVVDSTGKDVATTIVYPHQPQNRWKEAVSELANLCATHGVELMAIGNGTASRETEKLAGEVADMIKAAGGKRP
TPVVVSESGASVYSASPIAAEEFPDMDVSLRGAVSIARRLQDPLAELVKIEPKAIGVGQYQHDVNQVALSKTLDGVVEDA
VNAVGVNLNTASAPLLTRVAGVTSTLANNIVAYRNENGGFSSRKELNKVPRLGPKAFEQCAGFLRISGSTDPLDASAVHP
EAYPVVRNIAKATGLDVAGLIGNTAVLAKLKPADFADERFGIPTVTDIIAELDKPGRDPRPEFKTASFKEGVEKISDLTP
GMILEGTVTNVAAFGAFVDVGVHQDGLVHVSAMSDKFISNPHEVVRSGEVVKVKVMEVDVDRKRIGLSLRLTDEPGAPAP
QKRGNRPAKQQRAPQKQSAKPATGSMADALRRAGLGG

Specific function: Unknown

COG id: COG2183

COG function: function code K; Transcriptional accessory protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 S1 motif domain [H]

Homologues:

Organism=Homo sapiens, GI221136781, Length=773, Percent_Identity=33.6351875808538, Blast_Score=379, Evalue=1e-105,
Organism=Homo sapiens, GI27597090, Length=772, Percent_Identity=22.7979274611399, Blast_Score=108, Evalue=2e-23,
Organism=Escherichia coli, GI87082262, Length=763, Percent_Identity=58.0602883355177, Blast_Score=867, Evalue=0.0,
Organism=Escherichia coli, GI1787140, Length=89, Percent_Identity=38.2022471910112, Blast_Score=64, Evalue=3e-11,
Organism=Caenorhabditis elegans, GI17511129, Length=708, Percent_Identity=27.9661016949153, Blast_Score=223, Evalue=4e-58,
Organism=Caenorhabditis elegans, GI17552892, Length=292, Percent_Identity=30.8219178082192, Blast_Score=89, Evalue=1e-17,
Organism=Drosophila melanogaster, GI62484314, Length=757, Percent_Identity=32.1003963011889, Blast_Score=375, Evalue=1e-104,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003583
- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR003029
- InterPro:   IPR005227
- InterPro:   IPR006641
- InterPro:   IPR022967
- InterPro:   IPR018974
- InterPro:   IPR023097 [H]

Pfam domain/function: PF00575 S1; PF09371 Tex_N [H]

EC number: NA

Molecular weight: Translated: 81562; Mature: 81430

Theoretical pI: Translated: 5.37; Mature: 5.37

Prosite motif: PS50126 S1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.5 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
1.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSSISRTIALELGVKDEQVEAAIKLLDEGNTVPFIARYRKEITGGLDDTQLRDLEERLSY
CCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHH
LRELEDRKQSILAAIEEQGKLTDDLRSLILGCDTKARLEDLYLPFKKRRKTKADIAREAG
HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHHHHHHHHHC
LEGLVDKLIDAPSLDAAAQAAAFTTEGFEDSKKVLDGARAILIDRFALDADLVGEVREQM
HHHHHHHHHCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
YRAGSMAASVVAGKEQEGAKFKDYFEFSEPFDKLPSHRILALLRGENEGVLSLNLDAGDD
HHHHHHHHHHHCCCCCCCCHHHHHHHHCCHHHHCCCCEEEEEEECCCCCEEEEECCCCCH
TIYEGLIADRFSLDIHTSSWLAEAVRWGWRTKLYVSSGLDVRMRLKEKAEEGALDVFATN
HHHHHHHHHHEEEEEECHHHHHHHHHCCCCEEEEEECCCCEEHHHHHHHHCCCHHHHHHH
LRDVLLAAPAGQRSTIGLDPGFRNGVKVAVVDSTGKDVATTIVYPHQPQNRWKEAVSELA
HHHHHEECCCCCCCEECCCCCCCCCEEEEEEECCCCCEEEEEEECCCCHHHHHHHHHHHH
NLCATHGVELMAIGNGTASRETEKLAGEVADMIKAAGGKRPTPVVVSESGASVYSASPIA
HHHHHCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEECCCCCC
AEEFPDMDVSLRGAVSIARRLQDPLAELVKIEPKAIGVGQYQHDVNQVALSKTLDGVVED
HHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH
AVNAVGVNLNTASAPLLTRVAGVTSTLANNIVAYRNENGGFSSRKELNKVPRLGPKAFEQ
HHHHHCCCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCHHHHHHCCCCCHHHHHH
CAGFLRISGSTDPLDASAVHPEAYPVVRNIAKATGLDVAGLIGNTAVLAKLKPADFADER
HCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHCCCEEEEEECCCCCCCCC
FGIPTVTDIIAELDKPGRDPRPEFKTASFKEGVEKISDLTPGMILEGTVTNVAAFGAFVD
CCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHC
VGVHQDGLVHVSAMSDKFISNPHEVVRSGEVVKVKVMEVDVDRKRIGLSLRLTDEPGAPA
CCCCCCCCEEEEEHHHHHHCCHHHHHHCCCEEEEEEEEEECCHHHCCEEEEEECCCCCCC
PQKRGNRPAKQQRAPQKQSAKPATGSMADALRRAGLGG
CHHCCCCCHHHHCCCCHHCCCCCCCHHHHHHHHCCCCC
>Mature Secondary Structure 
SSISRTIALELGVKDEQVEAAIKLLDEGNTVPFIARYRKEITGGLDDTQLRDLEERLSY
CCCCCEEEEEECCCHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHH
LRELEDRKQSILAAIEEQGKLTDDLRSLILGCDTKARLEDLYLPFKKRRKTKADIAREAG
HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHHHHHHHHHC
LEGLVDKLIDAPSLDAAAQAAAFTTEGFEDSKKVLDGARAILIDRFALDADLVGEVREQM
HHHHHHHHHCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
YRAGSMAASVVAGKEQEGAKFKDYFEFSEPFDKLPSHRILALLRGENEGVLSLNLDAGDD
HHHHHHHHHHHCCCCCCCCHHHHHHHHCCHHHHCCCCEEEEEEECCCCCEEEEECCCCCH
TIYEGLIADRFSLDIHTSSWLAEAVRWGWRTKLYVSSGLDVRMRLKEKAEEGALDVFATN
HHHHHHHHHHEEEEEECHHHHHHHHHCCCCEEEEEECCCCEEHHHHHHHHCCCHHHHHHH
LRDVLLAAPAGQRSTIGLDPGFRNGVKVAVVDSTGKDVATTIVYPHQPQNRWKEAVSELA
HHHHHEECCCCCCCEECCCCCCCCCEEEEEEECCCCCEEEEEEECCCCHHHHHHHHHHHH
NLCATHGVELMAIGNGTASRETEKLAGEVADMIKAAGGKRPTPVVVSESGASVYSASPIA
HHHHHCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCEECCCCCC
AEEFPDMDVSLRGAVSIARRLQDPLAELVKIEPKAIGVGQYQHDVNQVALSKTLDGVVED
HHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH
AVNAVGVNLNTASAPLLTRVAGVTSTLANNIVAYRNENGGFSSRKELNKVPRLGPKAFEQ
HHHHHCCCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCHHHHHHCCCCCHHHHHH
CAGFLRISGSTDPLDASAVHPEAYPVVRNIAKATGLDVAGLIGNTAVLAKLKPADFADER
HCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHCCCEEEEEECCCCCCCCC
FGIPTVTDIIAELDKPGRDPRPEFKTASFKEGVEKISDLTPGMILEGTVTNVAAFGAFVD
CCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHC
VGVHQDGLVHVSAMSDKFISNPHEVVRSGEVVKVKVMEVDVDRKRIGLSLRLTDEPGAPA
CCCCCCCCEEEEEHHHHHHCCHHHHHHCCCEEEEEEEEEECCHHHCCEEEEEECCCCCCC
PQKRGNRPAKQQRAPQKQSAKPATGSMADALRRAGLGG
CHHCCCCCHHHHCCCCHHCCCCCCCHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 10761919 [H]