The gene/protein map for NC_007969 is currently unavailable.
Definition Psychrobacter cryohalolentis K5 chromosome, complete genome.
Accession NC_007969
Length 3,059,876

Click here to switch to the map view.

The map label for this gene is yhgF [C]

Identifier: 93004967

GI number: 93004967

Start: 158738

End: 161260

Strand: Direct

Name: yhgF [C]

Synonym: Pcryo_0136

Alternate gene names: 93004967

Gene position: 158738-161260 (Clockwise)

Preceding gene: 93004964

Following gene: 93004970

Centisome position: 5.19

GC content: 45.7

Gene sequence:

>2523_bases
ATGAGTAGCACTGATACCTTAACGCCATCTCAAACCTCGACAAATGCACAGGTTTTGGATGCAACCACACACGCGAATAT
TCACGACAAGCTTGCTCGCGCGCTTGGCATTAAGACTGCTCAAGTCAATGCGTTTGTCAAACTGTACGATGAAGGCGCGA
CCGTTCCGTTTATTGCCCGTTACCGCAAAGAAAAGACGCAGAATCTCGACGATGCGCAATTGCGCGCACTAGAGAAGTCT
CTCAATTATGAACGTGATATGGCAACCCGTCGCCTTAAAATTATTGAATTACTGAGCACACAAGGCAATTTGACTGACGA
GCTGCAGACACGTATTGATAATGCAACCTCAAAACTTGAGCTTGAAGACATCTACTTGCCATATCGTCCGCGCCGCCGTT
CACCAGCTGCCAAAGCACGTGCTGCTGGTCTTGATATTGCTGCACAAGCAGTGTTGACGCAAAACGTGACGCCAACCGAT
GCATTAGCAGACTATCAAGTACAATCAAGCATTACTGATGAAAGCGGTAATGAGATTGAAGTTGATTTTAGTGATATCGA
GAAGCAACTTGCTGGTGTACAAGCCATTATCGTAGATGAATGGACGCAAGCATTAGACTTGTTAGATCATTTACGTAGCA
GTTTTGCCAAAACAGCCAGTATCGTATCGAGCGTTGCCAGTGAAGAAAAACGTGAAGTCGGTGAAAAATTCAAAGATTAC
TTTGAGCATAGCGAAAGCCTTGCACGTTTGCCAAATCATCGCTTATTAGCGATGCTACGTGGTCGTCAAGAAAACGTCTT
GGGTCTAAAAATTGAAGGCGAAAATGCGCCTTTTATTGAAAAAATTATTAAGCATTTTGAGATTGATGCTAAAGCGCCAA
CTGAGCGTCAAGAATTTTTGACAGAAGCGGCGAGCAGCTTATGGAAAGACAAATGGCGTCCGCATATCGAGCATCGTTTA
TTGACTGAAAAGCGTCTGACTGCAGAAGCGGCTGCGATTGATGTTTTTGCCAATAATTTACAGCATTTACTGATGTCAGC
ACCTGCTGGTCGCAAAGTTATTTTGGGTGTTGATCCCGGTATCCGTCATGGCGTTAAGATGGCGATTGTTGATGCTCAAG
GTCATGTGATGTTAGATGGCGAAGACAAGCCTGTTATCGCAACGGTTTATCCATTTGCGCCTGATAATAAAATGACTGAA
GCCAAAGTGGTTATCGATGAGTTATTAAGTACTTATAATGTGGATCTAGTGGCTATCGGTAATGGTACGGCAAGCCGCGA
AACAGACGCAATGATTAAAGAGATTTTGGCGGCGAACGAGTCATTAAAAGCTAAAGCAGTCATCGTGAATGAATCTGGCG
CGTCAGTCTATTCTGCCAGTGAGCTTGCTACTGATGAGCTTGGTAATTTAGATGTCTCTGTACGTGGGGCAGTATCTATT
GCTCGTCGTCTACAAGACCCATTGTCAGAGCTGGTCAAGGTAGATCCAAAAGCCATTGGTGTTGGTCAATATCAACATGA
TGTCAATCAAGCTCAGTTGGCAGATAGTCTTGATAAAGTCACACAAGACAGCGTTAACGCCGTTGGTGTTGATGTGAATA
CGGCAAGCCCCGCTATCTTGGCACACATTGCTGGCTTAAATAAGAATGTTGCGCAGCAAATCGTTACCTATCGCAAAGAG
CATGGCGCTTTTGATAGCCGCGAGTCATTGAAAAATGTGCCGCGCTTAGGTGCTAAGACCTTTGAGCAAGCCGCAGGCTT
CTTACGCATACATGATGGTAGCAATCCACTTGATGCAACTGGTGTACATCCAGAAAGCTATGCGCTGGTCGACAGCTTAC
TTGCCCAAACAGGTAAAGCCTTGCCAGAAGTTATCGGTAACGACGGTGTGTTAAACAGTATCGATACCACAGCGCTAGCT
GCTAATGATGAAAATGTTAGCGTAAAAGCGATTTTAGATGAGCTTGCTAAGCCCGCTCGTGACCCACGTCCTGAGTTTAA
GACGGCTAACTTCCGTGAAGATGTGAACAGCATTAAAGATTTGAGCGAAGGCATGACACTAGAAGGGGTTGTTACCAACG
TGACTGCCTTTGGTTGCTTCATCGATGTTGGCGTGCATCAAGATGGTCTGGTTCATATCTCGCAAATGGCCAATGACTTT
GTCGCAGACCCGATGAACCGCGTCAAGCCCGGTGATATCGTCTCGGTCCGTGTCATCTCTATTGACGAAAAACGCGGTCG
TATTGGTTTTAGTATGAAACCTGAAGCTGAAAAACCTGCACGTCCAGCAGCGAAACCTGCAACGACAAATGTTACTAATA
ATGATGAGAATACCAGTCGTCCGCGCAGCAATCGTCCAGCTGGCGATAAGCGTCCGTCTAAGCCTAAGCGTCAACCTAGT
AGCAGCAATAATAGCGACAGTCGCAGTAAGGCGCCTCGTGCAGAAAAAGCAGAATCGCCTAGCAAAATGGGTACTTTCGG
CGCGCTATTGCAAGAAGCAGGTGTGACTAAAGTTAAGAAATAA

Upstream 100 bases:

>100_bases
CGATAAATAACACGATAAATAACGAAAAGATAAAAAGGATTTCTATTCATGCAGCTTTTGCTATAATGCAAAACTACATT
TCTCAACGATTGATACGTCT

Downstream 100 bases:

>100_bases
GACGTCTTATTCTAAATAGAAACAAAAAAGGCAGCCTAATGGCTGCCTTTTTTATTAAATCAAATATATCAATGGTGCTT
TTTAATCACTATATTTATTT

Product: RNA binding S1

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 840; Mature: 839

Protein sequence:

>840_residues
MSSTDTLTPSQTSTNAQVLDATTHANIHDKLARALGIKTAQVNAFVKLYDEGATVPFIARYRKEKTQNLDDAQLRALEKS
LNYERDMATRRLKIIELLSTQGNLTDELQTRIDNATSKLELEDIYLPYRPRRRSPAAKARAAGLDIAAQAVLTQNVTPTD
ALADYQVQSSITDESGNEIEVDFSDIEKQLAGVQAIIVDEWTQALDLLDHLRSSFAKTASIVSSVASEEKREVGEKFKDY
FEHSESLARLPNHRLLAMLRGRQENVLGLKIEGENAPFIEKIIKHFEIDAKAPTERQEFLTEAASSLWKDKWRPHIEHRL
LTEKRLTAEAAAIDVFANNLQHLLMSAPAGRKVILGVDPGIRHGVKMAIVDAQGHVMLDGEDKPVIATVYPFAPDNKMTE
AKVVIDELLSTYNVDLVAIGNGTASRETDAMIKEILAANESLKAKAVIVNESGASVYSASELATDELGNLDVSVRGAVSI
ARRLQDPLSELVKVDPKAIGVGQYQHDVNQAQLADSLDKVTQDSVNAVGVDVNTASPAILAHIAGLNKNVAQQIVTYRKE
HGAFDSRESLKNVPRLGAKTFEQAAGFLRIHDGSNPLDATGVHPESYALVDSLLAQTGKALPEVIGNDGVLNSIDTTALA
ANDENVSVKAILDELAKPARDPRPEFKTANFREDVNSIKDLSEGMTLEGVVTNVTAFGCFIDVGVHQDGLVHISQMANDF
VADPMNRVKPGDIVSVRVISIDEKRGRIGFSMKPEAEKPARPAAKPATTNVTNNDENTSRPRSNRPAGDKRPSKPKRQPS
SSNNSDSRSKAPRAEKAESPSKMGTFGALLQEAGVTKVKK

Sequences:

>Translated_840_residues
MSSTDTLTPSQTSTNAQVLDATTHANIHDKLARALGIKTAQVNAFVKLYDEGATVPFIARYRKEKTQNLDDAQLRALEKS
LNYERDMATRRLKIIELLSTQGNLTDELQTRIDNATSKLELEDIYLPYRPRRRSPAAKARAAGLDIAAQAVLTQNVTPTD
ALADYQVQSSITDESGNEIEVDFSDIEKQLAGVQAIIVDEWTQALDLLDHLRSSFAKTASIVSSVASEEKREVGEKFKDY
FEHSESLARLPNHRLLAMLRGRQENVLGLKIEGENAPFIEKIIKHFEIDAKAPTERQEFLTEAASSLWKDKWRPHIEHRL
LTEKRLTAEAAAIDVFANNLQHLLMSAPAGRKVILGVDPGIRHGVKMAIVDAQGHVMLDGEDKPVIATVYPFAPDNKMTE
AKVVIDELLSTYNVDLVAIGNGTASRETDAMIKEILAANESLKAKAVIVNESGASVYSASELATDELGNLDVSVRGAVSI
ARRLQDPLSELVKVDPKAIGVGQYQHDVNQAQLADSLDKVTQDSVNAVGVDVNTASPAILAHIAGLNKNVAQQIVTYRKE
HGAFDSRESLKNVPRLGAKTFEQAAGFLRIHDGSNPLDATGVHPESYALVDSLLAQTGKALPEVIGNDGVLNSIDTTALA
ANDENVSVKAILDELAKPARDPRPEFKTANFREDVNSIKDLSEGMTLEGVVTNVTAFGCFIDVGVHQDGLVHISQMANDF
VADPMNRVKPGDIVSVRVISIDEKRGRIGFSMKPEAEKPARPAAKPATTNVTNNDENTSRPRSNRPAGDKRPSKPKRQPS
SSNNSDSRSKAPRAEKAESPSKMGTFGALLQEAGVTKVKK
>Mature_839_residues
SSTDTLTPSQTSTNAQVLDATTHANIHDKLARALGIKTAQVNAFVKLYDEGATVPFIARYRKEKTQNLDDAQLRALEKSL
NYERDMATRRLKIIELLSTQGNLTDELQTRIDNATSKLELEDIYLPYRPRRRSPAAKARAAGLDIAAQAVLTQNVTPTDA
LADYQVQSSITDESGNEIEVDFSDIEKQLAGVQAIIVDEWTQALDLLDHLRSSFAKTASIVSSVASEEKREVGEKFKDYF
EHSESLARLPNHRLLAMLRGRQENVLGLKIEGENAPFIEKIIKHFEIDAKAPTERQEFLTEAASSLWKDKWRPHIEHRLL
TEKRLTAEAAAIDVFANNLQHLLMSAPAGRKVILGVDPGIRHGVKMAIVDAQGHVMLDGEDKPVIATVYPFAPDNKMTEA
KVVIDELLSTYNVDLVAIGNGTASRETDAMIKEILAANESLKAKAVIVNESGASVYSASELATDELGNLDVSVRGAVSIA
RRLQDPLSELVKVDPKAIGVGQYQHDVNQAQLADSLDKVTQDSVNAVGVDVNTASPAILAHIAGLNKNVAQQIVTYRKEH
GAFDSRESLKNVPRLGAKTFEQAAGFLRIHDGSNPLDATGVHPESYALVDSLLAQTGKALPEVIGNDGVLNSIDTTALAA
NDENVSVKAILDELAKPARDPRPEFKTANFREDVNSIKDLSEGMTLEGVVTNVTAFGCFIDVGVHQDGLVHISQMANDFV
ADPMNRVKPGDIVSVRVISIDEKRGRIGFSMKPEAEKPARPAAKPATTNVTNNDENTSRPRSNRPAGDKRPSKPKRQPSS
SNNSDSRSKAPRAEKAESPSKMGTFGALLQEAGVTKVKK

Specific function: Unknown

COG id: COG2183

COG function: function code K; Transcriptional accessory protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 S1 motif domain [H]

Homologues:

Organism=Homo sapiens, GI221136781, Length=783, Percent_Identity=32.0561941251596, Blast_Score=353, Evalue=3e-97,
Organism=Homo sapiens, GI27597090, Length=388, Percent_Identity=25.2577319587629, Blast_Score=89, Evalue=2e-17,
Organism=Escherichia coli, GI87082262, Length=778, Percent_Identity=46.7866323907455, Blast_Score=631, Evalue=0.0,
Organism=Escherichia coli, GI1787140, Length=104, Percent_Identity=36.5384615384615, Blast_Score=70, Evalue=6e-13,
Organism=Caenorhabditis elegans, GI17511129, Length=765, Percent_Identity=28.4967320261438, Blast_Score=247, Evalue=2e-65,
Organism=Caenorhabditis elegans, GI17552892, Length=310, Percent_Identity=25.8064516129032, Blast_Score=66, Evalue=8e-11,
Organism=Drosophila melanogaster, GI62484314, Length=786, Percent_Identity=31.0432569974555, Blast_Score=353, Evalue=2e-97,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003583
- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR003029
- InterPro:   IPR005227
- InterPro:   IPR006641
- InterPro:   IPR022967
- InterPro:   IPR018974
- InterPro:   IPR023097 [H]

Pfam domain/function: PF00575 S1; PF09371 Tex_N [H]

EC number: NA

Molecular weight: Translated: 91439; Mature: 91308

Theoretical pI: Translated: 5.99; Mature: 5.99

Prosite motif: PS50126 S1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.1 %Cys     (Translated Protein)
1.5 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
0.1 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
1.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSSTDTLTPSQTSTNAQVLDATTHANIHDKLARALGIKTAQVNAFVKLYDEGATVPFIAR
CCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHCCCHHHHEEEEEEECCCCCCHHHHH
YRKEKTQNLDDAQLRALEKSLNYERDMATRRLKIIELLSTQGNLTDELQTRIDNATSKLE
HHHHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCHHCCEE
LEDIYLPYRPRRRSPAAKARAAGLDIAAQAVLTQNVTPTDALADYQVQSSITDESGNEIE
EEEEECCCCCCCCCCHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCEEE
VDFSDIEKQLAGVQAIIVDEWTQALDLLDHLRSSFAKTASIVSSVASEEKREVGEKFKDY
EEHHHHHHHHHCHHEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
FEHSESLARLPNHRLLAMLRGRQENVLGLKIEGENAPFIEKIIKHFEIDAKAPTERQEFL
HHHHHHHHHCCCHHHHHHHCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCHHHHHH
TEAASSLWKDKWRPHIEHRLLTEKRLTAEAAAIDVFANNLQHLLMSAPAGRKVILGVDPG
HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCC
IRHGVKMAIVDAQGHVMLDGEDKPVIATVYPFAPDNKMTEAKVVIDELLSTYNVDLVAIG
CCCCCEEEEEECCCEEEECCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEEC
NGTASRETDAMIKEILAANESLKAKAVIVNESGASVYSASELATDELGNLDVSVRGAVSI
CCCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEHHHHHHHHHCCCCEEHHHHHHH
ARRLQDPLSELVKVDPKAIGVGQYQHDVNQAQLADSLDKVTQDSVNAVGVDVNTASPAIL
HHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHH
AHIAGLNKNVAQQIVTYRKEHGAFDSRESLKNVPRLGAKTFEQAAGFLRIHDGSNPLDAT
HHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHCCCEEEEECCCCCCCCC
GVHPESYALVDSLLAQTGKALPEVIGNDGVLNSIDTTALAANDENVSVKAILDELAKPAR
CCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHCCC
DPRPEFKTANFREDVNSIKDLSEGMTLEGVVTNVTAFGCFIDVGVHQDGLVHISQMANDF
CCCCCCCCCCHHHHHHHHHHHHCCCEEEHHHHHHHHHHEEEEECCCCCCHHHHHHHHHHH
VADPMNRVKPGDIVSVRVISIDEKRGRIGFSMKPEAEKPARPAAKPATTNVTNNDENTSR
HHCCHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
PRSNRPAGDKRPSKPKRQPSSSNNSDSRSKAPRAEKAESPSKMGTFGALLQEAGVTKVKK
CCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCHHCCCCHHHHHHHHHHHHCCCHHCCC
>Mature Secondary Structure 
SSTDTLTPSQTSTNAQVLDATTHANIHDKLARALGIKTAQVNAFVKLYDEGATVPFIAR
CCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHCCCHHHHEEEEEEECCCCCCHHHHH
YRKEKTQNLDDAQLRALEKSLNYERDMATRRLKIIELLSTQGNLTDELQTRIDNATSKLE
HHHHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCHHCCEE
LEDIYLPYRPRRRSPAAKARAAGLDIAAQAVLTQNVTPTDALADYQVQSSITDESGNEIE
EEEEECCCCCCCCCCHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCEEE
VDFSDIEKQLAGVQAIIVDEWTQALDLLDHLRSSFAKTASIVSSVASEEKREVGEKFKDY
EEHHHHHHHHHCHHEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
FEHSESLARLPNHRLLAMLRGRQENVLGLKIEGENAPFIEKIIKHFEIDAKAPTERQEFL
HHHHHHHHHCCCHHHHHHHCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCHHHHHH
TEAASSLWKDKWRPHIEHRLLTEKRLTAEAAAIDVFANNLQHLLMSAPAGRKVILGVDPG
HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCC
IRHGVKMAIVDAQGHVMLDGEDKPVIATVYPFAPDNKMTEAKVVIDELLSTYNVDLVAIG
CCCCCEEEEEECCCEEEECCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEEC
NGTASRETDAMIKEILAANESLKAKAVIVNESGASVYSASELATDELGNLDVSVRGAVSI
CCCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEHHHHHHHHHCCCCEEHHHHHHH
ARRLQDPLSELVKVDPKAIGVGQYQHDVNQAQLADSLDKVTQDSVNAVGVDVNTASPAIL
HHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHH
AHIAGLNKNVAQQIVTYRKEHGAFDSRESLKNVPRLGAKTFEQAAGFLRIHDGSNPLDAT
HHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHCCCEEEEECCCCCCCCC
GVHPESYALVDSLLAQTGKALPEVIGNDGVLNSIDTTALAANDENVSVKAILDELAKPAR
CCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHCCC
DPRPEFKTANFREDVNSIKDLSEGMTLEGVVTNVTAFGCFIDVGVHQDGLVHISQMANDF
CCCCCCCCCCHHHHHHHHHHHHCCCEEEHHHHHHHHHHEEEEECCCCCCHHHHHHHHHHH
VADPMNRVKPGDIVSVRVISIDEKRGRIGFSMKPEAEKPARPAAKPATTNVTNNDENTSR
HHCCHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
PRSNRPAGDKRPSKPKRQPSSSNNSDSRSKAPRAEKAESPSKMGTFGALLQEAGVTKVKK
CCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCHHCCCCHHHHHHHHHHHHCCCHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7542800 [H]