The gene/protein map for NC_009832 is currently unavailable.
Definition Serratia proteamaculans 568 chromosome, complete genome.
Accession NC_009832
Length 5,448,853

Click here to switch to the map view.

The map label for this gene is yhgF [H]

Identifier: 157372859

GI number: 157372859

Start: 5103204

End: 5105537

Strand: Direct

Name: yhgF [H]

Synonym: Spro_4627

Alternate gene names: 157372859

Gene position: 5103204-5105537 (Clockwise)

Preceding gene: 157372858

Following gene: 157372860

Centisome position: 93.66

GC content: 60.11

Gene sequence:

>2334_bases
ATGAATGACCCACTGAGCCGCATTATTGCAACAGAACTGCAGGCCCGGCCGGAGCAAGTTGACTCCGCCATCCGTCTGCT
GGATGAAGGTAATACCGTGCCCTTTATTGCACGCTATCGTAAGGAAGTCACCGGGGGTCTGGACGATACCCAACTGCGCC
AGCTGGAAACCCGCCTGGGTTATCTGCGTGAACTGGAAGACCGCCGTCAGACCATCCTTAAATCAATCGACGAGCAGGGC
AAACTGACCGAACAGCTGGCGGGGGCGATCACCGCCACGCAAAGCAAAACCGAACTTGAAGATCTCTACCTGCCGTACAA
ACCCAAGCGCCGCACCCGTGGGCAGATCGCGATTGAAGCCGGTCTGGAGCCCCTGGCAGACACCCTGTGGCAGGATCCTC
AGCAGCAGCCTGAACAACTGGCCGAAGGCTACGTTGATGCCGACAAGGGCGTAGCGGACGTTAAAGCCGCCCTCGACGGC
GCGCGTTACATTCTGATGGAGCGCTTTGCCGAAGACGCCGCGCTGCTGGCCAAGGTTCGTAATTACCTGTGGAAGCACGC
GCATCTGGTCTCCAAAGTGGTGGAAGGCAAAGAAGAAGCCGGCGCGAAATTCCGCGACTACTTCGATCACCACGAACCTA
TTGCCCAGGTGCCTTCACACCGTGCGCTGGCCATGTTCCGTGGCCGCAACGAAGGCGTGCTGCAACTGGCGCTGAACGCC
GACCCACAGTTTGAAGAAGCCCCGCGCGAAAGCCAGGCGGAACAGATCATCATCAGCCATCTCGATTTGCGCCTGAATAA
CGCCCCGGCAGATGCCTGGCGCAAAGCGGTGGTCAACTGGACCTGGCGCATCAAGGTGTTGCTGCATCTGGAAACCGAAC
TGATGAGCACCCTGCGCGAACGTGCGGAAGATGAAGCAATCAACGTCTTCGCCCGTAACATGCACGATTTGCTGATGGCC
GCACCGGCCGGCATGCGTGCGACCATGGGGCTGGATCCGGGCTTGCGTACCGGGGTGAAAGTGGCGGTGGTGGATGCCAC
CGGCAAGCTGGTCGCCACCGACACCGTCTACCCGCACACCGGCCAGGCCGCCAAAGCCGCGGCCATCGTCGCGGCACTGT
GCATCAAACACAAAGTGGAACTGGTCGCCATCGGCAACGGTACCGCATCGCGTGAGACCGAGCGTTTTTATCTTGATTTG
CAACAGCAATTCGGCGAAGTCAAAGCGCAGAAAGTGATCGTCAGCGAAGCCGGTGCCTCGGTGTATTCCGCCTCCGAACT
GGCGGCGCTGGAGTTCCCGAATCTCGACGTCTCGCTGCGTGGCGCCGTCTCCATCGCCCGTCGCCTGCAGGATCCACTGG
CCGAACTGGTCAAAATCGATCCGAAATCCATCGGTGTTGGTCAGTACCAGCACGATGTCAGCCAAAGCCAACTGGCGAAA
AAACTCGATTCGGTGGTTGAAGACTGCGTAAACGCCGTCGGGGTTGATCTGAACACCGCTTCGGTGCCGCTGCTGACCCG
CGTGGCCGGCCTGACCCGCATGATGGCGCAGAACATCGTTACCTGGCGTGATGAGAATGGCCGTTTCAGCAACCGCGAAC
AGCTGTTGAAAGTCAGCCGCCTGGGGCCAAAAGCCTTTGAGCAGTGCGCTGGCTTCCTGCGTATCAACCACGGCGACAAC
CCGCTGGACGCCTCGACCGTTCACCCGGAAACCTACCCGGTGGTCGAACGCATTCTGGCCGCCACCCGCCAGGCACTGCA
AGATCTGATGGGTAACCCGGCAGCGGTACGCAGCCTGAAGGCCAGCGATTTCACCGACGACAAGTTCGGCGTGCCAACGG
TGACCGACATCCTGAAAGAGCTGGAGAAACCGGGCCGCGATCCGCGTCCGGAATTCAAAACCGCCACCTTCGCCGAGGGC
GTTGAAACCCTGAGCGACCTGCAGCCGGGGATGATTTTGGAAGGTTCAGTGACCAACGTCACCAACTTCGGAGCCTTTGT
CGATATCGGCGTGCATCAGGACGGTCTGGTGCATATCTCCTCGTTGGCGGACAAGTTTGTCGAAGATCCGCACACCGTGG
TGAAAGCCGGTGACATCGTCAAAGTGAAGGTGATGGAAGTGGATCTGCAGCGCAAACGCATCGCGCTGAGCATGCGTCTG
GACGAGCAACCGGGTGAAGGTTCACCACGCCGCGGCGGTAACGCCGCCCCGGCCAGGGACAACGCCAACCGGGCACCGGT
CAATAAGGGCAAACCGCGCGGCAACAACAACACCTCGGCGGGTAACAGCGCCATGGGTGACGCGCTGGCGGCGGCATTCG
GCAAAAAATCTTAA

Upstream 100 bases:

>100_bases
CATGGCTGCCGAGAGGCGGCCCGCTTCACTGGCATTTTCCCCGCGCAATCCGTATAACTGTCCCCCTGTTTACTAACCAT
TCCCAACAGATACCAGACAT

Downstream 100 bases:

>100_bases
TCAGGCCCTGACCAGGCCTATGGGATGCAGGGCGGAGTTCACTCCGCCCTTTCCTGCCGAACTATCCCCGCAATTTTTTG
ACCCGCCTCAATAAACCTGC

Product: RNA-binding S1 domain-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 777; Mature: 777

Protein sequence:

>777_residues
MNDPLSRIIATELQARPEQVDSAIRLLDEGNTVPFIARYRKEVTGGLDDTQLRQLETRLGYLRELEDRRQTILKSIDEQG
KLTEQLAGAITATQSKTELEDLYLPYKPKRRTRGQIAIEAGLEPLADTLWQDPQQQPEQLAEGYVDADKGVADVKAALDG
ARYILMERFAEDAALLAKVRNYLWKHAHLVSKVVEGKEEAGAKFRDYFDHHEPIAQVPSHRALAMFRGRNEGVLQLALNA
DPQFEEAPRESQAEQIIISHLDLRLNNAPADAWRKAVVNWTWRIKVLLHLETELMSTLRERAEDEAINVFARNMHDLLMA
APAGMRATMGLDPGLRTGVKVAVVDATGKLVATDTVYPHTGQAAKAAAIVAALCIKHKVELVAIGNGTASRETERFYLDL
QQQFGEVKAQKVIVSEAGASVYSASELAALEFPNLDVSLRGAVSIARRLQDPLAELVKIDPKSIGVGQYQHDVSQSQLAK
KLDSVVEDCVNAVGVDLNTASVPLLTRVAGLTRMMAQNIVTWRDENGRFSNREQLLKVSRLGPKAFEQCAGFLRINHGDN
PLDASTVHPETYPVVERILAATRQALQDLMGNPAAVRSLKASDFTDDKFGVPTVTDILKELEKPGRDPRPEFKTATFAEG
VETLSDLQPGMILEGSVTNVTNFGAFVDIGVHQDGLVHISSLADKFVEDPHTVVKAGDIVKVKVMEVDLQRKRIALSMRL
DEQPGEGSPRRGGNAAPARDNANRAPVNKGKPRGNNNTSAGNSAMGDALAAAFGKKS

Sequences:

>Translated_777_residues
MNDPLSRIIATELQARPEQVDSAIRLLDEGNTVPFIARYRKEVTGGLDDTQLRQLETRLGYLRELEDRRQTILKSIDEQG
KLTEQLAGAITATQSKTELEDLYLPYKPKRRTRGQIAIEAGLEPLADTLWQDPQQQPEQLAEGYVDADKGVADVKAALDG
ARYILMERFAEDAALLAKVRNYLWKHAHLVSKVVEGKEEAGAKFRDYFDHHEPIAQVPSHRALAMFRGRNEGVLQLALNA
DPQFEEAPRESQAEQIIISHLDLRLNNAPADAWRKAVVNWTWRIKVLLHLETELMSTLRERAEDEAINVFARNMHDLLMA
APAGMRATMGLDPGLRTGVKVAVVDATGKLVATDTVYPHTGQAAKAAAIVAALCIKHKVELVAIGNGTASRETERFYLDL
QQQFGEVKAQKVIVSEAGASVYSASELAALEFPNLDVSLRGAVSIARRLQDPLAELVKIDPKSIGVGQYQHDVSQSQLAK
KLDSVVEDCVNAVGVDLNTASVPLLTRVAGLTRMMAQNIVTWRDENGRFSNREQLLKVSRLGPKAFEQCAGFLRINHGDN
PLDASTVHPETYPVVERILAATRQALQDLMGNPAAVRSLKASDFTDDKFGVPTVTDILKELEKPGRDPRPEFKTATFAEG
VETLSDLQPGMILEGSVTNVTNFGAFVDIGVHQDGLVHISSLADKFVEDPHTVVKAGDIVKVKVMEVDLQRKRIALSMRL
DEQPGEGSPRRGGNAAPARDNANRAPVNKGKPRGNNNTSAGNSAMGDALAAAFGKKS
>Mature_777_residues
MNDPLSRIIATELQARPEQVDSAIRLLDEGNTVPFIARYRKEVTGGLDDTQLRQLETRLGYLRELEDRRQTILKSIDEQG
KLTEQLAGAITATQSKTELEDLYLPYKPKRRTRGQIAIEAGLEPLADTLWQDPQQQPEQLAEGYVDADKGVADVKAALDG
ARYILMERFAEDAALLAKVRNYLWKHAHLVSKVVEGKEEAGAKFRDYFDHHEPIAQVPSHRALAMFRGRNEGVLQLALNA
DPQFEEAPRESQAEQIIISHLDLRLNNAPADAWRKAVVNWTWRIKVLLHLETELMSTLRERAEDEAINVFARNMHDLLMA
APAGMRATMGLDPGLRTGVKVAVVDATGKLVATDTVYPHTGQAAKAAAIVAALCIKHKVELVAIGNGTASRETERFYLDL
QQQFGEVKAQKVIVSEAGASVYSASELAALEFPNLDVSLRGAVSIARRLQDPLAELVKIDPKSIGVGQYQHDVSQSQLAK
KLDSVVEDCVNAVGVDLNTASVPLLTRVAGLTRMMAQNIVTWRDENGRFSNREQLLKVSRLGPKAFEQCAGFLRINHGDN
PLDASTVHPETYPVVERILAATRQALQDLMGNPAAVRSLKASDFTDDKFGVPTVTDILKELEKPGRDPRPEFKTATFAEG
VETLSDLQPGMILEGSVTNVTNFGAFVDIGVHQDGLVHISSLADKFVEDPHTVVKAGDIVKVKVMEVDLQRKRIALSMRL
DEQPGEGSPRRGGNAAPARDNANRAPVNKGKPRGNNNTSAGNSAMGDALAAAFGKKS

Specific function: Unknown

COG id: COG2183

COG function: function code K; Transcriptional accessory protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 S1 motif domain [H]

Homologues:

Organism=Homo sapiens, GI221136781, Length=797, Percent_Identity=35.0062735257215, Blast_Score=411, Evalue=1e-114,
Organism=Homo sapiens, GI27597090, Length=776, Percent_Identity=22.5515463917526, Blast_Score=107, Evalue=4e-23,
Organism=Escherichia coli, GI87082262, Length=776, Percent_Identity=84.9226804123711, Blast_Score=1318, Evalue=0.0,
Organism=Escherichia coli, GI1787140, Length=97, Percent_Identity=41.2371134020619, Blast_Score=73, Evalue=7e-14,
Organism=Escherichia coli, GI145693187, Length=78, Percent_Identity=46.1538461538462, Blast_Score=64, Evalue=3e-11,
Organism=Caenorhabditis elegans, GI17511129, Length=737, Percent_Identity=27.6797829036635, Blast_Score=226, Evalue=2e-59,
Organism=Caenorhabditis elegans, GI17552892, Length=297, Percent_Identity=28.956228956229, Blast_Score=92, Evalue=1e-18,
Organism=Saccharomyces cerevisiae, GI6321552, Length=229, Percent_Identity=27.0742358078603, Blast_Score=72, Evalue=4e-13,
Organism=Drosophila melanogaster, GI62484314, Length=770, Percent_Identity=30.5194805194805, Blast_Score=351, Evalue=1e-96,
Organism=Drosophila melanogaster, GI24640080, Length=765, Percent_Identity=20.7843137254902, Blast_Score=100, Evalue=4e-21,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR003029
- InterPro:   IPR005227
- InterPro:   IPR006641
- InterPro:   IPR022967
- InterPro:   IPR018974
- InterPro:   IPR023097 [H]

Pfam domain/function: PF00575 S1; PF09371 Tex_N [H]

EC number: NA

Molecular weight: Translated: 85157; Mature: 85157

Theoretical pI: Translated: 6.23; Mature: 6.23

Prosite motif: PS50126 S1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNDPLSRIIATELQARPEQVDSAIRLLDEGNTVPFIARYRKEVTGGLDDTQLRQLETRLG
CCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH
YLRELEDRRQTILKSIDEQGKLTEQLAGAITATQSKTELEDLYLPYKPKRRTRGQIAIEA
HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCEEEEC
GLEPLADTLWQDPQQQPEQLAEGYVDADKGVADVKAALDGARYILMERFAEDAALLAKVR
CCHHHHHHHHCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NYLWKHAHLVSKVVEGKEEAGAKFRDYFDHHEPIAQVPSHRALAMFRGRNEGVLQLALNA
HHHHHHHHHHHHHHHCHHHHCCHHHHHHHCCCCHHHCCCCCEEHHHCCCCCCEEEEEECC
DPQFEEAPRESQAEQIIISHLDLRLNNAPADAWRKAVVNWTWRIKVLLHLETELMSTLRE
CCCHHHCCCHHHHHHHHHHHHHEEECCCCHHHHHHHHHCEEEEEEEEEEEHHHHHHHHHH
RAEDEAINVFARNMHDLLMAAPAGMRATMGLDPGLRTGVKVAVVDATGKLVATDTVYPHT
HHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCEEEEEECCCCEEEECCCCCCC
GQAAKAAAIVAALCIKHKVELVAIGNGTASRETERFYLDLQQQFGEVKAQKVIVSEAGAS
CCHHHHHHHHHHHHHHHCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC
VYSASELAALEFPNLDVSLRGAVSIARRLQDPLAELVKIDPKSIGVGQYQHDVSQSQLAK
HHHHHHHHEEECCCCCEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHHHHH
KLDSVVEDCVNAVGVDLNTASVPLLTRVAGLTRMMAQNIVTWRDENGRFSNREQLLKVSR
HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCEEECCCCCCCCHHHHHHHHH
LGPKAFEQCAGFLRINHGDNPLDASTVHPETYPVVERILAATRQALQDLMGNPAAVRSLK
CCHHHHHHHHCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHCC
ASDFTDDKFGVPTVTDILKELEKPGRDPRPEFKTATFAEGVETLSDLQPGMILEGSVTNV
CCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCC
TNFGAFVDIGVHQDGLVHISSLADKFVEDPHTVVKAGDIVKVKVMEVDLQRKRIALSMRL
CCCCCEEEECCCCCCHHHHHHHHHHHHCCCHHHEECCCEEEEEEEEEHHHHHHEEEEEEE
DEQPGEGSPRRGGNAAPARDNANRAPVNKGKPRGNNNTSAGNSAMGDALAAAFGKKS
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC
>Mature Secondary Structure
MNDPLSRIIATELQARPEQVDSAIRLLDEGNTVPFIARYRKEVTGGLDDTQLRQLETRLG
CCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH
YLRELEDRRQTILKSIDEQGKLTEQLAGAITATQSKTELEDLYLPYKPKRRTRGQIAIEA
HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCEEEEC
GLEPLADTLWQDPQQQPEQLAEGYVDADKGVADVKAALDGARYILMERFAEDAALLAKVR
CCHHHHHHHHCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NYLWKHAHLVSKVVEGKEEAGAKFRDYFDHHEPIAQVPSHRALAMFRGRNEGVLQLALNA
HHHHHHHHHHHHHHHCHHHHCCHHHHHHHCCCCHHHCCCCCEEHHHCCCCCCEEEEEECC
DPQFEEAPRESQAEQIIISHLDLRLNNAPADAWRKAVVNWTWRIKVLLHLETELMSTLRE
CCCHHHCCCHHHHHHHHHHHHHEEECCCCHHHHHHHHHCEEEEEEEEEEEHHHHHHHHHH
RAEDEAINVFARNMHDLLMAAPAGMRATMGLDPGLRTGVKVAVVDATGKLVATDTVYPHT
HHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCEEEEEECCCCEEEECCCCCCC
GQAAKAAAIVAALCIKHKVELVAIGNGTASRETERFYLDLQQQFGEVKAQKVIVSEAGAS
CCHHHHHHHHHHHHHHHCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC
VYSASELAALEFPNLDVSLRGAVSIARRLQDPLAELVKIDPKSIGVGQYQHDVSQSQLAK
HHHHHHHHEEECCCCCEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHHHHH
KLDSVVEDCVNAVGVDLNTASVPLLTRVAGLTRMMAQNIVTWRDENGRFSNREQLLKVSR
HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCEEECCCCCCCCHHHHHHHHH
LGPKAFEQCAGFLRINHGDNPLDASTVHPETYPVVERILAATRQALQDLMGNPAAVRSLK
CCHHHHHHHHCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHCC
ASDFTDDKFGVPTVTDILKELEKPGRDPRPEFKTATFAEGVETLSDLQPGMILEGSVTNV
CCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCC
TNFGAFVDIGVHQDGLVHISSLADKFVEDPHTVVKAGDIVKVKVMEVDLQRKRIALSMRL
CCCCCEEEECCCCCCHHHHHHHHHHHHCCCHHHEECCCEEEEEEEEEHHHHHHEEEEEEE
DEQPGEGSPRRGGNAAPARDNANRAPVNKGKPRGNNNTSAGNSAMGDALAAAFGKKS
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9278503; 10493123 [H]