Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is rihA [H]

Identifier: 222523811

GI number: 222523811

Start: 620348

End: 621307

Strand: Reverse

Name: rihA [H]

Synonym: Chy400_0519

Alternate gene names: 222523811

Gene position: 621307-620348 (Counterclockwise)

Preceding gene: 222523814

Following gene: 222523807

Centisome position: 11.79

GC content: 56.25

Gene sequence:

>960_bases
ATGAGTCCTTTACCTCGCCGGATCGTGCTTGACACCGATCCAGGTATTGATGATGCCCTGGCAATCTTGCTCGCACTGGC
CTCGCCTGAGATTGAGCTGGTCGGTTTGAGTATTGTGCACGGCAATTGTACGCTGGCCGAGGCCGTAGCCAATGGTCTCT
CGGTACTGGAACTGAGCGGCGGCCACCACATCCCCCTCTTTGTCGGCTGTGATCGACCGTTACTGCGACCACTAACGACT
GCACACGACACCCACGGACAACGTGGTTTGGGGTATGCACAATTGCCACCGGCTCAACTCCAACCGGTCTCTGAACACGC
GGTCGATTTTATCATTCGTACCGCGCTGGAAGCTCCCGGCGAAGTGACGCTGGTTGCAGTTGGCCCCCTGACTAACGTTG
CTCTTGCACTACGTAAAGAACCCCGTCTTGCCGGTGCGTTACGCGAGATCGTCATAATGGGTGGAGCATTACGGGCTGAT
GGGAATGTAACGCCACGGGCCGAATTTAACGTCTACGCTGATCCACACGCTGCCCAGATCGTCTTCTCGTCGGGTGCTCC
GTTGGTGATTATGCCGTGGGATATTACGCGCCTGGTGCGCCTCCACGAGAGTGAAGTCAATCGGTTGGCCCAGGCGGGCA
AGCCGATTGGTCGCTTCATCGCCGATGCTACCCGGTTTTACATCGAGTTTCATCGTCGCTATTTTGGCTATGATGGCTGT
GCAATTAACGATCCGGCTGCACTGGCGCTGGTCTTTCTGCCCGATCTGGCAACCTATGCCGATGTTCATGTGACGGTCGA
GACCTGTAGCCCGTTAACGATGGGCTTTACGGTGGCTGATTTTATGCTAAGCGATGGCCGTCAGCCAAACGCACGGGCGG
TTGTAGAATTCGATACGCCCCGTTTCCTGTCGCTCTTTGTTGAGCGGATGCAGATGCTCGAACAACGCCTGTATGCCTAG

Upstream 100 bases:

>100_bases
ATAACCGCCGGACTCTCGTCTTGCCTGTCAGACGTGTATTCACTTGTGCTATAGTAATCAACGATCTTGCACTAGCGTTT
GTTTCCACGCAGGAGGAACA

Downstream 100 bases:

>100_bases
AGAGACATCATGTCGCGTGGGGCAGGTGAAGCCGTTGCCTGCGCCTGCCCCGGATGTCAACCCTGAAAGGGCTGTTCGAC
CAGGCTCGCGTCGGAAGTCA

Product: Inosine/uridine-preferring nucleoside hydrolase

Products: uracil; ribose; cytosine [C]

Alternate protein names: Cytidine/uridine-specific hydrolase [H]

Number of amino acids: Translated: 319; Mature: 318

Protein sequence:

>319_residues
MSPLPRRIVLDTDPGIDDALAILLALASPEIELVGLSIVHGNCTLAEAVANGLSVLELSGGHHIPLFVGCDRPLLRPLTT
AHDTHGQRGLGYAQLPPAQLQPVSEHAVDFIIRTALEAPGEVTLVAVGPLTNVALALRKEPRLAGALREIVIMGGALRAD
GNVTPRAEFNVYADPHAAQIVFSSGAPLVIMPWDITRLVRLHESEVNRLAQAGKPIGRFIADATRFYIEFHRRYFGYDGC
AINDPAALALVFLPDLATYADVHVTVETCSPLTMGFTVADFMLSDGRQPNARAVVEFDTPRFLSLFVERMQMLEQRLYA

Sequences:

>Translated_319_residues
MSPLPRRIVLDTDPGIDDALAILLALASPEIELVGLSIVHGNCTLAEAVANGLSVLELSGGHHIPLFVGCDRPLLRPLTT
AHDTHGQRGLGYAQLPPAQLQPVSEHAVDFIIRTALEAPGEVTLVAVGPLTNVALALRKEPRLAGALREIVIMGGALRAD
GNVTPRAEFNVYADPHAAQIVFSSGAPLVIMPWDITRLVRLHESEVNRLAQAGKPIGRFIADATRFYIEFHRRYFGYDGC
AINDPAALALVFLPDLATYADVHVTVETCSPLTMGFTVADFMLSDGRQPNARAVVEFDTPRFLSLFVERMQMLEQRLYA
>Mature_318_residues
SPLPRRIVLDTDPGIDDALAILLALASPEIELVGLSIVHGNCTLAEAVANGLSVLELSGGHHIPLFVGCDRPLLRPLTTA
HDTHGQRGLGYAQLPPAQLQPVSEHAVDFIIRTALEAPGEVTLVAVGPLTNVALALRKEPRLAGALREIVIMGGALRADG
NVTPRAEFNVYADPHAAQIVFSSGAPLVIMPWDITRLVRLHESEVNRLAQAGKPIGRFIADATRFYIEFHRRYFGYDGCA
INDPAALALVFLPDLATYADVHVTVETCSPLTMGFTVADFMLSDGRQPNARAVVEFDTPRFLSLFVERMQMLEQRLYA

Specific function: Hydrolyzes cytidine or uridine to ribose and cytosine or uracil, respectively [H]

COG id: COG1957

COG function: function code F; Inosine-uridine nucleoside N-ribohydrolase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the IUNH family. RihA subfamily [H]

Homologues:

Organism=Escherichia coli, GI1786871, Length=305, Percent_Identity=39.344262295082, Blast_Score=218, Evalue=4e-58,
Organism=Escherichia coli, GI1788486, Length=306, Percent_Identity=38.2352941176471, Blast_Score=212, Evalue=3e-56,
Organism=Escherichia coli, GI1786213, Length=305, Percent_Identity=34.7540983606557, Blast_Score=135, Evalue=4e-33,
Organism=Caenorhabditis elegans, GI32565474, Length=198, Percent_Identity=33.3333333333333, Blast_Score=112, Evalue=2e-25,
Organism=Caenorhabditis elegans, GI17565698, Length=305, Percent_Identity=30.1639344262295, Blast_Score=103, Evalue=9e-23,
Organism=Saccharomyces cerevisiae, GI6320608, Length=252, Percent_Identity=31.7460317460317, Blast_Score=110, Evalue=2e-25,
Organism=Drosophila melanogaster, GI24583915, Length=332, Percent_Identity=28.0120481927711, Blast_Score=103, Evalue=1e-22,
Organism=Drosophila melanogaster, GI24641839, Length=309, Percent_Identity=31.0679611650485, Blast_Score=100, Evalue=1e-21,
Organism=Drosophila melanogaster, GI24641837, Length=333, Percent_Identity=27.9279279279279, Blast_Score=93, Evalue=2e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR015910
- InterPro:   IPR023186
- InterPro:   IPR001910
- InterPro:   IPR022975 [H]

Pfam domain/function: PF01156 IU_nuc_hydro [H]

EC number: NA

Molecular weight: Translated: 34612; Mature: 34481

Theoretical pI: Translated: 5.54; Mature: 5.54

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSPLPRRIVLDTDPGIDDALAILLALASPEIELVGLSIVHGNCTLAEAVANGLSVLELSG
CCCCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEEEEEEECCCHHHHHHHCCCEEEEECC
GHHIPLFVGCDRPLLRPLTTAHDTHGQRGLGYAQLPPAQLQPVSEHAVDFIIRTALEAPG
CCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHCCCC
EVTLVAVGPLTNVALALRKEPRLAGALREIVIMGGALRADGNVTPRAEFNVYADPHAAQI
CEEEEEECCHHHHHHHHHCCCHHHHHHHHHHHCCCEEECCCCCCCCCEEEEEECCCEEEE
VFSSGAPLVIMPWDITRLVRLHESEVNRLAQAGKPIGRFIADATRFYIEFHRRYFGYDGC
EEECCCCEEEECCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCC
AINDPAALALVFLPDLATYADVHVTVETCSPLTMGFTVADFMLSDGRQPNARAVVEFDTP
CCCCCHHEEEEECCCHHHHEEEEEEEECCCCEEEHHHHHHHHHHCCCCCCCCEEEEECCC
RFLSLFVERMQMLEQRLYA
HHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
SPLPRRIVLDTDPGIDDALAILLALASPEIELVGLSIVHGNCTLAEAVANGLSVLELSG
CCCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEEEEEEECCCHHHHHHHCCCEEEEECC
GHHIPLFVGCDRPLLRPLTTAHDTHGQRGLGYAQLPPAQLQPVSEHAVDFIIRTALEAPG
CCCEEEEEECCCHHHHHHHHCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHCCCC
EVTLVAVGPLTNVALALRKEPRLAGALREIVIMGGALRADGNVTPRAEFNVYADPHAAQI
CEEEEEECCHHHHHHHHHCCCHHHHHHHHHHHCCCEEECCCCCCCCCEEEEEECCCEEEE
VFSSGAPLVIMPWDITRLVRLHESEVNRLAQAGKPIGRFIADATRFYIEFHRRYFGYDGC
EEECCCCEEEECCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCC
AINDPAALALVFLPDLATYADVHVTVETCSPLTMGFTVADFMLSDGRQPNARAVVEFDTP
CCCCCHHEEEEECCCHHHHEEEEEEEECCCCEEEHHHHHHHHHHCCCCCCCCEEEEECCC
RFLSLFVERMQMLEQRLYA
HHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: uridine; H2O; cytidine [C]

Specific reaction: uridine + H2O = uracil + ribose cytidine + H2O = cytosine + ribose [C]

General reaction: Pyrimidine-specific nucleoside hydrolase [C]

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA