The gene/protein map for NC_012588 is currently unavailable.
Definition Sulfolobus islandicus M.14.25 chromosome, complete genome.
Accession NC_012588
Length 2,608,832

Click here to switch to the map view.

The map label for this gene is codB [H]

Identifier: 227826916

GI number: 227826916

Start: 502749

End: 504110

Strand: Direct

Name: codB [H]

Synonym: M1425_0557

Alternate gene names: 227826916

Gene position: 502749-504110 (Clockwise)

Preceding gene: 227826915

Following gene: 227826917

Centisome position: 19.27

GC content: 37.22

Gene sequence:

>1362_bases
ATGACTGGAAAGGAAGAAATTAGCTCAAAATATGATGACTATTCACTGAAGGAAGTTCCGAAAGACTCCAGATATGGCTT
CTTTAACGTTTTTCTAGTATTTTCGTCTGTTTATGGTGCAATAGCTGTAATATGGGCTGGAGGAGCACTAGGTTATGGAC
TCACATTTTCTCAAGCTATAATAGCAGTATTGTCGGGAACAGTAGTATTAGGCATCTTAGGTTCATTGACTGCAGCTGTG
GGAGCTTATAGTGGTCTCTCAACTTATGTTATGTGGAGACACCCCTTAGGAAGATGGGGAGGTAAAGTTGCTGGATTATT
GCTGATAACTATAACTACGGGAATAGGATGGTATGCAGTCGAAACGTGGCTATTTGGTATAGTAATGAGTGAGATATTCC
CAAATAATCCTTTCTTTTCTGTTGGAGTAGCTGCCATTTGGGGAGGAATTTTGATGACAATAATGACATATGTAGGCTAT
AGGATGCTGTCTTTCCTAAGCTACTTTACAATTCCATTTCATATATGGCTGATAGCAATAGGAATAGCAATAGTGTTAGC
ACTTAAAGGTGGATTTCACACAGTTATGGCAGCAGTTCCAACAAGTCATATGAGCTTACTTGATGGTATATCCGCTACCA
TAGGATTATATAGCGCTGGGACTATAATTTCTCCCGATATCTCCAGATTTGCCAAATCAGCGAAAGATGCTGGCTATGCA
TGGTTTGCTCATATAATTTTCCTATATCCATTCTTAATATTGGGAGGAGTAGCAATAGTGTTAGCAACTGGTTCCTATTT
AATAACTAACGCGATGCTAGAGTTAGGTATGGGAGTTGGTGTTTTATTAATTATAGTTTTCGGTCAATTTATAATAAATA
CTGACAATCTATATAGTGGTTCCTTATCATTAGTTAACCTAATTCCAATGAGACGTGAAATCGCCTCCATAATTAACGGT
GTCATAGGTACTGCAATTGCAGCATATGTTGGATTCTCAGCAGGTTCGTCGATAACTCCCTTTGAGAACTTTATCTCTTT
GTTAGGAGACTTCTTACCTGCAATGGGAGGAATTGTACTAGCTGACTTTTACATTGTCAAGAAATACATTAATAAGATCC
AAGATCCTCATAAACGGTATCAGTTCGTACCAAATAATAAGTATTATAATATAAATATTGCAGGAATATTAGCCTTAGCA
TTAGGTTCAATAGTAGGTTACTTCGTAAATGCCGGTATACCAGCAATAAACTCCTTAGTTACCGGCTTCTTATCCTACAT
TATAATATATTACATTATCAACGCATTAGGTAAGAGTCCAGAAATACTGCCTTTCAATTACGAAGGAGGGATATTAAGAT
GA

Upstream 100 bases:

>100_bases
AGTCGACTGAATAACAGTGTTAATAATTGTTATTTTCTATTGAAACGATTTCTTCTAAACTATCATAATGTTTTTATTCT
ATATTTAAACATTAGAAAAT

Downstream 100 bases:

>100_bases
ACTTATTTGATAACTATAAAATATTTACTATATCCAACGTTATAATGGGATTAGTGTTTAGTGCACTGTATTTCATTACA
ACTGGGTTTATACAATACTA

Product: permease for cytosine/purines uracil thiamine allantoin

Products: Proton [Cytoplasm]; cytosine [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 453; Mature: 452

Protein sequence:

>453_residues
MTGKEEISSKYDDYSLKEVPKDSRYGFFNVFLVFSSVYGAIAVIWAGGALGYGLTFSQAIIAVLSGTVVLGILGSLTAAV
GAYSGLSTYVMWRHPLGRWGGKVAGLLLITITTGIGWYAVETWLFGIVMSEIFPNNPFFSVGVAAIWGGILMTIMTYVGY
RMLSFLSYFTIPFHIWLIAIGIAIVLALKGGFHTVMAAVPTSHMSLLDGISATIGLYSAGTIISPDISRFAKSAKDAGYA
WFAHIIFLYPFLILGGVAIVLATGSYLITNAMLELGMGVGVLLIIVFGQFIINTDNLYSGSLSLVNLIPMRREIASIING
VIGTAIAAYVGFSAGSSITPFENFISLLGDFLPAMGGIVLADFYIVKKYINKIQDPHKRYQFVPNNKYYNINIAGILALA
LGSIVGYFVNAGIPAINSLVTGFLSYIIIYYIINALGKSPEILPFNYEGGILR

Sequences:

>Translated_453_residues
MTGKEEISSKYDDYSLKEVPKDSRYGFFNVFLVFSSVYGAIAVIWAGGALGYGLTFSQAIIAVLSGTVVLGILGSLTAAV
GAYSGLSTYVMWRHPLGRWGGKVAGLLLITITTGIGWYAVETWLFGIVMSEIFPNNPFFSVGVAAIWGGILMTIMTYVGY
RMLSFLSYFTIPFHIWLIAIGIAIVLALKGGFHTVMAAVPTSHMSLLDGISATIGLYSAGTIISPDISRFAKSAKDAGYA
WFAHIIFLYPFLILGGVAIVLATGSYLITNAMLELGMGVGVLLIIVFGQFIINTDNLYSGSLSLVNLIPMRREIASIING
VIGTAIAAYVGFSAGSSITPFENFISLLGDFLPAMGGIVLADFYIVKKYINKIQDPHKRYQFVPNNKYYNINIAGILALA
LGSIVGYFVNAGIPAINSLVTGFLSYIIIYYIINALGKSPEILPFNYEGGILR
>Mature_452_residues
TGKEEISSKYDDYSLKEVPKDSRYGFFNVFLVFSSVYGAIAVIWAGGALGYGLTFSQAIIAVLSGTVVLGILGSLTAAVG
AYSGLSTYVMWRHPLGRWGGKVAGLLLITITTGIGWYAVETWLFGIVMSEIFPNNPFFSVGVAAIWGGILMTIMTYVGYR
MLSFLSYFTIPFHIWLIAIGIAIVLALKGGFHTVMAAVPTSHMSLLDGISATIGLYSAGTIISPDISRFAKSAKDAGYAW
FAHIIFLYPFLILGGVAIVLATGSYLITNAMLELGMGVGVLLIIVFGQFIINTDNLYSGSLSLVNLIPMRREIASIINGV
IGTAIAAYVGFSAGSSITPFENFISLLGDFLPAMGGIVLADFYIVKKYINKIQDPHKRYQFVPNNKYYNINIAGILALAL
GSIVGYFVNAGIPAINSLVTGFLSYIIIYYIINALGKSPEILPFNYEGGILR

Specific function: Required for cytosine transport into the cell [H]

COG id: COG1457

COG function: function code F; Purine-cytosine permease and related proteins

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the purine-cytosine permease family [H]

Homologues:

Organism=Escherichia coli, GI1786530, Length=433, Percent_Identity=26.0969976905312, Blast_Score=129, Evalue=5e-31,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001248
- InterPro:   IPR012681 [H]

Pfam domain/function: PF02133 Transp_cyt_pur [H]

EC number: NA

Molecular weight: Translated: 48784; Mature: 48653

Theoretical pI: Translated: 9.02; Mature: 9.02

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTGKEEISSKYDDYSLKEVPKDSRYGFFNVFLVFSSVYGAIAVIWAGGALGYGLTFSQAI
CCCHHHHHHCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHH
IAVLSGTVVLGILGSLTAAVGAYSGLSTYVMWRHPLGRWGGKVAGLLLITITTGIGWYAV
HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCHHHCCHHHHHHHHHHHHHCCHHHHH
ETWLFGIVMSEIFPNNPFFSVGVAAIWGGILMTIMTYVGYRMLSFLSYFTIPFHIWLIAI
HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GIAIVLALKGGFHTVMAAVPTSHMSLLDGISATIGLYSAGTIISPDISRFAKSAKDAGYA
HHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCEECCCHHHHHHHHCCCHHH
WFAHIIFLYPFLILGGVAIVLATGSYLITNAMLELGMGVGVLLIIVFGQFIINTDNLYSG
HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCC
SLSLVNLIPMRREIASIINGVIGTAIAAYVGFSAGSSITPFENFISLLGDFLPAMGGIVL
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
ADFYIVKKYINKIQDPHKRYQFVPNNKYYNINIAGILALALGSIVGYFVNAGIPAINSLV
HHHHHHHHHHHHHCCHHHHCEECCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCHHHHHHH
TGFLSYIIIYYIINALGKSPEILPFNYEGGILR
HHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC
>Mature Secondary Structure 
TGKEEISSKYDDYSLKEVPKDSRYGFFNVFLVFSSVYGAIAVIWAGGALGYGLTFSQAI
CCHHHHHHCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHH
IAVLSGTVVLGILGSLTAAVGAYSGLSTYVMWRHPLGRWGGKVAGLLLITITTGIGWYAV
HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCCHHHCCHHHHHHHHHHHHHCCHHHHH
ETWLFGIVMSEIFPNNPFFSVGVAAIWGGILMTIMTYVGYRMLSFLSYFTIPFHIWLIAI
HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GIAIVLALKGGFHTVMAAVPTSHMSLLDGISATIGLYSAGTIISPDISRFAKSAKDAGYA
HHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCEECCCHHHHHHHHCCCHHH
WFAHIIFLYPFLILGGVAIVLATGSYLITNAMLELGMGVGVLLIIVFGQFIINTDNLYSG
HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCC
SLSLVNLIPMRREIASIINGVIGTAIAAYVGFSAGSSITPFENFISLLGDFLPAMGGIVL
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
ADFYIVKKYINKIQDPHKRYQFVPNNKYYNINIAGILALALGSIVGYFVNAGIPAINSLV
HHHHHHHHHHHHHCCHHHHCEECCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCHHHHHHH
TGFLSYIIIYYIINALGKSPEILPFNYEGGILR
HHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Proton [Periplasm]; cytosine [Periplasm] [C]

Specific reaction: Proton [Periplasm] + cytosine [Periplasm] = Proton [Cytoplasm] + cytosine [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]