| Definition | Nocardioides sp. JS614 chromosome, complete genome. |
|---|---|
| Accession | NC_008699 |
| Length | 4,985,871 |
Click here to switch to the map view.
The map label for this gene is cbs [H]
Identifier: 119715184
GI number: 119715184
Start: 985091
End: 986479
Strand: Direct
Name: cbs [H]
Synonym: Noca_0939
Alternate gene names: 119715184
Gene position: 985091-986479 (Clockwise)
Preceding gene: 119715182
Following gene: 119715185
Centisome position: 19.76
GC content: 71.35
Gene sequence:
>1389_bases GTGAACTCACTCCTCGACCTGATCGGCAACACCCCGCTGCTGAGGCTCTCGACGTCCATGGGCTCCCTGAACGGCGCGAA GGGACCGATCGTCCTCGCCAAGGTGGAGTACCTCAACCCCGGCGGCTCCGTGAAGGACCGCATCGCCACCCGGATGATCG AGGCGGCCGAGGCGTCCGGGGAGCTTCAGCCCGGCGGCACCATCGTCGAGCCGACGTCCGGCAACACCGGCGTCGGGCTG GCGATGGTCGCCCAGGCGAAGGGCTACAGGTGCGTCTTCGTCTGCCCGGACAAGGTCAGCGAGGACAAGCGCAACGTGCT GAAGGCGTACGGCGCGGAGGTGGTCGTCTGCCCGACCGCGGTCGAGCCGGAGCACCCCGACTCCTACTACAACGTCTCCG ACCGGCTCGCCTCGCAGCCGGGTGCCTGGAAGCCGGACCAGTACTCCAACCCGCACAACCCGCGGTCGCACTACGAGACG ACCGGCCCGGAGATCTGGGCGCAGACCGAGGGGCGGGTCACCCACTTCGTCGCCGGCGTCGGCACCGGCGGCACCATCAG CGGCACCGGGCGCTACCTCAAGGAGCAGAACTCCTCGGTCCAGGTCATCGGGGCCGACCCGGCGGGCTCGGTCTACTCCG GCGGCACCGGCCGGCCCTACCTCGTCGAGGGAGTGGGCGAGGACTTCTGGCCGGAGGCCTACGATCGCGACGTCGCCGAC CGGATCATCGAGGTCTCCGACGCCGACTCGTTCGCGATGACGCGGCGGCTGGCCCGCGAGGAGGCCCTGCTGGTCGGCGG TTCCTCCGGCATGGCCGTGCACGCGGCGGTCCAGCTCGCCCACGAGCTCGCCGGCACCCCCGAGGGCGAGGACGCGGTGA TCGTCGTACTCCTCCCGGACTCCGGCCGCGGCTACCTCACGAAGGTCTTCAACGACGACTGGCTCGCGCAGTACGGATTC CCGGTCGACGGCGCCGAGCGCTCCGTGCAGTCCGTCGGGGAGGTGCTCCGCGGCAAGAGCGGGCGGCTGCCCGACCTCGT GCACACCCACCCGAACGAGACCATCGCCGAAGCCGTCGCGATCCTCCAGGAGTACAACGTCTCCCAGATGCCGGTCGTGC GCGCGGAGCCTCCGGTGGTGGCCGCCGAGGTCGTCGGATCGGTCTCCGAGCGGACCCTGCTCGACCTGCTGTTCACCGGC TCGGCCAAGCTCACCGACAGCGTCGGCGAGCACATGGCGCCCCCGCTGCCGACGATCGGCTCCACCGAGCCCGCCTCCGA GGCCGTCGCCGCACTCGAGGGCGCCGACGCCCTGTTGGTGCACGAGGACGGCAAGCCCGTCGGCGTCGTCACCCGCCACG ACCTGCTGGCCTACCTCGCGCGCGGCTGA
Upstream 100 bases:
>100_bases CGGCTTTACCCACGCCGTCACTCTACGGACGCCGGCCAGCGCGGACGCCCGGGGTTTACGAGAAGGTGCGGACCGCCCCT AGATTGGCCGGGTGCAGTAC
Downstream 100 bases:
>100_bases GTCAGTTGGCTGCGGGGTTCGGCTGCTGGCAGCCGAACCCCGAAGCTGCGTCACCGCCTATAGTGAGAACACGTTCTAGT TCTCAGGAGGGCTGATGACG
Product: cystathionine beta-synthase
Products: NA
Alternate protein names: Beta-thionase; Serine sulfhydrase [H]
Number of amino acids: Translated: 462; Mature: 462
Protein sequence:
>462_residues MNSLLDLIGNTPLLRLSTSMGSLNGAKGPIVLAKVEYLNPGGSVKDRIATRMIEAAEASGELQPGGTIVEPTSGNTGVGL AMVAQAKGYRCVFVCPDKVSEDKRNVLKAYGAEVVVCPTAVEPEHPDSYYNVSDRLASQPGAWKPDQYSNPHNPRSHYET TGPEIWAQTEGRVTHFVAGVGTGGTISGTGRYLKEQNSSVQVIGADPAGSVYSGGTGRPYLVEGVGEDFWPEAYDRDVAD RIIEVSDADSFAMTRRLAREEALLVGGSSGMAVHAAVQLAHELAGTPEGEDAVIVVLLPDSGRGYLTKVFNDDWLAQYGF PVDGAERSVQSVGEVLRGKSGRLPDLVHTHPNETIAEAVAILQEYNVSQMPVVRAEPPVVAAEVVGSVSERTLLDLLFTG SAKLTDSVGEHMAPPLPTIGSTEPASEAVAALEGADALLVHEDGKPVGVVTRHDLLAYLARG
Sequences:
>Translated_462_residues MNSLLDLIGNTPLLRLSTSMGSLNGAKGPIVLAKVEYLNPGGSVKDRIATRMIEAAEASGELQPGGTIVEPTSGNTGVGL AMVAQAKGYRCVFVCPDKVSEDKRNVLKAYGAEVVVCPTAVEPEHPDSYYNVSDRLASQPGAWKPDQYSNPHNPRSHYET TGPEIWAQTEGRVTHFVAGVGTGGTISGTGRYLKEQNSSVQVIGADPAGSVYSGGTGRPYLVEGVGEDFWPEAYDRDVAD RIIEVSDADSFAMTRRLAREEALLVGGSSGMAVHAAVQLAHELAGTPEGEDAVIVVLLPDSGRGYLTKVFNDDWLAQYGF PVDGAERSVQSVGEVLRGKSGRLPDLVHTHPNETIAEAVAILQEYNVSQMPVVRAEPPVVAAEVVGSVSERTLLDLLFTG SAKLTDSVGEHMAPPLPTIGSTEPASEAVAALEGADALLVHEDGKPVGVVTRHDLLAYLARG >Mature_462_residues MNSLLDLIGNTPLLRLSTSMGSLNGAKGPIVLAKVEYLNPGGSVKDRIATRMIEAAEASGELQPGGTIVEPTSGNTGVGL AMVAQAKGYRCVFVCPDKVSEDKRNVLKAYGAEVVVCPTAVEPEHPDSYYNVSDRLASQPGAWKPDQYSNPHNPRSHYET TGPEIWAQTEGRVTHFVAGVGTGGTISGTGRYLKEQNSSVQVIGADPAGSVYSGGTGRPYLVEGVGEDFWPEAYDRDVAD RIIEVSDADSFAMTRRLAREEALLVGGSSGMAVHAAVQLAHELAGTPEGEDAVIVVLLPDSGRGYLTKVFNDDWLAQYGF PVDGAERSVQSVGEVLRGKSGRLPDLVHTHPNETIAEAVAILQEYNVSQMPVVRAEPPVVAAEVVGSVSERTLLDLLFTG SAKLTDSVGEHMAPPLPTIGSTEPASEAVAALEGADALLVHEDGKPVGVVTRHDLLAYLARG
Specific function: Two Cysteine Synthase Enzymes Are Found. Both Catalyze The Same Reaction. Cysteine Synthase B Can Also Use Thiosulfate In Place Of Sulfide To Give Cysteine Thiosulfonate As A Product. [C]
COG id: COG0031
COG function: function code E; Cysteine synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the cysteine synthase/cystathionine beta- synthase family [H]
Homologues:
Organism=Homo sapiens, GI295821202, Length=477, Percent_Identity=39.832285115304, Blast_Score=296, Evalue=4e-80, Organism=Homo sapiens, GI295821200, Length=477, Percent_Identity=39.832285115304, Blast_Score=296, Evalue=4e-80, Organism=Homo sapiens, GI4557415, Length=477, Percent_Identity=39.832285115304, Blast_Score=296, Evalue=4e-80, Organism=Escherichia coli, GI2367138, Length=319, Percent_Identity=39.1849529780564, Blast_Score=214, Evalue=7e-57, Organism=Escherichia coli, GI1788754, Length=322, Percent_Identity=40.9937888198758, Blast_Score=181, Evalue=7e-47, Organism=Caenorhabditis elegans, GI17534315, Length=330, Percent_Identity=43.9393939393939, Blast_Score=236, Evalue=2e-62, Organism=Caenorhabditis elegans, GI17562970, Length=323, Percent_Identity=42.1052631578947, Blast_Score=228, Evalue=7e-60, Organism=Caenorhabditis elegans, GI17535051, Length=306, Percent_Identity=42.483660130719, Blast_Score=225, Evalue=4e-59, Organism=Caenorhabditis elegans, GI32566674, Length=303, Percent_Identity=43.2343234323432, Blast_Score=224, Evalue=7e-59, Organism=Caenorhabditis elegans, GI115535073, Length=330, Percent_Identity=42.1212121212121, Blast_Score=217, Evalue=9e-57, Organism=Caenorhabditis elegans, GI25147552, Length=328, Percent_Identity=40.5487804878049, Blast_Score=216, Evalue=2e-56, Organism=Caenorhabditis elegans, GI17561720, Length=334, Percent_Identity=38.622754491018, Blast_Score=197, Evalue=7e-51, Organism=Caenorhabditis elegans, GI32566672, Length=232, Percent_Identity=40.5172413793103, Blast_Score=153, Evalue=2e-37, Organism=Caenorhabditis elegans, GI71996324, Length=311, Percent_Identity=23.1511254019293, Blast_Score=81, Evalue=1e-15, Organism=Saccharomyces cerevisiae, GI6321594, Length=501, Percent_Identity=37.3253493013972, Blast_Score=285, Evalue=1e-77, Organism=Saccharomyces cerevisiae, GI6321449, Length=338, Percent_Identity=36.094674556213, Blast_Score=153, Evalue=7e-38, Organism=Drosophila melanogaster, GI24643623, Length=483, Percent_Identity=37.0600414078675, Blast_Score=258, Evalue=7e-69, Organism=Drosophila melanogaster, GI20129101, Length=483, Percent_Identity=37.0600414078675, Blast_Score=258, Evalue=7e-69,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013785 - InterPro: IPR005857 - InterPro: IPR000644 - InterPro: IPR001926 [H]
Pfam domain/function: PF00571 CBS; PF00291 PALP [H]
EC number: =4.2.1.22 [H]
Molecular weight: Translated: 48730; Mature: 48730
Theoretical pI: Translated: 4.58; Mature: 4.58
Prosite motif: PS00901 CYS_SYNTHASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNSLLDLIGNTPLLRLSTSMGSLNGAKGPIVLAKVEYLNPGGSVKDRIATRMIEAAEASG CCHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHHCCC ELQPGGTIVEPTSGNTGVGLAMVAQAKGYRCVFVCPDKVSEDKRNVLKAYGAEVVVCPTA CCCCCCEEEECCCCCCCCCEEEEECCCCCEEEEECCCCCCHHHHHHHHHCCCCEEEECCC VEPEHPDSYYNVSDRLASQPGAWKPDQYSNPHNPRSHYETTGPEIWAQTEGRVTHFVAGV CCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEEEC GTGGTISGTGRYLKEQNSSVQVIGADPAGSVYSGGTGRPYLVEGVGEDFWPEAYDRDVAD CCCCCCCCCCHHHHCCCCEEEEEECCCCCCCCCCCCCCCEEEECCCCCCCCCHHCCCHHH RIIEVSDADSFAMTRRLAREEALLVGGSSGMAVHAAVQLAHELAGTPEGEDAVIVVLLPD HHEECCCCHHHHHHHHHHHHCEEEEECCCCCHHHHHHHHHHHHCCCCCCCCEEEEEEECC SGRGYLTKVFNDDWLAQYGFPVDGAERSVQSVGEVLRGKSGRLPDLVHTHPNETIAEAVA CCCCEEEEECCCCHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHH ILQEYNVSQMPVVRAEPPVVAAEVVGSVSERTLLDLLFTGSAKLTDSVGEHMAPPLPTIG HHHHCCCCCCCEEECCCCEEHHHHHCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCC STEPASEAVAALEGADALLVHEDGKPVGVVTRHDLLAYLARG CCCCHHHHHHHHCCCCEEEEECCCCCEEEEEHHHHHHHHHCC >Mature Secondary Structure MNSLLDLIGNTPLLRLSTSMGSLNGAKGPIVLAKVEYLNPGGSVKDRIATRMIEAAEASG CCHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHHCCC ELQPGGTIVEPTSGNTGVGLAMVAQAKGYRCVFVCPDKVSEDKRNVLKAYGAEVVVCPTA CCCCCCEEEECCCCCCCCCEEEEECCCCCEEEEECCCCCCHHHHHHHHHCCCCEEEECCC VEPEHPDSYYNVSDRLASQPGAWKPDQYSNPHNPRSHYETTGPEIWAQTEGRVTHFVAGV CCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEEEC GTGGTISGTGRYLKEQNSSVQVIGADPAGSVYSGGTGRPYLVEGVGEDFWPEAYDRDVAD CCCCCCCCCCHHHHCCCCEEEEEECCCCCCCCCCCCCCCEEEECCCCCCCCCHHCCCHHH RIIEVSDADSFAMTRRLAREEALLVGGSSGMAVHAAVQLAHELAGTPEGEDAVIVVLLPD HHEECCCCHHHHHHHHHHHHCEEEEECCCCCHHHHHHHHHHHHCCCCCCCCEEEEEEECC SGRGYLTKVFNDDWLAQYGFPVDGAERSVQSVGEVLRGKSGRLPDLVHTHPNETIAEAVA CCCCEEEEECCCCHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHH ILQEYNVSQMPVVRAEPPVVAAEVVGSVSERTLLDLLFTGSAKLTDSVGEHMAPPLPTIG HHHHCCCCCCCEEECCCCEEHHHHHCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCC STEPASEAVAALEGADALLVHEDGKPVGVVTRHDLLAYLARG CCCCHHHHHHHHCCCCEEEEECCCCCEEEEEHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036 [H]