Definition | Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130 chromosome chromosome I, complete sequence. |
---|---|
Accession | NC_005823 |
Length | 4,277,185 |
Click here to switch to the map view.
The map label for this gene is guaB
Identifier: 45657779
GI number: 45657779
Start: 2316145
End: 2317671
Strand: Reverse
Name: guaB
Synonym: LIC11919
Alternate gene names: 45657779
Gene position: 2317671-2316145 (Counterclockwise)
Preceding gene: 45657781
Following gene: 45657778
Centisome position: 54.19
GC content: 40.21
Gene sequence:
>1527_bases ATGTCAAACCAAACTTACCGCGACTCCGAATATTTAGACGGACTATCTGGAGAGGAATTATTCAATCTTCAGATTGGTCT AACTTACAGAGATTTTCTCGTTCTACCTGGATTTATCGACTTCCATCCTTCTGAAGTTGAGCTTGAAACCAGACTGACCA GAAATATAAAATTGAAAAGACCTTTTATCAGTTCTCCTATGGACACAGTTACCGAATCCCAGATGGCGATTGCCCAAGCA CTTATGGGAGGGATCGGGATCATTCATTATAACAATACGATCGAAGAACAGGTTGCACTTGTAGAAAAGGTAAAACGTTT TGAAAACGGATTTATCACAGATCCGGTGATTTTAGGGCCAAAAAACGTCATTCGTGATTTAGATGCAATTAAAGAACGTA AAGGATTTACAGGAATTCCGGTAACTGAGGATGGAACAAGAAATTCTAAACTTATCGGAATCGTAACAAACAGAGATATA GATTTCGAAAAAAATAGAGAAATCACCTTGGACAAAGTGATGACTACCAATCTAATTACTGGAAAAGAGGGGATCACTTT ACAAGACGCAAACGAAATCATCAAAAAATCTAAGATCGGAAAACTTCCGATTGTAGATTCTCAAGGAAAATTAGTATCTC TTGTAAGTCGTTCCGATTTAAAGAAAAATAAAGAATTCCCAGATGCCTCCAAAGACGAAAGAAAAAGGCTTCGTTGTGGA GCTGCAGTTTCTACTTTGCTGGAATCTAGAGATAGAGTGGCTGCGTTGTATGAAGCGGGAGTAGACGTGATCATCATCGA TTCTGCCCAAGGAAATTCTAATTACCAAATCGAAATGATTCAATTCATTAAAAAGGAATTTAAAAATCTAGACATAGTTG CAGGAAACGTAGTGACACGTGCTCAAGCAGAAAATCTAATTCGTGCGGGTGCGGATGGGCTTCGAATTGGAATGGGCCCC GGTTCTATTTGTATTACTCAAGATACAATGGCAGTGGGACGTGCTCAAGCGACTGCAATCTATCAAACTGCAAAACATTC CGCAAAGTATGACGTTCCTGTAATTGCAGATGGTGGAATTTCTAACATAGGAGATATTGCAAACTCACTCGCGATTGGTG CGTCTACGTGTATGATGGGGTTTATGTTTGCCGGAACCACAGAAGCTCCCGGAGAATATTTTTATGAGAACGGGATTCGC CTTAAAAAATACAGAGGAATGGCGTCCATCGAAGCTATGAAAGCAGGTGGAGATAAACGTTATTTTAACGAAGGTCAAAA AGTAAAGGTGGCTCAAGGAGTCAGTGGGTCCGTAGTAGACAGAGGATCGATTTTGAATTTTATTCCTTACCTATCTCAAG GTCTCAGATTATCCTTTCAAGATATGGGTTATAAATCCATACCCGAAATTCATAAAGCTCTTAGGGATGGGAAACTGCGT TTTGAAAGAAGGTCCGAATCCGCACAGGCTCAGGGTTCAGTACATGGACTCTATTCTTTTAGTGCTCCTACTATGAGGGC GGAGTAA
Upstream 100 bases:
>100_bases CTCTCTTTTTTTTCTCTGAATTGTTTCCATAGTTTTTCAAAGCGAACGAGGGGAAATTCTCTTGATCTTAGTTCTCAGTT TACCTTTTATGTACTAGGTA
Downstream 100 bases:
>100_bases TTGGATCGGATTCTTGCGTTTACGTTTGGTTTTTTTGCTCTTGTAGTATTGAGCAAAGAAATAAACTCGCAAGGATCTGC TATAAGAGTCGCTCAGGAAT
Product: IMP dehydrogenase
Products: NA
Alternate protein names: IMP dehydrogenase; IMPD; IMPDH [H]
Number of amino acids: Translated: 508; Mature: 507
Protein sequence:
>508_residues MSNQTYRDSEYLDGLSGEELFNLQIGLTYRDFLVLPGFIDFHPSEVELETRLTRNIKLKRPFISSPMDTVTESQMAIAQA LMGGIGIIHYNNTIEEQVALVEKVKRFENGFITDPVILGPKNVIRDLDAIKERKGFTGIPVTEDGTRNSKLIGIVTNRDI DFEKNREITLDKVMTTNLITGKEGITLQDANEIIKKSKIGKLPIVDSQGKLVSLVSRSDLKKNKEFPDASKDERKRLRCG AAVSTLLESRDRVAALYEAGVDVIIIDSAQGNSNYQIEMIQFIKKEFKNLDIVAGNVVTRAQAENLIRAGADGLRIGMGP GSICITQDTMAVGRAQATAIYQTAKHSAKYDVPVIADGGISNIGDIANSLAIGASTCMMGFMFAGTTEAPGEYFYENGIR LKKYRGMASIEAMKAGGDKRYFNEGQKVKVAQGVSGSVVDRGSILNFIPYLSQGLRLSFQDMGYKSIPEIHKALRDGKLR FERRSESAQAQGSVHGLYSFSAPTMRAE
Sequences:
>Translated_508_residues MSNQTYRDSEYLDGLSGEELFNLQIGLTYRDFLVLPGFIDFHPSEVELETRLTRNIKLKRPFISSPMDTVTESQMAIAQA LMGGIGIIHYNNTIEEQVALVEKVKRFENGFITDPVILGPKNVIRDLDAIKERKGFTGIPVTEDGTRNSKLIGIVTNRDI DFEKNREITLDKVMTTNLITGKEGITLQDANEIIKKSKIGKLPIVDSQGKLVSLVSRSDLKKNKEFPDASKDERKRLRCG AAVSTLLESRDRVAALYEAGVDVIIIDSAQGNSNYQIEMIQFIKKEFKNLDIVAGNVVTRAQAENLIRAGADGLRIGMGP GSICITQDTMAVGRAQATAIYQTAKHSAKYDVPVIADGGISNIGDIANSLAIGASTCMMGFMFAGTTEAPGEYFYENGIR LKKYRGMASIEAMKAGGDKRYFNEGQKVKVAQGVSGSVVDRGSILNFIPYLSQGLRLSFQDMGYKSIPEIHKALRDGKLR FERRSESAQAQGSVHGLYSFSAPTMRAE >Mature_507_residues SNQTYRDSEYLDGLSGEELFNLQIGLTYRDFLVLPGFIDFHPSEVELETRLTRNIKLKRPFISSPMDTVTESQMAIAQAL MGGIGIIHYNNTIEEQVALVEKVKRFENGFITDPVILGPKNVIRDLDAIKERKGFTGIPVTEDGTRNSKLIGIVTNRDID FEKNREITLDKVMTTNLITGKEGITLQDANEIIKKSKIGKLPIVDSQGKLVSLVSRSDLKKNKEFPDASKDERKRLRCGA AVSTLLESRDRVAALYEAGVDVIIIDSAQGNSNYQIEMIQFIKKEFKNLDIVAGNVVTRAQAENLIRAGADGLRIGMGPG SICITQDTMAVGRAQATAIYQTAKHSAKYDVPVIADGGISNIGDIANSLAIGASTCMMGFMFAGTTEAPGEYFYENGIRL KKYRGMASIEAMKAGGDKRYFNEGQKVKVAQGVSGSVVDRGSILNFIPYLSQGLRLSFQDMGYKSIPEIHKALRDGKLRF ERRSESAQAQGSVHGLYSFSAPTMRAE
Specific function: GMP biosynthesis from IMP; first step. [C]
COG id: COG0516
COG function: function code F; IMP dehydrogenase/GMP reductase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 CBS domains [H]
Homologues:
Organism=Homo sapiens, GI217035146, Length=494, Percent_Identity=56.2753036437247, Blast_Score=577, Evalue=1e-165, Organism=Homo sapiens, GI34328928, Length=494, Percent_Identity=56.2753036437247, Blast_Score=577, Evalue=1e-165, Organism=Homo sapiens, GI34328930, Length=494, Percent_Identity=56.2753036437247, Blast_Score=577, Evalue=1e-165, Organism=Homo sapiens, GI156616279, Length=494, Percent_Identity=56.2753036437247, Blast_Score=577, Evalue=1e-164, Organism=Homo sapiens, GI66933016, Length=494, Percent_Identity=54.6558704453441, Blast_Score=568, Evalue=1e-162, Organism=Homo sapiens, GI217035148, Length=494, Percent_Identity=55.4655870445344, Blast_Score=566, Evalue=1e-161, Organism=Homo sapiens, GI217035152, Length=476, Percent_Identity=56.5126050420168, Blast_Score=560, Evalue=1e-159, Organism=Homo sapiens, GI217035150, Length=494, Percent_Identity=53.0364372469636, Blast_Score=536, Evalue=1e-152, Organism=Homo sapiens, GI156104880, Length=215, Percent_Identity=35.8139534883721, Blast_Score=128, Evalue=1e-29, Organism=Homo sapiens, GI50541956, Length=173, Percent_Identity=39.3063583815029, Blast_Score=118, Evalue=1e-26, Organism=Homo sapiens, GI50541954, Length=173, Percent_Identity=39.3063583815029, Blast_Score=118, Evalue=1e-26, Organism=Homo sapiens, GI50541952, Length=173, Percent_Identity=39.3063583815029, Blast_Score=118, Evalue=1e-26, Organism=Homo sapiens, GI50541948, Length=173, Percent_Identity=39.3063583815029, Blast_Score=118, Evalue=1e-26, Organism=Escherichia coli, GI1788855, Length=478, Percent_Identity=38.0753138075314, Blast_Score=276, Evalue=3e-75, Organism=Escherichia coli, GI1786293, Length=212, Percent_Identity=31.1320754716981, Blast_Score=108, Evalue=1e-24, Organism=Caenorhabditis elegans, GI71994385, Length=527, Percent_Identity=48.7666034155598, Blast_Score=478, Evalue=1e-135, Organism=Caenorhabditis elegans, GI71994389, Length=449, Percent_Identity=49.2204899777283, Blast_Score=408, Evalue=1e-114, Organism=Caenorhabditis elegans, GI17560440, Length=209, Percent_Identity=32.0574162679426, Blast_Score=116, Evalue=3e-26, Organism=Saccharomyces cerevisiae, GI6323464, Length=498, Percent_Identity=56.8273092369478, Blast_Score=561, Evalue=1e-160, Organism=Saccharomyces cerevisiae, GI6322012, Length=497, Percent_Identity=53.9235412474849, Blast_Score=559, Evalue=1e-160, Organism=Saccharomyces cerevisiae, GI6323585, Length=497, Percent_Identity=54.5271629778672, Blast_Score=557, Evalue=1e-159, Organism=Saccharomyces cerevisiae, GI6319352, Length=351, Percent_Identity=52.991452991453, Blast_Score=389, Evalue=1e-109, Organism=Saccharomyces cerevisiae, GI6319353, Length=146, Percent_Identity=54.7945205479452, Blast_Score=154, Evalue=4e-38, Organism=Drosophila melanogaster, GI24641071, Length=500, Percent_Identity=55.4, Blast_Score=572, Evalue=1e-163, Organism=Drosophila melanogaster, GI24641073, Length=500, Percent_Identity=55.4, Blast_Score=572, Evalue=1e-163, Organism=Drosophila melanogaster, GI28571163, Length=441, Percent_Identity=55.5555555555556, Blast_Score=503, Evalue=1e-142,
Paralogues:
None
Copy number: 600 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013785 - InterPro: IPR000644 - InterPro: IPR005990 - InterPro: IPR018529 - InterPro: IPR015875 - InterPro: IPR001093 [H]
Pfam domain/function: PF00571 CBS; PF00478 IMPDH [H]
EC number: =1.1.1.205 [H]
Molecular weight: Translated: 55882; Mature: 55751
Theoretical pI: Translated: 8.65; Mature: 8.65
Prosite motif: PS00487 IMP_DH_GMP_RED
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSNQTYRDSEYLDGLSGEELFNLQIGLTYRDFLVLPGFIDFHPSEVELETRLTRNIKLKR CCCCCCCCHHHHCCCCCCEEEEEEEECEEEHHEEECCEECCCCCCEEEHHHHCCCEEEEC PFISSPMDTVTESQMAIAQALMGGIGIIHYNNTIEEQVALVEKVKRFENGFITDPVILGP CCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHCCCCCCCEEECC KNVIRDLDAIKERKGFTGIPVTEDGTRNSKLIGIVTNRDIDFEKNREITLDKVMTTNLIT HHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEHHHHEEHHEEE GKEGITLQDANEIIKKSKIGKLPIVDSQGKLVSLVSRSDLKKNKEFPDASKDERKRLRCG CCCCCEECCHHHHHHHHCCCCCCEECCCCCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHH AAVSTLLESRDRVAALYEAGVDVIIIDSAQGNSNYQIEMIQFIKKEFKNLDIVAGNVVTR HHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCEEEECCCHHH AQAENLIRAGADGLRIGMGPGSICITQDTMAVGRAQATAIYQTAKHSAKYDVPVIADGGI HHHHHHHHCCCCCEEEECCCCCEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCC SNIGDIANSLAIGASTCMMGFMFAGTTEAPGEYFYENGIRLKKYRGMASIEAMKAGGDKR CHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCEEEEECCCCHHHHHHHCCCCHH YFNEGQKVKVAQGVSGSVVDRGSILNFIPYLSQGLRLSFQDMGYKSIPEIHKALRDGKLR HHCCCCEEEEECCCCCCEECCCHHHHHHHHHCCCCEEEHHHCCCCCHHHHHHHHHCCCEE FERRSESAQAQGSVHGLYSFSAPTMRAE EHHHCCCHHHCCCEEEEEEECCCCCCCC >Mature Secondary Structure SNQTYRDSEYLDGLSGEELFNLQIGLTYRDFLVLPGFIDFHPSEVELETRLTRNIKLKR CCCCCCCHHHHCCCCCCEEEEEEEECEEEHHEEECCEECCCCCCEEEHHHHCCCEEEEC PFISSPMDTVTESQMAIAQALMGGIGIIHYNNTIEEQVALVEKVKRFENGFITDPVILGP CCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHCCCCCCCEEECC KNVIRDLDAIKERKGFTGIPVTEDGTRNSKLIGIVTNRDIDFEKNREITLDKVMTTNLIT HHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEHHHHEEHHEEE GKEGITLQDANEIIKKSKIGKLPIVDSQGKLVSLVSRSDLKKNKEFPDASKDERKRLRCG CCCCCEECCHHHHHHHHCCCCCCEECCCCCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHH AAVSTLLESRDRVAALYEAGVDVIIIDSAQGNSNYQIEMIQFIKKEFKNLDIVAGNVVTR HHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCEEEECCCHHH AQAENLIRAGADGLRIGMGPGSICITQDTMAVGRAQATAIYQTAKHSAKYDVPVIADGGI HHHHHHHHCCCCCEEEECCCCCEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCC SNIGDIANSLAIGASTCMMGFMFAGTTEAPGEYFYENGIRLKKYRGMASIEAMKAGGDKR CHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCEEEEECCCCHHHHHHHCCCCHH YFNEGQKVKVAQGVSGSVVDRGSILNFIPYLSQGLRLSFQDMGYKSIPEIHKALRDGKLR HHCCCCEEEEECCCCCCEECCCHHHHHHHHHCCCCEEEHHHCCCCCHHHHHHHHHCCCEE FERRSESAQAQGSVHGLYSFSAPTMRAE EHHHCCCHHHCCCEEEEEEECCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9537320 [H]