Definition | Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130 chromosome chromosome I, complete sequence. |
---|---|
Accession | NC_005823 |
Length | 4,277,185 |
Click here to switch to the map view.
The map label for this gene is rsgA
Identifier: 45659015
GI number: 45659015
Start: 3910317
End: 3911396
Strand: Reverse
Name: rsgA
Synonym: LIC13193
Alternate gene names: 45659015
Gene position: 3911396-3910317 (Counterclockwise)
Preceding gene: 45659019
Following gene: 45659014
Centisome position: 91.45
GC content: 37.78
Gene sequence:
>1080_bases ATGTCACAACCAAATTCAATCCTAATGTCTTATGGTTGGGATCCGAGCATTTATTTAGAAGAACCGAAACTTCTTGAAGG TTTAAAACCTGGACGTGTACTTGCTGTTTACGGAGAATATTCAAAAATTATAATAGAACAGGGCGAAAAGAAAGGTATTT TTTCTGGTGCTCTGATGGCTTCTGGGGAATCAATTGTAACTGGAGATTGGGTACTCATACGAGAAATTGAAGGAGATGAA CTTTGTATCGTAGAAAAAATTCTTCCCCGAAAAACTTTTCTAAGAAGAAGTAATCCAGGAAAAAGAAAAGGTTCCCAGGC GATTGCGTCAAACATAGATCTTTTATTAGTGATTATGGGTTTAGATAACGATTATAGTCCGAGAAGAATAGAACGTTATT TGTTCTTGGCCAAGGTTAGTGGGGCACAAGTCACGATCGTTTTAAATAAAAAGGATCTTTGTATGGATCCTGAAAATAAA TTTATGGAAATTAAAATGATCGCTGGAGAAACACCGATCGAAATGATTTCAGCTTTGGATCTAAAACAGACTCGAACAAT TTTGCAATGGATCGATCCGGGAAAAACGATCGCATTTTTAGGATCTTCGGGCGCGGGTAAATCTACTATCATTAATTCTT TATTAGGTGGAGAGATTCAAAAAACCAATGAGGTAAAAGTTTCCGATGGAACCGGAAAACATACTACAACTCGCAGAGAA CTGTTTCTTTTACCTTCGGGTGGGGTTCTTATGGACAATCCGGGAATCAGAGAAGTAGGTTTGTTTTCAGAAGGAAGCGA AGACGAACTTGAGGAAGTGTTTCCGGAAATTGCAGTGGCTGCGGAAGAATGTCGTTTTAACGATTGTTCTCATAATGAAG AACCCAATTGCGGAGTTGTAGCGGCCGTAAAAGATGGAAGAATCAGTGAAGCTAGATACTTTTCTTATTTAAAACTTTCG AAAGAATTAATGGCTTATCAGGCCTTGAACGATCCGGAAGAAGCGAGAAAGAAAAAACAAAAAGATAAACAAATGTCCAA AGCTTTACAAAAGAGACTTAAGGATAAGGGTAGAAAATAG
Upstream 100 bases:
>100_bases TTTGGATTCAAAGTTCGAAAGGTTGATGAATAAAATTTATATAGTTGTTTAATTTTATTTTAAAAATCTAAATTTTCAGA ATCTAAAAAGAGGAAAAAAT
Downstream 100 bases:
>100_bases TAGCTTTTACAGATTACGTCACTTCGTGTTTAGGAATTTTTTTATTAAATAATAGATTTGTTAAAAAATTCCATAGTGGC CGATTAACAAAACTGCTTCA
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 359; Mature: 358
Protein sequence:
>359_residues MSQPNSILMSYGWDPSIYLEEPKLLEGLKPGRVLAVYGEYSKIIIEQGEKKGIFSGALMASGESIVTGDWVLIREIEGDE LCIVEKILPRKTFLRRSNPGKRKGSQAIASNIDLLLVIMGLDNDYSPRRIERYLFLAKVSGAQVTIVLNKKDLCMDPENK FMEIKMIAGETPIEMISALDLKQTRTILQWIDPGKTIAFLGSSGAGKSTIINSLLGGEIQKTNEVKVSDGTGKHTTTRRE LFLLPSGGVLMDNPGIREVGLFSEGSEDELEEVFPEIAVAAEECRFNDCSHNEEPNCGVVAAVKDGRISEARYFSYLKLS KELMAYQALNDPEEARKKKQKDKQMSKALQKRLKDKGRK
Sequences:
>Translated_359_residues MSQPNSILMSYGWDPSIYLEEPKLLEGLKPGRVLAVYGEYSKIIIEQGEKKGIFSGALMASGESIVTGDWVLIREIEGDE LCIVEKILPRKTFLRRSNPGKRKGSQAIASNIDLLLVIMGLDNDYSPRRIERYLFLAKVSGAQVTIVLNKKDLCMDPENK FMEIKMIAGETPIEMISALDLKQTRTILQWIDPGKTIAFLGSSGAGKSTIINSLLGGEIQKTNEVKVSDGTGKHTTTRRE LFLLPSGGVLMDNPGIREVGLFSEGSEDELEEVFPEIAVAAEECRFNDCSHNEEPNCGVVAAVKDGRISEARYFSYLKLS KELMAYQALNDPEEARKKKQKDKQMSKALQKRLKDKGRK >Mature_358_residues SQPNSILMSYGWDPSIYLEEPKLLEGLKPGRVLAVYGEYSKIIIEQGEKKGIFSGALMASGESIVTGDWVLIREIEGDEL CIVEKILPRKTFLRRSNPGKRKGSQAIASNIDLLLVIMGLDNDYSPRRIERYLFLAKVSGAQVTIVLNKKDLCMDPENKF MEIKMIAGETPIEMISALDLKQTRTILQWIDPGKTIAFLGSSGAGKSTIINSLLGGEIQKTNEVKVSDGTGKHTTTRREL FLLPSGGVLMDNPGIREVGLFSEGSEDELEEVFPEIAVAAEECRFNDCSHNEEPNCGVVAAVKDGRISEARYFSYLKLSK ELMAYQALNDPEEARKKKQKDKQMSKALQKRLKDKGRK
Specific function: May play a role in 30S ribosomal subunit biogenesis. Unusual circulary permuted GTPase that catalyzes rapid hydrolysis of GTP with a slow catalytic turnover
COG id: COG1162
COG function: function code R; Predicted GTPases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 engC GTPase domain
Homologues:
Organism=Escherichia coli, GI87082381, Length=300, Percent_Identity=30.6666666666667, Blast_Score=127, Evalue=2e-30,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): RSGA_LEPIC (Q72MK0)
Other databases:
- EMBL: AE016823 - RefSeq: YP_003101.1 - ProteinModelPortal: Q72MK0 - SMR: Q72MK0 - GeneID: 2769624 - GenomeReviews: AE016823_GR - KEGG: lic:LIC13193 - NMPDR: fig|267671.1.peg.3101 - HOGENOM: HBG652450 - OMA: FPAVGDW - BioCyc: LINT267671:LIC_13193-MONOMER - HAMAP: MF_01820 - InterPro: IPR010914 - InterPro: IPR016027 - InterPro: IPR004881
Pfam domain/function: PF03193 DUF258; SSF50249 Nucleic_acid_OB
EC number: NA
Molecular weight: Translated: 39939; Mature: 39807
Theoretical pI: Translated: 7.25; Mature: 7.25
Prosite motif: PS50936 ENGC_GTPASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.5 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSQPNSILMSYGWDPSIYLEEPKLLEGLKPGRVLAVYGEYSKIIIEQGEKKGIFSGALMA CCCCCCEEEEECCCCCEEECCCHHHCCCCCCCEEEEECCCEEEEEECCCCCCCEECHHHC SGESIVTGDWVLIREIEGDELCIVEKILPRKTFLRRSNPGKRKGSQAIASNIDLLLVIMG CCCEEEECCEEEEEECCCCCEEEHHHHCCHHHHHHCCCCCCCCCHHHHHCCCCEEEEEEE LDNDYSPRRIERYLFLAKVSGAQVTIVLNKKDLCMDPENKFMEIKMIAGETPIEMISALD CCCCCCHHHHHHEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEECCCCHHHHHHHHH LKQTRTILQWIDPGKTIAFLGSSGAGKSTIINSLLGGEIQKTNEVKVSDGTGKHTTTRRE HHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEE LFLLPSGGVLMDNPGIREVGLFSEGSEDELEEVFPEIAVAAEECRFNDCSHNEEPNCGVV EEEECCCCEEECCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEE AAVKDGRISEARYFSYLKLSKELMAYQALNDPEEARKKKQKDKQMSKALQKRLKDKGRK EEECCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC >Mature Secondary Structure SQPNSILMSYGWDPSIYLEEPKLLEGLKPGRVLAVYGEYSKIIIEQGEKKGIFSGALMA CCCCCEEEEECCCCCEEECCCHHHCCCCCCCEEEEECCCEEEEEECCCCCCCEECHHHC SGESIVTGDWVLIREIEGDELCIVEKILPRKTFLRRSNPGKRKGSQAIASNIDLLLVIMG CCCEEEECCEEEEEECCCCCEEEHHHHCCHHHHHHCCCCCCCCCHHHHHCCCCEEEEEEE LDNDYSPRRIERYLFLAKVSGAQVTIVLNKKDLCMDPENKFMEIKMIAGETPIEMISALD CCCCCCHHHHHHEEEEEEECCCEEEEEECCCCCCCCCCCCEEEEEEECCCCHHHHHHHHH LKQTRTILQWIDPGKTIAFLGSSGAGKSTIINSLLGGEIQKTNEVKVSDGTGKHTTTRRE HHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEE LFLLPSGGVLMDNPGIREVGLFSEGSEDELEEVFPEIAVAAEECRFNDCSHNEEPNCGVV EEEECCCCEEECCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEE AAVKDGRISEARYFSYLKLSKELMAYQALNDPEEARKKKQKDKQMSKALQKRLKDKGRK EEECCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: GTP [C]
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA