Definition Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome.
Accession NC_004663
Length 6,260,361

Click here to switch to the map view.

The map label for this gene is rsgA1

Identifier: 29346600

GI number: 29346600

Start: 1485359

End: 1486426

Strand: Reverse

Name: rsgA1

Synonym: BT_1190

Alternate gene names: 29346600

Gene position: 1486426-1485359 (Counterclockwise)

Preceding gene: 29346601

Following gene: 29346596

Centisome position: 23.74

GC content: 42.6

Gene sequence:

>1068_bases
ATGAGTAACGAAAATAATAATCTGTCATTATATGGTTGGAGTGATAATCTATTCCGGCAAAAACAAATCTCACAATATAA
AGATATGTCTCATGGACGAATCATCGTGACACATAAAACCTGTTATGAAGTAGTTGCAGAGGATGGTGTATATTTGTGCG
AGTTAACAGGAAATATGATTTATGGCAGAGTGCCTGATGAATATCCTTGTACGGGCGATTGGGTGATTTTTCAGCCATTC
GATGCAAACAAAGGGATTATAGTTGATATATTGCCCCGTGAACGGGCGTTGTATCGAAAGAAGAACGGAAGGGTAGCCGA
CCGGCAGGCCATTGCTTCTTACGTCGATAAGGCATTTATTGTGCAAAGCCTTGATGATAATTTCAATGTTCGCAGGGCGG
AACGTTTCATTGCCCAGGTAATGGAAGAAAAAATCAAACCGGTATTAGTGCTCAATAAAGCCGATTTAGGTTGTGACAGA
CAGAAAATAGATGAAGCGATCAAACACATTGCCCGTCAGTTTCCTGTATTTATCACAAGTATTCGTCAACCTCAAACGAT
TCTTCGGTTGCGGGAATCCATAACAAAAGGCGAAACAGTTGTGTTTGTCGGCTCTTCGGGTGTCGGCAAGAGTTCTTTGG
TGAATGCCCTTTGCGGGAAATCGGTATTGAATACTTCTGATATAAGCCTGTCTACAGGGAAGGGGCGGCATACTTCGACT
CGTCGGGAAATGGTATTGATGGATGGCTCAGGTGTTTTAATCGACACTCCGGGTGTTCGGGAATTTGGTTTGGCGATTGA
CAATCCCGATTCGCTCACCGAAATGTTTGAAATATCCGACTATGCGGAATCATGCCGTTTCAGCGATTGTAAACATATCG
ACGAGCCGGGTTGTGCTGTTTTAGAGGCGGTACATAATGGTACGTTAGATCATAAGGTATATGAGAGTTATCTGAAACTC
AGACGAGAAGCATGGCACTTCTCCGCTTCCGAACATGAAAAACGTAAAAAGGAGAAATCCTTTACGAAACTCGTAGAAGA
AGTGAAGAAACGCAAGGCTAATTTCTAA

Upstream 100 bases:

>100_bases
GAAGTGATTTTCCTATATTATATAACTTTAGTTTTTATACCCGCATACAATAGCGAAAGCATTGTTGTGCTTTTACGTGT
ATGCTATATACATTAAAATA

Downstream 100 bases:

>100_bases
AGAACTATTGAAGCAGGGAATGCCGGTACTTTATCAGACAAGGTGATACTGTATGACGGCTTCTCTGCTTCTTATTGTTT
GGTAAGAGTTTACGGCAACT

Product: putative ATP GTP-binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 355; Mature: 354

Protein sequence:

>355_residues
MSNENNNLSLYGWSDNLFRQKQISQYKDMSHGRIIVTHKTCYEVVAEDGVYLCELTGNMIYGRVPDEYPCTGDWVIFQPF
DANKGIIVDILPRERALYRKKNGRVADRQAIASYVDKAFIVQSLDDNFNVRRAERFIAQVMEEKIKPVLVLNKADLGCDR
QKIDEAIKHIARQFPVFITSIRQPQTILRLRESITKGETVVFVGSSGVGKSSLVNALCGKSVLNTSDISLSTGKGRHTST
RREMVLMDGSGVLIDTPGVREFGLAIDNPDSLTEMFEISDYAESCRFSDCKHIDEPGCAVLEAVHNGTLDHKVYESYLKL
RREAWHFSASEHEKRKKEKSFTKLVEEVKKRKANF

Sequences:

>Translated_355_residues
MSNENNNLSLYGWSDNLFRQKQISQYKDMSHGRIIVTHKTCYEVVAEDGVYLCELTGNMIYGRVPDEYPCTGDWVIFQPF
DANKGIIVDILPRERALYRKKNGRVADRQAIASYVDKAFIVQSLDDNFNVRRAERFIAQVMEEKIKPVLVLNKADLGCDR
QKIDEAIKHIARQFPVFITSIRQPQTILRLRESITKGETVVFVGSSGVGKSSLVNALCGKSVLNTSDISLSTGKGRHTST
RREMVLMDGSGVLIDTPGVREFGLAIDNPDSLTEMFEISDYAESCRFSDCKHIDEPGCAVLEAVHNGTLDHKVYESYLKL
RREAWHFSASEHEKRKKEKSFTKLVEEVKKRKANF
>Mature_354_residues
SNENNNLSLYGWSDNLFRQKQISQYKDMSHGRIIVTHKTCYEVVAEDGVYLCELTGNMIYGRVPDEYPCTGDWVIFQPFD
ANKGIIVDILPRERALYRKKNGRVADRQAIASYVDKAFIVQSLDDNFNVRRAERFIAQVMEEKIKPVLVLNKADLGCDRQ
KIDEAIKHIARQFPVFITSIRQPQTILRLRESITKGETVVFVGSSGVGKSSLVNALCGKSVLNTSDISLSTGKGRHTSTR
REMVLMDGSGVLIDTPGVREFGLAIDNPDSLTEMFEISDYAESCRFSDCKHIDEPGCAVLEAVHNGTLDHKVYESYLKLR
REAWHFSASEHEKRKKEKSFTKLVEEVKKRKANF

Specific function: May play a role in 30S ribosomal subunit biogenesis. Unusual circulary permuted GTPase that catalyzes rapid hydrolysis of GTP with a slow catalytic turnover

COG id: COG1162

COG function: function code R; Predicted GTPases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 engC GTPase domain

Homologues:

Organism=Escherichia coli, GI87082381, Length=267, Percent_Identity=34.0823970037453, Blast_Score=132, Evalue=5e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): RSGA1_BACTN (Q8A8H7)

Other databases:

- EMBL:   AE015928
- RefSeq:   NP_810103.1
- ProteinModelPortal:   Q8A8H7
- SMR:   Q8A8H7
- GeneID:   1073808
- GenomeReviews:   AE015928_GR
- KEGG:   bth:BT_1190
- NMPDR:   fig|226186.1.peg.1190
- HOGENOM:   HBG652450
- OMA:   TSRELVP
- PhylomeDB:   Q8A8H7
- BioCyc:   BTHE226186:BT_1190-MONOMER
- HAMAP:   MF_01820
- InterPro:   IPR010914
- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR004881
- Gene3D:   G3DSA:2.40.50.140
- TIGRFAMs:   TIGR00157

Pfam domain/function: PF03193 DUF258; SSF50249 Nucleic_acid_OB

EC number: NA

Molecular weight: Translated: 40311; Mature: 40180

Theoretical pI: Translated: 8.20; Mature: 8.20

Prosite motif: PS50936 ENGC_GTPASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.3 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
2.3 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSNENNNLSLYGWSDNLFRQKQISQYKDMSHGRIIVTHKTCYEVVAEDGVYLCELTGNMI
CCCCCCCEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHCCCEEEEEEECCEE
YGRVPDEYPCTGDWVIFQPFDANKGIIVDILPRERALYRKKNGRVADRQAIASYVDKAFI
ECCCCCCCCCCCCEEEEEECCCCCCEEEEECCCHHHHHHHCCCCCHHHHHHHHHHHHHHH
VQSLDDNFNVRRAERFIAQVMEEKIKPVLVLNKADLGCDRQKIDEAIKHIARQFPVFITS
HHHCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCHHHHH
IRQPQTILRLRESITKGETVVFVGSSGVGKSSLVNALCGKSVLNTSDISLSTGKGRHTST
CCCHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHCCHHCCCCCCEEECCCCCCCCC
RREMVLMDGSGVLIDTPGVREFGLAIDNPDSLTEMFEISDYAESCRFSDCKHIDEPGCAV
CCEEEEECCCEEEEECCCCHHHCCEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH
LEAVHNGTLDHKVYESYLKLRREAWHFSASEHEKRKKEKSFTKLVEEVKKRKANF
HHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure 
SNENNNLSLYGWSDNLFRQKQISQYKDMSHGRIIVTHKTCYEVVAEDGVYLCELTGNMI
CCCCCCEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHCCCEEEEEEECCEE
YGRVPDEYPCTGDWVIFQPFDANKGIIVDILPRERALYRKKNGRVADRQAIASYVDKAFI
ECCCCCCCCCCCCEEEEEECCCCCCEEEEECCCHHHHHHHCCCCCHHHHHHHHHHHHHHH
VQSLDDNFNVRRAERFIAQVMEEKIKPVLVLNKADLGCDRQKIDEAIKHIARQFPVFITS
HHHCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCHHHHH
IRQPQTILRLRESITKGETVVFVGSSGVGKSSLVNALCGKSVLNTSDISLSTGKGRHTST
CCCHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHCCHHCCCCCCEEECCCCCCCCC
RREMVLMDGSGVLIDTPGVREFGLAIDNPDSLTEMFEISDYAESCRFSDCKHIDEPGCAV
CCEEEEECCCEEEEECCCCHHHCCEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH
LEAVHNGTLDHKVYESYLKLRREAWHFSASEHEKRKKEKSFTKLVEEVKKRKANF
HHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: GTP [C]

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12663928