Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is rho [H]

Identifier: 15890078

GI number: 15890078

Start: 2838347

End: 2839612

Strand: Reverse

Name: rho [H]

Synonym: Atu2833

Alternate gene names: 15890078

Gene position: 2839612-2838347 (Counterclockwise)

Preceding gene: 159185405

Following gene: 15890077

Centisome position: 99.93

GC content: 54.82

Gene sequence:

>1266_bases
ATGGCTGAAATGAAGCTTCAGGAACTTAAGAGCAAATCGCCTACCGACTTGCTGACTTTTGCCGAATCCCTGGAAGTTGA
AAATGCGAGCACCATGCGCAAGCAGGAGCTCATGTTCGCAATCCTCAAGATTCTCGCGAGCCAGGACGTCGAAATCATCG
GCGAAGGCGTCGTTGAAGTCCTTCAGGACGGATTCGGGTTCCTGCGTTCTGCAAACGCAAATTACCTGCCCGGTCCAGAC
GACATCTATATTTCGCCCTCGCAGATCCGCCGCTTCTCGCTGAAGACCGGTGACACTGTTGAGGGACCGATCCGTGGTCC
GAAGGAAGGCGAAAGATATTTCGCACTTCTCAAGGTCAATACGATCAATTTCGACGACCCGGAAAAAATTCGTCACAAGG
TTCACTTCGACAATCTGACGCCGCTTTACCCGAATGAGCGCTTCAAGATGGAACTGGAAGTTCCGACATCGAAGGATCTG
TCGTCGCGCGTTATCGACCTCGTCGCACCGCTCGGCAAGGGCCAGCGCGGGTTGATCGTGGCGCCGCCGCGTACCGGTAA
GACGGTACTGTTGCAGAACATTGCTCATTCCATCACGGCGAACCATCCGGAATGCTACTTGATCGTTCTGCTGATCGACG
AGCGTCCGGAAGAAGTGACCGACATGCAGCGTTCGGTGAAGGGTGAGGTTGTGTCCTCCACCTTCGACGAACCGGCTGTA
CGCCACGTCCAGGTCGCCGAAATGGTCATCGAAAAGGCGAAGCGTCTGGTCGAGCATGGCCGTGACGTCGTCATCCTGCT
CGATTCCATCACGCGCCTTGGCCGCGCCTACAACACCGTTGTTCCCTCCTCCGGCAAGGTTCTGACCGGTGGTGTGGATG
CCAACGCCCTGCAGCGCCCGAAGCGTTTCTTTGGTGCTGCGCGTAACATCGAAGAAGGTGGCTCGCTGACAATTATTGCG
ACTGCGCTTATCGATACCGGTAGCCGCATGGATGAAGTCATCTTTGAAGAATTCAAGGGCACCGGCAACTCGGAAATCGT
GCTTGATCGCAAGGTTGCCGACAAGCGTATCTTCCCGGCACTGGATATTCTGAAATCCGGTACGCGTAAGGAAGATTTGC
TGGTGCCGCGTCAGGACCTGCAGAAGATCTTCGTTCTTCGCCGCATTCTGGCGCCGATGGGAACGATGGACGCTATCGAA
TTCCTGATCGACAAGCTGAAGCAGACGAAGACCAATTCGGATTTCTTCGAATCGATGAATACCTGA

Upstream 100 bases:

>100_bases
CGTGGCATGTGAATTATTCAGTCATCTGCGGTATCTCCACAGCATCTGATTCGCGAACACGCCCTCCCCCGACATTACAT
TATCAAACGGTCTTCCCTTC

Downstream 100 bases:

>100_bases
TCCGAACTTTAGTCGGATTTATCATAAGAGCTTTAAAGCCGTATCCGTTCATTCGGGTGCGGCTTGACTGTTTTAGACCG
AGGTGAACGATGCCAGATTC

Product: transcription termination factor Rho

Products: NA

Alternate protein names: ATP-dependent helicase Rho [H]

Number of amino acids: Translated: 421; Mature: 420

Protein sequence:

>421_residues
MAEMKLQELKSKSPTDLLTFAESLEVENASTMRKQELMFAILKILASQDVEIIGEGVVEVLQDGFGFLRSANANYLPGPD
DIYISPSQIRRFSLKTGDTVEGPIRGPKEGERYFALLKVNTINFDDPEKIRHKVHFDNLTPLYPNERFKMELEVPTSKDL
SSRVIDLVAPLGKGQRGLIVAPPRTGKTVLLQNIAHSITANHPECYLIVLLIDERPEEVTDMQRSVKGEVVSSTFDEPAV
RHVQVAEMVIEKAKRLVEHGRDVVILLDSITRLGRAYNTVVPSSGKVLTGGVDANALQRPKRFFGAARNIEEGGSLTIIA
TALIDTGSRMDEVIFEEFKGTGNSEIVLDRKVADKRIFPALDILKSGTRKEDLLVPRQDLQKIFVLRRILAPMGTMDAIE
FLIDKLKQTKTNSDFFESMNT

Sequences:

>Translated_421_residues
MAEMKLQELKSKSPTDLLTFAESLEVENASTMRKQELMFAILKILASQDVEIIGEGVVEVLQDGFGFLRSANANYLPGPD
DIYISPSQIRRFSLKTGDTVEGPIRGPKEGERYFALLKVNTINFDDPEKIRHKVHFDNLTPLYPNERFKMELEVPTSKDL
SSRVIDLVAPLGKGQRGLIVAPPRTGKTVLLQNIAHSITANHPECYLIVLLIDERPEEVTDMQRSVKGEVVSSTFDEPAV
RHVQVAEMVIEKAKRLVEHGRDVVILLDSITRLGRAYNTVVPSSGKVLTGGVDANALQRPKRFFGAARNIEEGGSLTIIA
TALIDTGSRMDEVIFEEFKGTGNSEIVLDRKVADKRIFPALDILKSGTRKEDLLVPRQDLQKIFVLRRILAPMGTMDAIE
FLIDKLKQTKTNSDFFESMNT
>Mature_420_residues
AEMKLQELKSKSPTDLLTFAESLEVENASTMRKQELMFAILKILASQDVEIIGEGVVEVLQDGFGFLRSANANYLPGPDD
IYISPSQIRRFSLKTGDTVEGPIRGPKEGERYFALLKVNTINFDDPEKIRHKVHFDNLTPLYPNERFKMELEVPTSKDLS
SRVIDLVAPLGKGQRGLIVAPPRTGKTVLLQNIAHSITANHPECYLIVLLIDERPEEVTDMQRSVKGEVVSSTFDEPAVR
HVQVAEMVIEKAKRLVEHGRDVVILLDSITRLGRAYNTVVPSSGKVLTGGVDANALQRPKRFFGAARNIEEGGSLTIIAT
ALIDTGSRMDEVIFEEFKGTGNSEIVLDRKVADKRIFPALDILKSGTRKEDLLVPRQDLQKIFVLRRILAPMGTMDAIEF
LIDKLKQTKTNSDFFESMNT

Specific function: Facilitates transcription termination by a mechanism that involves Rho binding to the nascent RNA, activation of Rho's RNA-dependent ATPase activity, and release of the mRNA from the DNA template [H]

COG id: COG1158

COG function: function code K; Transcription termination factor

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the Rho family [H]

Homologues:

Organism=Escherichia coli, GI1790217, Length=416, Percent_Identity=70.1923076923077, Blast_Score=618, Evalue=1e-178,
Organism=Escherichia coli, GI1790170, Length=276, Percent_Identity=24.2753623188406, Blast_Score=62, Evalue=9e-11,

Paralogues:

None

Copy number: 300 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000194
- InterPro:   IPR003593
- InterPro:   IPR011129
- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR011112
- InterPro:   IPR011113
- InterPro:   IPR004665 [H]

Pfam domain/function: PF00006 ATP-synt_ab; PF07498 Rho_N; PF07497 Rho_RNA_bind [H]

EC number: NA

Molecular weight: Translated: 46990; Mature: 46859

Theoretical pI: Translated: 6.05; Mature: 6.05

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAEMKLQELKSKSPTDLLTFAESLEVENASTMRKQELMFAILKILASQDVEIIGEGVVEV
CCCCHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LQDGFGFLRSANANYLPGPDDIYISPSQIRRFSLKTGDTVEGPIRGPKEGERYFALLKVN
HHHHHHHHHCCCCCCCCCCCCEEECHHHHHEEECCCCCCCCCCCCCCCCCCEEEEEEEEE
TINFDDPEKIRHKVHFDNLTPLYPNERFKMELEVPTSKDLSSRVIDLVAPLGKGQRGLIV
EECCCCHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHCCCCCCCCEEE
APPRTGKTVLLQNIAHSITANHPECYLIVLLIDERPEEVTDMQRSVKGEVVSSTFDEPAV
ECCCCCCHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHHHHCCHHHHHCCCCHHH
RHVQVAEMVIEKAKRLVEHGRDVVILLDSITRLGRAYNTVVPSSGKVLTGGVDANALQRP
HHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHH
KRFFGAARNIEEGGSLTIIATALIDTGSRMDEVIFEEFKGTGNSEIVLDRKVADKRIFPA
HHHHHHHCCCCCCCCEEEEEEEHHCCCCHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHH
LDILKSGTRKEDLLVPRQDLQKIFVLRRILAPMGTMDAIEFLIDKLKQTKTNSDFFESMN
HHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHCC
T
C
>Mature Secondary Structure 
AEMKLQELKSKSPTDLLTFAESLEVENASTMRKQELMFAILKILASQDVEIIGEGVVEV
CCCHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
LQDGFGFLRSANANYLPGPDDIYISPSQIRRFSLKTGDTVEGPIRGPKEGERYFALLKVN
HHHHHHHHHCCCCCCCCCCCCEEECHHHHHEEECCCCCCCCCCCCCCCCCCEEEEEEEEE
TINFDDPEKIRHKVHFDNLTPLYPNERFKMELEVPTSKDLSSRVIDLVAPLGKGQRGLIV
EECCCCHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHCCCCCCCCEEE
APPRTGKTVLLQNIAHSITANHPECYLIVLLIDERPEEVTDMQRSVKGEVVSSTFDEPAV
ECCCCCCHHHHHHHHHHHCCCCCCEEEEEEEECCCCHHHHHHHHHHCCHHHHHCCCCHHH
RHVQVAEMVIEKAKRLVEHGRDVVILLDSITRLGRAYNTVVPSSGKVLTGGVDANALQRP
HHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHH
KRFFGAARNIEEGGSLTIIATALIDTGSRMDEVIFEEFKGTGNSEIVLDRKVADKRIFPA
HHHHHHHCCCCCCCCEEEEEEEHHCCCCHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHH
LDILKSGTRKEDLLVPRQDLQKIFVLRRILAPMGTMDAIEFLIDKLKQTKTNSDFFESMN
HHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHCC
T
C

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8606169 [H]