Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is traC [H]

Identifier: 29144542

GI number: 29144542

Start: 4451268

End: 4453217

Strand: Direct

Name: traC [H]

Synonym: t4286

Alternate gene names: 29144542

Gene position: 4451268-4453217 (Clockwise)

Preceding gene: 29144541

Following gene: 29144543

Centisome position: 92.89

GC content: 54.51

Gene sequence:

>1950_bases
ATGAACACATCGTTACATCCGGATGATATCAGCCGGTTTATCTCCGGCCGACTTATCAGTAGCCTGGCTGCAGGCCAGGT
ACCGTGGCGGGGCACAATTCCCGGGTTACCGGAACACGCGCTTACGGGCGTGCCGTTTACCGGTATTAACGTACTGTTAT
TGTGGCAGGCCATGCAGCAGCGTTCGCTTCGTTCAGGAAGGTGGCTGACCGGAGATGACCTCCGCCAACTGGGCGGTCAG
GTCAGATCCGGTGAAAAGCCAGTCACCCTGGTCCGCTACCGGCCTTCGTTATCGCTTTTCAAGGTAATTAACCCTGAACA
GTGTGATGGTCTGCCGGATACGCTGCAACCGGGGTGGCCACTGCCTCCACGACCTCAGCCGTCACTGAATGTGATCCGCG
ACCTGCTTCAGAACAGTGGGGTTCCCGTGATCCACAGGGACAACGTTTTGCCGGTATACCGGGCATTGCATGACCGGATT
GAGCTTCCACCCGTGGCATCGTATGTCGGAGAGGAGACATACTGGCAGGACATACTGAATCTGCTGGTTCAGGCTACCGG
ACATCCGCAGCGCTTACATCGCTTCGGATTAACAGTGGATACACGTACCGATGAGGTTCATGAGGCCCTGGTTGCAGAAC
TGGGCGCCGCATTTCTCTCGGCTGCTCTGGGATTACCGGGAGCGATGCTATCCAGGCTGGATGTGGCACCCTGGGTGACA
TATTTACAGGGGGACCCGTGGCGTCTCTTTCGGGCTGCGGAAGCAGCGCGAAAGGCGATGATGTGGCTGAAAGAGCGAAG
ACCCTCGATGACGACAGTGGAGATGTGGCAGAAGATGGCCTCATTGATTCTTGAAACGCATTACGGTTTTTCTCTCGATG
ACACCACGCTGGGATGTCGCAGTGTGGTTGAGCGGCATATTGAATGTGGGATCACTCCACTGATGGCGATTAATGCACTG
GCCCGCATTTACCAGTGGGAACGCTACGACCAGCCTCAGCGGTCGCTGTTTATCAACGAAGCAGGACCAGACAGTGAAAT
ACTGACTTTGTCTGAAATTCGTCCCGAACTGCTGACCTGTTACCGCGTGCCTGTGCCATCCGGTATACCGGACAGGAAAG
CGGAAGCGGTGCAGGTCGAAAGTTTACCTTTGCTCCTGGCACCGGCGGTGTCGGAAGGTAAGAGTGCGGCGAATGATGAC
GGGCCTGATGACCCGGATGGTAATGACAATGTTGTGGCGCTGCCCTGGGCTGCCCGCAGGGGAAAGGAGAACCCGCATAT
ACATCGTTTTGTCAGTATCTTTAACGGTATTGCACCTCATGAAAGCCGCTGGCAGGTATTCAGTGATTTCGTCCATATGG
CGGCCTGTTCACTGTACAACGCTGTACATCGGGATCCTGATTTTGAAGCGGACTACATGAGGCGGGTATCCCACTATTCA
GCTGAAGATGCAAACAACATGGCCCGTTTACTGTCAGAAGTTGTTATGGGGCTGGAATTCAGTCCAACAGATTTCCTGGG
GCGAATTTATATGATATCCGGACTGGGAAATTTTCATAACGCACAGTATTTCACGCCTTACAGCGTTTCGTACGCGATGG
CGCGAATGACACTCAGTGACCGTATACCTGAACTTTCCAGCGGGGAACGAGACTTTATTACTGTCAGCGATCCTGCCAGT
GGTGCCGGAAGTATGGTCGTTGCGCTGGCAGAAGCCATGCTGGAGGCGGGATTTAATCCGCAGAAACAGATGGTAGCGTA
CTGTGTCGATATTGACCCGGTGGCCTCGATGATGTGTTACATCCAGCTATCCCTGATGGGTATTCCGGCCATTGTGGCTA
CCGGCAACAGCCTGACCGTGGAGATTAAACGGGAGATGGCAACACCAATGTTTGTACTGGGTCGTTGGCATCACCGGTGG
CAGGCAGATCGGACGCGTAAAGCGGCCTAG

Upstream 100 bases:

>100_bases
CTGGTACCGTAATCACGTACGCATCCTGCGTTACTGAACCTTATCCCGTAGGGAGAGTGACTCCCTGTGGGGGCGCTCTC
CCTTTTTTTCAGGAGAGCAT

Downstream 100 bases:

>100_bases
TTGCGTATTTATCTTCATCCCGACAGGGTTCTCCCTGTGGGGAGAATTCCTGTTCTGAACAGGAGTCACTTATGCTGGCC
AGTTTCTATATTCAACGGCA

Product: hypothetical protein

Products: NA

Alternate protein names: Replication primase [H]

Number of amino acids: Translated: 649; Mature: 649

Protein sequence:

>649_residues
MNTSLHPDDISRFISGRLISSLAAGQVPWRGTIPGLPEHALTGVPFTGINVLLLWQAMQQRSLRSGRWLTGDDLRQLGGQ
VRSGEKPVTLVRYRPSLSLFKVINPEQCDGLPDTLQPGWPLPPRPQPSLNVIRDLLQNSGVPVIHRDNVLPVYRALHDRI
ELPPVASYVGEETYWQDILNLLVQATGHPQRLHRFGLTVDTRTDEVHEALVAELGAAFLSAALGLPGAMLSRLDVAPWVT
YLQGDPWRLFRAAEAARKAMMWLKERRPSMTTVEMWQKMASLILETHYGFSLDDTTLGCRSVVERHIECGITPLMAINAL
ARIYQWERYDQPQRSLFINEAGPDSEILTLSEIRPELLTCYRVPVPSGIPDRKAEAVQVESLPLLLAPAVSEGKSAANDD
GPDDPDGNDNVVALPWAARRGKENPHIHRFVSIFNGIAPHESRWQVFSDFVHMAACSLYNAVHRDPDFEADYMRRVSHYS
AEDANNMARLLSEVVMGLEFSPTDFLGRIYMISGLGNFHNAQYFTPYSVSYAMARMTLSDRIPELSSGERDFITVSDPAS
GAGSMVVALAEAMLEAGFNPQKQMVAYCVDIDPVASMMCYIQLSLMGIPAIVATGNSLTVEIKREMATPMFVLGRWHHRW
QADRTRKAA

Sequences:

>Translated_649_residues
MNTSLHPDDISRFISGRLISSLAAGQVPWRGTIPGLPEHALTGVPFTGINVLLLWQAMQQRSLRSGRWLTGDDLRQLGGQ
VRSGEKPVTLVRYRPSLSLFKVINPEQCDGLPDTLQPGWPLPPRPQPSLNVIRDLLQNSGVPVIHRDNVLPVYRALHDRI
ELPPVASYVGEETYWQDILNLLVQATGHPQRLHRFGLTVDTRTDEVHEALVAELGAAFLSAALGLPGAMLSRLDVAPWVT
YLQGDPWRLFRAAEAARKAMMWLKERRPSMTTVEMWQKMASLILETHYGFSLDDTTLGCRSVVERHIECGITPLMAINAL
ARIYQWERYDQPQRSLFINEAGPDSEILTLSEIRPELLTCYRVPVPSGIPDRKAEAVQVESLPLLLAPAVSEGKSAANDD
GPDDPDGNDNVVALPWAARRGKENPHIHRFVSIFNGIAPHESRWQVFSDFVHMAACSLYNAVHRDPDFEADYMRRVSHYS
AEDANNMARLLSEVVMGLEFSPTDFLGRIYMISGLGNFHNAQYFTPYSVSYAMARMTLSDRIPELSSGERDFITVSDPAS
GAGSMVVALAEAMLEAGFNPQKQMVAYCVDIDPVASMMCYIQLSLMGIPAIVATGNSLTVEIKREMATPMFVLGRWHHRW
QADRTRKAA
>Mature_649_residues
MNTSLHPDDISRFISGRLISSLAAGQVPWRGTIPGLPEHALTGVPFTGINVLLLWQAMQQRSLRSGRWLTGDDLRQLGGQ
VRSGEKPVTLVRYRPSLSLFKVINPEQCDGLPDTLQPGWPLPPRPQPSLNVIRDLLQNSGVPVIHRDNVLPVYRALHDRI
ELPPVASYVGEETYWQDILNLLVQATGHPQRLHRFGLTVDTRTDEVHEALVAELGAAFLSAALGLPGAMLSRLDVAPWVT
YLQGDPWRLFRAAEAARKAMMWLKERRPSMTTVEMWQKMASLILETHYGFSLDDTTLGCRSVVERHIECGITPLMAINAL
ARIYQWERYDQPQRSLFINEAGPDSEILTLSEIRPELLTCYRVPVPSGIPDRKAEAVQVESLPLLLAPAVSEGKSAANDD
GPDDPDGNDNVVALPWAARRGKENPHIHRFVSIFNGIAPHESRWQVFSDFVHMAACSLYNAVHRDPDFEADYMRRVSHYS
AEDANNMARLLSEVVMGLEFSPTDFLGRIYMISGLGNFHNAQYFTPYSVSYAMARMTLSDRIPELSSGERDFITVSDPAS
GAGSMVVALAEAMLEAGFNPQKQMVAYCVDIDPVASMMCYIQLSLMGIPAIVATGNSLTVEIKREMATPMFVLGRWHHRW
QADRTRKAA

Specific function: Required for autonomous replication in E.coli. Transferred into the recipient cell during bacterial conjugation. Catalyzes the synthesis of short oligoribonucleotide primers with CpA or pCpA at their 5'-termini on a single stranded template DNA [H]

COG id: COG4227

COG function: function code L; Antirestriction protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 Toprim domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013610
- InterPro:   IPR006171 [H]

Pfam domain/function: PF08401 DUF1738; PF01751 Toprim [H]

EC number: NA

Molecular weight: Translated: 72353; Mature: 72353

Theoretical pI: Translated: 6.24; Mature: 6.24

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.7 %Met     (Translated Protein)
4.8 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.7 %Met     (Mature Protein)
4.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNTSLHPDDISRFISGRLISSLAAGQVPWRGTIPGLPEHALTGVPFTGINVLLLWQAMQQ
CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHH
RSLRSGRWLTGDDLRQLGGQVRSGEKPVTLVRYRPSLSLFKVINPEQCDGLPDTLQPGWP
HHHHCCCCCCCHHHHHHCCHHCCCCCCEEEEEECCCCCEEECCCCHHCCCCCCCCCCCCC
LPPRPQPSLNVIRDLLQNSGVPVIHRDNVLPVYRALHDRIELPPVASYVGEETYWQDILN
CCCCCCCHHHHHHHHHHCCCCCEEECCCCHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHH
LLVQATGHPQRLHRFGLTVDTRTDEVHEALVAELGAAFLSAALGLPGAMLSRLDVAPWVT
HHHHCCCCHHHHHHHCCEEECCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
YLQGDPWRLFRAAEAARKAMMWLKERRPSMTTVEMWQKMASLILETHYGFSLDDTTLGCR
HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH
SVVERHIECGITPLMAINALARIYQWERYDQPQRSLFINEAGPDSEILTLSEIRPELLTC
HHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEHHHHCHHHHHE
YRVPVPSGIPDRKAEAVQVESLPLLLAPAVSEGKSAANDDGPDDPDGNDNVVALPWAARR
EECCCCCCCCCCCCCCEEECCCCEEEECCHHCCCCCCCCCCCCCCCCCCCEEEECCHHHC
GKENPHIHRFVSIFNGIAPHESRWQVFSDFVHMAACSLYNAVHRDPDFEADYMRRVSHYS
CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCC
AEDANNMARLLSEVVMGLEFSPTDFLGRIYMISGLGNFHNAQYFTPYSVSYAMARMTLSD
CCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHH
RIPELSSGERDFITVSDPASGAGSMVVALAEAMLEAGFNPQKQMVAYCVDIDPVASMMCY
CCCCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHH
IQLSLMGIPAIVATGNSLTVEIKREMATPMFVLGRWHHRWQADRTRKAA
HHHHHHCCCEEEECCCEEEEEEHHHHCCHHHHHHHHHHHHHHCHHCCCC
>Mature Secondary Structure
MNTSLHPDDISRFISGRLISSLAAGQVPWRGTIPGLPEHALTGVPFTGINVLLLWQAMQQ
CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHH
RSLRSGRWLTGDDLRQLGGQVRSGEKPVTLVRYRPSLSLFKVINPEQCDGLPDTLQPGWP
HHHHCCCCCCCHHHHHHCCHHCCCCCCEEEEEECCCCCEEECCCCHHCCCCCCCCCCCCC
LPPRPQPSLNVIRDLLQNSGVPVIHRDNVLPVYRALHDRIELPPVASYVGEETYWQDILN
CCCCCCCHHHHHHHHHHCCCCCEEECCCCHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHH
LLVQATGHPQRLHRFGLTVDTRTDEVHEALVAELGAAFLSAALGLPGAMLSRLDVAPWVT
HHHHCCCCHHHHHHHCCEEECCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
YLQGDPWRLFRAAEAARKAMMWLKERRPSMTTVEMWQKMASLILETHYGFSLDDTTLGCR
HHCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH
SVVERHIECGITPLMAINALARIYQWERYDQPQRSLFINEAGPDSEILTLSEIRPELLTC
HHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEHHHHCHHHHHE
YRVPVPSGIPDRKAEAVQVESLPLLLAPAVSEGKSAANDDGPDDPDGNDNVVALPWAARR
EECCCCCCCCCCCCCCEEECCCCEEEECCHHCCCCCCCCCCCCCCCCCCCEEEECCHHHC
GKENPHIHRFVSIFNGIAPHESRWQVFSDFVHMAACSLYNAVHRDPDFEADYMRRVSHYS
CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCC
AEDANNMARLLSEVVMGLEFSPTDFLGRIYMISGLGNFHNAQYFTPYSVSYAMARMTLSD
CCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHH
RIPELSSGERDFITVSDPASGAGSMVVALAEAMLEAGFNPQKQMVAYCVDIDPVASMMCY
CCCCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCHHHHHHHH
IQLSLMGIPAIVATGNSLTVEIKREMATPMFVLGRWHHRWQADRTRKAA
HHHHHHCCCEEEECCCEEEEEEHHHHCCHHHHHHHHHHHHHHCHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1818755 [H]