The gene/protein map for NC_007722 is currently unavailable.
Definition Erythrobacter litoralis HTCC2594 chromosome, complete genome.
Accession NC_007722
Length 3,052,398

Click here to switch to the map view.

The map label for this gene is traI [H]

Identifier: 85375767

GI number: 85375767

Start: 2959510

End: 2962065

Strand: Direct

Name: traI [H]

Synonym: ELI_14700

Alternate gene names: 85375767

Gene position: 2959510-2962065 (Clockwise)

Preceding gene: 85375766

Following gene: 85375769

Centisome position: 96.96

GC content: 59.55

Gene sequence:

>2556_bases
ATGGGCAGCAAAGACGGTTACGGAAGGGTCGCAACCGACAACCTCACAATTGGTCTTTTTCAGCATGATACCAACCGCAA
TCAGGAGCCAAACCTGCATTTTCACGCGGTCGTAGCCAACGTCACGCAAGGCCCCGACGGCAAGTGGCGGGCGCTGCGCA
ATGACAAGCTCTGGTCTTTCAACACGCTGCTCAATTCGATGACGATGGCCCACTTTCGCATGGCAGTCGAGAAGATGGGA
TACGAAGCCGGACCGGTCGGCAAGCACGGCAATTTCGAAGCGGCCGGCATCACACGCCAGCAGCTCATGGCGTTCTCTTC
CCGCCGCGAAGAGGTCCTAGATGCGGTCCGCCAAATCGGCGAAAACACCCCCAAGGCGCGTGACGTCGCCGTGCTTGCCT
CAAGGAAAAGCAAGGAACCCGTGAGAGACCGTGAAGGCCTGCTTGGAGAGTGGAAGCAGAGCGCAGAAGAGGCTAGGCTC
GATCTGCAAACTATCATCGATGCATCCGAGATGCGGGCAGCAGCCAAGACGATTGCCAACTCAAAGGCTGAAAGCCTTCT
CCAGCGCGGCCTGGCAAAGTTGCGAGAATTTGCGCAGCGGATCAAAGGCGATCCAGCAGACCCCCTGATCCCCGCTCACG
TGCTCAAGACAGATGCGCCCACTATCGCGGCTGCGCAGGCTGTAGCCTCTGCGGTGCGCCACCTCTCCCAGCGCGAAGCG
GCTTTTCCGCGCGAAGGTTTGCTCAAAGCAGCATTGGACTTCGGACTACCAACAACAGTGGACCGTGTCGAAAAACAGGT
GAACGCGCTGGTTCGGCAGGGCGCGCTGGTGCGCGGTAAAGGGGCCCAGTCTGGTTGGCTCGCTAGTAAAGAGGCTCTGC
AGCTTGAAGGGGCAATCCTTGCCGGTGTCGACCAGGGACGCGGTGCGGCTGCGCAAATTTTTGAACGCAAGGACGCCGTC
ACGCGCGTTCAGGCCGTCTCCGCAATCAACCACGGCATCACGCTCAATCCGGGTCAGGAAGAAGCGGCAAGCCTCGTACT
TTCATCGCGCGACCGGATTGTCGCGATCCAGGGCGTTGCCGGCGCCGGGAAGAGCAGCGTAATGAAGCCGGTCGCCCAGC
TTTTGCGAGAGGAGGGCAAACAAGTGCTGGGTCTCGCGGTCCAGAACACACTCGTTCAGATGCTCGAGCGCGACACAGGC
ATCCATTCGATGACTATCGTGCGATTCCTGTCGCAATGGGACCGCTTGCTGCGCGAACCTGGCAACGCAGCTTTGCTGCA
TGAGGCGAAAAGCGCGCTCGGTGACCACGTGATTGTTCTCGATGAAGCCTCGATGGTCTCGAACGAAGACAAAGCGAAGC
TGGTCCGCCTTGCAAACCTCGCAGAAGTCCAAAGGCTAGCGCTCGTGGGCGATCGCCAGCAGCTCGGTGCGGTTGATGCA
GGCAAACCCTTTGATCTCGTCCAGCAGGCCGGGATCGAGCGCGCGATCATGGAAGAGAATCTGCGCGGACGCGACTCTGT
CTTGCGGCGCGCCCAAGCCGCTGCGCAGGCAGGGCGTATCGATGATGCGCTGAAAGCGCTCGCGCCAACCACCATCGAGG
CCAAAGGGGACAGCGCAATCGTGGCCGCCGAGCGATGGCTTTCGCTTAGTCCGGCGGAACGCGCGAGGACGTCCATCTAT
GCTTCCGGACGCGCCTTGCGCTCCGCCGTCAACGATGCCGTACAGCAAGGGCTCAAGGCGAATGGCGAGCTTGGACAGCG
TGCTGCACGACTGACTGTTCACTCTCGGGTCAACGCGACACGGGAGGAGCTTCGGTACATGGGCACTTACAGCGCCGGCA
TGGTGCTGAATGTCCGCTCGCGCGACAGTTCCCAGAAACTGTCGAAGGGCGACTACACCGTCAAATCCATCGACCACACC
CGAAAGCGCGTGATGCTCGAAGACAGGAAGGGACGGAAGCATACATTCAGCCCCACGCGTCTTCGCCCAGGTGGGAGCGA
CGACCGTTTCTCATTGTTCGAACGCAAGTCGTTCAGGCTTTTCGAAGGCGACAAGATCCGTTGGACCGACAACGATCATA
AGCGCGCACTTTTCAATGCCGATCAGGCGAAGATTGTAGGGGTCGATGCCAAGGGCGTTACCGTCGAAACCTCAGCGGGC
AACGAGCTTCGCTTGGCGCGGGGCGATCCCATGTTGAAGCGTCTTGATCTGGCTTATGCCTTGAACGCACATATGGCGCA
GGGACTGACCTCTGATCGCGGCATTGCTGTAATGGATAGCCGCGAGCGCAATCTTTCTAATCAGCAAACCTTCCTCGTCA
CCGTGACGAGGCTGCGTGACGGTCTCACATTGGTTGCTGACAGCGCAGAGAAACTCGGCCGGGCAATCAAGGCCAACAGC
GGCGAGAAAAGCTCGGCGCTAGAAGTGACGCAGCGACTGAAGTCAGCGGCGGCAAAAGGCACAGCGCAAGACAAGACCAA
TGAGAGCGCTGCGCCAGCAAAGGATCCGCCGGAACTCACAAAGGAGCGAGTAAAGCCCTTTGAAATAGGGATTTGA

Upstream 100 bases:

>100_bases
CCTCGCCCTCGCTGGCGGGGACAAGCGGATAATTGAAGCTTATCGCGAGGCCGTGGTCGAAACATTACGTTGGGCCGAGG
CCAACGCAGCGCAGACCCGG

Downstream 100 bases:

>100_bases
CCTAGCGGGAGTAATTTAAATCGCGCGACCAAACCAGATCACTCGGCCGACCACCTGGATTTCGGACCGGTCGACATCGT
CCCAGCTTGGATAAGCCGCA

Product: hypothetical protein

Products: NA

Alternate protein names: DNA helicase I [H]

Number of amino acids: Translated: 851; Mature: 850

Protein sequence:

>851_residues
MGSKDGYGRVATDNLTIGLFQHDTNRNQEPNLHFHAVVANVTQGPDGKWRALRNDKLWSFNTLLNSMTMAHFRMAVEKMG
YEAGPVGKHGNFEAAGITRQQLMAFSSRREEVLDAVRQIGENTPKARDVAVLASRKSKEPVRDREGLLGEWKQSAEEARL
DLQTIIDASEMRAAAKTIANSKAESLLQRGLAKLREFAQRIKGDPADPLIPAHVLKTDAPTIAAAQAVASAVRHLSQREA
AFPREGLLKAALDFGLPTTVDRVEKQVNALVRQGALVRGKGAQSGWLASKEALQLEGAILAGVDQGRGAAAQIFERKDAV
TRVQAVSAINHGITLNPGQEEAASLVLSSRDRIVAIQGVAGAGKSSVMKPVAQLLREEGKQVLGLAVQNTLVQMLERDTG
IHSMTIVRFLSQWDRLLREPGNAALLHEAKSALGDHVIVLDEASMVSNEDKAKLVRLANLAEVQRLALVGDRQQLGAVDA
GKPFDLVQQAGIERAIMEENLRGRDSVLRRAQAAAQAGRIDDALKALAPTTIEAKGDSAIVAAERWLSLSPAERARTSIY
ASGRALRSAVNDAVQQGLKANGELGQRAARLTVHSRVNATREELRYMGTYSAGMVLNVRSRDSSQKLSKGDYTVKSIDHT
RKRVMLEDRKGRKHTFSPTRLRPGGSDDRFSLFERKSFRLFEGDKIRWTDNDHKRALFNADQAKIVGVDAKGVTVETSAG
NELRLARGDPMLKRLDLAYALNAHMAQGLTSDRGIAVMDSRERNLSNQQTFLVTVTRLRDGLTLVADSAEKLGRAIKANS
GEKSSALEVTQRLKSAAAKGTAQDKTNESAAPAKDPPELTKERVKPFEIGI

Sequences:

>Translated_851_residues
MGSKDGYGRVATDNLTIGLFQHDTNRNQEPNLHFHAVVANVTQGPDGKWRALRNDKLWSFNTLLNSMTMAHFRMAVEKMG
YEAGPVGKHGNFEAAGITRQQLMAFSSRREEVLDAVRQIGENTPKARDVAVLASRKSKEPVRDREGLLGEWKQSAEEARL
DLQTIIDASEMRAAAKTIANSKAESLLQRGLAKLREFAQRIKGDPADPLIPAHVLKTDAPTIAAAQAVASAVRHLSQREA
AFPREGLLKAALDFGLPTTVDRVEKQVNALVRQGALVRGKGAQSGWLASKEALQLEGAILAGVDQGRGAAAQIFERKDAV
TRVQAVSAINHGITLNPGQEEAASLVLSSRDRIVAIQGVAGAGKSSVMKPVAQLLREEGKQVLGLAVQNTLVQMLERDTG
IHSMTIVRFLSQWDRLLREPGNAALLHEAKSALGDHVIVLDEASMVSNEDKAKLVRLANLAEVQRLALVGDRQQLGAVDA
GKPFDLVQQAGIERAIMEENLRGRDSVLRRAQAAAQAGRIDDALKALAPTTIEAKGDSAIVAAERWLSLSPAERARTSIY
ASGRALRSAVNDAVQQGLKANGELGQRAARLTVHSRVNATREELRYMGTYSAGMVLNVRSRDSSQKLSKGDYTVKSIDHT
RKRVMLEDRKGRKHTFSPTRLRPGGSDDRFSLFERKSFRLFEGDKIRWTDNDHKRALFNADQAKIVGVDAKGVTVETSAG
NELRLARGDPMLKRLDLAYALNAHMAQGLTSDRGIAVMDSRERNLSNQQTFLVTVTRLRDGLTLVADSAEKLGRAIKANS
GEKSSALEVTQRLKSAAAKGTAQDKTNESAAPAKDPPELTKERVKPFEIGI
>Mature_850_residues
GSKDGYGRVATDNLTIGLFQHDTNRNQEPNLHFHAVVANVTQGPDGKWRALRNDKLWSFNTLLNSMTMAHFRMAVEKMGY
EAGPVGKHGNFEAAGITRQQLMAFSSRREEVLDAVRQIGENTPKARDVAVLASRKSKEPVRDREGLLGEWKQSAEEARLD
LQTIIDASEMRAAAKTIANSKAESLLQRGLAKLREFAQRIKGDPADPLIPAHVLKTDAPTIAAAQAVASAVRHLSQREAA
FPREGLLKAALDFGLPTTVDRVEKQVNALVRQGALVRGKGAQSGWLASKEALQLEGAILAGVDQGRGAAAQIFERKDAVT
RVQAVSAINHGITLNPGQEEAASLVLSSRDRIVAIQGVAGAGKSSVMKPVAQLLREEGKQVLGLAVQNTLVQMLERDTGI
HSMTIVRFLSQWDRLLREPGNAALLHEAKSALGDHVIVLDEASMVSNEDKAKLVRLANLAEVQRLALVGDRQQLGAVDAG
KPFDLVQQAGIERAIMEENLRGRDSVLRRAQAAAQAGRIDDALKALAPTTIEAKGDSAIVAAERWLSLSPAERARTSIYA
SGRALRSAVNDAVQQGLKANGELGQRAARLTVHSRVNATREELRYMGTYSAGMVLNVRSRDSSQKLSKGDYTVKSIDHTR
KRVMLEDRKGRKHTFSPTRLRPGGSDDRFSLFERKSFRLFEGDKIRWTDNDHKRALFNADQAKIVGVDAKGVTVETSAGN
ELRLARGDPMLKRLDLAYALNAHMAQGLTSDRGIAVMDSRERNLSNQQTFLVTVTRLRDGLTLVADSAEKLGRAIKANSG
EKSSALEVTQRLKSAAAKGTAQDKTNESAAPAKDPPELTKERVKPFEIGI

Specific function: TraI has been identified as DNA helicase I and it also has an additional activity of site-specific nicking at oriT. DNA helicase I is a potent DNA-dependent ATPase [H]

COG id: COG0507

COG function: function code L; ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: To traI of plasmid IncFII R100 [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014059
- InterPro:   IPR014129
- InterPro:   IPR009767
- InterPro:   IPR014862 [H]

Pfam domain/function: PF07057 TraI; PF08751 TrwC [H]

EC number: =3.6.4.12 [H]

Molecular weight: Translated: 92589; Mature: 92458

Theoretical pI: Translated: 10.52; Mature: 10.52

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGSKDGYGRVATDNLTIGLFQHDTNRNQEPNLHFHAVVANVTQGPDGKWRALRNDKLWSF
CCCCCCCCEEEECCEEEEEEEECCCCCCCCCEEEEEEEEECCCCCCCCEEEECCCCCCHH
NTLLNSMTMAHFRMAVEKMGYEAGPVGKHGNFEAAGITRQQLMAFSSRREEVLDAVRQIG
HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC
ENTPKARDVAVLASRKSKEPVRDREGLLGEWKQSAEEARLDLQTIIDASEMRAAAKTIAN
CCCCCHHHHHHHHHCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHH
SKAESLLQRGLAKLREFAQRIKGDPADPLIPAHVLKTDAPTIAAAQAVASAVRHLSQREA
HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHH
AFPREGLLKAALDFGLPTTVDRVEKQVNALVRQGALVRGKGAQSGWLASKEALQLEGAIL
CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCHHHHHCCEEE
AGVDQGRGAAAQIFERKDAVTRVQAVSAINHGITLNPGQEEAASLVLSSRDRIVAIQGVA
ECCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEECCCCHHHHHHHHHCCCCCEEEEECCC
GAGKSSVMKPVAQLLREEGKQVLGLAVQNTLVQMLERDTGIHSMTIVRFLSQWDRLLREP
CCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
GNAALLHEAKSALGDHVIVLDEASMVSNEDKAKLVRLANLAEVQRLALVGDRQQLGAVDA
CCHHHHHHHHHHCCCEEEEEECHHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCC
GKPFDLVQQAGIERAIMEENLRGRDSVLRRAQAAAQAGRIDDALKALAPTTIEAKGDSAI
CCCHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCEEECCCCCEE
VAAERWLSLSPAERARTSIYASGRALRSAVNDAVQQGLKANGELGQRAARLTVHSRVNAT
EEHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH
REELRYMGTYSAGMVLNVRSRDSSQKLSKGDYTVKSIDHTRKRVMLEDRKGRKHTFSPTR
HHHHHHHCCCCCCEEEEECCCCCHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC
LRPGGSDDRFSLFERKSFRLFEGDKIRWTDNDHKRALFNADQAKIVGVDAKGVTVETSAG
CCCCCCCCHHHHHHHCCCEEECCCEEEECCCCCCHHHCCCCCCEEEEECCCCEEEECCCC
NELRLARGDPMLKRLDLAYALNAHMAQGLTSDRGIAVMDSRERNLSNQQTFLVTVTRLRD
CEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCHHCCCCCCCEEEEEHHHHHC
GLTLVADSAEKLGRAIKANSGEKSSALEVTQRLKSAAAKGTAQDKTNESAAPAKDPPELT
CCEEEECCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHH
KERVKPFEIGI
HHHCCCCCCCC
>Mature Secondary Structure 
GSKDGYGRVATDNLTIGLFQHDTNRNQEPNLHFHAVVANVTQGPDGKWRALRNDKLWSF
CCCCCCCEEEECCEEEEEEEECCCCCCCCCEEEEEEEEECCCCCCCCEEEECCCCCCHH
NTLLNSMTMAHFRMAVEKMGYEAGPVGKHGNFEAAGITRQQLMAFSSRREEVLDAVRQIG
HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC
ENTPKARDVAVLASRKSKEPVRDREGLLGEWKQSAEEARLDLQTIIDASEMRAAAKTIAN
CCCCCHHHHHHHHHCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHH
SKAESLLQRGLAKLREFAQRIKGDPADPLIPAHVLKTDAPTIAAAQAVASAVRHLSQREA
HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHH
AFPREGLLKAALDFGLPTTVDRVEKQVNALVRQGALVRGKGAQSGWLASKEALQLEGAIL
CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCHHHHHCCEEE
AGVDQGRGAAAQIFERKDAVTRVQAVSAINHGITLNPGQEEAASLVLSSRDRIVAIQGVA
ECCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEECCCCHHHHHHHHHCCCCCEEEEECCC
GAGKSSVMKPVAQLLREEGKQVLGLAVQNTLVQMLERDTGIHSMTIVRFLSQWDRLLREP
CCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC
GNAALLHEAKSALGDHVIVLDEASMVSNEDKAKLVRLANLAEVQRLALVGDRQQLGAVDA
CCHHHHHHHHHHCCCEEEEEECHHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCC
GKPFDLVQQAGIERAIMEENLRGRDSVLRRAQAAAQAGRIDDALKALAPTTIEAKGDSAI
CCCHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCEEECCCCCEE
VAAERWLSLSPAERARTSIYASGRALRSAVNDAVQQGLKANGELGQRAARLTVHSRVNAT
EEHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH
REELRYMGTYSAGMVLNVRSRDSSQKLSKGDYTVKSIDHTRKRVMLEDRKGRKHTFSPTR
HHHHHHHCCCCCCEEEEECCCCCHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC
LRPGGSDDRFSLFERKSFRLFEGDKIRWTDNDHKRALFNADQAKIVGVDAKGVTVETSAG
CCCCCCCCHHHHHHHCCCEEECCCEEEECCCCCCHHHCCCCCCEEEEECCCCEEEECCCC
NELRLARGDPMLKRLDLAYALNAHMAQGLTSDRGIAVMDSRERNLSNQQTFLVTVTRLRD
CEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCHHCCCCCCCEEEEEHHHHHC
GLTLVADSAEKLGRAIKANSGEKSSALEVTQRLKSAAAKGTAQDKTNESAAPAKDPPELT
CCEEEECCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHH
KERVKPFEIGI
HHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2163400; 7915817; 2680768; 2164585; 8736534 [H]