Definition Nitrosospira multiformis ATCC 25196 chromosome, complete genome.
Accession NC_007614
Length 3,184,243

Click here to switch to the map view.

The map label for this gene is cho [H]

Identifier: 82703123

GI number: 82703123

Start: 2300907

End: 2302298

Strand: Direct

Name: cho [H]

Synonym: Nmul_A2002

Alternate gene names: 82703123

Gene position: 2300907-2302298 (Clockwise)

Preceding gene: 82703122

Following gene: 82703127

Centisome position: 72.26

GC content: 58.41

Gene sequence:

>1392_bases
TTGCTTCCTTTTCCCTACGTCATCCTGGACCTGGAAACCACGGGCGGCACGCCCCTGCATGACCGCATCATCGAGATTGC
GCTCATTCGTTTCGAGGAAGGAATGGAAAGCGAGCGTTGGGAAACGCTCGTTAATCCAGGCATATCCATTCCGCCTTTCA
TCACGCATCTGACCGGTATCAGCAACGGGATGGTAAAGGATGCGCCTTCCTTCGGGGATATCGCCCACCGGCTCTACGGT
TTTCTTGATGGAGCGGTGCTGGCAGCGCATAACGTCCGCTTCGACTACGGATTCCTGAAGAACGAGTACCGGCGCATGGG
CGCGCTGCTCCAGCACAGGGTATTGTGCACGGCCCGGCTTTCGCGCAAGCTCTATCCTCAGCACAAGGGTCATGGACTCG
ATGCCATCATGCAGCGTCATGGGCTAAAGACCGAGATGCGCCACCGGGCGATGGGGGACGTGGAGCTCGTCGCAGCCTAT
CTTGAGATGGCAAGGCGCGAGCTGGGTGCCCGGGAGGTACAAGAAGCGGCAGCCATCCTGCTGAAAGACCCAAGCCTGCC
CGCAGGCCTGGATGCTTCGATTCTGGACCAGATTCCGGACAGGCCGGGTGTCTATTTCTTCTACGGCAAAAACGGCCTTC
CCCTCTATATCGGCAAGAGCGTGACGCTGCGCTCCCGGGTCATGTCCCATTTCAGCGGCGACCATGCTTCGTTCAGCGAC
ATGCGCATTGCCCAGGAGGTCGAGCGGGTCGAATGGATGGAAACAGCCGGGGAACTGGGCGCACTGCTGCTGGAGTCGAG
GTTAATCAAGGAGCATCATCCCATCCACAACAAGCGATTGCGCCGCTCCCGCACGCTCTTTTCCCTGAAGCTGGGCGACG
ACTCGTACGAGGCTCCCCTGGTGAATATCGTGACGGAAGAGGATATCCACCCTGAGGTGTTCGGCGATCTGTACGGGCTC
TTCCGCTCGAAAACAAAAGCGGTTGATGCACTGCGCGAGGTTGTCCGGGAGAACAGGTTGTGTCCCCGGGTGGTAGGCCT
TGAGAATGGGAAGGGCGCGTGTTTCGCACACCAGTTGAAACGCTGTAACGGCGTCTGCGCGGGCAAGGAAGTGCCGCAAC
TGCATTACCTGCGCCTGAAACAGGCGCTGCTCCCCCTTAAGCTCAAATCATGGCCCTATCCCGGCAGGATCGGCATACGG
GAATACAATGCGTCGTCCGGCCGATCGGAAGTGCATGTCTTCCACTACTGGTGCCATCTGGGAACGGTGGACAACGAAGC
CGGCCTGGACGATGTGCTGGGGACGCGCTCATCCATGAAGTTTGATCTCGATACCTACAAGCTGCTCCTAAAGACCCTGG
GAAAGCAAACAGAGGTGATTACGTTTGGATAG

Upstream 100 bases:

>100_bases
GCAGCATCCCAGGATTACCATGACTACCCTGACATTGTGCCCTCTTTGCTGGGCGATTGATCCGGGAATTTAGAAACCTA
TCCTCACTCCTGTCCCAACC

Downstream 100 bases:

>100_bases
ACTGTTTCTGGTGCGAAAATGGCTCTCGTATGCAGCCTTCATGCGCGCTCAAAACTTCCCTCCAGCCACAGCGCCATGAG
CTGCTTGGAAATGTCGCGCA

Product: DNA polymerase III subunit epsilon

Products: diphosphate; DNAn+1

Alternate protein names: Endonuclease cho; UvrC homolog protein [H]

Number of amino acids: Translated: 463; Mature: 463

Protein sequence:

>463_residues
MLPFPYVILDLETTGGTPLHDRIIEIALIRFEEGMESERWETLVNPGISIPPFITHLTGISNGMVKDAPSFGDIAHRLYG
FLDGAVLAAHNVRFDYGFLKNEYRRMGALLQHRVLCTARLSRKLYPQHKGHGLDAIMQRHGLKTEMRHRAMGDVELVAAY
LEMARRELGAREVQEAAAILLKDPSLPAGLDASILDQIPDRPGVYFFYGKNGLPLYIGKSVTLRSRVMSHFSGDHASFSD
MRIAQEVERVEWMETAGELGALLLESRLIKEHHPIHNKRLRRSRTLFSLKLGDDSYEAPLVNIVTEEDIHPEVFGDLYGL
FRSKTKAVDALREVVRENRLCPRVVGLENGKGACFAHQLKRCNGVCAGKEVPQLHYLRLKQALLPLKLKSWPYPGRIGIR
EYNASSGRSEVHVFHYWCHLGTVDNEAGLDDVLGTRSSMKFDLDTYKLLLKTLGKQTEVITFG

Sequences:

>Translated_463_residues
MLPFPYVILDLETTGGTPLHDRIIEIALIRFEEGMESERWETLVNPGISIPPFITHLTGISNGMVKDAPSFGDIAHRLYG
FLDGAVLAAHNVRFDYGFLKNEYRRMGALLQHRVLCTARLSRKLYPQHKGHGLDAIMQRHGLKTEMRHRAMGDVELVAAY
LEMARRELGAREVQEAAAILLKDPSLPAGLDASILDQIPDRPGVYFFYGKNGLPLYIGKSVTLRSRVMSHFSGDHASFSD
MRIAQEVERVEWMETAGELGALLLESRLIKEHHPIHNKRLRRSRTLFSLKLGDDSYEAPLVNIVTEEDIHPEVFGDLYGL
FRSKTKAVDALREVVRENRLCPRVVGLENGKGACFAHQLKRCNGVCAGKEVPQLHYLRLKQALLPLKLKSWPYPGRIGIR
EYNASSGRSEVHVFHYWCHLGTVDNEAGLDDVLGTRSSMKFDLDTYKLLLKTLGKQTEVITFG
>Mature_463_residues
MLPFPYVILDLETTGGTPLHDRIIEIALIRFEEGMESERWETLVNPGISIPPFITHLTGISNGMVKDAPSFGDIAHRLYG
FLDGAVLAAHNVRFDYGFLKNEYRRMGALLQHRVLCTARLSRKLYPQHKGHGLDAIMQRHGLKTEMRHRAMGDVELVAAY
LEMARRELGAREVQEAAAILLKDPSLPAGLDASILDQIPDRPGVYFFYGKNGLPLYIGKSVTLRSRVMSHFSGDHASFSD
MRIAQEVERVEWMETAGELGALLLESRLIKEHHPIHNKRLRRSRTLFSLKLGDDSYEAPLVNIVTEEDIHPEVFGDLYGL
FRSKTKAVDALREVVRENRLCPRVVGLENGKGACFAHQLKRCNGVCAGKEVPQLHYLRLKQALLPLKLKSWPYPGRIGIR
EYNASSGRSEVHVFHYWCHLGTVDNEAGLDDVLGTRSSMKFDLDTYKLLLKTLGKQTEVITFG

Specific function: Incises the DNA at the 3' side of a lesion during nucleotide excision repair. Incises the DNA farther away from the lesion than uvrC. Not able to incise the 5' site of a lesion. When a lesion remains because uvrC is not able to induce the 3' incision, cho

COG id: COG0322

COG function: function code L; Nuclease subunit of the excinuclease complex

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 uvrC C-terminal domain [H]

Homologues:

Organism=Escherichia coli, GI1788037, Length=295, Percent_Identity=40, Blast_Score=190, Evalue=1e-49,
Organism=Escherichia coli, GI87081999, Length=204, Percent_Identity=27.9411764705882, Blast_Score=64, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000305 [H]

Pfam domain/function: PF01541 GIY-YIG [H]

EC number: 2.7.7.7

Molecular weight: Translated: 52234; Mature: 52234

Theoretical pI: Translated: 8.36; Mature: 8.36

Prosite motif: PS50164 UVRC_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLPFPYVILDLETTGGTPLHDRIIEIALIRFEEGMESERWETLVNPGISIPPFITHLTGI
CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCHHHHHHHHC
SNGMVKDAPSFGDIAHRLYGFLDGAVLAAHNVRFDYGFLKNEYRRMGALLQHRVLCTARL
CCCCCCCCCCHHHHHHHHHHHHCCHHHHHCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHH
SRKLYPQHKGHGLDAIMQRHGLKTEMRHRAMGDVELVAAYLEMARRELGAREVQEAAAIL
HHHHCCCCCCCCHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHHHCHHHHHHHHHHE
LKDPSLPAGLDASILDQIPDRPGVYFFYGKNGLPLYIGKSVTLRSRVMSHFSGDHASFSD
EECCCCCCCCCHHHHHHCCCCCCEEEEECCCCCEEEECCCHHHHHHHHHHHCCCCCCHHH
MRIAQEVERVEWMETAGELGALLLESRLIKEHHPIHNKRLRRSRTLFSLKLGDDSYEAPL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCEEEEEECCCCCCCCE
VNIVTEEDIHPEVFGDLYGLFRSKTKAVDALREVVRENRLCPRVVGLENGKGACFAHQLK
EEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHH
RCNGVCAGKEVPQLHYLRLKQALLPLKLKSWPYPGRIGIREYNASSGRSEVHVFHYWCHL
HHCCCCCCCCCCHHHHHHHHHHHCCCHHCCCCCCCCCCEEECCCCCCCCEEEEEEEEEEC
GTVDNEAGLDDVLGTRSSMKFDLDTYKLLLKTLGKQTEVITFG
CCCCCCCCCHHHHCCCCCCEECHHHHHHHHHHHCCCCEEEEEC
>Mature Secondary Structure
MLPFPYVILDLETTGGTPLHDRIIEIALIRFEEGMESERWETLVNPGISIPPFITHLTGI
CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCHHHHHHHHC
SNGMVKDAPSFGDIAHRLYGFLDGAVLAAHNVRFDYGFLKNEYRRMGALLQHRVLCTARL
CCCCCCCCCCHHHHHHHHHHHHCCHHHHHCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHH
SRKLYPQHKGHGLDAIMQRHGLKTEMRHRAMGDVELVAAYLEMARRELGAREVQEAAAIL
HHHHCCCCCCCCHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHHHCHHHHHHHHHHE
LKDPSLPAGLDASILDQIPDRPGVYFFYGKNGLPLYIGKSVTLRSRVMSHFSGDHASFSD
EECCCCCCCCCHHHHHHCCCCCCEEEEECCCCCEEEECCCHHHHHHHHHHHCCCCCCHHH
MRIAQEVERVEWMETAGELGALLLESRLIKEHHPIHNKRLRRSRTLFSLKLGDDSYEAPL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCEEEEEECCCCCCCCE
VNIVTEEDIHPEVFGDLYGLFRSKTKAVDALREVVRENRLCPRVVGLENGKGACFAHQLK
EEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHH
RCNGVCAGKEVPQLHYLRLKQALLPLKLKSWPYPGRIGIREYNASSGRSEVHVFHYWCHL
HHCCCCCCCCCCHHHHHHHHHHHCCCHHCCCCCCCCCCEEECCCCCCCCEEEEEEEEEEC
GTVDNEAGLDDVLGTRSSMKFDLDTYKLLLKTLGKQTEVITFG
CCCCCCCCCHHHHCCCCCCEECHHHHHHHHHHHCCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: deoxynucleoside triphosphate; DNAn

Specific reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1)

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11677608; 12644504 [H]