Definition | Shewanella sp. ANA-3 chromosome chromosome 1, complete sequence. |
---|---|
Accession | NC_008577 |
Length | 4,972,204 |
Click here to switch to the map view.
The map label for this gene is 117919382
Identifier: 117919382
GI number: 117919382
Start: 1093373
End: 1094032
Strand: Direct
Name: 117919382
Synonym: Shewana3_0933
Alternate gene names: NA
Gene position: 1093373-1094032 (Clockwise)
Preceding gene: 117919379
Following gene: 117919383
Centisome position: 21.99
GC content: 47.73
Gene sequence:
>660_bases ATGAAACAAGTCGATTTAGGACTCGAACCTGAGGCCAATCTTGAAGCTGATAGCGGCATCTCTTGCGCCCATCATCAAGA TGCGGCACAAGTCGAGCCGCCTATCACCTTAGTTAGGGATTATTTGAATGTGGCGCAGCAGAATGCCCTAATCAAGGAGG CCGAATCTTACCCTCTGAACCGCCCACAAATTCAAGTATTTGGGGAATATCATGCCATTCCAAGGCAGCAGGTTTGGTAT GGCGATTTGGGATGCGACTATTTATATTCGGGGCTATTTATTCGTGCTTTGCCTTGGCCTAAGTATTTGCAAAAATTGCG TGACAAGTTGCAGCGGGATTTTGGCCTGGGCAGCAATGGTGTGTTAGTGAATCGCTATGCCGATGGTCAAGATTGCATGG GCGCCCACAGCGATGATGAGCCCGAGATTGCAAGCGGCAGTGATATCGCCTCTATCAGCCTTGGCGCCTCGCGGGATTTT GTGATTAAACATAAACACAGCAAGGTGAAATACACGATTAGCTTACACAGTGGTGACTTACTGATTATGCACTGGCCAAT GCAGCAAGATTGGTTACACAGCGTGCCTAAGCGCTTAAAGGTTAAGGAACCCCGCTGGAACTACACCTTTAGGCAATTGA TTGTTAATTATCATGGCTGA
Upstream 100 bases:
>100_bases CCCCTTATCGTGCGATGCTTGCCTGAGGTTGTATATCTGCTCGTGCATTTATGGGTGATTGGATGGGGTCAATCAAGTCG TCAGTGGTATAAGGTACTGT
Downstream 100 bases:
>100_bases TTGAAAGCTAAATTATTTCATACTGTTGCTAAATTGGCTCAGTTTAAAGCTGCAAAGCTGACGTTGGGATTTGATCTTTT GTACTTGAAGCAGTACTCTG
Product: DNA-N1-methyladenine dioxygenase
Products: NA
Alternate protein names: Alkylated DNA Repair Protein; DNA Repair System Specific For Alkylated DNA; DNA-N1-Methyladenine Dioxygenase; 2OG-Fe(II) Oxygenase Superfamily Protein; 2OG-Fe(II) Oxygenase Family Oxidoreductase; Oxidoreductase 2OG-Fe(II) Oxygenase Family; Alkylated DNA Repair Protein-Like Protein; Oxidoreductase 2OG-Fe(II) Oxygenase Family Protein; Alkylated DNA Repair Protein AlkB; CRISPR-Associated Family Protein; DNA Repair System Specific For Alkylated DNA Protein; 2OG-Fe(II) Oxygenase Family Protein; DNA Repair System Protein
Number of amino acids: Translated: 219; Mature: 219
Protein sequence:
>219_residues MKQVDLGLEPEANLEADSGISCAHHQDAAQVEPPITLVRDYLNVAQQNALIKEAESYPLNRPQIQVFGEYHAIPRQQVWY GDLGCDYLYSGLFIRALPWPKYLQKLRDKLQRDFGLGSNGVLVNRYADGQDCMGAHSDDEPEIASGSDIASISLGASRDF VIKHKHSKVKYTISLHSGDLLIMHWPMQQDWLHSVPKRLKVKEPRWNYTFRQLIVNYHG
Sequences:
>Translated_219_residues MKQVDLGLEPEANLEADSGISCAHHQDAAQVEPPITLVRDYLNVAQQNALIKEAESYPLNRPQIQVFGEYHAIPRQQVWY GDLGCDYLYSGLFIRALPWPKYLQKLRDKLQRDFGLGSNGVLVNRYADGQDCMGAHSDDEPEIASGSDIASISLGASRDF VIKHKHSKVKYTISLHSGDLLIMHWPMQQDWLHSVPKRLKVKEPRWNYTFRQLIVNYHG >Mature_219_residues MKQVDLGLEPEANLEADSGISCAHHQDAAQVEPPITLVRDYLNVAQQNALIKEAESYPLNRPQIQVFGEYHAIPRQQVWY GDLGCDYLYSGLFIRALPWPKYLQKLRDKLQRDFGLGSNGVLVNRYADGQDCMGAHSDDEPEIASGSDIASISLGASRDF VIKHKHSKVKYTISLHSGDLLIMHWPMQQDWLHSVPKRLKVKEPRWNYTFRQLIVNYHG
Specific function: Unknown
COG id: COG3145
COG function: function code L; Alkylated DNA repair protein
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI224451107, Length=175, Percent_Identity=44, Blast_Score=149, Evalue=2e-36, Organism=Homo sapiens, GI48717226, Length=175, Percent_Identity=44, Blast_Score=149, Evalue=2e-36, Organism=Homo sapiens, GI224451103, Length=175, Percent_Identity=44, Blast_Score=149, Evalue=2e-36, Organism=Homo sapiens, GI21040275, Length=156, Percent_Identity=37.1794871794872, Blast_Score=94, Evalue=7e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 24969; Mature: 24969
Theoretical pI: Translated: 6.86; Mature: 6.86
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKQVDLGLEPEANLEADSGISCAHHQDAAQVEPPITLVRDYLNVAQQNALIKEAESYPLN CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC RPQIQVFGEYHAIPRQQVWYGDLGCDYLYSGLFIRALPWPKYLQKLRDKLQRDFGLGSNG CCEEEEEECCCCCCCCEEEECCCCHHHHHHCHHEEECCCHHHHHHHHHHHHHHHCCCCCC VLVNRYADGQDCMGAHSDDEPEIASGSDIASISLGASRDFVIKHKHSKVKYTISLHSGDL EEEEEECCCHHHCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCEEEEEEEECCCCE LIMHWPMQQDWLHSVPKRLKVKEPRWNYTFRQLIVNYHG EEEECCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCC >Mature Secondary Structure MKQVDLGLEPEANLEADSGISCAHHQDAAQVEPPITLVRDYLNVAQQNALIKEAESYPLN CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC RPQIQVFGEYHAIPRQQVWYGDLGCDYLYSGLFIRALPWPKYLQKLRDKLQRDFGLGSNG CCEEEEEECCCCCCCCEEEECCCCHHHHHHCHHEEECCCHHHHHHHHHHHHHHHCCCCCC VLVNRYADGQDCMGAHSDDEPEIASGSDIASISLGASRDFVIKHKHSKVKYTISLHSGDL EEEEEECCCHHHCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCEEEEEEEECCCCE LIMHWPMQQDWLHSVPKRLKVKEPRWNYTFRQLIVNYHG EEEECCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA