Definition | Geobacter metallireducens GS-15 chromosome, complete genome. |
---|---|
Accession | NC_007517 |
Length | 3,997,420 |
Click here to switch to the map view.
The map label for this gene is 78222447
Identifier: 78222447
GI number: 78222447
Start: 1390389
End: 1392110
Strand: Direct
Name: 78222447
Synonym: Gmet_1233
Alternate gene names: NA
Gene position: 1390389-1392110 (Clockwise)
Preceding gene: 78222442
Following gene: 78222448
Centisome position: 34.78
GC content: 55.23
Gene sequence:
>1722_bases ATGAAAAAACGTATTGTCGTGGCATTATTCCTGGCTCTTTCCTTTCTTCCGGGATGCGCCACAAACGGCGCGGGAAAGCC TCTTCCTGCCAACGAGCATTCATTTCAGCCGACCGTCAATATTGCCGGATCCAGGGCTCTCTACATTTATTCATTGTCGC GTCTCCGCGAGCTGGACGGGGATTTCGAGGGTGCGCTCACGCTCCTGAATGGTGCCATTGAGGCTGACCCCAACTCCGCG TTTCTTCATACGGCAGCCGCCGAGATCTATCTGAAATCGGGCAAACTCGACGATGCGCTCCGTGCATGTGAAAACGCCAT CCGCGTCGATCCCGGGTTCCGTCCGGCACGGATCATTGCCGGGACCATCCTGGCAAACCTCAAGCGGGACAAGGAAGCTA TTGTCCATCTCAGCAAGGCCATTGAGCTTGATCCGACCAAGGAGGATGCCTATCTCCACCTGGCCATCTCCTATGTCCGA ACCTTCGATTACGAGCAGGCGGTTAACACCCTCAAGTCCCTGATCAAGATCAACCCCGAGTCATCGCTGGGCTACTATTA CCTGGGCAAGACCTACGACCAGATGAAGCTCCAGAAGGAGGCTGCCAACTATTACAAGAAGGCCATCGAGATCAAGCCCG ATTTCGAGCAGGCCATAATCGACCTGGGGATTTCCCAGGAGGGGCTCGGCCTCTATGATGATGCCATCGCCACCTACAAG CGGCTGCTTGAGACAAATCCGTTCAACATGAACGTGCTGCAGCACCTGGTGCAGCTTTACCTCCAGCAGCAGCGGCTGGA GGACGCGCTCCCACTGCTCATCGTGATGAAGGATCGGGGTGTCGGCGGCCTGGAAACCCAGCGCAAGATCGGCCTCATCT ATATGGAACTGGAGCGCTATGACGAGGCAATCGCCGAATTCGAGCAGATCCTTGCCCGGGAACCCAAGGCCCACCAGATT CGCTTCTACATTGCGAGCGCTTACGAGGAGAAAGAGGAGTTCGACAAGGCCATAGAGGAATTCAGTAAGATTCCTCCGGG AACCGCCAATTACGTTGAGGCCTTGGGGCACATCGCTTTCATGTACCGGGATCAGGAAAAACCTGAGAAGGGAATCCAGA TCCTGACGGATGCCATTACTGCCAATCCCGATAAATTGGACCTCTATCTCTATCTGGCCGGTCTGTACGAATCAATGGAT AAGTTCTCAGAAGGACTTGCCGTTCTCAAGGGGGTTGAAGGGAAATTTGCCGAGGACCCGAGGCTCCACTTCCGGATGGG AACCATCCTCGACAAAATGGGGAACAAGGAGGAGTCCATCGCCCGGATGAAGCGGGTTATCGCCATTACGCCCGACGATG CCCAGGCCCTCAACTATCTGGGCTATACCTACGCCGAGATGGGCATCAAGCTGGATGAAGCCCTCCAGTACCTCAAGAAG GCCGTGGCGCTTCGTCCCAACGACGGTTTTATTCTCGACAGTCTCGGCTGGGTCTACTTCAAGATGAAGCGTTATGACGA GGCCGTGCCGCTCCTGGAGCGGTCGCTCAAGGTCGTGGAGGACGATCTGACGGTCATGGAGCACCTGGCCGACGCCTATG CAGCCAACCATGAATACCGCAATGCCTTGAAACTTTACAAAAAGATCCTCGACGCCGACCCGGGTCGCAAGGATATCGCC GAAAAGAAGAAAAAGGTCAGGGCGGAAAGTCTGGAAAAATGA
Upstream 100 bases:
>100_bases TGAACACTTATTTCACTTTACCATATTCGGAACAACGGAGTAGATGCGGGGCTTTCGGGCGTCTTTCCCGCTTCAGGCAG ACGTAGGCACGGGTAACTGC
Downstream 100 bases:
>100_bases CGGCACGGCTGTTCTGCTGTCTTGCGGCACTGGTTACCCTTGTTTCGTGCAGCGGCGGTACGCCTCCTGCGGAACTGCCG CGGCGTGGAGCAGGGGCCCC
Product: intermediate filament protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 573; Mature: 573
Protein sequence:
>573_residues MKKRIVVALFLALSFLPGCATNGAGKPLPANEHSFQPTVNIAGSRALYIYSLSRLRELDGDFEGALTLLNGAIEADPNSA FLHTAAAEIYLKSGKLDDALRACENAIRVDPGFRPARIIAGTILANLKRDKEAIVHLSKAIELDPTKEDAYLHLAISYVR TFDYEQAVNTLKSLIKINPESSLGYYYLGKTYDQMKLQKEAANYYKKAIEIKPDFEQAIIDLGISQEGLGLYDDAIATYK RLLETNPFNMNVLQHLVQLYLQQQRLEDALPLLIVMKDRGVGGLETQRKIGLIYMELERYDEAIAEFEQILAREPKAHQI RFYIASAYEEKEEFDKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLYLAGLYESMD KFSEGLAVLKGVEGKFAEDPRLHFRMGTILDKMGNKEESIARMKRVIAITPDDAQALNYLGYTYAEMGIKLDEALQYLKK AVALRPNDGFILDSLGWVYFKMKRYDEAVPLLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKILDADPGRKDIA EKKKKVRAESLEK
Sequences:
>Translated_573_residues MKKRIVVALFLALSFLPGCATNGAGKPLPANEHSFQPTVNIAGSRALYIYSLSRLRELDGDFEGALTLLNGAIEADPNSA FLHTAAAEIYLKSGKLDDALRACENAIRVDPGFRPARIIAGTILANLKRDKEAIVHLSKAIELDPTKEDAYLHLAISYVR TFDYEQAVNTLKSLIKINPESSLGYYYLGKTYDQMKLQKEAANYYKKAIEIKPDFEQAIIDLGISQEGLGLYDDAIATYK RLLETNPFNMNVLQHLVQLYLQQQRLEDALPLLIVMKDRGVGGLETQRKIGLIYMELERYDEAIAEFEQILAREPKAHQI RFYIASAYEEKEEFDKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLYLAGLYESMD KFSEGLAVLKGVEGKFAEDPRLHFRMGTILDKMGNKEESIARMKRVIAITPDDAQALNYLGYTYAEMGIKLDEALQYLKK AVALRPNDGFILDSLGWVYFKMKRYDEAVPLLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKILDADPGRKDIA EKKKKVRAESLEK >Mature_573_residues MKKRIVVALFLALSFLPGCATNGAGKPLPANEHSFQPTVNIAGSRALYIYSLSRLRELDGDFEGALTLLNGAIEADPNSA FLHTAAAEIYLKSGKLDDALRACENAIRVDPGFRPARIIAGTILANLKRDKEAIVHLSKAIELDPTKEDAYLHLAISYVR TFDYEQAVNTLKSLIKINPESSLGYYYLGKTYDQMKLQKEAANYYKKAIEIKPDFEQAIIDLGISQEGLGLYDDAIATYK RLLETNPFNMNVLQHLVQLYLQQQRLEDALPLLIVMKDRGVGGLETQRKIGLIYMELERYDEAIAEFEQILAREPKAHQI RFYIASAYEEKEEFDKAIEEFSKIPPGTANYVEALGHIAFMYRDQEKPEKGIQILTDAITANPDKLDLYLYLAGLYESMD KFSEGLAVLKGVEGKFAEDPRLHFRMGTILDKMGNKEESIARMKRVIAITPDDAQALNYLGYTYAEMGIKLDEALQYLKK AVALRPNDGFILDSLGWVYFKMKRYDEAVPLLERSLKVVEDDLTVMEHLADAYAANHEYRNALKLYKKILDADPGRKDIA EKKKKVRAESLEK
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 5 TPR repeats [H]
Homologues:
Organism=Homo sapiens, GI32307148, Length=503, Percent_Identity=23.2604373757455, Blast_Score=92, Evalue=2e-18, Organism=Homo sapiens, GI32307150, Length=496, Percent_Identity=22.1774193548387, Blast_Score=92, Evalue=2e-18, Organism=Homo sapiens, GI83415184, Length=533, Percent_Identity=20.0750469043152, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI301336134, Length=406, Percent_Identity=20.4433497536946, Blast_Score=71, Evalue=3e-12, Organism=Caenorhabditis elegans, GI115532690, Length=526, Percent_Identity=23.0038022813688, Blast_Score=89, Evalue=6e-18, Organism=Caenorhabditis elegans, GI115532692, Length=425, Percent_Identity=23.5294117647059, Blast_Score=85, Evalue=9e-17, Organism=Caenorhabditis elegans, GI25147174, Length=258, Percent_Identity=22.8682170542636, Blast_Score=76, Evalue=6e-14, Organism=Drosophila melanogaster, GI17647755, Length=404, Percent_Identity=22.029702970297, Blast_Score=84, Evalue=2e-16, Organism=Drosophila melanogaster, GI24585827, Length=404, Percent_Identity=22.029702970297, Blast_Score=84, Evalue=2e-16, Organism=Drosophila melanogaster, GI24585829, Length=404, Percent_Identity=22.029702970297, Blast_Score=84, Evalue=2e-16,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011717 - InterPro: IPR013026 - InterPro: IPR011990 - InterPro: IPR013105 - InterPro: IPR019734 [H]
Pfam domain/function: PF07719 TPR_2; PF07721 TPR_4 [H]
EC number: NA
Molecular weight: Translated: 64769; Mature: 64769
Theoretical pI: Translated: 5.62; Mature: 5.62
Prosite motif: PS50005 TPR L=RR ; PS50293 TPR_REGION ; PS00226 IF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKKRIVVALFLALSFLPGCATNGAGKPLPANEHSFQPTVNIAGSRALYIYSLSRLRELDG CCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEHHHHHHHHHCC DFEGALTLLNGAIEADPNSAFLHTAAAEIYLKSGKLDDALRACENAIRVDPGFRPARIIA CHHHHHHHHHCCCCCCCCCHHHHHHHHHHHEECCCCHHHHHHHHHHEECCCCCCHHHHHH GTILANLKRDKEAIVHLSKAIELDPTKEDAYLHLAISYVRTFDYEQAVNTLKSLIKINPE HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHHHHHCCHHHHHHHHHHHHCCCCC SSLGYYYLGKTYDQMKLQKEAANYYKKAIEIKPDFEQAIIDLGISQEGLGLYDDAIATYK CCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHH RLLETNPFNMNVLQHLVQLYLQQQRLEDALPLLIVMKDRGVGGLETQRKIGLIYMELERY HHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHCEEEEHHHHH DEAIAEFEQILAREPKAHQIRFYIASAYEEKEEFDKAIEEFSKIPPGTANYVEALGHIAF HHHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH MYRDQEKPEKGIQILTDAITANPDKLDLYLYLAGLYESMDKFSEGLAVLKGVEGKFAEDP HHCCCCCCHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC RLHFRMGTILDKMGNKEESIARMKRVIAITPDDAQALNYLGYTYAEMGIKLDEALQYLKK CHHHHHHHHHHHHCCHHHHHHHHHHHEEECCCHHHHHHHHHHHHHHHCCCHHHHHHHHHH AVALRPNDGFILDSLGWVYFKMKRYDEAVPLLERSLKVVEDDLTVMEHLADAYAANHEYR HHCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH NALKLYKKILDADPGRKDIAEKKKKVRAESLEK HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MKKRIVVALFLALSFLPGCATNGAGKPLPANEHSFQPTVNIAGSRALYIYSLSRLRELDG CCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEEEEHHHHHHHHHCC DFEGALTLLNGAIEADPNSAFLHTAAAEIYLKSGKLDDALRACENAIRVDPGFRPARIIA CHHHHHHHHHCCCCCCCCCHHHHHHHHHHHEECCCCHHHHHHHHHHEECCCCCCHHHHHH GTILANLKRDKEAIVHLSKAIELDPTKEDAYLHLAISYVRTFDYEQAVNTLKSLIKINPE HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHHHHHCCHHHHHHHHHHHHCCCCC SSLGYYYLGKTYDQMKLQKEAANYYKKAIEIKPDFEQAIIDLGISQEGLGLYDDAIATYK CCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHH RLLETNPFNMNVLQHLVQLYLQQQRLEDALPLLIVMKDRGVGGLETQRKIGLIYMELERY HHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHCEEEEHHHHH DEAIAEFEQILAREPKAHQIRFYIASAYEEKEEFDKAIEEFSKIPPGTANYVEALGHIAF HHHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH MYRDQEKPEKGIQILTDAITANPDKLDLYLYLAGLYESMDKFSEGLAVLKGVEGKFAEDP HHCCCCCCHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC RLHFRMGTILDKMGNKEESIARMKRVIAITPDDAQALNYLGYTYAEMGIKLDEALQYLKK CHHHHHHHHHHHHCCHHHHHHHHHHHEEECCCHHHHHHHHHHHHHHHCCCHHHHHHHHHH AVALRPNDGFILDSLGWVYFKMKRYDEAVPLLERSLKVVEDDLTVMEHLADAYAANHEYR HHCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH NALKLYKKILDADPGRKDIAEKKKKVRAESLEK HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7883699; 10984043 [H]