Definition | Cupriavidus metallidurans CH34 megaplasmid, complete sequence. |
---|---|
Accession | NC_007974 |
Length | 2,580,084 |
Click here to switch to the map view.
The map label for this gene is 94312756
Identifier: 94312756
GI number: 94312756
Start: 367549
End: 369537
Strand: Direct
Name: 94312756
Synonym: Rmet_3826
Alternate gene names: NA
Gene position: 367549-369537 (Clockwise)
Preceding gene: 94312753
Following gene: 94312757
Centisome position: 14.25
GC content: 65.51
Gene sequence:
>1989_bases ATGTCCGACCAACCCCTCGTTGCCCGCCGCCGTGCCCTGAAAATCCTTGGTGGAGCCCCGATGCTCCCGATCGGCGCGGC CGCCGCGAGTCTTGCGGGTGGGATGACCGCGATGGAGGCCGCGTTCGCCGCCGCGCCCCGGCGCAAAGCTGGCGCGGCCT TTACCTCGGCAACGTTCACGGGCATGGCCGCGCCGTCGCTCGACAATCCGGCGACGATGGCCACGACGACGGTTGGCTCG TCGATCAAGGTGTCTTACAGCGATGACAGCGCGCAGGAATATGCGCTGGCATACCAGCCCTTCTTCGTCACCGGCGACAT GCTGCCCGATGGCAAGGGCGGCAAGGTGCTGGCAGGCGGTTACTACGATATCCATAACCGCGCGATCATGGATACCTCGG TGCCCGGCAAGACGCGCCAGTTCTTCTCGGACTGCCCGGACGGCAGCTCGCTGCTGACCGTGCGCGGCGCGCGCGTGCCC GGCGTGAAGGGCAATACGGTGTTCGCCGTGGTGCAATTCGAATACACCACGCGCGACCAGGCCAACGGTGACATGTACGG CATGCTGCCGTCGCCGATCGCCGTCATCACGCTCGACCAGGACCCGAAGACCGGCAAACTCACTCCGGTGCGCTATCACA ACGTGGATACGTCGTCCGTGAACGGCCTGTGGATCACCTGTGGCGCGAGCCTGTCGCCGTGGAACACGCACCTGTCGAGC GAGGAGTACGAGCCGGACGCGGTCAAGGCCGCCACCGACAAGCGTCTGCGTGGCTTCAGCCAGAACCTGTATGGCGACGC AGACCGCGCCAACCCCTACCACTACGGCCATCTCCCCGAAGTCACCGTGCACGCCGACGGCACGGGTTCGATCCGCAAGC ATTACTGCCTGGGCCGCATCTCGCACGAACTCGTGCAGGTGATGCCGGACGAACGCACCGTACTGATGGGCGACGACGCC ACCAATGGCGGCCTGTTCATGTTCATCGCCGATCGTCCGCGCGATCTGTCGTCGGGCAAGCTCTATGTGGCCAAGTGGCA CCAGACTTCCGGCACGGGGCCGGGCGCGGCCACGCTGTCGTGGATCGCGCTGGGCAGCGCGACGAGCGACGAGATCCGCA AGCTGGTGGATAGTGGCATCAAGGTCTCCGACATCATGGACGTCAAGACCAAGGACCCGGCTGATAGCGCTTACACGCGC ATCCTGATCGATGGCAAGCCGAACTGGGTGAAGGTCGCGCCGGGCCAGGAGAAGGCGGCGGCGTTCCTGGAGACTCATCG TTACGCGGCGCTTGTGGGTGGCAGCCTGGGCTTCACCAAGATGGAAGGGACCACGGTCAATATCGCGGACCGCAAGGCGT ACTCGGCGATCTCGCGTATCGAATCGAGCATGTTGGCCGGCAACGCGGCGAACGCGGGCGATATCCGCGTGGAAGGTCCC TACTCCGGCGCCGTGTACGAACTGAACCTGCGTGGCGCCCAGACTGACCATGCTGGCGCGCGCATCGACAGCGAATGGGT GCCGGTCGACATGGCACCGGTGCCGGCACTGGTATCCGAAGACCTCGGCGGCGGACCCAGGAAGGCGCAGGATGCGCTCG GCAACTTCGCCAACCCCGACAAGGTGGCCACGCCGGACAACCTCAAGTTCTCGGAGTCGCTGCGCACGCTGTTCGTCGGC GAAGACAGCAACACGCACGTCAACAACTTCCTGTGGGCGTACAACGTCGACACGAAGCAGCTGTCGCGCGTGCTGTCGTG CCCGGCCGGCGCGGAATCGACCGGACTGCACGCGGTGGACGAGATCAATGGCTGGACGTACATCATGAGCAACTTCCAGC ATCCTGGCGACTGGGAAGCGGGGCTGCACGACAAGGTCAAGGCACAGCTCGACCCGCTGGTCCGCGCCAACTATAAGGAC CGGTTTGGCGCGGCGGTGGGCTACCTGACGGCCACGCCGGCAGCGATTCGCCTCGGCAAGCGGGCCTGA
Upstream 100 bases:
>100_bases AAAAACTGTCATGCTGGGTGCTTATCATCCGCCACGGGGCGTTCCGTCCACATTGCCCCGGCGCCCGATTCGCATACCTA TCCACCCACATAGAACCATC
Downstream 100 bases:
>100_bases CGCCGCGTTCCGGCCGGTCCTGCTACCCTGTGGGCTGCGCGCCCGGCATCCGACCGGGTGGCAGCCCCACGAGGAACCCC ATGACCGCTGATCCGACCGC
Product: hypothetical protein
Products: NA
Alternate protein names: Phosphatase; Exported Alkaline Phosphatase; Phosphatase-Like Protein; Cell Surface Protein; Twin-Arginine Translocation Pathway Signal; Phosphatase Or Zinc Metalloprotease With Signal Peptide; PKD Domain Protein; Lipoprotein
Number of amino acids: Translated: 662; Mature: 661
Protein sequence:
>662_residues MSDQPLVARRRALKILGGAPMLPIGAAAASLAGGMTAMEAAFAAAPRRKAGAAFTSATFTGMAAPSLDNPATMATTTVGS SIKVSYSDDSAQEYALAYQPFFVTGDMLPDGKGGKVLAGGYYDIHNRAIMDTSVPGKTRQFFSDCPDGSSLLTVRGARVP GVKGNTVFAVVQFEYTTRDQANGDMYGMLPSPIAVITLDQDPKTGKLTPVRYHNVDTSSVNGLWITCGASLSPWNTHLSS EEYEPDAVKAATDKRLRGFSQNLYGDADRANPYHYGHLPEVTVHADGTGSIRKHYCLGRISHELVQVMPDERTVLMGDDA TNGGLFMFIADRPRDLSSGKLYVAKWHQTSGTGPGAATLSWIALGSATSDEIRKLVDSGIKVSDIMDVKTKDPADSAYTR ILIDGKPNWVKVAPGQEKAAAFLETHRYAALVGGSLGFTKMEGTTVNIADRKAYSAISRIESSMLAGNAANAGDIRVEGP YSGAVYELNLRGAQTDHAGARIDSEWVPVDMAPVPALVSEDLGGGPRKAQDALGNFANPDKVATPDNLKFSESLRTLFVG EDSNTHVNNFLWAYNVDTKQLSRVLSCPAGAESTGLHAVDEINGWTYIMSNFQHPGDWEAGLHDKVKAQLDPLVRANYKD RFGAAVGYLTATPAAIRLGKRA
Sequences:
>Translated_662_residues MSDQPLVARRRALKILGGAPMLPIGAAAASLAGGMTAMEAAFAAAPRRKAGAAFTSATFTGMAAPSLDNPATMATTTVGS SIKVSYSDDSAQEYALAYQPFFVTGDMLPDGKGGKVLAGGYYDIHNRAIMDTSVPGKTRQFFSDCPDGSSLLTVRGARVP GVKGNTVFAVVQFEYTTRDQANGDMYGMLPSPIAVITLDQDPKTGKLTPVRYHNVDTSSVNGLWITCGASLSPWNTHLSS EEYEPDAVKAATDKRLRGFSQNLYGDADRANPYHYGHLPEVTVHADGTGSIRKHYCLGRISHELVQVMPDERTVLMGDDA TNGGLFMFIADRPRDLSSGKLYVAKWHQTSGTGPGAATLSWIALGSATSDEIRKLVDSGIKVSDIMDVKTKDPADSAYTR ILIDGKPNWVKVAPGQEKAAAFLETHRYAALVGGSLGFTKMEGTTVNIADRKAYSAISRIESSMLAGNAANAGDIRVEGP YSGAVYELNLRGAQTDHAGARIDSEWVPVDMAPVPALVSEDLGGGPRKAQDALGNFANPDKVATPDNLKFSESLRTLFVG EDSNTHVNNFLWAYNVDTKQLSRVLSCPAGAESTGLHAVDEINGWTYIMSNFQHPGDWEAGLHDKVKAQLDPLVRANYKD RFGAAVGYLTATPAAIRLGKRA >Mature_661_residues SDQPLVARRRALKILGGAPMLPIGAAAASLAGGMTAMEAAFAAAPRRKAGAAFTSATFTGMAAPSLDNPATMATTTVGSS IKVSYSDDSAQEYALAYQPFFVTGDMLPDGKGGKVLAGGYYDIHNRAIMDTSVPGKTRQFFSDCPDGSSLLTVRGARVPG VKGNTVFAVVQFEYTTRDQANGDMYGMLPSPIAVITLDQDPKTGKLTPVRYHNVDTSSVNGLWITCGASLSPWNTHLSSE EYEPDAVKAATDKRLRGFSQNLYGDADRANPYHYGHLPEVTVHADGTGSIRKHYCLGRISHELVQVMPDERTVLMGDDAT NGGLFMFIADRPRDLSSGKLYVAKWHQTSGTGPGAATLSWIALGSATSDEIRKLVDSGIKVSDIMDVKTKDPADSAYTRI LIDGKPNWVKVAPGQEKAAAFLETHRYAALVGGSLGFTKMEGTTVNIADRKAYSAISRIESSMLAGNAANAGDIRVEGPY SGAVYELNLRGAQTDHAGARIDSEWVPVDMAPVPALVSEDLGGGPRKAQDALGNFANPDKVATPDNLKFSESLRTLFVGE DSNTHVNNFLWAYNVDTKQLSRVLSCPAGAESTGLHAVDEINGWTYIMSNFQHPGDWEAGLHDKVKAQLDPLVRANYKDR FGAAVGYLTATPAAIRLGKRA
Specific function: Unknown
COG id: COG3211
COG function: function code R; Predicted phosphatase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 70653; Mature: 70522
Theoretical pI: Translated: 6.68; Mature: 6.68
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSDQPLVARRRALKILGGAPMLPIGAAAASLAGGMTAMEAAFAAAPRRKAGAAFTSATFT CCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCCCCCEEEHHEEC GMAAPSLDNPATMATTTVGSSIKVSYSDDSAQEYALAYQPFFVTGDMLPDGKGGKVLAGG CCCCCCCCCCCEEEEEECCCEEEEEECCCCCHHEEEEECCEEEECCCCCCCCCCEEEECC YYDIHNRAIMDTSVPGKTRQFFSDCPDGSSLLTVRGARVPGVKGNTVFAVVQFEYTTRDQ EEECCCCEEEECCCCCHHHHHHHCCCCCCCEEEEECCCCCCCCCCEEEEEEEEEEECCCC ANGDMYGMLPSPIAVITLDQDPKTGKLTPVRYHNVDTSSVNGLWITCGASLSPWNTHLSS CCCCEEEECCCCEEEEEECCCCCCCCCCEEEEECCCCCCCCCEEEEECCCCCCCCCCCCC EEYEPDAVKAATDKRLRGFSQNLYGDADRANPYHYGHLPEVTVHADGTGSIRKHYCLGRI CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCEEEEEECCCCCHHHHHHHHHH SHELVQVMPDERTVLMGDDATNGGLFMFIADRPRDLSSGKLYVAKWHQTSGTGPGAATLS HHHHHHHCCCCCEEEECCCCCCCCEEEEEECCCCCCCCCEEEEEEEECCCCCCCCCCEEE WIALGSATSDEIRKLVDSGIKVSDIMDVKTKDPADSAYTRILIDGKPNWVKVAPGQEKAA EEEECCCCHHHHHHHHHCCCEEEEEEECCCCCCCCCCEEEEEECCCCCEEEECCCHHHHH AFLETHRYAALVGGSLGFTKMEGTTVNIADRKAYSAISRIESSMLAGNAANAGDIRVEGP HHHHHHHEEEEECCCCCCEEECCCEEEECHHHHHHHHHHHHHHHHCCCCCCCCCEEEECC YSGAVYELNLRGAQTDHAGARIDSEWVPVDMAPVPALVSEDLGGGPRKAQDALGNFANPD CCCEEEEEEECCCCCCCCCCEECCCCCEECCCCCCHHHHHCCCCCCCHHHHHHCCCCCCC KVATPDNLKFSESLRTLFVGEDSNTHVNNFLWAYNVDTKQLSRVLSCPAGAESTGLHAVD CCCCCCCCCHHCCCEEEEEECCCCCCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCEEHH EINGWTYIMSNFQHPGDWEAGLHDKVKAQLDPLVRANYKDRFGAAVGYLTATPAAIRLGK HCCCEEEEEECCCCCCCCCCCCHHHHHHHHCHHHHCCCCHHHHHHHHHHCCCHHHHHCCC RA CC >Mature Secondary Structure SDQPLVARRRALKILGGAPMLPIGAAAASLAGGMTAMEAAFAAAPRRKAGAAFTSATFT CCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCCCCCEEEHHEEC GMAAPSLDNPATMATTTVGSSIKVSYSDDSAQEYALAYQPFFVTGDMLPDGKGGKVLAGG CCCCCCCCCCCEEEEEECCCEEEEEECCCCCHHEEEEECCEEEECCCCCCCCCCEEEECC YYDIHNRAIMDTSVPGKTRQFFSDCPDGSSLLTVRGARVPGVKGNTVFAVVQFEYTTRDQ EEECCCCEEEECCCCCHHHHHHHCCCCCCCEEEEECCCCCCCCCCEEEEEEEEEEECCCC ANGDMYGMLPSPIAVITLDQDPKTGKLTPVRYHNVDTSSVNGLWITCGASLSPWNTHLSS CCCCEEEECCCCEEEEEECCCCCCCCCCEEEEECCCCCCCCCEEEEECCCCCCCCCCCCC EEYEPDAVKAATDKRLRGFSQNLYGDADRANPYHYGHLPEVTVHADGTGSIRKHYCLGRI CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCEEEEEECCCCCHHHHHHHHHH SHELVQVMPDERTVLMGDDATNGGLFMFIADRPRDLSSGKLYVAKWHQTSGTGPGAATLS HHHHHHHCCCCCEEEECCCCCCCCEEEEEECCCCCCCCCEEEEEEEECCCCCCCCCCEEE WIALGSATSDEIRKLVDSGIKVSDIMDVKTKDPADSAYTRILIDGKPNWVKVAPGQEKAA EEEECCCCHHHHHHHHHCCCEEEEEEECCCCCCCCCCEEEEEECCCCCEEEECCCHHHHH AFLETHRYAALVGGSLGFTKMEGTTVNIADRKAYSAISRIESSMLAGNAANAGDIRVEGP HHHHHHHEEEEECCCCCCEEECCCEEEECHHHHHHHHHHHHHHHHCCCCCCCCCEEEECC YSGAVYELNLRGAQTDHAGARIDSEWVPVDMAPVPALVSEDLGGGPRKAQDALGNFANPD CCCEEEEEEECCCCCCCCCCEECCCCCEECCCCCCHHHHHCCCCCCCHHHHHHCCCCCCC KVATPDNLKFSESLRTLFVGEDSNTHVNNFLWAYNVDTKQLSRVLSCPAGAESTGLHAVD CCCCCCCCCHHCCCEEEEEECCCCCCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCEEHH EINGWTYIMSNFQHPGDWEAGLHDKVKAQLDPLVRANYKDRFGAAVGYLTATPAAIRLGK HCCCEEEEEECCCCCCCCCCCCHHHHHHHHCHHHHCCCCHHHHHHHHHHCCCHHHHHCCC RA CC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA