Definition Chlorobaculum parvum NCIB 8327 chromosome, complete genome.
Accession NC_011027
Length 2,289,249

Click here to switch to the map view.

The map label for this gene is cphA [H]

Identifier: 193213072

GI number: 193213072

Start: 1558527

End: 1560263

Strand: Direct

Name: cphA [H]

Synonym: Cpar_1425

Alternate gene names: 193213072

Gene position: 1558527-1560263 (Clockwise)

Preceding gene: 193213071

Following gene: 193213073

Centisome position: 68.08

GC content: 58.03

Gene sequence:

>1737_bases
ATGACAAGAAAACACCGAAGTGAAGAGACTCTGGTACCGATGCAGTCCCCCAGCATGAAAAGCTGGGGGCAACCGGCAGA
GTTACCAAAGCAGAACGTGATCATCGAGTGCGGATGGGGGCGCGTCATTTTCGGCCACACCTTTCACGACAACACCCGGA
TCGCCGAAATACTGCGTCAGGAAAAAGAGGGCTTTCGTGACATCGCCCTCTATCTGCGCGATCCGCAGGTAGTGCTCTCG
TACGCGCCGCAGAACCTGTTCATCGACCCGTCGTATACCTTCCGCTTGTGGCTCGATGACTACACTCCGCTACAGAAGTC
GAGTGGCATGTTTTCGGTTCGTCAGCTCGATCCGGCAAAGGATATCGACCGAGTCAACTGGATCTACAACGTGCACAACA
TGGTACCTGCCGATCCGGAATTTTTGAGAGAAATCGCCGCAAAGCAGTGCATCGACTACTGGGTCGCCGTGGACGATGAT
ACGGATCAGGTCATCGCCGTGTGCATGACCATCGACCACAAGGCAGCATTCGACGATCCGGAAAACGGGTCGAGTCTCTG
GGCGCTGGCGGTCGATCCGCAAGCGAGGCATTCGGGACTCGGGGTTCAGGTGGTGCAGACGGTGGCGGAACATTACAAAG
CCAAAGGCCGGAGCTTCGTTGACCTTTCGGTGCTGCACAGCAACGGCGCGGCCATCGAGATGTACAAAAAGCTCGGCTTC
GTGCAGATTCCGGTCTTCACCGTCAAGAACAAAAACGCGATCAACGAGCGGCTCTTCTCCGGCCCGCAGCCTGAAGCGGA
GCTGAATCCTTACTCGACCATCATCATCAATGAGGCTCGCCGCCGTGGGATTCGCGTTGATGTGCTCGATCCGGTCGATA
ACTATTTCCGGCTTTCGTGGGGTGGAGCGAGCGTGGTGTGCCGCGAATCGCTGACCGAGTTGACCTCGGCCATCGCCATG
AGCCGCTGCGCCGACAAGCAGACCACGCACCGCATGCTCTCCGCCGCCGGGCTAAGGGTTCCGGATCAGCAGGTCGCCAC
CAGCCCGGAAGAAAACATCCGCTTCCTCGAAAAACACGGGCATCTGGTGGTCAAGCCAGCCGACAGCGAACAGGGCAAGG
GCATCACGGTTGGGATCACCACTACCGAAGAGCTTGAACAGGCGATCAAAACCGCCGGAGCGATCTGCTCCAAGGTGCTG
CTCGAAGAGATGGTCGAAGGCGCGGATCTGCGCATCATCGTGATCGACTACGAAGTGGTTGCCGCCGCGGTCAGACGACC
TCCGAAAATCACCGGTGACGGGCAGCACTCGATTCTCGACCTCGTCAAAAAGCAGAGCCGCCGCAGGGAAAAGGCATCGC
AGGGCGAAAGCCGGATTCCCATCGACGACGAGCTTCAGCGCACCATCGGCCTGAAAGGCTACAAACTCGACGACATCCTG
CCGAAAGGCGAGGAGCTCGAAGTGCGCAAAACGGCCAACCTGCACACCGGCGGTACGATTCACGACGTCACCGACCAGCT
TCATCCTGAGCTTGGCAAGGCCGCCTGTAAAGCCGCTGACATTCTCGGCATTCCGGTTACCGGACTCGATTTTCTTGTAT
CATCGCCCGAGAAGAGCGATTATGTCATTATCGAAGCCAACGAACGGCCGGGGCTGGCCAACCACGAGCCGCAACCGACC
GCCGAGCGCTTCATCGATTTTCTGTTTCCGCAAAGCATTGCAAGGGCGATTTCATGA

Upstream 100 bases:

>100_bases
ACATGCTGCTCGATGCGCCGAAGGAACACATCACACCGTTGCGCGGCTCGAAGCTCTGGCAGATCACGCTTCTCGAATAC
TGGTTACAGGAGCAGGGACT

Downstream 100 bases:

>100_bases
TCAACATCGATACCGATTACCTCCGCTCCATTCTGTTCAAGATGCTCCAGATTCCCAGCCCGACGGGCTACACCGACGAG
ATCGTCCACTTCGTCGGGCG

Product: GNAT-family acetyltransferase

Products: NA

Alternate protein names: Cyanophycin synthase [H]

Number of amino acids: Translated: 578; Mature: 577

Protein sequence:

>578_residues
MTRKHRSEETLVPMQSPSMKSWGQPAELPKQNVIIECGWGRVIFGHTFHDNTRIAEILRQEKEGFRDIALYLRDPQVVLS
YAPQNLFIDPSYTFRLWLDDYTPLQKSSGMFSVRQLDPAKDIDRVNWIYNVHNMVPADPEFLREIAAKQCIDYWVAVDDD
TDQVIAVCMTIDHKAAFDDPENGSSLWALAVDPQARHSGLGVQVVQTVAEHYKAKGRSFVDLSVLHSNGAAIEMYKKLGF
VQIPVFTVKNKNAINERLFSGPQPEAELNPYSTIIINEARRRGIRVDVLDPVDNYFRLSWGGASVVCRESLTELTSAIAM
SRCADKQTTHRMLSAAGLRVPDQQVATSPEENIRFLEKHGHLVVKPADSEQGKGITVGITTTEELEQAIKTAGAICSKVL
LEEMVEGADLRIIVIDYEVVAAAVRRPPKITGDGQHSILDLVKKQSRRREKASQGESRIPIDDELQRTIGLKGYKLDDIL
PKGEELEVRKTANLHTGGTIHDVTDQLHPELGKAACKAADILGIPVTGLDFLVSSPEKSDYVIIEANERPGLANHEPQPT
AERFIDFLFPQSIARAIS

Sequences:

>Translated_578_residues
MTRKHRSEETLVPMQSPSMKSWGQPAELPKQNVIIECGWGRVIFGHTFHDNTRIAEILRQEKEGFRDIALYLRDPQVVLS
YAPQNLFIDPSYTFRLWLDDYTPLQKSSGMFSVRQLDPAKDIDRVNWIYNVHNMVPADPEFLREIAAKQCIDYWVAVDDD
TDQVIAVCMTIDHKAAFDDPENGSSLWALAVDPQARHSGLGVQVVQTVAEHYKAKGRSFVDLSVLHSNGAAIEMYKKLGF
VQIPVFTVKNKNAINERLFSGPQPEAELNPYSTIIINEARRRGIRVDVLDPVDNYFRLSWGGASVVCRESLTELTSAIAM
SRCADKQTTHRMLSAAGLRVPDQQVATSPEENIRFLEKHGHLVVKPADSEQGKGITVGITTTEELEQAIKTAGAICSKVL
LEEMVEGADLRIIVIDYEVVAAAVRRPPKITGDGQHSILDLVKKQSRRREKASQGESRIPIDDELQRTIGLKGYKLDDIL
PKGEELEVRKTANLHTGGTIHDVTDQLHPELGKAACKAADILGIPVTGLDFLVSSPEKSDYVIIEANERPGLANHEPQPT
AERFIDFLFPQSIARAIS
>Mature_577_residues
TRKHRSEETLVPMQSPSMKSWGQPAELPKQNVIIECGWGRVIFGHTFHDNTRIAEILRQEKEGFRDIALYLRDPQVVLSY
APQNLFIDPSYTFRLWLDDYTPLQKSSGMFSVRQLDPAKDIDRVNWIYNVHNMVPADPEFLREIAAKQCIDYWVAVDDDT
DQVIAVCMTIDHKAAFDDPENGSSLWALAVDPQARHSGLGVQVVQTVAEHYKAKGRSFVDLSVLHSNGAAIEMYKKLGFV
QIPVFTVKNKNAINERLFSGPQPEAELNPYSTIIINEARRRGIRVDVLDPVDNYFRLSWGGASVVCRESLTELTSAIAMS
RCADKQTTHRMLSAAGLRVPDQQVATSPEENIRFLEKHGHLVVKPADSEQGKGITVGITTTEELEQAIKTAGAICSKVLL
EEMVEGADLRIIVIDYEVVAAAVRRPPKITGDGQHSILDLVKKQSRRREKASQGESRIPIDDELQRTIGLKGYKLDDILP
KGEELEVRKTANLHTGGTIHDVTDQLHPELGKAACKAADILGIPVTGLDFLVSSPEKSDYVIIEANERPGLANHEPQPTA
ERFIDFLFPQSIARAIS

Specific function: Catalyzes the ATP-dependent polymerization of arginine and aspartate to multi-L-arginyl-poly-L-aspartic acid (cyanophycin; a water-insoluble reserve polymer) [H]

COG id: COG1181

COG function: function code M; D-alanine-D-alanine ligase and related ATP-grasp enzymes

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 ATP-grasp domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011761
- InterPro:   IPR013651
- InterPro:   IPR013815
- InterPro:   IPR013816
- InterPro:   IPR011810
- InterPro:   IPR004101
- InterPro:   IPR013221 [H]

Pfam domain/function: PF02875 Mur_ligase_C; PF08245 Mur_ligase_M; PF08443 RimK [H]

EC number: =6.3.2.29; =6.3.2.30 [H]

Molecular weight: Translated: 64469; Mature: 64338

Theoretical pI: Translated: 5.88; Mature: 5.88

Prosite motif: PS50975 ATP_GRASP

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTRKHRSEETLVPMQSPSMKSWGQPAELPKQNVIIECGWGRVIFGHTFHDNTRIAEILRQ
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCEEEEEEEECCCHHHHHHHHH
EKEGFRDIALYLRDPQVVLSYAPQNLFIDPSYTFRLWLDDYTPLQKSSGMFSVRQLDPAK
HHHCHHEEEEEECCCEEEEEECCCCEEECCCCEEEEEECCCCCCCCCCCCEEEECCCCCH
DIDRVNWIYNVHNMVPADPEFLREIAAKQCIDYWVAVDDDTDQVIAVCMTIDHKAAFDDP
HHHHHHHEEECCCCCCCCHHHHHHHHHHHHHHEEEEECCCCHHEEEEEEEECCCCCCCCC
ENGSSLWALAVDPQARHSGLGVQVVQTVAEHYKAKGRSFVDLSVLHSNGAAIEMYKKLGF
CCCCEEEEEEECCCHHHCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCEEHHHHHCCE
VQIPVFTVKNKNAINERLFSGPQPEAELNPYSTIIINEARRRGIRVDVLDPVDNYFRLSW
EEEEEEEECCCCHHHHHHHCCCCCCCCCCCCEEEEEEHHHHCCCEEEEECCCCCEEEEEC
GGASVVCRESLTELTSAIAMSRCADKQTTHRMLSAAGLRVPDQQVATSPEENIRFLEKHG
CCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHCCCHHHHHHHHHCC
HLVVKPADSEQGKGITVGITTTEELEQAIKTAGAICSKVLLEEMVEGADLRIIVIDYEVV
CEEEECCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHH
AAAVRRPPKITGDGQHSILDLVKKQSRRREKASQGESRIPIDDELQRTIGLKGYKLDDIL
HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCHHHCC
PKGEELEVRKTANLHTGGTIHDVTDQLHPELGKAACKAADILGIPVTGLDFLVSSPEKSD
CCCCCEEEEEECCCCCCCCHHHHHHHHCHHHHHHHHHHHHEECCCCCCHHHHCCCCCCCC
YVIIEANERPGLANHEPQPTAERFIDFLFPQSIARAIS
EEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
TRKHRSEETLVPMQSPSMKSWGQPAELPKQNVIIECGWGRVIFGHTFHDNTRIAEILRQ
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCEEEEEEEECCCHHHHHHHHH
EKEGFRDIALYLRDPQVVLSYAPQNLFIDPSYTFRLWLDDYTPLQKSSGMFSVRQLDPAK
HHHCHHEEEEEECCCEEEEEECCCCEEECCCCEEEEEECCCCCCCCCCCCEEEECCCCCH
DIDRVNWIYNVHNMVPADPEFLREIAAKQCIDYWVAVDDDTDQVIAVCMTIDHKAAFDDP
HHHHHHHEEECCCCCCCCHHHHHHHHHHHHHHEEEEECCCCHHEEEEEEEECCCCCCCCC
ENGSSLWALAVDPQARHSGLGVQVVQTVAEHYKAKGRSFVDLSVLHSNGAAIEMYKKLGF
CCCCEEEEEEECCCHHHCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCEEHHHHHCCE
VQIPVFTVKNKNAINERLFSGPQPEAELNPYSTIIINEARRRGIRVDVLDPVDNYFRLSW
EEEEEEEECCCCHHHHHHHCCCCCCCCCCCCEEEEEEHHHHCCCEEEEECCCCCEEEEEC
GGASVVCRESLTELTSAIAMSRCADKQTTHRMLSAAGLRVPDQQVATSPEENIRFLEKHG
CCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHCCCHHHHHHHHHCC
HLVVKPADSEQGKGITVGITTTEELEQAIKTAGAICSKVLLEEMVEGADLRIIVIDYEVV
CEEEECCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHH
AAAVRRPPKITGDGQHSILDLVKKQSRRREKASQGESRIPIDDELQRTIGLKGYKLDDIL
HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCHHHCC
PKGEELEVRKTANLHTGGTIHDVTDQLHPELGKAACKAADILGIPVTGLDFLVSSPEKSD
CCCCCEEEEEECCCCCCCCHHHHHHHHCHHHHHHHHHHHHEECCCCCCHHHHCCCCCCCC
YVIIEANERPGLANHEPQPTAERFIDFLFPQSIARAIS
EEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA