| Definition | Anaeromyxobacter dehalogenans 2CP-1 chromosome, complete genome. |
|---|---|
| Accession | NC_011891 |
| Length | 5,029,329 |
Click here to switch to the map view.
The map label for this gene is 220917043
Identifier: 220917043
GI number: 220917043
Start: 2191241
End: 2193085
Strand: Direct
Name: 220917043
Synonym: A2cp1_1939
Alternate gene names: NA
Gene position: 2191241-2193085 (Clockwise)
Preceding gene: 220917042
Following gene: 220917051
Centisome position: 43.57
GC content: 70.35
Gene sequence:
>1845_bases ATGACGACACTCCCGGTCTTGCAGCTCGCAGCCGTCCTCCTCGCCGCGACGCCCTCCGCGCCGGCGGCGCCCCCGCAGCC CGCAGCCGCCGCGAAGCCCACCGCCGCCGACGCGAAGGCGTTCGTGGGCGAGGCGAACGCCGAGCTGAAGAAGCTCTGGA TCCGCTGGTCCACCGCGGAGTGGATCAAGTCCACCTACATCACCGACGACACCGAGCGGAACGCCGCGGCCCTCAACGAG GACCAGATGGCCTACCTGACCGACGCGATCCGCGGCGCGGTCCGGTTCCAGGGCGTCGGCGCCGACCCCGACACCGAGCG CATGCTGATGCTGCTGCGCATCGCCTCGCCGCTCCCGGCGCCGAGCGACGCCGCGCAGCGCGCCGAGCTGGCGCAGCTCG CCGCGAAGCTCGAGGGCATGTACGGCAAGGGCAAGTGGTGCGGCGCGCCCGGCAGCGCGGCGCGCGGCGCCGAGAAGTGC CGCGACCTGCAGGATCTCGAGCAGGTGATGGCCAAGAGCCGCGACTACGACGCGCTGCTCGACGCCTGGACCGGCTGGCA CACCATCTCGCGCGACATGCGCCCGCTCTACGAGCGGATGGTGACCATCTCCAACGCGGGCGCGCGCGAGATCGGCTTCC GCGACCTCGGCGACCTGTGGCGCAGCGGCTACGACATGCCGCCCGACGCGTTCGAGGCCGACACCGACCGGCTCTGGGCG CAGGTGAAGCCGTTCTACGAGGAGCTGCACTGCTACGTCCGCAGCAAGCTGCAGAAGGCCTACGGCAAGGCGCGGGTGCC GGACGGCAAGCCCATCCCGGCGCACCTGCTCGGCAACATGTGGGCGCAGGACTGGGCCAACCTCTACCCGCTGGTCGAGC CGTACCACGGGCAGCCGTCGCTCGACGTGGACGCGGCGCTGCGCCGGCAGAAGACCGATCCCATCAAGATGGTGAAGCTG GGCGAGGCGTTCTTCACCTCGCTGGGCTTCGAGCCGCTCCCGAAGACGTTCTGGGAGCGCTCGCAGTTCACGAAGCCGCG CGACCGCGACGTGGTGTGCCACGCCAGCGCCTGGGACGTGACCTACTCCGCCGACCTGCGCATCAAGATGTGCATCCGGC CGACCGAGGAGGACCTCGTCACCATCCACCACGAGCTGGGCCACAACTTCTACCAGCGCGCCTACGTGCACCTGCCGGTG CTGTTCCAGAACGGCGCGAACGACGGCTTCCACGAGGCCATCGGCGACGCGATCGCGCTGTCCATCACCCCCGGGTACCT GAAGCAGGTCGGCCTGATCCAGACCGTGCCGAAGGACGAGAAGGGCGTGGTGAACGTGCAGATGAAGCGCGCGCTCGAGA AGGTCGCGTTCCTGCCGTTCGGCAAGCTCATCGACCAGTGGCGCTGGGACGTGTTCTCCGGCAAGACGCCGGCGTCGCGC TACAACGCGGCCTGGTGGGAGCTGCGCCGCAAGTACCAGGGCGTGGACGCGCCGGTCTCGCGCAGCGAGGCGGACTTCGA TCCGGGCGCGAAGTACCACGTGCCCGCGAACGTGCCCTACACGCGCTACTTCCTCGCGCACGTGTACCAGTTCCAGTTCC ACAAGGCGCTGTGCGAGGCCGCCGGGTGGAAGGGGCCGCTGCACGAGTGCTCGATCTACGGCTCGAAGGCCGCCGGCAAG AAGCTCGACGCCATGCTGGCCATGGGCGCGTCGAGGCCGTGGCCCGAGGCCTACGCCGCGCTGACCGGCTCGCGCCAGGC GGACGCGTCCGCCATGCTCGAGTACTTCGCGCCGCTGCGCGCCTGGCTCCGGAAGCAGATCGCCGGCCAGTCCTGCGGCT GGTAG
Upstream 100 bases:
>100_bases GCGCGTCCCGCGCGGTCACGTCCGGCGCGCTCGCGTCTCAGGAACGGTCCGGTCCCGCCATGCTGGCGGGCCCGTCGACC CCGTCCTAGGATCCGCGCGC
Downstream 100 bases:
>100_bases GCGGCTGGTCGGCGGGGCCGCGGCCGCGGTGCGGCCCGACGGCCCGTGGACGAGCGCACGCCCCGGCGACCGAGCGGTCG CCGGGGCGCGTCGTCATCGA
Product: peptidyl-dipeptidase A
Products: NA
Alternate protein names: Zinc-Dependent Metallopeptidase; Angiotensin-Converting; Dipeptidyl Carboxypeptidase; Angiotensin-Converting Family Protein; Dipeptidyl Carboxydipeptidase Family; Zinc Metallopeptidase Family Protein; Angiotensin-Converting Peptidyl Dipeptidase Protein; Peptidyl-Dipeptidase Dcp; Dipeptidyl Carboxydipeptidase Family Protein
Number of amino acids: Translated: 614; Mature: 613
Protein sequence:
>614_residues MTTLPVLQLAAVLLAATPSAPAAPPQPAAAAKPTAADAKAFVGEANAELKKLWIRWSTAEWIKSTYITDDTERNAAALNE DQMAYLTDAIRGAVRFQGVGADPDTERMLMLLRIASPLPAPSDAAQRAELAQLAAKLEGMYGKGKWCGAPGSAARGAEKC RDLQDLEQVMAKSRDYDALLDAWTGWHTISRDMRPLYERMVTISNAGAREIGFRDLGDLWRSGYDMPPDAFEADTDRLWA QVKPFYEELHCYVRSKLQKAYGKARVPDGKPIPAHLLGNMWAQDWANLYPLVEPYHGQPSLDVDAALRRQKTDPIKMVKL GEAFFTSLGFEPLPKTFWERSQFTKPRDRDVVCHASAWDVTYSADLRIKMCIRPTEEDLVTIHHELGHNFYQRAYVHLPV LFQNGANDGFHEAIGDAIALSITPGYLKQVGLIQTVPKDEKGVVNVQMKRALEKVAFLPFGKLIDQWRWDVFSGKTPASR YNAAWWELRRKYQGVDAPVSRSEADFDPGAKYHVPANVPYTRYFLAHVYQFQFHKALCEAAGWKGPLHECSIYGSKAAGK KLDAMLAMGASRPWPEAYAALTGSRQADASAMLEYFAPLRAWLRKQIAGQSCGW
Sequences:
>Translated_614_residues MTTLPVLQLAAVLLAATPSAPAAPPQPAAAAKPTAADAKAFVGEANAELKKLWIRWSTAEWIKSTYITDDTERNAAALNE DQMAYLTDAIRGAVRFQGVGADPDTERMLMLLRIASPLPAPSDAAQRAELAQLAAKLEGMYGKGKWCGAPGSAARGAEKC RDLQDLEQVMAKSRDYDALLDAWTGWHTISRDMRPLYERMVTISNAGAREIGFRDLGDLWRSGYDMPPDAFEADTDRLWA QVKPFYEELHCYVRSKLQKAYGKARVPDGKPIPAHLLGNMWAQDWANLYPLVEPYHGQPSLDVDAALRRQKTDPIKMVKL GEAFFTSLGFEPLPKTFWERSQFTKPRDRDVVCHASAWDVTYSADLRIKMCIRPTEEDLVTIHHELGHNFYQRAYVHLPV LFQNGANDGFHEAIGDAIALSITPGYLKQVGLIQTVPKDEKGVVNVQMKRALEKVAFLPFGKLIDQWRWDVFSGKTPASR YNAAWWELRRKYQGVDAPVSRSEADFDPGAKYHVPANVPYTRYFLAHVYQFQFHKALCEAAGWKGPLHECSIYGSKAAGK KLDAMLAMGASRPWPEAYAALTGSRQADASAMLEYFAPLRAWLRKQIAGQSCGW >Mature_613_residues TTLPVLQLAAVLLAATPSAPAAPPQPAAAAKPTAADAKAFVGEANAELKKLWIRWSTAEWIKSTYITDDTERNAAALNED QMAYLTDAIRGAVRFQGVGADPDTERMLMLLRIASPLPAPSDAAQRAELAQLAAKLEGMYGKGKWCGAPGSAARGAEKCR DLQDLEQVMAKSRDYDALLDAWTGWHTISRDMRPLYERMVTISNAGAREIGFRDLGDLWRSGYDMPPDAFEADTDRLWAQ VKPFYEELHCYVRSKLQKAYGKARVPDGKPIPAHLLGNMWAQDWANLYPLVEPYHGQPSLDVDAALRRQKTDPIKMVKLG EAFFTSLGFEPLPKTFWERSQFTKPRDRDVVCHASAWDVTYSADLRIKMCIRPTEEDLVTIHHELGHNFYQRAYVHLPVL FQNGANDGFHEAIGDAIALSITPGYLKQVGLIQTVPKDEKGVVNVQMKRALEKVAFLPFGKLIDQWRWDVFSGKTPASRY NAAWWELRRKYQGVDAPVSRSEADFDPGAKYHVPANVPYTRYFLAHVYQFQFHKALCEAAGWKGPLHECSIYGSKAAGKK LDAMLAMGASRPWPEAYAALTGSRQADASAMLEYFAPLRAWLRKQIAGQSCGW
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI23238214, Length=585, Percent_Identity=43.5897435897436, Blast_Score=479, Evalue=1e-135, Organism=Homo sapiens, GI4503273, Length=585, Percent_Identity=43.5897435897436, Blast_Score=477, Evalue=1e-134, Organism=Homo sapiens, GI295844837, Length=591, Percent_Identity=39.424703891709, Blast_Score=401, Evalue=1e-112, Organism=Homo sapiens, GI11225609, Length=601, Percent_Identity=36.1064891846922, Blast_Score=390, Evalue=1e-108, Organism=Caenorhabditis elegans, GI71985287, Length=467, Percent_Identity=31.6916488222698, Blast_Score=241, Evalue=7e-64, Organism=Drosophila melanogaster, GI24584232, Length=592, Percent_Identity=38.0067567567568, Blast_Score=425, Evalue=1e-119, Organism=Drosophila melanogaster, GI17137008, Length=592, Percent_Identity=38.0067567567568, Blast_Score=425, Evalue=1e-119, Organism=Drosophila melanogaster, GI85724942, Length=570, Percent_Identity=37.3684210526316, Blast_Score=392, Evalue=1e-109, Organism=Drosophila melanogaster, GI17137262, Length=459, Percent_Identity=42.7015250544662, Blast_Score=380, Evalue=1e-105, Organism=Drosophila melanogaster, GI24762773, Length=483, Percent_Identity=27.536231884058, Blast_Score=183, Evalue=3e-46, Organism=Drosophila melanogaster, GI22026846, Length=460, Percent_Identity=27.3913043478261, Blast_Score=178, Evalue=8e-45, Organism=Drosophila melanogaster, GI28574153, Length=490, Percent_Identity=25.3061224489796, Blast_Score=149, Evalue=6e-36,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 68525; Mature: 68394
Theoretical pI: Translated: 8.19; Mature: 8.19
Prosite motif: PS00142 ZINC_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTTLPVLQLAAVLLAATPSAPAAPPQPAAAAKPTAADAKAFVGEANAELKKLWIRWSTAE CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHCCCHHH WIKSTYITDDTERNAAALNEDQMAYLTDAIRGAVRFQGVGADPDTERMLMLLRIASPLPA HHHHHCCCCCCCCCHHHCCCHHHHHHHHHHHHHHEEECCCCCCCHHHHHHHHHHHCCCCC PSDAAQRAELAQLAAKLEGMYGKGKWCGAPGSAARGAEKCRDLQDLEQVMAKSRDYDALL CCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHH DAWTGWHTISRDMRPLYERMVTISNAGAREIGFRDLGDLWRSGYDMPPDAFEADTDRLWA HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCHHHHHHHHHCCCCCCCCCCCCCHHHHHH QVKPFYEELHCYVRSKLQKAYGKARVPDGKPIPAHLLGNMWAQDWANLYPLVEPYHGQPS HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC LDVDAALRRQKTDPIKMVKLGEAFFTSLGFEPLPKTFWERSQFTKPRDRDVVCHASAWDV CCHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCCE TYSADLRIKMCIRPTEEDLVTIHHELGHNFYQRAYVHLPVLFQNGANDGFHEAIGDAIAL EEECCCEEEEEECCCHHHHHHHHHHHCHHHHHHHHHEEEEEEECCCCCCHHHHHCCEEEE SITPGYLKQVGLIQTVPKDEKGVVNVQMKRALEKVAFLPFGKLIDQWRWDVFSGKTPASR EECHHHHHHCCHHHCCCCCCCCEEHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCHH YNAAWWELRRKYQGVDAPVSRSEADFDPGAKYHVPANVPYTRYFLAHVYQFQFHKALCEA HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHH AGWKGPLHECSIYGSKAAGKKLDAMLAMGASRPWPEAYAALTGSRQADASAMLEYFAPLR CCCCCCCHHHHCCCCHHCCHHHHHHHHHCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHH AWLRKQIAGQSCGW HHHHHHHCCCCCCC >Mature Secondary Structure TTLPVLQLAAVLLAATPSAPAAPPQPAAAAKPTAADAKAFVGEANAELKKLWIRWSTAE CCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHCCCHHH WIKSTYITDDTERNAAALNEDQMAYLTDAIRGAVRFQGVGADPDTERMLMLLRIASPLPA HHHHHCCCCCCCCCHHHCCCHHHHHHHHHHHHHHEEECCCCCCCHHHHHHHHHHHCCCCC PSDAAQRAELAQLAAKLEGMYGKGKWCGAPGSAARGAEKCRDLQDLEQVMAKSRDYDALL CCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHH DAWTGWHTISRDMRPLYERMVTISNAGAREIGFRDLGDLWRSGYDMPPDAFEADTDRLWA HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCHHHHHHHHHCCCCCCCCCCCCCHHHHHH QVKPFYEELHCYVRSKLQKAYGKARVPDGKPIPAHLLGNMWAQDWANLYPLVEPYHGQPS HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC LDVDAALRRQKTDPIKMVKLGEAFFTSLGFEPLPKTFWERSQFTKPRDRDVVCHASAWDV CCHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCCE TYSADLRIKMCIRPTEEDLVTIHHELGHNFYQRAYVHLPVLFQNGANDGFHEAIGDAIAL EEECCCEEEEEECCCHHHHHHHHHHHCHHHHHHHHHEEEEEEECCCCCCHHHHHCCEEEE SITPGYLKQVGLIQTVPKDEKGVVNVQMKRALEKVAFLPFGKLIDQWRWDVFSGKTPASR EECHHHHHHCCHHHCCCCCCCCEEHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCHH YNAAWWELRRKYQGVDAPVSRSEADFDPGAKYHVPANVPYTRYFLAHVYQFQFHKALCEA HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHH AGWKGPLHECSIYGSKAAGKKLDAMLAMGASRPWPEAYAALTGSRQADASAMLEYFAPLR CCCCCCCHHHHCCCCHHCCHHHHHHHHHCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHH AWLRKQIAGQSCGW HHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA