Definition | Burkholderia mallei NCTC 10247 chromosome II, complete genome. |
---|---|
Accession | NC_009079 |
Length | 2,352,693 |
Click here to switch to the map view.
The map label for this gene is ddpA [H]
Identifier: 126445727
GI number: 126445727
Start: 1004670
End: 1006262
Strand: Reverse
Name: ddpA [H]
Synonym: BMA10247_A1065
Alternate gene names: 126445727
Gene position: 1006262-1004670 (Counterclockwise)
Preceding gene: 126446538
Following gene: 126446324
Centisome position: 42.77
GC content: 65.85
Gene sequence:
>1593_bases ATGAAGCACATGCTGTCCAAGCTCGCGGCGAGCGCCGCACTCGCCGCGCTGGCCCCGGTGCTGGCCCCCGCGCACGCGGC CACGCCGCCCGGCATCTTCGTGATCGCGACGCAGCTCGGCGAATTCACGACGCTCGACCCGAGCGAAATCTACGAGCTCG TGCCGTCCGAATACGTCGCGAACACGTACGAGCGGCTCGTGCGCGTCGACCTGCGCGAACCGTCGAAATTCGAAGGCCGG ATCGCGCAATCGTGGAGCGTCGGCGCGGACGGCCTCACCTACACGTTCAAGCTGCGCACCGGCCTGAAGTTCCACTCGGG CAATCCGGTGACGGCCGACGACGTGGCGTGGTCGCTGCAGCGCACGGTGCTGCTCGACAAAGGGCCGGCCGGCGTGCTCG CGGACCTCGGCCTGACCAAGGACAACGTCGCGCGGAAGGTACGCAAGCTCGACGACACGACCGTGTCGATCGAGACCGAC CGCCGGTACGCGCCGAGCTTCGTGCTGAACGTGCTGAGCGCGGACCCGGCATCGATCGTCGACAAGCAGTTGCTGCTCTC GCACGAGAAGAACGGCGACTTCGGCAATGCATGGCTGAAGAACGCGGATGCCGGCTCGGGCCCGTACCGGCTCGTCAAGT GGACGCCGAACGAAAGCCTCGTGCTGCAACGCTTCGACGGCTACCGCGCGCCGTATCCGATGAAGCGCATCGTGTTGCGG CACGTGCCGGAAGCGTCCGCGCAGCGCCTGCTGCTCGAGAACGGCGACGTCGACGCCGCGCGCAACCTGAGCCCCGACAG CCTTGCTGCGCTGTCGAAGGCGGGCAAGATCCACGTCGCGTCATGGCCCGTGTCCGCGCTGCTGTACCTGAGCCTGAACA CGAGGAATCCGAATCTCGCGAAGCCCGAGGTGCAGGAAGCGATGAAGTGGCTCGTCGATTACGACGGCATCCAGCGCAAC ATCGTCAGGACGACGTACAAGGTGCATCAGACCTTCCTGCCGGACGGCTTCCTCGGCGCGCTGGACGCGAATCCGTACCG GCAGAACGTCGCGAAGGCGAAGGCGCTGCTCGCGAAGGCCGGCCTGCCGAACGGCTTCGCGGTAACGATGGACATGCCGA ACGATTACCCGTACGTCGAGATCGCGCAGGCGTTGCAGGCGAACTTCGCGCAGGGCGGCATCCAGGTGAAGCTGATTCCG GGCGACGCGAAACAGGCGATCGGCAAGTACCGTGCGCGCCAGCACGACATCTTCATCGGCGAATGGTCGCCGGACTACAT GGACCCGAACAGCAACGCGCGCGGTTTCGCGTGGAATCCCGACAATTCGGACAACGCCAAGCACAAGCTGCTCGCGTGGC GCAACGGCTGGGATGTGCCGCAACTGACCGCGAAGACCGATGCGGCGCTCGCCGAGCCGTCGGCCGCGAAGCGCGCGCAG GACTATCAGGCGCTGCAAAAGGCGGTGCTCGCGAATTCGCCGTTCGTGATCCTGTTCGAGAAGGTCGTGCAGGTTGCGAC GCGGCCGGGTGTCACGGGCCCGGAAATCGGGCCGATCAACGATCTCGTGTCGTATCGGACCTTGAAGAAGTAA
Upstream 100 bases:
>100_bases TGACGCGGCGCAGCGGGGTCTCGCGCCGCGACGGCCGAGGTTTCTGGAACGCGCCCGGCAATGAGAGAATCGAGCTCGAT ACCTTTTCGACGGAGTCTCG
Downstream 100 bases:
>100_bases CCGCGTGGGCGCGGGCGCGCGCCGTCGCGAATCCGGCGGCGCGCTCGGCGGCGGGCGATCTTCGCGGCGATCCGGCCGCC GCGCATCGCTCGCCGCGCGG
Product: solute-binding family 5 protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 530; Mature: 530
Protein sequence:
>530_residues MKHMLSKLAASAALAALAPVLAPAHAATPPGIFVIATQLGEFTTLDPSEIYELVPSEYVANTYERLVRVDLREPSKFEGR IAQSWSVGADGLTYTFKLRTGLKFHSGNPVTADDVAWSLQRTVLLDKGPAGVLADLGLTKDNVARKVRKLDDTTVSIETD RRYAPSFVLNVLSADPASIVDKQLLLSHEKNGDFGNAWLKNADAGSGPYRLVKWTPNESLVLQRFDGYRAPYPMKRIVLR HVPEASAQRLLLENGDVDAARNLSPDSLAALSKAGKIHVASWPVSALLYLSLNTRNPNLAKPEVQEAMKWLVDYDGIQRN IVRTTYKVHQTFLPDGFLGALDANPYRQNVAKAKALLAKAGLPNGFAVTMDMPNDYPYVEIAQALQANFAQGGIQVKLIP GDAKQAIGKYRARQHDIFIGEWSPDYMDPNSNARGFAWNPDNSDNAKHKLLAWRNGWDVPQLTAKTDAALAEPSAAKRAQ DYQALQKAVLANSPFVILFEKVVQVATRPGVTGPEIGPINDLVSYRTLKK
Sequences:
>Translated_530_residues MKHMLSKLAASAALAALAPVLAPAHAATPPGIFVIATQLGEFTTLDPSEIYELVPSEYVANTYERLVRVDLREPSKFEGR IAQSWSVGADGLTYTFKLRTGLKFHSGNPVTADDVAWSLQRTVLLDKGPAGVLADLGLTKDNVARKVRKLDDTTVSIETD RRYAPSFVLNVLSADPASIVDKQLLLSHEKNGDFGNAWLKNADAGSGPYRLVKWTPNESLVLQRFDGYRAPYPMKRIVLR HVPEASAQRLLLENGDVDAARNLSPDSLAALSKAGKIHVASWPVSALLYLSLNTRNPNLAKPEVQEAMKWLVDYDGIQRN IVRTTYKVHQTFLPDGFLGALDANPYRQNVAKAKALLAKAGLPNGFAVTMDMPNDYPYVEIAQALQANFAQGGIQVKLIP GDAKQAIGKYRARQHDIFIGEWSPDYMDPNSNARGFAWNPDNSDNAKHKLLAWRNGWDVPQLTAKTDAALAEPSAAKRAQ DYQALQKAVLANSPFVILFEKVVQVATRPGVTGPEIGPINDLVSYRTLKK >Mature_530_residues MKHMLSKLAASAALAALAPVLAPAHAATPPGIFVIATQLGEFTTLDPSEIYELVPSEYVANTYERLVRVDLREPSKFEGR IAQSWSVGADGLTYTFKLRTGLKFHSGNPVTADDVAWSLQRTVLLDKGPAGVLADLGLTKDNVARKVRKLDDTTVSIETD RRYAPSFVLNVLSADPASIVDKQLLLSHEKNGDFGNAWLKNADAGSGPYRLVKWTPNESLVLQRFDGYRAPYPMKRIVLR HVPEASAQRLLLENGDVDAARNLSPDSLAALSKAGKIHVASWPVSALLYLSLNTRNPNLAKPEVQEAMKWLVDYDGIQRN IVRTTYKVHQTFLPDGFLGALDANPYRQNVAKAKALLAKAGLPNGFAVTMDMPNDYPYVEIAQALQANFAQGGIQVKLIP GDAKQAIGKYRARQHDIFIGEWSPDYMDPNSNARGFAWNPDNSDNAKHKLLAWRNGWDVPQLTAKTDAALAEPSAAKRAQ DYQALQKAVLANSPFVILFEKVVQVATRPGVTGPEIGPINDLVSYRTLKK
Specific function: Part of the ABC transporter complex ddpABCDF, which is probably involved in D,D-dipeptide transport [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Periplasm (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the bacterial solute-binding protein 5 family [H]
Homologues:
Organism=Escherichia coli, GI1787762, Length=538, Percent_Identity=31.2267657992565, Blast_Score=182, Evalue=4e-47, Organism=Escherichia coli, GI1789966, Length=544, Percent_Identity=25.5514705882353, Blast_Score=129, Evalue=6e-31, Organism=Escherichia coli, GI1787052, Length=371, Percent_Identity=26.9541778975741, Blast_Score=116, Evalue=4e-27, Organism=Escherichia coli, GI1787551, Length=547, Percent_Identity=25.7769652650823, Blast_Score=98, Evalue=2e-21, Organism=Escherichia coli, GI1789887, Length=564, Percent_Identity=26.063829787234, Blast_Score=89, Evalue=5e-19, Organism=Escherichia coli, GI1789397, Length=554, Percent_Identity=24.5487364620939, Blast_Score=69, Evalue=1e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000914 [H]
Pfam domain/function: PF00496 SBP_bac_5 [H]
EC number: NA
Molecular weight: Translated: 58084; Mature: 58084
Theoretical pI: Translated: 9.58; Mature: 9.58
Prosite motif: PS00284 SERPIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.3 %Met (Translated Protein) 1.3 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKHMLSKLAASAALAALAPVLAPAHAATPPGIFVIATQLGEFTTLDPSEIYELVPSEYVA CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHCCHHHHH NTYERLVRVDLREPSKFEGRIAQSWSVGADGLTYTFKLRTGLKFHSGNPVTADDVAWSLQ HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCEEECCCCCCHHHHHHHHH RTVLLDKGPAGVLADLGLTKDNVARKVRKLDDTTVSIETDRRYAPSFVLNVLSADPASIV HEEEEECCCCCCEECCCCCHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHCCCHHHHH DKQLLLSHEKNGDFGNAWLKNADAGSGPYRLVKWTPNESLVLQRFDGYRAPYPMKRIVLR HHHHHHHCCCCCCCCHHHHCCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCHHHHHHHH HVPEASAQRLLLENGDVDAARNLSPDSLAALSKAGKIHVASWPVSALLYLSLNTRNPNLA HCCCHHHHHHHHCCCCCCHHCCCCHHHHHHHHHCCEEEEECCCCEEEEEEEEECCCCCCC KPEVQEAMKWLVDYDGIQRNIVRTTYKVHQTFLPDGFLGALDANPYRQNVAKAKALLAKA CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCEECCCCCHHHHHHHHHHHHHHHC GLPNGFAVTMDMPNDYPYVEIAQALQANFAQGGIQVKLIPGDAKQAIGKYRARQHDIFIG CCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCEEEE EWSPDYMDPNSNARGFAWNPDNSDNAKHKLLAWRNGWDVPQLTAKTDAALAEPSAAKRAQ CCCCCCCCCCCCCCCEEECCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHCCCHHHHHHH DYQALQKAVLANSPFVILFEKVVQVATRPGVTGPEIGPINDLVSYRTLKK HHHHHHHHHHCCCCEEHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCC >Mature Secondary Structure MKHMLSKLAASAALAALAPVLAPAHAATPPGIFVIATQLGEFTTLDPSEIYELVPSEYVA CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHCCHHHHH NTYERLVRVDLREPSKFEGRIAQSWSVGADGLTYTFKLRTGLKFHSGNPVTADDVAWSLQ HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCEEECCCCCCHHHHHHHHH RTVLLDKGPAGVLADLGLTKDNVARKVRKLDDTTVSIETDRRYAPSFVLNVLSADPASIV HEEEEECCCCCCEECCCCCHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHCCCHHHHH DKQLLLSHEKNGDFGNAWLKNADAGSGPYRLVKWTPNESLVLQRFDGYRAPYPMKRIVLR HHHHHHHCCCCCCCCHHHHCCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCHHHHHHHH HVPEASAQRLLLENGDVDAARNLSPDSLAALSKAGKIHVASWPVSALLYLSLNTRNPNLA HCCCHHHHHHHHCCCCCCHHCCCCHHHHHHHHHCCEEEEECCCCEEEEEEEEECCCCCCC KPEVQEAMKWLVDYDGIQRNIVRTTYKVHQTFLPDGFLGALDANPYRQNVAKAKALLAKA CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCEECCCCCHHHHHHHHHHHHHHHC GLPNGFAVTMDMPNDYPYVEIAQALQANFAQGGIQVKLIPGDAKQAIGKYRARQHDIFIG CCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCEEEE EWSPDYMDPNSNARGFAWNPDNSDNAKHKLLAWRNGWDVPQLTAKTDAALAEPSAAKRAQ CCCCCCCCCCCCCCCEEECCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHCCCHHHHHHH DYQALQKAVLANSPFVILFEKVVQVATRPGVTGPEIGPINDLVSYRTLKK HHHHHHHHHHCCCCEEHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9097039; 9278503; 9751644 [H]