Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is dtpA [H]
Identifier: 187736087
GI number: 187736087
Start: 1921929
End: 1923284
Strand: Reverse
Name: dtpA [H]
Synonym: Amuc_1598
Alternate gene names: 187736087
Gene position: 1923284-1921929 (Counterclockwise)
Preceding gene: 187736088
Following gene: 187736085
Centisome position: 72.19
GC content: 55.83
Gene sequence:
>1356_bases ATGAGCGCCCTGCCCTCCCAGTCCAGATACATCATCGGAACGGAAGCCTGTGAACGTTTCAGCTTCTACGGCATGAAGTC CATCCTCATGCTGTACATGACGGGCCATCTGCTGATGAGCGACAACTGGGCCACCTCCACCCTTCATATTTTTATGGGAA TGGTTTATCTGCTTCCCCTGGCGGGGGCCTGGCTGGCGGACAAGGTCTGGGGCCGATATAAAACCATTCTTTATATTTCC CTGCTTTACTGCGTAGGACACGGAGTTCTGGCGACGGCGGATCTTTTCCACACCATTGAAGCGCGCCGTTACATTCTCAT GGCGGGCCTGTTCATCATCGCTCTGGGAGCCGGAGGCATCAAGCCATGCGTCTCCGCCTTTATGGGAGACCAGATTCCTA ATAAGTCCCCGCAGTTAATGACCAAGGCTTTCAATGCCTTCTACTGGGCTATCAACCTTGGCTCCTTCTTCTCCTTCCTG GTCATTCCGGCCATGGAACAGCGTTACGGATACAGTTGGGCGTTTGCCGTCCCGGGCCTCTTCATGGGGGTTGCCACCTT CGTCTTCTGGCTGGGCCGCAAAAAATACCACAAAACGCCTCCGGCCCGGAACAGCGGGCAGCCTGGCTTCTGGAAAGTCC TTTTCATCATTCTGTTCCACGGAGGCTGGAAAAACGCAGAACAGCGCTGCGGAACTTCTGCCGTGGAAGACACCCGGCAC ATCCTGAAAATCCTCTCCATCTTCGCCTTTATCATCCCATTCTGGTCCATTTTCGAACAGACGGCTTCTTCCTGGGTATC CCAGGGCAGCAGGATGATTCCTCTTTCCATCCCGCTCCCGGGCGGTTCCTGGTCCATCGGGCCGGCCCAAATCCAGGCGG CCAATCCCATTTTCGTCATGGTGTTCATCCCCCTCATCACCGTATTTGTTTATCCCAGGGTGGCAACGCTTGCAAGGCCC CTGGTGCGCCTCGGAACGGGATTGGCCCTCAGCTCCGCTACATTCCTGATTGTCGCTTTCCTGCAATACCGGCTGGAGGA AGGAACCTCCATGTCCATCGCATGGCAGCTGATTCCTTACTGCGTACTCACCATCTCTGAGATCCTGGTCAGCACCACGG GCCTGGAATTCGCCTATACGCAAGCCCCGGCGCATTTGAAAAGCCTCATCACCAGTTTCTGGAACCTCACTATCTTTGCA GGCAACATGCTGGTGGCCGCAATTACTTTTTTCCTGTCCAACGGAGAATCAGCCAACGCCATTTCCACGGACCGCTTCAT CCTGTACGCCGTGCTCGCCGCCGTGGTGGCGGTCGCCTACTCCTTCCGGGCGCGCAGGTACGGAAAAACGGAATAA
Upstream 100 bases:
>100_bases AGGCCTTGAACACCGTCTGAACGCATTCGCCATGATGGCCTCCGCCCGGAGGCCATCATTTTTATTACTCCTTCTCCATC TCCCTCCTTTTCATCATCCA
Downstream 100 bases:
>100_bases AGAAAGCACTGCCGTTCCCGGGCGCGTGGCCGGGTTTAACCATGTTTTGAAAAAACAATCAGGCCGTAACCAGCATGCGC TCCTCCCATTCCCTCCGGCA
Product: amino acid/peptide transporter
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 451; Mature: 450
Protein sequence:
>451_residues MSALPSQSRYIIGTEACERFSFYGMKSILMLYMTGHLLMSDNWATSTLHIFMGMVYLLPLAGAWLADKVWGRYKTILYIS LLYCVGHGVLATADLFHTIEARRYILMAGLFIIALGAGGIKPCVSAFMGDQIPNKSPQLMTKAFNAFYWAINLGSFFSFL VIPAMEQRYGYSWAFAVPGLFMGVATFVFWLGRKKYHKTPPARNSGQPGFWKVLFIILFHGGWKNAEQRCGTSAVEDTRH ILKILSIFAFIIPFWSIFEQTASSWVSQGSRMIPLSIPLPGGSWSIGPAQIQAANPIFVMVFIPLITVFVYPRVATLARP LVRLGTGLALSSATFLIVAFLQYRLEEGTSMSIAWQLIPYCVLTISEILVSTTGLEFAYTQAPAHLKSLITSFWNLTIFA GNMLVAAITFFLSNGESANAISTDRFILYAVLAAVVAVAYSFRARRYGKTE
Sequences:
>Translated_451_residues MSALPSQSRYIIGTEACERFSFYGMKSILMLYMTGHLLMSDNWATSTLHIFMGMVYLLPLAGAWLADKVWGRYKTILYIS LLYCVGHGVLATADLFHTIEARRYILMAGLFIIALGAGGIKPCVSAFMGDQIPNKSPQLMTKAFNAFYWAINLGSFFSFL VIPAMEQRYGYSWAFAVPGLFMGVATFVFWLGRKKYHKTPPARNSGQPGFWKVLFIILFHGGWKNAEQRCGTSAVEDTRH ILKILSIFAFIIPFWSIFEQTASSWVSQGSRMIPLSIPLPGGSWSIGPAQIQAANPIFVMVFIPLITVFVYPRVATLARP LVRLGTGLALSSATFLIVAFLQYRLEEGTSMSIAWQLIPYCVLTISEILVSTTGLEFAYTQAPAHLKSLITSFWNLTIFA GNMLVAAITFFLSNGESANAISTDRFILYAVLAAVVAVAYSFRARRYGKTE >Mature_450_residues SALPSQSRYIIGTEACERFSFYGMKSILMLYMTGHLLMSDNWATSTLHIFMGMVYLLPLAGAWLADKVWGRYKTILYISL LYCVGHGVLATADLFHTIEARRYILMAGLFIIALGAGGIKPCVSAFMGDQIPNKSPQLMTKAFNAFYWAINLGSFFSFLV IPAMEQRYGYSWAFAVPGLFMGVATFVFWLGRKKYHKTPPARNSGQPGFWKVLFIILFHGGWKNAEQRCGTSAVEDTRHI LKILSIFAFIIPFWSIFEQTASSWVSQGSRMIPLSIPLPGGSWSIGPAQIQAANPIFVMVFIPLITVFVYPRVATLARPL VRLGTGLALSSATFLIVAFLQYRLEEGTSMSIAWQLIPYCVLTISEILVSTTGLEFAYTQAPAHLKSLITSFWNLTIFAG NMLVAAITFFLSNGESANAISTDRFILYAVLAAVVAVAYSFRARRYGKTE
Specific function: Proton-dependent permease that transports di- and tripeptides [H]
COG id: COG3104
COG function: function code E; Dipeptide/tripeptide permease
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the PTR2/POT transporter (TC 2.A.17) family. DtpA subfamily [H]
Homologues:
Organism=Homo sapiens, GI226371746, Length=473, Percent_Identity=30.2325581395349, Blast_Score=186, Evalue=5e-47, Organism=Homo sapiens, GI4827008, Length=376, Percent_Identity=30.8510638297872, Blast_Score=182, Evalue=7e-46, Organism=Homo sapiens, GI226371748, Length=466, Percent_Identity=28.5407725321888, Blast_Score=160, Evalue=2e-39, Organism=Homo sapiens, GI21717816, Length=458, Percent_Identity=26.2008733624454, Blast_Score=132, Evalue=5e-31, Organism=Homo sapiens, GI7706117, Length=531, Percent_Identity=23.728813559322, Blast_Score=99, Evalue=1e-20, Organism=Escherichia coli, GI1787922, Length=429, Percent_Identity=24.7086247086247, Blast_Score=111, Evalue=9e-26, Organism=Escherichia coli, GI1790572, Length=198, Percent_Identity=32.3232323232323, Blast_Score=93, Evalue=4e-20, Organism=Escherichia coli, GI1789911, Length=413, Percent_Identity=23.4866828087167, Blast_Score=86, Evalue=5e-18, Organism=Escherichia coli, GI1786927, Length=202, Percent_Identity=28.7128712871287, Blast_Score=86, Evalue=5e-18, Organism=Caenorhabditis elegans, GI71987453, Length=371, Percent_Identity=35.3099730458221, Blast_Score=204, Evalue=8e-53, Organism=Caenorhabditis elegans, GI17569141, Length=411, Percent_Identity=32.360097323601, Blast_Score=184, Evalue=9e-47, Organism=Caenorhabditis elegans, GI17541704, Length=375, Percent_Identity=30.4, Blast_Score=146, Evalue=3e-35, Organism=Saccharomyces cerevisiae, GI6322946, Length=458, Percent_Identity=25.9825327510917, Blast_Score=122, Evalue=1e-28, Organism=Drosophila melanogaster, GI28571102, Length=445, Percent_Identity=33.0337078651685, Blast_Score=213, Evalue=2e-55, Organism=Drosophila melanogaster, GI28571100, Length=445, Percent_Identity=33.0337078651685, Blast_Score=213, Evalue=2e-55, Organism=Drosophila melanogaster, GI28571098, Length=410, Percent_Identity=33.9024390243902, Blast_Score=208, Evalue=5e-54, Organism=Drosophila melanogaster, GI24639583, Length=367, Percent_Identity=35.4223433242507, Blast_Score=192, Evalue=4e-49, Organism=Drosophila melanogaster, GI24639585, Length=367, Percent_Identity=35.4223433242507, Blast_Score=192, Evalue=5e-49, Organism=Drosophila melanogaster, GI24639581, Length=367, Percent_Identity=35.4223433242507, Blast_Score=192, Evalue=5e-49, Organism=Drosophila melanogaster, GI24645459, Length=473, Percent_Identity=26.215644820296, Blast_Score=113, Evalue=3e-25,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016196 - InterPro: IPR000109 - InterPro: IPR005279 - InterPro: IPR018456 [H]
Pfam domain/function: PF00854 PTR2 [H]
EC number: NA
Molecular weight: Translated: 50099; Mature: 49967
Theoretical pI: Translated: 9.85; Mature: 9.85
Prosite motif: PS01022 PTR2_1 ; PS01023 PTR2_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 3.5 %Met (Translated Protein) 4.7 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 4.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSALPSQSRYIIGTEACERFSFYGMKSILMLYMTGHLLMSDNWATSTLHIFMGMVYLLPL CCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHHHHHHHHH AGAWLADKVWGRYKTILYISLLYCVGHGVLATADLFHTIEARRYILMAGLFIIALGAGGI CCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH KPCVSAFMGDQIPNKSPQLMTKAFNAFYWAINLGSFFSFLVIPAMEQRYGYSWAFAVPGL HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH FMGVATFVFWLGRKKYHKTPPARNSGQPGFWKVLFIILFHGGWKNAEQRCGTSAVEDTRH HHHHHHHHHHHCCHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHCCCHHHHHHHH ILKILSIFAFIIPFWSIFEQTASSWVSQGSRMIPLSIPLPGGSWSIGPAQIQAANPIFVM HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCCHHEECCCCCHHH VFIPLITVFVYPRVATLARPLVRLGTGLALSSATFLIVAFLQYRLEEGTSMSIAWQLIPY HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH CVLTISEILVSTTGLEFAYTQAPAHLKSLITSFWNLTIFAGNMLVAAITFFLSNGESANA HHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC ISTDRFILYAVLAAVVAVAYSFRARRYGKTE CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC >Mature Secondary Structure SALPSQSRYIIGTEACERFSFYGMKSILMLYMTGHLLMSDNWATSTLHIFMGMVYLLPL CCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHHHHHHHHH AGAWLADKVWGRYKTILYISLLYCVGHGVLATADLFHTIEARRYILMAGLFIIALGAGGI CCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH KPCVSAFMGDQIPNKSPQLMTKAFNAFYWAINLGSFFSFLVIPAMEQRYGYSWAFAVPGL HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH FMGVATFVFWLGRKKYHKTPPARNSGQPGFWKVLFIILFHGGWKNAEQRCGTSAVEDTRH HHHHHHHHHHHCCHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHCCCHHHHHHHH ILKILSIFAFIIPFWSIFEQTASSWVSQGSRMIPLSIPLPGGSWSIGPAQIQAANPIFVM HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCCHHEECCCCCHHH VFIPLITVFVYPRVATLARPLVRLGTGLALSSATFLIVAFLQYRLEEGTSMSIAWQLIPY HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH CVLTISEILVSTTGLEFAYTQAPAHLKSLITSFWNLTIFAGNMLVAAITFFLSNGESANA HHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC ISTDRFILYAVLAAVVAVAYSFRARRYGKTE CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA