Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is yhjA [H]

Identifier: 187735586

GI number: 187735586

Start: 1302165

End: 1303544

Strand: Reverse

Name: yhjA [H]

Synonym: Amuc_1091

Alternate gene names: 187735586

Gene position: 1303544-1302165 (Counterclockwise)

Preceding gene: 187735587

Following gene: 187735585

Centisome position: 48.93

GC content: 56.23

Gene sequence:

>1380_bases
ATGAGCAAAACATGTTCCTTCATCAAGGCAGGAGCCATTCTGGGCGGCACGGTCATCGTGACGGCGGCTGTCGCCCCCCT
GTTCCTGCCGAACCAGAATGTCAAGCCACTGACGTCCGCCCAGGCGGCGGAAATTACCGCACAGACCATGAATTCCAAAT
GTGCGGACTGCCACAAGCCCGGCACTCACATTTCCGAACTGGTCAATACTCTTTCCGGAGGCCTGCTGGCGCGCCATATC
AGGGACGGGCAGCGCAGCTACAATATGGAAGAACCTCCCACTGCCGTCACCCTTTCCAAGCTGGAACATGTGCTTCAAAT
CAATTCCATGCCCCCCACTTCCTACACCATGGTGCACTGGGGCAGCACGCTCACTCTCCGGGAGAAAAACGCCATGCTCC
AGTGGATCAAGGATGAGCGCCTGAAAATTTTCGGCGATATGGTAGGAGAGGAATACGCCCTTTCCCCTCTTGCCCCCATT
CCGGACGCCCTCCCCACGGATCCGGCCAAAGTGGCCCTGGGCTACAAGCTTTTTCATGACGTGCGCCTTTCCACGGACAA
TACCGTTTCCTGCGCTTCCTGCCATTCCCTGGAAAAAGCCGGGACGGACAACCTGCCCACTTCCACCGGAGTCCGCGGCC
AGAAAGGCGGCATCAATGCCCCCACCGTTTTCAATGCCGCTTTCCATGCCAAGCAATTTTGGGACGGACGCGCAGCCAAC
CTCCAGGAACAGGCCGGCGGACCGCCCCTGAATCCGGTGGAAATGGGGTACGAACATCCGGATGACTGGAAGAAGATCGC
TGCCAAACTGGACCAGGACACCGCTTTTGCCGCAGAATTCAAAAAGGTTTACCCCCAGGGATTCACCGGAGAGACCATCA
CGAATGCCATCGCGGAATATGAAAAAACTCTTATCACGCCGAACAGCCCGTTTGACCGCTACCTGAAAGGGGATGAAAAC
GCCATCAGCGAGAACGCCAAAAAAGGTTACAAGCTTTTCCTGAAGCTTGGTTGCCAGACCTGCCACACCGGTCCCGCCAT
GGGAGGCCAGTCCTTTGAATACGCCGACCTCAAAGGCGATTTCTTTGCCGGACGCGCCAAGACCAACGACGATAACGGCC
TGATGAATTTCTCCAAAAAGGAATCGGACAGGCACCGCTTCCGTGTTCCGACCCTCCGCAATGTGGAACTCACCTGGCCG
TACATGCATGACGCCTCCGCACAGACTCTGGAGGAAGCCATTACGAAAATGTACCATTACCAGCTCGGTTACGATAAACT
GGACAAGAAGGAAGTGAGACTTCTGGTGGCTTTCCTGAAGACGCTTACCGGAGAATACAACGGCAAACCCGTCCAGGGCG
AAGTTTGCCCTGCCTCCTGA

Upstream 100 bases:

>100_bases
TAACGAAGACCTGAGAAAGGCCGGCGAACTTTAGAATAAATCTTGATTTTTTATCCTTCCCCTTTTACCATCCGGGCAAT
TCCACGTCATTGAGAATCTC

Downstream 100 bases:

>100_bases
ATCTGAAGAAAAGCTCTTTTTCCAGCCGCGCCAATGAACATTCATTGGCGCGGCTTTTTTGCATTTCCCTTCCAGTTCCT
ACGGCAGACACTGCCGGGAA

Product: Cytochrome-c peroxidase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 459; Mature: 458

Protein sequence:

>459_residues
MSKTCSFIKAGAILGGTVIVTAAVAPLFLPNQNVKPLTSAQAAEITAQTMNSKCADCHKPGTHISELVNTLSGGLLARHI
RDGQRSYNMEEPPTAVTLSKLEHVLQINSMPPTSYTMVHWGSTLTLREKNAMLQWIKDERLKIFGDMVGEEYALSPLAPI
PDALPTDPAKVALGYKLFHDVRLSTDNTVSCASCHSLEKAGTDNLPTSTGVRGQKGGINAPTVFNAAFHAKQFWDGRAAN
LQEQAGGPPLNPVEMGYEHPDDWKKIAAKLDQDTAFAAEFKKVYPQGFTGETITNAIAEYEKTLITPNSPFDRYLKGDEN
AISENAKKGYKLFLKLGCQTCHTGPAMGGQSFEYADLKGDFFAGRAKTNDDNGLMNFSKKESDRHRFRVPTLRNVELTWP
YMHDASAQTLEEAITKMYHYQLGYDKLDKKEVRLLVAFLKTLTGEYNGKPVQGEVCPAS

Sequences:

>Translated_459_residues
MSKTCSFIKAGAILGGTVIVTAAVAPLFLPNQNVKPLTSAQAAEITAQTMNSKCADCHKPGTHISELVNTLSGGLLARHI
RDGQRSYNMEEPPTAVTLSKLEHVLQINSMPPTSYTMVHWGSTLTLREKNAMLQWIKDERLKIFGDMVGEEYALSPLAPI
PDALPTDPAKVALGYKLFHDVRLSTDNTVSCASCHSLEKAGTDNLPTSTGVRGQKGGINAPTVFNAAFHAKQFWDGRAAN
LQEQAGGPPLNPVEMGYEHPDDWKKIAAKLDQDTAFAAEFKKVYPQGFTGETITNAIAEYEKTLITPNSPFDRYLKGDEN
AISENAKKGYKLFLKLGCQTCHTGPAMGGQSFEYADLKGDFFAGRAKTNDDNGLMNFSKKESDRHRFRVPTLRNVELTWP
YMHDASAQTLEEAITKMYHYQLGYDKLDKKEVRLLVAFLKTLTGEYNGKPVQGEVCPAS
>Mature_458_residues
SKTCSFIKAGAILGGTVIVTAAVAPLFLPNQNVKPLTSAQAAEITAQTMNSKCADCHKPGTHISELVNTLSGGLLARHIR
DGQRSYNMEEPPTAVTLSKLEHVLQINSMPPTSYTMVHWGSTLTLREKNAMLQWIKDERLKIFGDMVGEEYALSPLAPIP
DALPTDPAKVALGYKLFHDVRLSTDNTVSCASCHSLEKAGTDNLPTSTGVRGQKGGINAPTVFNAAFHAKQFWDGRAANL
QEQAGGPPLNPVEMGYEHPDDWKKIAAKLDQDTAFAAEFKKVYPQGFTGETITNAIAEYEKTLITPNSPFDRYLKGDENA
ISENAKKGYKLFLKLGCQTCHTGPAMGGQSFEYADLKGDFFAGRAKTNDDNGLMNFSKKESDRHRFRVPTLRNVELTWPY
MHDASAQTLEEAITKMYHYQLGYDKLDKKEVRLLVAFLKTLTGEYNGKPVQGEVCPAS

Specific function: Unknown

COG id: COG1858

COG function: function code P; Cytochrome c peroxidase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI1789935, Length=404, Percent_Identity=47.029702970297, Blast_Score=366, Evalue=1e-102,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR009056
- InterPro:   IPR003088
- InterPro:   IPR004852 [H]

Pfam domain/function: PF03150 CCP_MauG; PF00034 Cytochrom_C [H]

EC number: =1.11.1.5 [H]

Molecular weight: Translated: 50399; Mature: 50268

Theoretical pI: Translated: 7.35; Mature: 7.35

Prosite motif: PS51007 CYTC L=RR ; PS51008 MULTIHEME_CYTC L=rr

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSKTCSFIKAGAILGGTVIVTAAVAPLFLPNQNVKPLTSAQAAEITAQTMNSKCADCHKP
CCCCHHHHHHCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCHHHHCCCC
GTHISELVNTLSGGLLARHIRDGQRSYNMEEPPTAVTLSKLEHVLQINSMPPTSYTMVHW
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEE
GSTLTLREKNAMLQWIKDERLKIFGDMVGEEYALSPLAPIPDALPTDPAKVALGYKLFHD
CCEEEEEHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHH
VRLSTDNTVSCASCHSLEKAGTDNLPTSTGVRGQKGGINAPTVFNAAFHAKQFWDGRAAN
EEECCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCC
LQEQAGGPPLNPVEMGYEHPDDWKKIAAKLDQDTAFAAEFKKVYPQGFTGETITNAIAEY
CHHHCCCCCCCHHHCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHH
EKTLITPNSPFDRYLKGDENAISENAKKGYKLFLKLGCQTCHTGPAMGGQSFEYADLKGD
HHHCCCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHCCCCCCCCCCCCEEECCCCC
FFAGRAKTNDDNGLMNFSKKESDRHRFRVPTLRNVELTWPYMHDASAQTLEEAITKMYHY
EEECCCCCCCCCCCCCCCCCCCCCCEEECCCCCCEEEECCCCCCCHHHHHHHHHHHHHHH
QLGYDKLDKKEVRLLVAFLKTLTGEYNGKPVQGEVCPAS
HCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC
>Mature Secondary Structure 
SKTCSFIKAGAILGGTVIVTAAVAPLFLPNQNVKPLTSAQAAEITAQTMNSKCADCHKP
CCCHHHHHHCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCHHHHCCCC
GTHISELVNTLSGGLLARHIRDGQRSYNMEEPPTAVTLSKLEHVLQINSMPPTSYTMVHW
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEEE
GSTLTLREKNAMLQWIKDERLKIFGDMVGEEYALSPLAPIPDALPTDPAKVALGYKLFHD
CCEEEEEHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHH
VRLSTDNTVSCASCHSLEKAGTDNLPTSTGVRGQKGGINAPTVFNAAFHAKQFWDGRAAN
EEECCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCC
LQEQAGGPPLNPVEMGYEHPDDWKKIAAKLDQDTAFAAEFKKVYPQGFTGETITNAIAEY
CHHHCCCCCCCHHHCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHH
EKTLITPNSPFDRYLKGDENAISENAKKGYKLFLKLGCQTCHTGPAMGGQSFEYADLKGD
HHHCCCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHCCCCCCCCCCCCEEECCCCC
FFAGRAKTNDDNGLMNFSKKESDRHRFRVPTLRNVELTWPYMHDASAQTLEEAITKMYHY
EEECCCCCCCCCCCCCCCCCCCCCCEEECCCCCCEEEECCCCCCCHHHHHHHHHHHHHHH
QLGYDKLDKKEVRLLVAFLKTLTGEYNGKPVQGEVCPAS
HCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8041620; 9278503 [H]