Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is rfaE [C]
Identifier: 187736080
GI number: 187736080
Start: 1914118
End: 1915209
Strand: Reverse
Name: rfaE [C]
Synonym: Amuc_1591
Alternate gene names: 187736080
Gene position: 1915209-1914118 (Counterclockwise)
Preceding gene: 187736081
Following gene: 187736079
Centisome position: 71.89
GC content: 57.78
Gene sequence:
>1092_bases ATGAAAAAAGTTTTCGTCTCCGGCTGCTATGATATCGTCCACGCTGGCCATATCCAGTTCTTTGAAGAAGCCCGCGCTCT GGGCGACTATCTAATCGTCTCTTTCGCTTCGGAACCCGTGCTGTGGCACCACAAGCAGCGCAAGCCGTCCATTCCGGACG AACACAAAAAAGTTCTGCTGGAAAGCCTGCGCATGGTTGACAAGGTTATCCTGGGTACCGGCATGAAAAAGGGCCTGGAT TTTGAAGAAGAGTTTCTTCAGGAAAAACCGGACATCCTGGCTGTGACGGAAGACGACCTTTACAGCGATATCAAGAAGGA GCTGTGCGCCCGCGTCGGAGCCAATTACGTGGTGCTCCCCAAGACGCCCCCCAAATTCACTCCCGTCTCCACCACCATGC TGGTGAACCGCATCAAGGCTCCGTCCGCCGTACCGCTTCGCGTGGACTTTGCCGGGGGATGGCTGGACGTGCCCCGCTAC GCCAGAAAAGGCTCCTATGTGGTCAACTGCGCCATCACCCCCATGGTTTCCCTCTGCGAATGGCCGTATGAAAAACGCTC CGGGCTGGGAGGCAGCGGCGCGTGGGCCATGCTGGAAGGGCGCGATCCGGTCGCGTCCGAACTCGCTCTGGGCGTCGGTT GGCAGGATCCTGCCGTCATTGCGGAAACGGGGTTGTGCGTATGGCGCTCCGGCAGTTCTCCGGTGCTGGATGTCAAAGGC ACGGGTGACTTCCTGGAAGGAAGAATGGCCATTCTATATACGGGTGAAGAACACGACACCCCAAAGATGGCGGATGAACA ACGGGACTACGTACGCATCTCCCAATCCTCCCTCATTGCCCGCACCGGAGTGCTGGAACGCAACATCAATACACTGGCGG CAGGCGTAGCCCTGTACTACAGCGTGCAGCTTGACGAAGGAATGCGCCCCCTGCCGGATATTCCCAATGCCCTTGCAAAA AAATATCTGGGAGGCGGCTACGGGGGATACGCCCTGTACCTTTTCCCCTGCCGGGATGATCGCGACCAGGCTGTCAAGGA CAACCCGGCAATGAAACGGGTGGAGCCGTATTGCCGCCAGCTGTTCAAGTAA
Upstream 100 bases:
>100_bases CTGTAGTCCGCGCCGTTTTTTAACGCATTTTCATCTCGCTCTTGCATCCGGCACGGCAATAGGGCATACTGTTGCAGACT AATTATCTATCATTACACGC
Downstream 100 bases:
>100_bases ATTCCCCCCCTTTCATGTCCCTGATGCGGAACAGCCTGGTTGCCTCCGGCGCCATCTTCGCCTGCCGCCTGACAGGCATG GCCAGGGAAATTGTGTACAC
Product: cytidyltransferase-related domain protein
Products: diphosphate; CDPglycerol
Alternate protein names: NA
Number of amino acids: Translated: 363; Mature: 363
Protein sequence:
>363_residues MKKVFVSGCYDIVHAGHIQFFEEARALGDYLIVSFASEPVLWHHKQRKPSIPDEHKKVLLESLRMVDKVILGTGMKKGLD FEEEFLQEKPDILAVTEDDLYSDIKKELCARVGANYVVLPKTPPKFTPVSTTMLVNRIKAPSAVPLRVDFAGGWLDVPRY ARKGSYVVNCAITPMVSLCEWPYEKRSGLGGSGAWAMLEGRDPVASELALGVGWQDPAVIAETGLCVWRSGSSPVLDVKG TGDFLEGRMAILYTGEEHDTPKMADEQRDYVRISQSSLIARTGVLERNINTLAAGVALYYSVQLDEGMRPLPDIPNALAK KYLGGGYGGYALYLFPCRDDRDQAVKDNPAMKRVEPYCRQLFK
Sequences:
>Translated_363_residues MKKVFVSGCYDIVHAGHIQFFEEARALGDYLIVSFASEPVLWHHKQRKPSIPDEHKKVLLESLRMVDKVILGTGMKKGLD FEEEFLQEKPDILAVTEDDLYSDIKKELCARVGANYVVLPKTPPKFTPVSTTMLVNRIKAPSAVPLRVDFAGGWLDVPRY ARKGSYVVNCAITPMVSLCEWPYEKRSGLGGSGAWAMLEGRDPVASELALGVGWQDPAVIAETGLCVWRSGSSPVLDVKG TGDFLEGRMAILYTGEEHDTPKMADEQRDYVRISQSSLIARTGVLERNINTLAAGVALYYSVQLDEGMRPLPDIPNALAK KYLGGGYGGYALYLFPCRDDRDQAVKDNPAMKRVEPYCRQLFK >Mature_363_residues MKKVFVSGCYDIVHAGHIQFFEEARALGDYLIVSFASEPVLWHHKQRKPSIPDEHKKVLLESLRMVDKVILGTGMKKGLD FEEEFLQEKPDILAVTEDDLYSDIKKELCARVGANYVVLPKTPPKFTPVSTTMLVNRIKAPSAVPLRVDFAGGWLDVPRY ARKGSYVVNCAITPMVSLCEWPYEKRSGLGGSGAWAMLEGRDPVASELALGVGWQDPAVIAETGLCVWRSGSSPVLDVKG TGDFLEGRMAILYTGEEHDTPKMADEQRDYVRISQSSLIARTGVLERNINTLAAGVALYYSVQLDEGMRPLPDIPNALAK KYLGGGYGGYALYLFPCRDDRDQAVKDNPAMKRVEPYCRQLFK
Specific function: Lipopolysaccharide core biosynthesis. [C]
COG id: COG0615
COG function: function code MI; Cytidylyltransferase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004821 - InterPro: IPR004820 - InterPro: IPR014729 [H]
Pfam domain/function: PF01467 CTP_transf_2 [H]
EC number: 2.7.7.39
Molecular weight: Translated: 40325; Mature: 40325
Theoretical pI: Translated: 6.78; Mature: 6.78
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 4.7 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 4.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKKVFVSGCYDIVHAGHIQFFEEARALGDYLIVSFASEPVLWHHKQRKPSIPDEHKKVLL CCEEEEEHHHHHHHCCHHHHHHHHHHHHHEEEEEECCCCEEECCCCCCCCCCHHHHHHHH ESLRMVDKVILGTGMKKGLDFEEEFLQEKPDILAVTEDDLYSDIKKELCARVGANYVVLP HHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHCCCEEEEC KTPPKFTPVSTTMLVNRIKAPSAVPLRVDFAGGWLDVPRYARKGSYVVNCAITPMVSLCE CCCCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCHHHCCCCEEEEEEHHHHHHHHC WPYEKRSGLGGSGAWAMLEGRDPVASELALGVGWQDPAVIAETGLCVWRSGSSPVLDVKG CCCHHHCCCCCCCCEEEEECCCCHHHHHHHCCCCCCCCCEEECCEEEEECCCCCEEEECC TGDFLEGRMAILYTGEEHDTPKMADEQRDYVRISQSSLIARTGVLERNINTLAAGVALYY CCCHHCCCEEEEEECCCCCCCCCCCCCHHHEEECHHHHHHHHHHHHHCHHHHHHHEHEEE SVQLDEGMRPLPDIPNALAKKYLGGGYGGYALYLFPCRDDRDQAVKDNPAMKRVEPYCRQ EEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHH LFK HCC >Mature Secondary Structure MKKVFVSGCYDIVHAGHIQFFEEARALGDYLIVSFASEPVLWHHKQRKPSIPDEHKKVLL CCEEEEEHHHHHHHCCHHHHHHHHHHHHHEEEEEECCCCEEECCCCCCCCCCHHHHHHHH ESLRMVDKVILGTGMKKGLDFEEEFLQEKPDILAVTEDDLYSDIKKELCARVGANYVVLP HHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHCCCEEEEC KTPPKFTPVSTTMLVNRIKAPSAVPLRVDFAGGWLDVPRYARKGSYVVNCAITPMVSLCE CCCCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCCCCCHHHCCCCEEEEEEHHHHHHHHC WPYEKRSGLGGSGAWAMLEGRDPVASELALGVGWQDPAVIAETGLCVWRSGSSPVLDVKG CCCHHHCCCCCCCCEEEEECCCCHHHHHHHCCCCCCCCCEEECCEEEEECCCCCEEEECC TGDFLEGRMAILYTGEEHDTPKMADEQRDYVRISQSSLIARTGVLERNINTLAAGVALYY CCCHHCCCEEEEEECCCCCCCCCCCCCHHHEEECHHHHHHHHHHHHHCHHHHHHHEHEEE SVQLDEGMRPLPDIPNALAKKYLGGGYGGYALYLFPCRDDRDQAVKDNPAMKRVEPYCRQ EEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHH LFK HCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: CTP; sn-glycerol 3-phosphate
Specific reaction: CTP + sn-glycerol 3-phosphate = diphosphate + CDP-glycerol
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8688087 [H]