Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is infB [H]

Identifier: 187735457

GI number: 187735457

Start: 1139940

End: 1141991

Strand: Reverse

Name: infB [H]

Synonym: Amuc_0956

Alternate gene names: 187735457

Gene position: 1141991-1139940 (Counterclockwise)

Preceding gene: 187735458

Following gene: 187735456

Centisome position: 42.87

GC content: 61.21

Gene sequence:

>2052_bases
ATGCCTAATAAACAAGAAGAGAAAGAACCCAAAAAGGAAGTATTGGATCTGATCGGCGGATCTTCCAAAAAGAAACGCGC
ACCCCAGCCGGCCCCCGCACCGGCGCCCTCCCGCCCTGTTCCAGTCAAGAAAGAAGCTCTCGACTTGCTTTCCGGCAACA
AGAAAAAAGCGTCGCGCGCCGCTGACGCCGCTCCGGCGCCCGCCGCTGCCCCGGCCGCCGCAGAACCCGCCGCGCCCGCA
CCCCAGGAAGAACCCTCCGCAGACGATAAAATCATCAACCTCAAGCCGCCCGTCTCCGTTTCCGAGCTGGCGGGCATGCT
GAAAGCCAAGCCCTTCCAAATCATCAAGGATCTGATGGGCATGGGCATCTTCGCCAACCCGAACACGCCGCTGGATGCGG
ACGCCGTCAGTTCCATCTGCGACCTCCACGGCTACACATTCGCCCGTGAAAAACGGGAAAAGGGCGGCGGCGTGAAAGCC
CAGCAGGAACCCGTCAAGGAACCGGAACCCGTGCCGGTGGTGGAAGAACCCAAAGCCACCCTCATCACGCGCACGCCCAT
CATCACCGTCATGGGCCATGTGGACCACGGCAAAACATCCCTGCTGGACTACATCCGCAAAACGCGCGTGGCCAAGGGGG
AAGCCGGGGGCATTACCCAGCACATCGGCGCCTATACGGTTGACTACAACGGCAGCACGCTCACCTTCCTGGATACGCCG
GGCCATGCCATCTTCACGGAAATGCGCGCCCGCGGCGCGGACGTCACGGACATCGTGGTTCTGGTGGTGGCCGCCAATGA
CGGCATCATGCCCCAGACGCGGGAAGCCATCGCCCACTCCAAGGCAGCCGGCAAAACCATCATCGTCGCCATCAACAAAT
GCGACCTTCCGGCCGCAGACCCCGTCAAGACCAAGAGCGGCCTGATGGAAGAAGGGCTGGTTCCCACGGACTTCGGAGGC
GATGTGGAATGCGTGGAAGTCTCCGCCCTGACCGGGGCCGGCATTGACGACCTTCTCGGCCTTCTCGTCCTCCAGTCAGA
AGTGCTGGAACTCCAGGCCAACCCCAAGGCCAACTGCCGCGCCTCCATCATTGAGGCCCGCGTGGAACCCGGCACGGGCA
GCTCCGCCACGGCCATCGTGGAAAGCGGCACCATCCGCGTCGGAATGCCTTTCATCTGCGGCCCTTACGCCGGCAAGGTG
CGCGCCCTGGTCAACGACCACGGCGAACGCGTCAAAAAGGTAGGGCCCGGCATGCCCGTGGAAATCACCGGCTTCTCTGA
AACCCCGAACGTGGGGGACGAACTGGTGGAAATGGAAAACGAACGCGCCGCCAAAAAGCTTGGGGAAGAACGCCAGGAGG
AACTGCGCAAGCAGCGCCTGGCCCAGCCCCGCAAGGCACGCATGGAAGAACTGCTCGCCATGATGGGGGACGGCACCCAG
AAAGCCCAGCTCAAAATACTCCTGAAGGGGGATGTGCAGGGTTCCGTGGAAGCCATCAGGAAAGCTGTTCTGGACATCCA
GTCGGACAAAGTGGAATGCGTCTTCCTGAACGCCTCCGCCGGCCCCATCTCGGAATCGGACGTTCTGCTGGCCTCCTCCT
CGGACGCCGTCATCCTGGGCTTCAACGTCAAGGTGGAAGCCAACGCCGTCAAACTGCTCAAGCGGGAAGGCGTGCAGGTA
AAACTGTATTCCATCGTTTACGAACTCATTGACCAGGTGCGGGATGCCATGCTGGGCCTTCTGGAACCGGAAACGCGCGA
AACCATCATCGGCCACGCCAAAGTGCTCCAGGTCTTCAAGCTCAACAAGGGCCGTGCGGCAGGCTGCATGGTGGAGGACG
GGAAAATCCTCCGCAGCTGCGAGGCGCGCGTCATCCGCGACAAGACGCCCGTCTTTGACGGTAAAATGTCCACCCTCCGC
CGCTTCCAGGATGAAGTGGAAGAAGTCAAGGCAGGTCTGGAATGCGGCATCCGCCTCGGAGACTTCAACGAATACGAAAC
GGGCGACATCATCGAATGCTACACGCTGGAAAAAATCCAGCAAACGCTGTAA

Upstream 100 bases:

>100_bases
TCCCGCAGAGGAAGCCGCCCAAATTCTGGACAAGGCCCGGGACCTTATCTCCCAATAACCGTTGAGCGCGGCGCTCAACA
TCTAACAACCTTAAATCAAG

Downstream 100 bases:

>100_bases
CCCCCAACAAAAATCATCCAACAGGGACGGAGCTCCTTGCACGCTCCTCCGGGCGTCCGGGAAACCTCCGTCCCTGTCCT
TTCCCTTATTCCTCCTTCTC

Product: translation initiation factor IF-2

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 683; Mature: 682

Protein sequence:

>683_residues
MPNKQEEKEPKKEVLDLIGGSSKKKRAPQPAPAPAPSRPVPVKKEALDLLSGNKKKASRAADAAPAPAAAPAAAEPAAPA
PQEEPSADDKIINLKPPVSVSELAGMLKAKPFQIIKDLMGMGIFANPNTPLDADAVSSICDLHGYTFAREKREKGGGVKA
QQEPVKEPEPVPVVEEPKATLITRTPIITVMGHVDHGKTSLLDYIRKTRVAKGEAGGITQHIGAYTVDYNGSTLTFLDTP
GHAIFTEMRARGADVTDIVVLVVAANDGIMPQTREAIAHSKAAGKTIIVAINKCDLPAADPVKTKSGLMEEGLVPTDFGG
DVECVEVSALTGAGIDDLLGLLVLQSEVLELQANPKANCRASIIEARVEPGTGSSATAIVESGTIRVGMPFICGPYAGKV
RALVNDHGERVKKVGPGMPVEITGFSETPNVGDELVEMENERAAKKLGEERQEELRKQRLAQPRKARMEELLAMMGDGTQ
KAQLKILLKGDVQGSVEAIRKAVLDIQSDKVECVFLNASAGPISESDVLLASSSDAVILGFNVKVEANAVKLLKREGVQV
KLYSIVYELIDQVRDAMLGLLEPETRETIIGHAKVLQVFKLNKGRAAGCMVEDGKILRSCEARVIRDKTPVFDGKMSTLR
RFQDEVEEVKAGLECGIRLGDFNEYETGDIIECYTLEKIQQTL

Sequences:

>Translated_683_residues
MPNKQEEKEPKKEVLDLIGGSSKKKRAPQPAPAPAPSRPVPVKKEALDLLSGNKKKASRAADAAPAPAAAPAAAEPAAPA
PQEEPSADDKIINLKPPVSVSELAGMLKAKPFQIIKDLMGMGIFANPNTPLDADAVSSICDLHGYTFAREKREKGGGVKA
QQEPVKEPEPVPVVEEPKATLITRTPIITVMGHVDHGKTSLLDYIRKTRVAKGEAGGITQHIGAYTVDYNGSTLTFLDTP
GHAIFTEMRARGADVTDIVVLVVAANDGIMPQTREAIAHSKAAGKTIIVAINKCDLPAADPVKTKSGLMEEGLVPTDFGG
DVECVEVSALTGAGIDDLLGLLVLQSEVLELQANPKANCRASIIEARVEPGTGSSATAIVESGTIRVGMPFICGPYAGKV
RALVNDHGERVKKVGPGMPVEITGFSETPNVGDELVEMENERAAKKLGEERQEELRKQRLAQPRKARMEELLAMMGDGTQ
KAQLKILLKGDVQGSVEAIRKAVLDIQSDKVECVFLNASAGPISESDVLLASSSDAVILGFNVKVEANAVKLLKREGVQV
KLYSIVYELIDQVRDAMLGLLEPETRETIIGHAKVLQVFKLNKGRAAGCMVEDGKILRSCEARVIRDKTPVFDGKMSTLR
RFQDEVEEVKAGLECGIRLGDFNEYETGDIIECYTLEKIQQTL
>Mature_682_residues
PNKQEEKEPKKEVLDLIGGSSKKKRAPQPAPAPAPSRPVPVKKEALDLLSGNKKKASRAADAAPAPAAAPAAAEPAAPAP
QEEPSADDKIINLKPPVSVSELAGMLKAKPFQIIKDLMGMGIFANPNTPLDADAVSSICDLHGYTFAREKREKGGGVKAQ
QEPVKEPEPVPVVEEPKATLITRTPIITVMGHVDHGKTSLLDYIRKTRVAKGEAGGITQHIGAYTVDYNGSTLTFLDTPG
HAIFTEMRARGADVTDIVVLVVAANDGIMPQTREAIAHSKAAGKTIIVAINKCDLPAADPVKTKSGLMEEGLVPTDFGGD
VECVEVSALTGAGIDDLLGLLVLQSEVLELQANPKANCRASIIEARVEPGTGSSATAIVESGTIRVGMPFICGPYAGKVR
ALVNDHGERVKKVGPGMPVEITGFSETPNVGDELVEMENERAAKKLGEERQEELRKQRLAQPRKARMEELLAMMGDGTQK
AQLKILLKGDVQGSVEAIRKAVLDIQSDKVECVFLNASAGPISESDVLLASSSDAVILGFNVKVEANAVKLLKREGVQVK
LYSIVYELIDQVRDAMLGLLEPETRETIIGHAKVLQVFKLNKGRAAGCMVEDGKILRSCEARVIRDKTPVFDGKMSTLRR
FQDEVEEVKAGLECGIRLGDFNEYETGDIIECYTLEKIQQTL

Specific function: One of the essential components for the initiation of protein synthesis. Protects formylmethionyl-tRNA from spontaneous hydrolysis and promotes its binding to the 30S ribosomal subunits. Also involved in the hydrolysis of GTP during the formation of the 7

COG id: COG0532

COG function: function code J; Translation initiation factor 2 (IF-2; GTPase)

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the IF-2 family [H]

Homologues:

Organism=Homo sapiens, GI53729339, Length=638, Percent_Identity=35.2664576802508, Blast_Score=367, Evalue=1e-101,
Organism=Homo sapiens, GI53729337, Length=638, Percent_Identity=35.2664576802508, Blast_Score=367, Evalue=1e-101,
Organism=Homo sapiens, GI84043963, Length=465, Percent_Identity=29.0322580645161, Blast_Score=132, Evalue=7e-31,
Organism=Homo sapiens, GI157426893, Length=186, Percent_Identity=29.5698924731183, Blast_Score=73, Evalue=1e-12,
Organism=Homo sapiens, GI34147630, Length=141, Percent_Identity=36.1702127659575, Blast_Score=66, Evalue=9e-11,
Organism=Escherichia coli, GI1789559, Length=588, Percent_Identity=44.2176870748299, Blast_Score=496, Evalue=1e-141,
Organism=Escherichia coli, GI1788922, Length=223, Percent_Identity=28.6995515695067, Blast_Score=75, Evalue=2e-14,
Organism=Escherichia coli, GI2367247, Length=207, Percent_Identity=31.8840579710145, Blast_Score=73, Evalue=5e-14,
Organism=Escherichia coli, GI48994988, Length=175, Percent_Identity=34.8571428571429, Blast_Score=73, Evalue=5e-14,
Organism=Escherichia coli, GI1790412, Length=233, Percent_Identity=32.1888412017167, Blast_Score=70, Evalue=6e-13,
Organism=Escherichia coli, GI1789737, Length=233, Percent_Identity=32.1888412017167, Blast_Score=69, Evalue=7e-13,
Organism=Caenorhabditis elegans, GI71994658, Length=556, Percent_Identity=34.3525179856115, Blast_Score=306, Evalue=2e-83,
Organism=Caenorhabditis elegans, GI212656558, Length=587, Percent_Identity=26.0647359454855, Blast_Score=126, Evalue=3e-29,
Organism=Caenorhabditis elegans, GI17557151, Length=243, Percent_Identity=27.5720164609054, Blast_Score=76, Evalue=6e-14,
Organism=Caenorhabditis elegans, GI25141371, Length=222, Percent_Identity=28.3783783783784, Blast_Score=68, Evalue=2e-11,
Organism=Saccharomyces cerevisiae, GI6324550, Length=621, Percent_Identity=35.7487922705314, Blast_Score=343, Evalue=5e-95,
Organism=Saccharomyces cerevisiae, GI6319282, Length=555, Percent_Identity=26.8468468468468, Blast_Score=119, Evalue=1e-27,
Organism=Saccharomyces cerevisiae, GI6324761, Length=127, Percent_Identity=38.5826771653543, Blast_Score=69, Evalue=4e-12,
Organism=Drosophila melanogaster, GI28572034, Length=535, Percent_Identity=39.4392523364486, Blast_Score=350, Evalue=2e-96,
Organism=Drosophila melanogaster, GI24656849, Length=558, Percent_Identity=25.8064516129032, Blast_Score=123, Evalue=4e-28,
Organism=Drosophila melanogaster, GI160714833, Length=73, Percent_Identity=69.8630136986301, Blast_Score=106, Evalue=5e-23,
Organism=Drosophila melanogaster, GI19921738, Length=154, Percent_Identity=38.3116883116883, Blast_Score=75, Evalue=1e-13,

Paralogues:

None

Copy number: 1150 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR006847
- InterPro:   IPR000795
- InterPro:   IPR005225
- InterPro:   IPR000178
- InterPro:   IPR015760
- InterPro:   IPR023115
- InterPro:   IPR004161
- InterPro:   IPR009000 [H]

Pfam domain/function: PF00009 GTP_EFTU; PF03144 GTP_EFTU_D2; PF11987 IF-2; PF04760 IF2_N [H]

EC number: NA

Molecular weight: Translated: 73207; Mature: 73076

Theoretical pI: Translated: 5.56; Mature: 5.56

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPNKQEEKEPKKEVLDLIGGSSKKKRAPQPAPAPAPSRPVPVKKEALDLLSGNKKKASRA
CCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHH
ADAAPAPAAAPAAAEPAAPAPQEEPSADDKIINLKPPVSVSELAGMLKAKPFQIIKDLMG
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHCCCHHHHHHHHHC
MGIFANPNTPLDADAVSSICDLHGYTFAREKREKGGGVKAQQEPVKEPEPVPVVEEPKAT
CCEECCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCE
LITRTPIITVMGHVDHGKTSLLDYIRKTRVAKGEAGGITQHIGAYTVDYNGSTLTFLDTP
EEECCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCHHHCCCEEEEECCCEEEEEECC
GHAIFTEMRARGADVTDIVVLVVAANDGIMPQTREAIAHSKAAGKTIIVAINKCDLPAAD
CHHHHHHHHHCCCCCEEEEEEEEECCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCC
PVKTKSGLMEEGLVPTDFGGDVECVEVSALTGAGIDDLLGLLVLQSEVLELQANPKANCR
CCCHHHCHHHCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCH
ASIIEARVEPGTGSSATAIVESGTIRVGMPFICGPYAGKVRALVNDHGERVKKVGPGMPV
HHHHHEECCCCCCCCCEEEEECCEEEECCCCEECCCCHHHHHHHHHHHHHHHHCCCCCCE
EITGFSETPNVGDELVEMENERAAKKLGEERQEELRKQRLAQPRKARMEELLAMMGDGTQ
EEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCC
KAQLKILLKGDVQGSVEAIRKAVLDIQSDKVECVFLNASAGPISESDVLLASSSDAVILG
CEEEEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCEEEECCCCEEEEE
FNVKVEANAVKLLKREGVQVKLYSIVYELIDQVRDAMLGLLEPETRETIIGHAKVLQVFK
EEEEECHHHHHHHHHCCCEEEHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH
LNKGRAAGCMVEDGKILRSCEARVIRDKTPVFDGKMSTLRRFQDEVEEVKAGLECGIRLG
CCCCCCCCEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC
DFNEYETGDIIECYTLEKIQQTL
CCCCCCCCCEEEEHHHHHHHHCC
>Mature Secondary Structure 
PNKQEEKEPKKEVLDLIGGSSKKKRAPQPAPAPAPSRPVPVKKEALDLLSGNKKKASRA
CCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHH
ADAAPAPAAAPAAAEPAAPAPQEEPSADDKIINLKPPVSVSELAGMLKAKPFQIIKDLMG
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHCCCHHHHHHHHHC
MGIFANPNTPLDADAVSSICDLHGYTFAREKREKGGGVKAQQEPVKEPEPVPVVEEPKAT
CCEECCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCE
LITRTPIITVMGHVDHGKTSLLDYIRKTRVAKGEAGGITQHIGAYTVDYNGSTLTFLDTP
EEECCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCHHHCCCEEEEECCCEEEEEECC
GHAIFTEMRARGADVTDIVVLVVAANDGIMPQTREAIAHSKAAGKTIIVAINKCDLPAAD
CHHHHHHHHHCCCCCEEEEEEEEECCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCC
PVKTKSGLMEEGLVPTDFGGDVECVEVSALTGAGIDDLLGLLVLQSEVLELQANPKANCR
CCCHHHCHHHCCCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCH
ASIIEARVEPGTGSSATAIVESGTIRVGMPFICGPYAGKVRALVNDHGERVKKVGPGMPV
HHHHHEECCCCCCCCCEEEEECCEEEECCCCEECCCCHHHHHHHHHHHHHHHHCCCCCCE
EITGFSETPNVGDELVEMENERAAKKLGEERQEELRKQRLAQPRKARMEELLAMMGDGTQ
EEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCC
KAQLKILLKGDVQGSVEAIRKAVLDIQSDKVECVFLNASAGPISESDVLLASSSDAVILG
CEEEEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCEEEECCCCEEEEE
FNVKVEANAVKLLKREGVQVKLYSIVYELIDQVRDAMLGLLEPETRETIIGHAKVLQVFK
EEEEECHHHHHHHHHCCCEEEHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH
LNKGRAAGCMVEDGKILRSCEARVIRDKTPVFDGKMSTLRRFQDEVEEVKAGLECGIRLG
CCCCCCCCEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC
DFNEYETGDIIECYTLEKIQQTL
CCCCCCCCCEEEEHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; In phosphorus-containing anhydrides [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11997336 [H]