Definition Helicobacter pylori HPAG1 chromosome, complete genome.
Accession NC_008086
Length 1,596,366

Click here to switch to the map view.

The map label for this gene is pabB [H]

Identifier: 108562720

GI number: 108562720

Start: 306666

End: 308345

Strand: Direct

Name: pabB [H]

Synonym: HPAG1_0295

Alternate gene names: 108562720

Gene position: 306666-308345 (Clockwise)

Preceding gene: 108562719

Following gene: 108562721

Centisome position: 19.21

GC content: 36.67

Gene sequence:

>1680_bases
ATGATTTTTGGGGATTTTAAATATCAAAAAAGCGTTAAAAAACTCACCGCCACCAATCTTAATGAGCTGAAAAACGCCCT
AGATTTCATCTCTCAAAATAGGGGGAATGGGTATTTTGTGGGGTATCTTTTATATGAAGCGCGTTTAGCGTTTCTAGATG
AAACTTTTCAAAGCCAAACCCCTTTTTTGTATTTTGAACAATTTTTAGAAAGAAAAAAATATTCTTTAGAGCCTTTAAAA
GAGCATGCGTTTTACCCTAAAATCCATAGTTCTTTAGATCAAAAAACTTATTTCAAGCAGTTTAAAGCCGTTAAAGAGCG
TCTCAAAAACGGCGATACCTATCAAGTGAATCTCACGATGGATTTATTTTTAGACACTAAAGCCAAACTAGAGCGTGTTT
TTAAGGAAGTGGTACACAACCAAAACACGCCTTTTAAGGCTTTGATAGAAAACGAGTTTGGGAGCGTTTTAAGCTTTTCG
CCGGAATTGTTTTTTGAATTAGAGTTTTTAGATACAGCGATTAAGATTATTACAAAACCCATGAAAGGCACGATCGCTCG
CTCAAAAAACCCCTTAATAGATGAAAAAAACCGATTGTTTTTGCAAAATGATGACAAAAATAGAAGTGAAAATGTGATGA
TTGTAGATTTACTGCGTAACGATTTGAGCCGTTTAGCCTTAAAAAATAGCGTGAAAGTCAATCAATTGTTTGAAGTCATC
AGCTTGCCTAGCGTGTATCAAATGATAAGCGAGATTGAAGCGAAATTGCCCCTAAAAACCAGTTTGTTTGAGATTTTTAA
GGCGTTGTTCCCTTGCGGCTCTGTGACCGGATGCCCTAAAATAAAAACCATGCAAATCATTGAAAGTTTAGAAAAACGCC
CTAGGGGGGTGTATTGCGGGGCGATAGGCATGGTTGAAGAAAAAAAAGCCCTTTTTAGCGTGCCTATCCGCACTTTAGAA
AAAAGAGCGCATGAAGATTTTTTGCATTTAGGGGTAGGGAGTGGGGTAACTTATAAAAGTAAAGCGCCAAAAGAATATGA
AGAGAGCTTTTTAAAATCCTTTTTTGTGATGCCCAAAATAGAATTTGAGATTGTAGAGACGATGAAAATTATCAAAAGGG
ATCAAAAATTAGAGATTAACAATAAAAACGCCCATAAAGAACGCTTAATGCATAGCGCCCAATACTTTAATTTCAAATGC
GATGAAAATCTTTTAGACTTTGAATTGGAAAAAGAAGGGGTTTTAAGGGTTTTACTCAATAAAAGGGGCAAGCTCATTAA
AGAATACAAAACCTTAGAGCCTTTAAAAAGCCTAGAAATCCGTTTGAGTGAAACCCCCATTGATAAACACAATGATTTTT
TATACCATAAGACCACTTATGCCCCTTTTTATCAAAACGCTCGAGCGCTCATTAAAAAAGGCGTTATTTTTGATGAAATC
TTTTATAACCAGGATCTGGAACTCACTGAGGGCGCTAGGAGCAATCTTGTTTTAGAAATCCATAACAGGCTTTTAACCCC
TTATTTTAGCGCGGGCGCGTTAAACGGGACGGGTGTTGTGGGGTTGTTAAAAAAGGGTCTTGTTGGGCATGCCCCTTTAA
AATTACACGATCTGCAAAAAGCGGCTAAAATCTATTGTATTAACGCGCTATATGGCTTAGTGGAAGTGAAAATCAAATAA

Upstream 100 bases:

>100_bases
CAGAGAAGAAGATCTAGGATTAGAGGGTTTGAGAAGGTCTAAAATGAGCTATAACCCAGTGTTTTTGATAGACAAATACG
AAGCCGTTGCTAAAAATTGA

Downstream 100 bases:

>100_bases
CCATAAAAATAGAGTAACTAAAACCTCATTTTTAGAAATAGGTTACCCAATAGAGCAAAAAAGTTAAAACTCGCCCACAA
TAATGATTAAAGTTTTCACA

Product: para-aminobenzoate synthetase

Products: NA

Alternate protein names: ADC synthase; Para-aminobenzoate synthase component I [H]

Number of amino acids: Translated: 559; Mature: 559

Protein sequence:

>559_residues
MIFGDFKYQKSVKKLTATNLNELKNALDFISQNRGNGYFVGYLLYEARLAFLDETFQSQTPFLYFEQFLERKKYSLEPLK
EHAFYPKIHSSLDQKTYFKQFKAVKERLKNGDTYQVNLTMDLFLDTKAKLERVFKEVVHNQNTPFKALIENEFGSVLSFS
PELFFELEFLDTAIKIITKPMKGTIARSKNPLIDEKNRLFLQNDDKNRSENVMIVDLLRNDLSRLALKNSVKVNQLFEVI
SLPSVYQMISEIEAKLPLKTSLFEIFKALFPCGSVTGCPKIKTMQIIESLEKRPRGVYCGAIGMVEEKKALFSVPIRTLE
KRAHEDFLHLGVGSGVTYKSKAPKEYEESFLKSFFVMPKIEFEIVETMKIIKRDQKLEINNKNAHKERLMHSAQYFNFKC
DENLLDFELEKEGVLRVLLNKRGKLIKEYKTLEPLKSLEIRLSETPIDKHNDFLYHKTTYAPFYQNARALIKKGVIFDEI
FYNQDLELTEGARSNLVLEIHNRLLTPYFSAGALNGTGVVGLLKKGLVGHAPLKLHDLQKAAKIYCINALYGLVEVKIK

Sequences:

>Translated_559_residues
MIFGDFKYQKSVKKLTATNLNELKNALDFISQNRGNGYFVGYLLYEARLAFLDETFQSQTPFLYFEQFLERKKYSLEPLK
EHAFYPKIHSSLDQKTYFKQFKAVKERLKNGDTYQVNLTMDLFLDTKAKLERVFKEVVHNQNTPFKALIENEFGSVLSFS
PELFFELEFLDTAIKIITKPMKGTIARSKNPLIDEKNRLFLQNDDKNRSENVMIVDLLRNDLSRLALKNSVKVNQLFEVI
SLPSVYQMISEIEAKLPLKTSLFEIFKALFPCGSVTGCPKIKTMQIIESLEKRPRGVYCGAIGMVEEKKALFSVPIRTLE
KRAHEDFLHLGVGSGVTYKSKAPKEYEESFLKSFFVMPKIEFEIVETMKIIKRDQKLEINNKNAHKERLMHSAQYFNFKC
DENLLDFELEKEGVLRVLLNKRGKLIKEYKTLEPLKSLEIRLSETPIDKHNDFLYHKTTYAPFYQNARALIKKGVIFDEI
FYNQDLELTEGARSNLVLEIHNRLLTPYFSAGALNGTGVVGLLKKGLVGHAPLKLHDLQKAAKIYCINALYGLVEVKIK
>Mature_559_residues
MIFGDFKYQKSVKKLTATNLNELKNALDFISQNRGNGYFVGYLLYEARLAFLDETFQSQTPFLYFEQFLERKKYSLEPLK
EHAFYPKIHSSLDQKTYFKQFKAVKERLKNGDTYQVNLTMDLFLDTKAKLERVFKEVVHNQNTPFKALIENEFGSVLSFS
PELFFELEFLDTAIKIITKPMKGTIARSKNPLIDEKNRLFLQNDDKNRSENVMIVDLLRNDLSRLALKNSVKVNQLFEVI
SLPSVYQMISEIEAKLPLKTSLFEIFKALFPCGSVTGCPKIKTMQIIESLEKRPRGVYCGAIGMVEEKKALFSVPIRTLE
KRAHEDFLHLGVGSGVTYKSKAPKEYEESFLKSFFVMPKIEFEIVETMKIIKRDQKLEINNKNAHKERLMHSAQYFNFKC
DENLLDFELEKEGVLRVLLNKRGKLIKEYKTLEPLKSLEIRLSETPIDKHNDFLYHKTTYAPFYQNARALIKKGVIFDEI
FYNQDLELTEGARSNLVLEIHNRLLTPYFSAGALNGTGVVGLLKKGLVGHAPLKLHDLQKAAKIYCINALYGLVEVKIK

Specific function: Catalyzes the biosynthesis of 4-amino-4-deoxychorismate (ADC) from chorismate and glutamine [H]

COG id: COG0147

COG function: function code EH; Anthranilate/para-aminobenzoate synthases component I

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the anthranilate synthase component I family [H]

Homologues:

Organism=Escherichia coli, GI1788114, Length=295, Percent_Identity=35.2542372881356, Blast_Score=186, Evalue=4e-48,
Organism=Escherichia coli, GI1787518, Length=266, Percent_Identity=25.187969924812, Blast_Score=81, Evalue=2e-16,
Organism=Saccharomyces cerevisiae, GI6320935, Length=265, Percent_Identity=27.1698113207547, Blast_Score=114, Evalue=3e-26,
Organism=Saccharomyces cerevisiae, GI6324361, Length=338, Percent_Identity=27.5147928994083, Blast_Score=96, Evalue=2e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005801
- InterPro:   IPR019999
- InterPro:   IPR006805
- InterPro:   IPR015890
- InterPro:   IPR005802 [H]

Pfam domain/function: PF04715 Anth_synt_I_N; PF00425 Chorismate_bind [H]

EC number: =2.6.1.85 [H]

Molecular weight: Translated: 64627; Mature: 64627

Theoretical pI: Translated: 9.62; Mature: 9.62

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
2.7 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIFGDFKYQKSVKKLTATNLNELKNALDFISQNRGNGYFVGYLLYEARLAFLDETFQSQT
CCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHHHHHHHHHCCC
PFLYFEQFLERKKYSLEPLKEHAFYPKIHSSLDQKTYFKQFKAVKERLKNGDTYQVNLTM
CHHHHHHHHHHHCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEE
DLFLDTKAKLERVFKEVVHNQNTPFKALIENEFGSVLSFSPELFFELEFLDTAIKIITKP
EEEECCHHHHHHHHHHHHCCCCCCHHHHHHHHCCHHHCCCHHHEEEHHHHHHHHHHHHCC
MKGTIARSKNPLIDEKNRLFLQNDDKNRSENVMIVDLLRNDLSRLALKNSVKVNQLFEVI
CCCHHHCCCCCCCCCCCCEEEECCCCCCCCCEEEHHHHHHHHHHHHHHCCCCHHHHHHHH
SLPSVYQMISEIEAKLPLKTSLFEIFKALFPCGSVTGCPKIKTMQIIESLEKRPRGVYCG
CCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEE
AIGMVEEKKALFSVPIRTLEKRAHEDFLHLGVGSGVTYKSKAPKEYEESFLKSFFVMPKI
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCH
EFEIVETMKIIKRDQKLEINNKNAHKERLMHSAQYFNFKCDENLLDFELEKEGVLRVLLN
HHHHHHHHHHHHCCCCEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHH
KRGKLIKEYKTLEPLKSLEIRLSETPIDKHNDFLYHKTTYAPFYQNARALIKKGVIFDEI
HCCHHHHHHHHHCCHHHHHEEECCCCCCCCCCEEEEECCCCHHHHHHHHHHHCCCHHHHH
FYNQDLELTEGARSNLVLEIHNRLLTPYFSAGALNGTGVVGLLKKGLVGHAPLKLHDLQK
HCCCCCCCCCCCCCCEEHEEHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHH
AAKIYCINALYGLVEVKIK
HHHHHHHHHHHEEEEEEEC
>Mature Secondary Structure
MIFGDFKYQKSVKKLTATNLNELKNALDFISQNRGNGYFVGYLLYEARLAFLDETFQSQT
CCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHHHHHHHHHCCC
PFLYFEQFLERKKYSLEPLKEHAFYPKIHSSLDQKTYFKQFKAVKERLKNGDTYQVNLTM
CHHHHHHHHHHHCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEE
DLFLDTKAKLERVFKEVVHNQNTPFKALIENEFGSVLSFSPELFFELEFLDTAIKIITKP
EEEECCHHHHHHHHHHHHCCCCCCHHHHHHHHCCHHHCCCHHHEEEHHHHHHHHHHHHCC
MKGTIARSKNPLIDEKNRLFLQNDDKNRSENVMIVDLLRNDLSRLALKNSVKVNQLFEVI
CCCHHHCCCCCCCCCCCCEEEECCCCCCCCCEEEHHHHHHHHHHHHHHCCCCHHHHHHHH
SLPSVYQMISEIEAKLPLKTSLFEIFKALFPCGSVTGCPKIKTMQIIESLEKRPRGVYCG
CCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEE
AIGMVEEKKALFSVPIRTLEKRAHEDFLHLGVGSGVTYKSKAPKEYEESFLKSFFVMPKI
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCH
EFEIVETMKIIKRDQKLEINNKNAHKERLMHSAQYFNFKCDENLLDFELEKEGVLRVLLN
HHHHHHHHHHHHCCCCEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHH
KRGKLIKEYKTLEPLKSLEIRLSETPIDKHNDFLYHKTTYAPFYQNARALIKKGVIFDEI
HCCHHHHHHHHHCCHHHHHEEECCCCCCCCCCEEEEECCCCHHHHHHHHHHHCCCHHHHH
FYNQDLELTEGARSNLVLEIHNRLLTPYFSAGALNGTGVVGLLKKGLVGHAPLKLHDLQK
HCCCCCCCCCCCCCCEEHEEHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHH
AAKIYCINALYGLVEVKIK
HHHHHHHHHHHEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 6330050; 9097040; 9278503; 7896119; 2251281; 7592344 [H]