Definition Xanthobacter autotrophicus Py2 chromosome, complete genome.
Accession NC_009720
Length 5,308,934

Click here to switch to the map view.

The map label for this gene is aoxB [H]

Identifier: 154247873

GI number: 154247873

Start: 4371052

End: 4373514

Strand: Direct

Name: aoxB [H]

Synonym: Xaut_3950

Alternate gene names: 154247873

Gene position: 4371052-4373514 (Clockwise)

Preceding gene: 154247872

Following gene: 154247875

Centisome position: 82.33

GC content: 65.25

Gene sequence:

>2463_bases
ATGGCTTACAAGCGCCAGATCGGCCAATTGCCCATCATTCCCGCCAATGCGACCGTCCACAACGTCGTCTGCCACTATTG
CATCGTCGGCTGCGGCTACAAGGCCTATAGCTGGGACGCCCGATACGAGGGCGGCACGGCGCCGGGCGAGAACGCCTTCG
GCGTCGATCTCTCCAGGCAACAGCCGGCAGAGACCGCAGCCTGGTATGCGCCGTCCATGTACAACATCGTCAGGCAGAAC
GGGCGAGACGTGCACATCGTCATCAAGCCCGACAAGGCGTGCGTGGTGAATTCCGGCCTCGGCTCCGTGCGCGGCGCGCG
GATCGCCGAGATGAGCTATTCGCGCCAGCGCAACACCCAGCTCCAGCGCCTCACCGATCCCCAGGTCTGGCGCTACGGAC
AGCTCCAGCCCACGAGCTGGGACGACGCCCTCGACCTCGTCGCCCGCGTCACCGCAGCGGTGATCGCTGAGCAGGGCGAG
GACGGCCTGTTCGTGTCCGCCTTCGACCACGGCGGCGCGGGCGGCGGATACGAGAACACCTGGGGCACCGGAAAGCTCTA
TTTCGGCGCCATGAAGGTAAAGAACATCCGCATCCACAATCGCCCGGCCTACAATTCGGAGGTCCACGGCTCGCGCGACA
TGGGGGTGGGCGAGCTCAACAATTGCTACGAGGACGCGGAGCTCGCCGACACCCTCGTAGCGGTGGGCACGAACGCGCTG
GAAACCCAGACCAACTATTTCCTGAACCACTGGGTGCCCAACCTGCGCGGAACCTCCCTCGACAAGAAGAAGGCGGAGTT
CGGCAGCGAGCCGGTGGCCAAGGCGCGCATCGTCATCGTCGATCCGCGCCGCACCGTCACCGTGAATGCCAGCGAGGTGG
AGGCGGGCAAGGAGAACGTGCTCCATCTCGCCCTCAATTCCGGCACCGACCTGATCCTGTTCAACGCCTGGCTCACCTAC
GCGGCGGAGAAGGGGTGGATCGACAAGGGTTTCATCGCGGCCTCCACGAAGGACTTCGACAAGGCCCTGGCGGCCAACAA
GGTGAGCGTGGCGGAAGCCGCTCGCGCCACCGGCCTCAGCGAGGCGGACATCGTCAAGGCAGTCACCTGGATCGGCGAGC
CCAAGGCCGGCGGGGCACGTCGGCGGACCATGTTCGCCTACGAGAAGGGCCTCATCTGGGGCAATGACAACTACCGCACC
AACCAGGCGCTGGTGAATCTCGCCCTCGCCACCGGCAATATCGGGCGTCCGGGCGGCGGCTGCGTGCGCATGGGCGGCCA
CCAGGAGGGCTATTGCCGCCCGTCCGACGCCCATGTGGGCCGGCCCGCCGCCTATGTGGACAAGCTGCTGATCGAGGGAA
AGGGCGGCGTGCACCACATCTGGGGCTGCGACCACTACAAGACCACGCTCAACGCCATGGCCTTCAAGCGCGCCTACAAG
ATGCGCACGGACCTCGTCAAGGACGCCATGGCCAGCGTGCCCTACGGCGACCGCGATGCCATGGTGGCCGCCATCTTGGG
CGCCATCCGTAAGGGCGGGCTGTTCAGCGTCGACGTGGATATCGTGCCGACCCATATCGGCGAGGCCGCCCATGTGATGC
TGCCCGCGGCCACCTCCGGCGAGATGAACCTCACCTCCATGAACGGCGAGCGCCGCATGCGCCTCACCGAGCGCTACATG
GACCCACCCGGGCAGGCCATGCCCGACTGCCTCATCGCCGCGCGGATCGCCAACCACATGGAGCGCGTTCTGCGCGCCAC
GGGGAAGGCGGAGGCGGCGGACAAGTTCAAGGGCTTCGACTGGAAGAGCGAGGAAGACGCCTTCATGGACGGCTACGCCA
AGAACGAAAAGGGCGGCGCGTTCGTCACCTACGATCGCCTGCGGACCATGGGCACTAACGGCTTCCAGGAGCCCGCCACG
GCCTTCGCAGACGGCAAGATCGTCGGCACCAAGCGGCTGTTCGCCGATGGCAAGTTCAACAAGCCGGACGGCAAGGCGGT
GTTCGCCGAAACCAGATGGCGCGGGCTCCAGGCCCCCGGCAAGCAGGCGGAGAAGGACAAGTTCGCCTTCCTCATCAATA
ACGGGCGGGCGAACCTCGTCTGGCAGAGCGCCTATCTTGACGTGGAGAATGAGCTGGTCATGGATCGCTGGCCCTATCCC
TTCATCGAGATGAACCCGCAGGACATGGCCGAGCTTGGCCTCAAGAGCGGAGATCTTGTGGAGGTCTACAACGAGAACGG
CTCCACCCAGGCCATGGCCTATCCCACCCCCACGGCGAAGCGGAAGGAGACTTTCATGCTGTTCGGCTTCCCGACCGGCG
TGCAGGGCAATGTGGTGTCCGCGGGGGTCAACGAGGACATCATCCCCAACTACAAGCAGACGTGGGGCAACATCCGAAAG
ATCGCCGACGCACCCGAAGGCGTGCGGCACCTGACCTTCAAGTCGAAGGAATATCCCGCCTGA

Upstream 100 bases:

>100_bases
AATACGCGCTGCGTGTGGACGCGAAGGGCGACATCTATGCCGAAGGCGTGGATGAGCTGCTCTACGGCCGCCTCTCCAAC
GTGCTCTGAGGGAGGACACC

Downstream 100 bases:

>100_bases
CGGCGGGTACCATCGGCCGGCGCCATGGCGCCGGCCGAGACCGCACCCTCATCCGTGGCGCCACACTTTCACCGATGACA
CGACGAGGATCAGCGCCAGG

Product: arsenite oxidase large subunit

Products: NA

Alternate protein names: AOI [H]

Number of amino acids: Translated: 820; Mature: 819

Protein sequence:

>820_residues
MAYKRQIGQLPIIPANATVHNVVCHYCIVGCGYKAYSWDARYEGGTAPGENAFGVDLSRQQPAETAAWYAPSMYNIVRQN
GRDVHIVIKPDKACVVNSGLGSVRGARIAEMSYSRQRNTQLQRLTDPQVWRYGQLQPTSWDDALDLVARVTAAVIAEQGE
DGLFVSAFDHGGAGGGYENTWGTGKLYFGAMKVKNIRIHNRPAYNSEVHGSRDMGVGELNNCYEDAELADTLVAVGTNAL
ETQTNYFLNHWVPNLRGTSLDKKKAEFGSEPVAKARIVIVDPRRTVTVNASEVEAGKENVLHLALNSGTDLILFNAWLTY
AAEKGWIDKGFIAASTKDFDKALAANKVSVAEAARATGLSEADIVKAVTWIGEPKAGGARRRTMFAYEKGLIWGNDNYRT
NQALVNLALATGNIGRPGGGCVRMGGHQEGYCRPSDAHVGRPAAYVDKLLIEGKGGVHHIWGCDHYKTTLNAMAFKRAYK
MRTDLVKDAMASVPYGDRDAMVAAILGAIRKGGLFSVDVDIVPTHIGEAAHVMLPAATSGEMNLTSMNGERRMRLTERYM
DPPGQAMPDCLIAARIANHMERVLRATGKAEAADKFKGFDWKSEEDAFMDGYAKNEKGGAFVTYDRLRTMGTNGFQEPAT
AFADGKIVGTKRLFADGKFNKPDGKAVFAETRWRGLQAPGKQAEKDKFAFLINNGRANLVWQSAYLDVENELVMDRWPYP
FIEMNPQDMAELGLKSGDLVEVYNENGSTQAMAYPTPTAKRKETFMLFGFPTGVQGNVVSAGVNEDIIPNYKQTWGNIRK
IADAPEGVRHLTFKSKEYPA

Sequences:

>Translated_820_residues
MAYKRQIGQLPIIPANATVHNVVCHYCIVGCGYKAYSWDARYEGGTAPGENAFGVDLSRQQPAETAAWYAPSMYNIVRQN
GRDVHIVIKPDKACVVNSGLGSVRGARIAEMSYSRQRNTQLQRLTDPQVWRYGQLQPTSWDDALDLVARVTAAVIAEQGE
DGLFVSAFDHGGAGGGYENTWGTGKLYFGAMKVKNIRIHNRPAYNSEVHGSRDMGVGELNNCYEDAELADTLVAVGTNAL
ETQTNYFLNHWVPNLRGTSLDKKKAEFGSEPVAKARIVIVDPRRTVTVNASEVEAGKENVLHLALNSGTDLILFNAWLTY
AAEKGWIDKGFIAASTKDFDKALAANKVSVAEAARATGLSEADIVKAVTWIGEPKAGGARRRTMFAYEKGLIWGNDNYRT
NQALVNLALATGNIGRPGGGCVRMGGHQEGYCRPSDAHVGRPAAYVDKLLIEGKGGVHHIWGCDHYKTTLNAMAFKRAYK
MRTDLVKDAMASVPYGDRDAMVAAILGAIRKGGLFSVDVDIVPTHIGEAAHVMLPAATSGEMNLTSMNGERRMRLTERYM
DPPGQAMPDCLIAARIANHMERVLRATGKAEAADKFKGFDWKSEEDAFMDGYAKNEKGGAFVTYDRLRTMGTNGFQEPAT
AFADGKIVGTKRLFADGKFNKPDGKAVFAETRWRGLQAPGKQAEKDKFAFLINNGRANLVWQSAYLDVENELVMDRWPYP
FIEMNPQDMAELGLKSGDLVEVYNENGSTQAMAYPTPTAKRKETFMLFGFPTGVQGNVVSAGVNEDIIPNYKQTWGNIRK
IADAPEGVRHLTFKSKEYPA
>Mature_819_residues
AYKRQIGQLPIIPANATVHNVVCHYCIVGCGYKAYSWDARYEGGTAPGENAFGVDLSRQQPAETAAWYAPSMYNIVRQNG
RDVHIVIKPDKACVVNSGLGSVRGARIAEMSYSRQRNTQLQRLTDPQVWRYGQLQPTSWDDALDLVARVTAAVIAEQGED
GLFVSAFDHGGAGGGYENTWGTGKLYFGAMKVKNIRIHNRPAYNSEVHGSRDMGVGELNNCYEDAELADTLVAVGTNALE
TQTNYFLNHWVPNLRGTSLDKKKAEFGSEPVAKARIVIVDPRRTVTVNASEVEAGKENVLHLALNSGTDLILFNAWLTYA
AEKGWIDKGFIAASTKDFDKALAANKVSVAEAARATGLSEADIVKAVTWIGEPKAGGARRRTMFAYEKGLIWGNDNYRTN
QALVNLALATGNIGRPGGGCVRMGGHQEGYCRPSDAHVGRPAAYVDKLLIEGKGGVHHIWGCDHYKTTLNAMAFKRAYKM
RTDLVKDAMASVPYGDRDAMVAAILGAIRKGGLFSVDVDIVPTHIGEAAHVMLPAATSGEMNLTSMNGERRMRLTERYMD
PPGQAMPDCLIAARIANHMERVLRATGKAEAADKFKGFDWKSEEDAFMDGYAKNEKGGAFVTYDRLRTMGTNGFQEPATA
FADGKIVGTKRLFADGKFNKPDGKAVFAETRWRGLQAPGKQAEKDKFAFLINNGRANLVWQSAYLDVENELVMDRWPYPF
IEMNPQDMAELGLKSGDLVEVYNENGSTQAMAYPTPTAKRKETFMLFGFPTGVQGNVVSAGVNEDIIPNYKQTWGNIRKI
ADAPEGVRHLTFKSKEYPA

Specific function: Involved in the detoxification of arsenic. Oxidizes As(III)O3(3-) (arsenite) to the somewhat less toxic As(V)O4(3-) (arsenate) [H]

COG id: COG0243

COG function: function code C; Anaerobic dehydrogenases, typically selenocysteine-containing

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the prokaryotic molybdopterin-containing oxidoreductase family [H]

Homologues:

Organism=Escherichia coli, GI3868721, Length=703, Percent_Identity=22.3328591749644, Blast_Score=103, Evalue=4e-23,
Organism=Escherichia coli, GI1788534, Length=522, Percent_Identity=24.7126436781609, Blast_Score=86, Evalue=7e-18,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014066
- InterPro:   IPR009010
- InterPro:   IPR006657
- InterPro:   IPR006656 [H]

Pfam domain/function: PF00384 Molybdopterin; PF01568 Molydop_binding [H]

EC number: =1.20.98.1 [H]

Molecular weight: Translated: 89867; Mature: 89735

Theoretical pI: Translated: 8.28; Mature: 8.28

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAYKRQIGQLPIIPANATVHNVVCHYCIVGCGYKAYSWDARYEGGTAPGENAFGVDLSRQ
CCCHHHHCCCCEEECCCHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCC
QPAETAAWYAPSMYNIVRQNGRDVHIVIKPDKACVVNSGLGSVRGARIAEMSYSRQRNTQ
CCCHHHHHCCCHHHHHHHCCCCEEEEEECCCCEEEEECCCCCCCCCEEHHHHHHHHHCCH
LQRLTDPQVWRYGQLQPTSWDDALDLVARVTAAVIAEQGEDGLFVSAFDHGGAGGGYENT
HHHCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCC
WGTGKLYFGAMKVKNIRIHNRPAYNSEVHGSRDMGVGELNNCYEDAELADTLVAVGTNAL
CCCCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHH
ETQTNYFLNHWVPNLRGTSLDKKKAEFGSEPVAKARIVIVDPRRTVTVNASEVEAGKENV
HHHHHHHHHHCCCCCCCCCCCHHHHHCCCCCCCCEEEEEECCCEEEEEEHHHHCCCCCCE
LHLALNSGTDLILFNAWLTYAAEKGWIDKGFIAASTKDFDKALAANKVSVAEAARATGLS
EEEEECCCCCEEEEEHHHHHHHCCCCCCCCEEEECCCHHHHHHHCCCHHHHHHHHHCCCC
EADIVKAVTWIGEPKAGGARRRTMFAYEKGLIWGNDNYRTNQALVNLALATGNIGRPGGG
HHHHHHHHHHCCCCCCCCCHHHEEEEEECCEEECCCCCCCCCEEEEEEEECCCCCCCCCC
CVRMGGHQEGYCRPSDAHVGRPAAYVDKLLIEGKGGVHHIWGCDHYKTTLNAMAFKRAYK
EEEECCCCCCCCCCCCCCCCCCHHHHHHHHEECCCCEEEEECCCHHHHHHHHHHHHHHHH
MRTDLVKDAMASVPYGDRDAMVAAILGAIRKGGLFSVDVDIVPTHIGEAAHVMLPAATSG
HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCEEEEEECCCCC
EMNLTSMNGERRMRLTERYMDPPGQAMPDCLIAARIANHMERVLRATGKAEAADKFKGFD
CEEEEECCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCC
WKSEEDAFMDGYAKNEKGGAFVTYDRLRTMGTNGFQEPATAFADGKIVGTKRLFADGKFN
CCCCCCHHHCCCCCCCCCCEEEEHHHHHHCCCCCCCCCHHHHCCCEEEECEEEEECCCCC
KPDGKAVFAETRWRGLQAPGKQAEKDKFAFLINNGRANLVWQSAYLDVENELVMDRWPYP
CCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEEEECCCCCEEECCCCCC
FIEMNPQDMAELGLKSGDLVEVYNENGSTQAMAYPTPTAKRKETFMLFGFPTGVQGNVVS
EEEECHHHHHHHCCCCCCEEEEECCCCCEEEEEECCCCCCCCCEEEEEECCCCCCCCEEE
AGVNEDIIPNYKQTWGNIRKIADAPEGVRHLTFKSKEYPA
CCCCCCCCCCHHHHHHHHHHHHCCCCCHHEEEECCCCCCC
>Mature Secondary Structure 
AYKRQIGQLPIIPANATVHNVVCHYCIVGCGYKAYSWDARYEGGTAPGENAFGVDLSRQ
CCHHHHCCCCEEECCCHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCC
QPAETAAWYAPSMYNIVRQNGRDVHIVIKPDKACVVNSGLGSVRGARIAEMSYSRQRNTQ
CCCHHHHHCCCHHHHHHHCCCCEEEEEECCCCEEEEECCCCCCCCCEEHHHHHHHHHCCH
LQRLTDPQVWRYGQLQPTSWDDALDLVARVTAAVIAEQGEDGLFVSAFDHGGAGGGYENT
HHHCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCC
WGTGKLYFGAMKVKNIRIHNRPAYNSEVHGSRDMGVGELNNCYEDAELADTLVAVGTNAL
CCCCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHH
ETQTNYFLNHWVPNLRGTSLDKKKAEFGSEPVAKARIVIVDPRRTVTVNASEVEAGKENV
HHHHHHHHHHCCCCCCCCCCCHHHHHCCCCCCCCEEEEEECCCEEEEEEHHHHCCCCCCE
LHLALNSGTDLILFNAWLTYAAEKGWIDKGFIAASTKDFDKALAANKVSVAEAARATGLS
EEEEECCCCCEEEEEHHHHHHHCCCCCCCCEEEECCCHHHHHHHCCCHHHHHHHHHCCCC
EADIVKAVTWIGEPKAGGARRRTMFAYEKGLIWGNDNYRTNQALVNLALATGNIGRPGGG
HHHHHHHHHHCCCCCCCCCHHHEEEEEECCEEECCCCCCCCCEEEEEEEECCCCCCCCCC
CVRMGGHQEGYCRPSDAHVGRPAAYVDKLLIEGKGGVHHIWGCDHYKTTLNAMAFKRAYK
EEEECCCCCCCCCCCCCCCCCCHHHHHHHHEECCCCEEEEECCCHHHHHHHHHHHHHHHH
MRTDLVKDAMASVPYGDRDAMVAAILGAIRKGGLFSVDVDIVPTHIGEAAHVMLPAATSG
HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCEEEEEECCCCC
EMNLTSMNGERRMRLTERYMDPPGQAMPDCLIAARIANHMERVLRATGKAEAADKFKGFD
CEEEEECCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCC
WKSEEDAFMDGYAKNEKGGAFVTYDRLRTMGTNGFQEPATAFADGKIVGTKRLFADGKFN
CCCCCCHHHCCCCCCCCCCEEEEHHHHHHCCCCCCCCCHHHHCCCEEEECEEEEECCCCC
KPDGKAVFAETRWRGLQAPGKQAEKDKFAFLINNGRANLVWQSAYLDVENELVMDRWPYP
CCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEEEECCCCCEEECCCCCC
FIEMNPQDMAELGLKSGDLVEVYNENGSTQAMAYPTPTAKRKETFMLFGFPTGVQGNVVS
EEEECHHHHHHHCCCCCCEEEEECCCCCEEEEEECCCCCCCCCEEEEEECCCCCCCCEEE
AGVNEDIIPNYKQTWGNIRKIADAPEGVRHLTFKSKEYPA
CCCCCCCCCCHHHHHHHHHHHHCCCCCHHEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11250197; 1331097 [H]