Definition Xanthobacter autotrophicus Py2 chromosome, complete genome.
Accession NC_009720
Length 5,308,934

Click here to switch to the map view.

The map label for this gene is 154247631

Identifier: 154247631

GI number: 154247631

Start: 4110782

End: 4112653

Strand: Direct

Name: 154247631

Synonym: Xaut_3706

Alternate gene names: NA

Gene position: 4110782-4112653 (Clockwise)

Preceding gene: 154247630

Following gene: 154247632

Centisome position: 77.43

GC content: 65.54

Gene sequence:

>1872_bases
ATGGCGAAGCCGACCAAGAAGTACCTGCTCGAGCTGGTGGCGCAGCACGGGCCCGAGATCGCAAAGGCGTTCAAGGCATC
CATCACGTCGATCAAGGGCTCGGCGCAGGTCGGCGCCCTGACCACGATGTTGGAGAAGGGAGACATCAACGGCGCCCTGC
TGGCCATCGGTCTCGATCCGGCGGCGTTCCGGAAGCTGGACCAGTCGATTTCCGATGCCTTTGAGGATGGGGGTGGGAAG
ACGGCCCAGACCATCCCGCCGGTGAAAACCCCGGACGGGCTGCGCATGGTGGTGCAGTTCAATGCCCGGAACGACCGCGC
CGAGCAGTGGCTGAAGGACCAATCCTCGACGTTGATCCGCGAGATCATTGAAGACCAGCGGCAGGTGATCCGGGCCGTTC
TCGCCGACGGCATGGCCCGCGGGCTTAACCCGCGCGACGTTGCCCTGGACCTGGTCGGGCGTCTCGACCCCGTGACGAGG
ACGCGCCAGGGCGGCGTGATCGGGCTGACCTCACAGCAACTGGATTATGTCCAGCGCTACCGGTCCGAACTGCTTTCCGG
AGATCCGGCCCAGCTTCGCAAAGCGCTGGCGCGGGTGCGGCGCGACCGCAAGTTCGACGGTGCGATCCGCAAGGCCATCG
CCGCCGGCAAGCCGATCGACGCCGCCGCGGCGGAGAAGATGGTCGGCCGATATTCCGACCGGCTGCTGCTCCTGCGCGGC
GAGACCATCGGCCGGACCGAGGCAATGACGGCCCTGCACCAGTCGCAGGAGGAGGCATACCAGCAGGCGATCGACAAGGG
AGCCCTGCAGAGCAACCACGTCCGCTATTTCTGGCGAACGGCGGCGGACGAGCGGGTGCGCCACTCGCACGCGATGATCC
CCGGCATGAACGAGAAGGGGGCACCGCACGGCACGCCGTTCTCGTCCCCGCTGGGCCCGATCCGCTTTCCCGGCGATCCC
AACGCCTCTGCGGCCAACAGGATCGGATGCCGCTGCTGGCGCGAGGCAAAGGTTGACTTCCTCGCCCAGGCCGTGGAGGC
AGAGCAGAAGCCGAAGCCAGCACCTAAGCCGAAGGCGCCGCCCAAACCGAAGAAGCCGCTGCCACCGGAACCGGTCCTGC
TGCCTCAGCGCTATGGCCCGGACGGGGCGGCGTGGCCCCGTGTGCCGTCGGAACTTGCAGATCGCGTCGCTGATGAACTG
CTCACTCCGACCCATCCGGCTTGGCGCAAGGAGATCGAGACCATCCTCAAGAGTGCGCCCGCGCCGTACAAGCGGCTTTC
GATTGGGCAGGCTGGGCTCATCAACTGGTACACCGGCAGTGGCTACCGCCGGCTGAACAAGGATCTGCGGGAAGGGACGG
GGAATTGGCTCACCCCGATGATCTCCGATGCGCTGAATGGCGCGCTGGACGCCGTCACCCGGAAGGCAACCGGCCTCGTG
ACCAGAGGTCTCAACGTCTACAGCGATGCCGAAATCACGTCGACCTATGCTGTCGGCGCCGCTGTCGAGTTCCCGGCCTT
CACGTCGACGGGCCGAGGGTTTTCCTTCGGTGGCAATGTCCGGATGATCATCGATGGATCGTCCGGCGTGGACGTGAAGC
CCCTGTCTCGCTTTCGTAATGAGAACGAGGTGTTGTTCAAGGCGGGCACGCGGTTTATCGTGACCGGCATGGCAAAGCGC
GGGAACGTCTACGAGATCACGCTCAAGGAGGTCGCGGACGACAAAAAGTCCGACGATCGCGTCATGCTGGGCGGAGGGTT
CTCGGACGAGGCGAAGCGGATCTTCGATCAGTGGCGTGAGCCGCAATCGGACGCGGAACACCGACTGGCGGACAAGATGC
AGTCTGGTGGCGCCGGGCGGCGCATTCGCTGA

Upstream 100 bases:

>100_bases
CGGCTGAGCGCGATCTGCGCGCCGCCGAGGAAGCCGGCGCGTCCTTGGGCGACACCGGGCAACCGGCGGCCGATACCCAG
GCCGATCCGGCCGCTCCCTG

Downstream 100 bases:

>100_bases
CTGGACCGGAAATCGCTGAGAAATGGCCCGCTTCGGCGGGCTTTTTCATTGGGGAGGCGGGTGTGGCTGACCAGAACCTC
AGCTTCGCAGCCCAGGTCGA

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 623; Mature: 622

Protein sequence:

>623_residues
MAKPTKKYLLELVAQHGPEIAKAFKASITSIKGSAQVGALTTMLEKGDINGALLAIGLDPAAFRKLDQSISDAFEDGGGK
TAQTIPPVKTPDGLRMVVQFNARNDRAEQWLKDQSSTLIREIIEDQRQVIRAVLADGMARGLNPRDVALDLVGRLDPVTR
TRQGGVIGLTSQQLDYVQRYRSELLSGDPAQLRKALARVRRDRKFDGAIRKAIAAGKPIDAAAAEKMVGRYSDRLLLLRG
ETIGRTEAMTALHQSQEEAYQQAIDKGALQSNHVRYFWRTAADERVRHSHAMIPGMNEKGAPHGTPFSSPLGPIRFPGDP
NASAANRIGCRCWREAKVDFLAQAVEAEQKPKPAPKPKAPPKPKKPLPPEPVLLPQRYGPDGAAWPRVPSELADRVADEL
LTPTHPAWRKEIETILKSAPAPYKRLSIGQAGLINWYTGSGYRRLNKDLREGTGNWLTPMISDALNGALDAVTRKATGLV
TRGLNVYSDAEITSTYAVGAAVEFPAFTSTGRGFSFGGNVRMIIDGSSGVDVKPLSRFRNENEVLFKAGTRFIVTGMAKR
GNVYEITLKEVADDKKSDDRVMLGGGFSDEAKRIFDQWREPQSDAEHRLADKMQSGGAGRRIR

Sequences:

>Translated_623_residues
MAKPTKKYLLELVAQHGPEIAKAFKASITSIKGSAQVGALTTMLEKGDINGALLAIGLDPAAFRKLDQSISDAFEDGGGK
TAQTIPPVKTPDGLRMVVQFNARNDRAEQWLKDQSSTLIREIIEDQRQVIRAVLADGMARGLNPRDVALDLVGRLDPVTR
TRQGGVIGLTSQQLDYVQRYRSELLSGDPAQLRKALARVRRDRKFDGAIRKAIAAGKPIDAAAAEKMVGRYSDRLLLLRG
ETIGRTEAMTALHQSQEEAYQQAIDKGALQSNHVRYFWRTAADERVRHSHAMIPGMNEKGAPHGTPFSSPLGPIRFPGDP
NASAANRIGCRCWREAKVDFLAQAVEAEQKPKPAPKPKAPPKPKKPLPPEPVLLPQRYGPDGAAWPRVPSELADRVADEL
LTPTHPAWRKEIETILKSAPAPYKRLSIGQAGLINWYTGSGYRRLNKDLREGTGNWLTPMISDALNGALDAVTRKATGLV
TRGLNVYSDAEITSTYAVGAAVEFPAFTSTGRGFSFGGNVRMIIDGSSGVDVKPLSRFRNENEVLFKAGTRFIVTGMAKR
GNVYEITLKEVADDKKSDDRVMLGGGFSDEAKRIFDQWREPQSDAEHRLADKMQSGGAGRRIR
>Mature_622_residues
AKPTKKYLLELVAQHGPEIAKAFKASITSIKGSAQVGALTTMLEKGDINGALLAIGLDPAAFRKLDQSISDAFEDGGGKT
AQTIPPVKTPDGLRMVVQFNARNDRAEQWLKDQSSTLIREIIEDQRQVIRAVLADGMARGLNPRDVALDLVGRLDPVTRT
RQGGVIGLTSQQLDYVQRYRSELLSGDPAQLRKALARVRRDRKFDGAIRKAIAAGKPIDAAAAEKMVGRYSDRLLLLRGE
TIGRTEAMTALHQSQEEAYQQAIDKGALQSNHVRYFWRTAADERVRHSHAMIPGMNEKGAPHGTPFSSPLGPIRFPGDPN
ASAANRIGCRCWREAKVDFLAQAVEAEQKPKPAPKPKAPPKPKKPLPPEPVLLPQRYGPDGAAWPRVPSELADRVADELL
TPTHPAWRKEIETILKSAPAPYKRLSIGQAGLINWYTGSGYRRLNKDLREGTGNWLTPMISDALNGALDAVTRKATGLVT
RGLNVYSDAEITSTYAVGAAVEFPAFTSTGRGFSFGGNVRMIIDGSSGVDVKPLSRFRNENEVLFKAGTRFIVTGMAKRG
NVYEITLKEVADDKKSDDRVMLGGGFSDEAKRIFDQWREPQSDAEHRLADKMQSGGAGRRIR

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 68195; Mature: 68063

Theoretical pI: Translated: 10.31; Mature: 10.31

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAKPTKKYLLELVAQHGPEIAKAFKASITSIKGSAQVGALTTMLEKGDINGALLAIGLDP
CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEECCH
AAFRKLDQSISDAFEDGGGKTAQTIPPVKTPDGLRMVVQFNARNDRAEQWLKDQSSTLIR
HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHH
EIIEDQRQVIRAVLADGMARGLNPRDVALDLVGRLDPVTRTRQGGVIGLTSQQLDYVQRY
HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHCCCCCEEEECHHHHHHHHHH
RSELLSGDPAQLRKALARVRRDRKFDGAIRKAIAAGKPIDAAAAEKMVGRYSDRLLLLRG
HHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEEC
ETIGRTEAMTALHQSQEEAYQQAIDKGALQSNHVRYFWRTAADERVRHSHAMIPGMNEKG
CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHCCCCCCCCCC
APHGTPFSSPLGPIRFPGDPNASAANRIGCRCWREAKVDFLAQAVEAEQKPKPAPKPKAP
CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC
PKPKKPLPPEPVLLPQRYGPDGAAWPRVPSELADRVADELLTPTHPAWRKEIETILKSAP
CCCCCCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC
APYKRLSIGQAGLINWYTGSGYRRLNKDLREGTGNWLTPMISDALNGALDAVTRKATGLV
CCHHHHCCCCCCEEEEECCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
TRGLNVYSDAEITSTYAVGAAVEFPAFTSTGRGFSFGGNVRMIIDGSSGVDVKPLSRFRN
HCCCCCCCCCHHHHHHHHCCEEECCCCCCCCCCEEECCCEEEEEECCCCCCCHHHHHHCC
ENEVLFKAGTRFIVTGMAKRGNVYEITLKEVADDKKSDDRVMLGGGFSDEAKRIFDQWRE
CCCEEEECCCEEEEEECCCCCCEEEEEHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCC
PQSDAEHRLADKMQSGGAGRRIR
CCHHHHHHHHHHHHCCCCCCCCC
>Mature Secondary Structure 
AKPTKKYLLELVAQHGPEIAKAFKASITSIKGSAQVGALTTMLEKGDINGALLAIGLDP
CCCHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEECCH
AAFRKLDQSISDAFEDGGGKTAQTIPPVKTPDGLRMVVQFNARNDRAEQWLKDQSSTLIR
HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHH
EIIEDQRQVIRAVLADGMARGLNPRDVALDLVGRLDPVTRTRQGGVIGLTSQQLDYVQRY
HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHCCCCCEEEECHHHHHHHHHH
RSELLSGDPAQLRKALARVRRDRKFDGAIRKAIAAGKPIDAAAAEKMVGRYSDRLLLLRG
HHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEEC
ETIGRTEAMTALHQSQEEAYQQAIDKGALQSNHVRYFWRTAADERVRHSHAMIPGMNEKG
CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHCCCCCCCCCC
APHGTPFSSPLGPIRFPGDPNASAANRIGCRCWREAKVDFLAQAVEAEQKPKPAPKPKAP
CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC
PKPKKPLPPEPVLLPQRYGPDGAAWPRVPSELADRVADELLTPTHPAWRKEIETILKSAP
CCCCCCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC
APYKRLSIGQAGLINWYTGSGYRRLNKDLREGTGNWLTPMISDALNGALDAVTRKATGLV
CCHHHHCCCCCCEEEEECCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
TRGLNVYSDAEITSTYAVGAAVEFPAFTSTGRGFSFGGNVRMIIDGSSGVDVKPLSRFRN
HCCCCCCCCCHHHHHHHHCCEEECCCCCCCCCCEEECCCEEEEEECCCCCCCHHHHHHCC
ENEVLFKAGTRFIVTGMAKRGNVYEITLKEVADDKKSDDRVMLGGGFSDEAKRIFDQWRE
CCCEEEECCCEEEEEECCCCCCEEEEEHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCC
PQSDAEHRLADKMQSGGAGRRIR
CCHHHHHHHHHHHHCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA