Definition Xanthobacter autotrophicus Py2 chromosome, complete genome.
Accession NC_009720
Length 5,308,934

Click here to switch to the map view.

The map label for this gene is 154247634

Identifier: 154247634

GI number: 154247634

Start: 4114573

End: 4115925

Strand: Direct

Name: 154247634

Synonym: Xaut_3709

Alternate gene names: NA

Gene position: 4114573-4115925 (Clockwise)

Preceding gene: 154247633

Following gene: 154247635

Centisome position: 77.5

GC content: 67.7

Gene sequence:

>1353_bases
ATGGCTGGTGAAAACGACGTGCTGGAGCAGATCCAGCGCGAGGTGAAGGGCTTTGGCGACAACGTCTCCGGGCTGAAGTC
CTCGATGGAGAAGGACCTCGCCGACGTGCGCAAGCTCGCCGAGGAGGCGAAGAAGTCCGCCGATCGTCCGGAGGTGAAGG
CGCAGATCGACGCGCTCACCACCTCCGTTTCCGAGAAGCACGCCGCGATCGAGAAGACCGTCGCCGAGCTGAAGGCCCAG
GCCGACCAGGTCGCGACCGCCATGCGCCGGGCGCCCCTCGGCGATCCCAAGGAGCAGGGCGAGGAGCAGAAGCACGCCCT
GGCCTATTTCGAGACCAAGGCGGCCGCCGCCGGCTCGCTGAAGGTGGGCAGCCCCATCGACCCGGCCGGCCTCGACATGG
ACGGCTATCGGGCGTGGACCAAGTCGTTCCCGCTCTACCTGCGCCGGGATGACAAGCCGATCGAGGCGAAGGCCCTCTCC
GTGGGCTCCGACCCGAACGGCGGCTATCTCGTGCCGACCGCGACCTCGGCGCGGATCCTGACCCGCATCTGGGAGACCTC
CCCGCTGCGCCAGCTTTCGACCGTCGAGACGATCGGGACCGACAAGATCGAAATCCCGATCGATGACGACGAGGCCTCGG
CGGGCTGGGTCGGGGAGACCGAGGGCCGACCGGAGACCGGAACCCCGGCGATCGGCGTCCAGACGATCCCGGTCTTCGAG
ATCTATGCGAAGCCCCGGGCGACGCAGTCGATGCTCGAGGATGCCTCGATCAACATCGAGGGATGGCTCGCCACCAAGAT
CTCGGACAAGTTCGCCCGGATCGAGGCCTCCGCCTTCATCGTCGGCAACGGGGTCAAGAAGCCCCGCGGCATCCTGACCT
ATCCGGCTGCGCCGGCCGGAACTTACGCGCGGGGCAAGATCCTGCAGGTGAACTCCGGTCATGCGACCAACATCACCGCC
GACGGCCTCGTGAACATGACCTTCTCGCTGAAGGAGGCCTACCTCGCCAACGGGTCCTGGCTGATGAAGCGTGGAACCGT
AGGCTCGGTGGCGCTGCTCAAGGATGCTCAGGGCCAGTACCTGTGGCGCCCGGGGCTGGAGGCGGGCAAGCCGTCCATCC
TGCTCGGCTATCCGGTGCGGCAGGCCGACGACATGCCGGTGGTGGGCGCCGGCGCGCTCCCGATCGCCTTCGGTGACTTC
CGGGCCGGCTACACCGTCGTGGACCGTCTCGGCATCACCACCCTGCGCGACCCCTACAGCGCCAAGCCGTTCGTGGAGTT
CTACTCCCGCCGGCGCGTGGGCGGCGACGTGACGAACTTCGAGGCCTTCGCCCTCATGGTCGTGAGCGCCTGA

Upstream 100 bases:

>100_bases
AGCCTCGGGATGAGGACGGCGCGATGAACGACCTGCTGCGGACCATCAAGGGCATCCGGGCCGGTCTCTGACATCACACC
GCCGAAACAGGAGCACCCCT

Downstream 100 bases:

>100_bases
TCCTGAGCTGACCGGGGGCCGCCGCGAGCGGCCCTCTCTCTTCATTCCCGCCCGGAGGAGGGCTTCCCAATGCATGGTCT
TCTCAACAACGCCGAAGTGC

Product: HK97 family phage major capsid protein

Products: NA

Alternate protein names: HK97 Family Phage Capsid Protein; Phage Capsid Protein HK; HK97 Family Capsid Protein; Capsid Protein; Capsid Protein Of Prophage; HK97 Family Phage Prohead Protease/Phage Capsid Protein; Phage Capsid Family; Phage Prohead Protease; Phage Capsid Protein; Prophage; Phage Capsid Protein Gp; Bacteriophage Protein; Phage Capsid Protein-Like Protein; Phage Prohead Protease And Phage Capsid Protein; Phage Phi-C31 Capsid; Capsid C; Phage Capsid Family Protein

Number of amino acids: Translated: 450; Mature: 449

Protein sequence:

>450_residues
MAGENDVLEQIQREVKGFGDNVSGLKSSMEKDLADVRKLAEEAKKSADRPEVKAQIDALTTSVSEKHAAIEKTVAELKAQ
ADQVATAMRRAPLGDPKEQGEEQKHALAYFETKAAAAGSLKVGSPIDPAGLDMDGYRAWTKSFPLYLRRDDKPIEAKALS
VGSDPNGGYLVPTATSARILTRIWETSPLRQLSTVETIGTDKIEIPIDDDEASAGWVGETEGRPETGTPAIGVQTIPVFE
IYAKPRATQSMLEDASINIEGWLATKISDKFARIEASAFIVGNGVKKPRGILTYPAAPAGTYARGKILQVNSGHATNITA
DGLVNMTFSLKEAYLANGSWLMKRGTVGSVALLKDAQGQYLWRPGLEAGKPSILLGYPVRQADDMPVVGAGALPIAFGDF
RAGYTVVDRLGITTLRDPYSAKPFVEFYSRRRVGGDVTNFEAFALMVVSA

Sequences:

>Translated_450_residues
MAGENDVLEQIQREVKGFGDNVSGLKSSMEKDLADVRKLAEEAKKSADRPEVKAQIDALTTSVSEKHAAIEKTVAELKAQ
ADQVATAMRRAPLGDPKEQGEEQKHALAYFETKAAAAGSLKVGSPIDPAGLDMDGYRAWTKSFPLYLRRDDKPIEAKALS
VGSDPNGGYLVPTATSARILTRIWETSPLRQLSTVETIGTDKIEIPIDDDEASAGWVGETEGRPETGTPAIGVQTIPVFE
IYAKPRATQSMLEDASINIEGWLATKISDKFARIEASAFIVGNGVKKPRGILTYPAAPAGTYARGKILQVNSGHATNITA
DGLVNMTFSLKEAYLANGSWLMKRGTVGSVALLKDAQGQYLWRPGLEAGKPSILLGYPVRQADDMPVVGAGALPIAFGDF
RAGYTVVDRLGITTLRDPYSAKPFVEFYSRRRVGGDVTNFEAFALMVVSA
>Mature_449_residues
AGENDVLEQIQREVKGFGDNVSGLKSSMEKDLADVRKLAEEAKKSADRPEVKAQIDALTTSVSEKHAAIEKTVAELKAQA
DQVATAMRRAPLGDPKEQGEEQKHALAYFETKAAAAGSLKVGSPIDPAGLDMDGYRAWTKSFPLYLRRDDKPIEAKALSV
GSDPNGGYLVPTATSARILTRIWETSPLRQLSTVETIGTDKIEIPIDDDEASAGWVGETEGRPETGTPAIGVQTIPVFEI
YAKPRATQSMLEDASINIEGWLATKISDKFARIEASAFIVGNGVKKPRGILTYPAAPAGTYARGKILQVNSGHATNITAD
GLVNMTFSLKEAYLANGSWLMKRGTVGSVALLKDAQGQYLWRPGLEAGKPSILLGYPVRQADDMPVVGAGALPIAFGDFR
AGYTVVDRLGITTLRDPYSAKPFVEFYSRRRVGGDVTNFEAFALMVVSA

Specific function: Unknown

COG id: COG4653

COG function: function code R; Predicted phage phi-C31 gp36 major capsid-like protein

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 48379; Mature: 48248

Theoretical pI: Translated: 5.64; Mature: 5.64

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAGENDVLEQIQREVKGFGDNVSGLKSSMEKDLADVRKLAEEAKKSADRPEVKAQIDALT
CCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH
TSVSEKHAAIEKTVAELKAQADQVATAMRRAPLGDPKEQGEEQKHALAYFETKAAAAGSL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCHHHHHHHHHHHHHHHCCCCE
KVGSPIDPAGLDMDGYRAWTKSFPLYLRRDDKPIEAKALSVGSDPNGGYLVPTATSARIL
ECCCCCCCCCCCCCHHHHHHHCCCEEEECCCCCCCEEEEECCCCCCCCEEEECCCHHHHH
TRIWETSPLRQLSTVETIGTDKIEIPIDDDEASAGWVGETEGRPETGTPAIGVQTIPVFE
HHHHCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEECEEEE
IYAKPRATQSMLEDASINIEGWLATKISDKFARIEASAFIVGNGVKKPRGILTYPAAPAG
EECCCCHHHHHHHHCCCCEEEEEEEHHHHHHHHEEEEEEEEECCCCCCCCEEEECCCCCC
TYARGKILQVNSGHATNITADGLVNMTFSLKEAYLANGSWLMKRGTVGSVALLKDAQGQY
CCCCCEEEEECCCCCCCEECCCEEEEEEEHHHHHHCCCCEEEECCCCCCEEEEECCCCCE
LWRPGLEAGKPSILLGYPVRQADDMPVVGAGALPIAFGDFRAGYTVVDRLGITTLRDPYS
EECCCCCCCCCCEEEECCCCCCCCCCEEECCCCEEEECCCCCCHHHHHHCCCCEECCCCC
AKPFVEFYSRRRVGGDVTNFEAFALMVVSA
CCHHHHHHHHHCCCCCCCCCEEEEEEEEEC
>Mature Secondary Structure 
AGENDVLEQIQREVKGFGDNVSGLKSSMEKDLADVRKLAEEAKKSADRPEVKAQIDALT
CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH
TSVSEKHAAIEKTVAELKAQADQVATAMRRAPLGDPKEQGEEQKHALAYFETKAAAAGSL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCHHHHHHHHHHHHHHHCCCCE
KVGSPIDPAGLDMDGYRAWTKSFPLYLRRDDKPIEAKALSVGSDPNGGYLVPTATSARIL
ECCCCCCCCCCCCCHHHHHHHCCCEEEECCCCCCCEEEEECCCCCCCCEEEECCCHHHHH
TRIWETSPLRQLSTVETIGTDKIEIPIDDDEASAGWVGETEGRPETGTPAIGVQTIPVFE
HHHHCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEECEEEE
IYAKPRATQSMLEDASINIEGWLATKISDKFARIEASAFIVGNGVKKPRGILTYPAAPAG
EECCCCHHHHHHHHCCCCEEEEEEEHHHHHHHHEEEEEEEEECCCCCCCCEEEECCCCCC
TYARGKILQVNSGHATNITADGLVNMTFSLKEAYLANGSWLMKRGTVGSVALLKDAQGQY
CCCCCEEEEECCCCCCCEECCCEEEEEEEHHHHHHCCCCEEEECCCCCCEEEEECCCCCE
LWRPGLEAGKPSILLGYPVRQADDMPVVGAGALPIAFGDFRAGYTVVDRLGITTLRDPYS
EECCCCCCCCCCEEEECCCCCCCCCCEEECCCCEEEECCCCCCHHHHHHCCCCEECCCCC
AKPFVEFYSRRRVGGDVTNFEAFALMVVSA
CCHHHHHHHHHCCCCCCCCCEEEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA