Definition Acidovorax citrulli AAC00-1 chromosome, complete genome.
Accession NC_008752
Length 5,352,772

Click here to switch to the map view.

The map label for this gene is alpha-LP [H]

Identifier: 120609767

GI number: 120609767

Start: 1178134

End: 1179393

Strand: Reverse

Name: alpha-LP [H]

Synonym: Aave_1075

Alternate gene names: 120609767

Gene position: 1179393-1178134 (Counterclockwise)

Preceding gene: 120609769

Following gene: 120609766

Centisome position: 22.03

GC content: 47.54

Gene sequence:

>1260_bases
ATGATTGCCAAAAAAATGATTCTTTCCATGGTTTCTTTTTTTTCCATTTCGCTCAATGTGACCGCACAGCCCCTCGTGCC
TAGTTCCGAAGAAGCCCCTGCGGTTTTTCATTTCACCAAAGAGCAGGCGTTGGCAGCTTCTGGCTTGGATGTTATTGCCA
CCGAGAAAAACCAAAAGATCGTGGAGGCGTTGGGAAAGAACACGGTTTCCATCAAGGAACTGGCGGGAGATACCTTTGCT
GGCTCATGGATTCATTACGACGAGAACCATGAGGCGCAGCAGTTTTTCGCAACGACCGTGCCTTTGAATCTACATAAAAG
CAGAATGGCAGGTTCTGAGGGAAAAATATTTTTCATTCAAGTCAAATACAGTTACAAGGAACTTGAAACCATTCGAGAAA
AAATATTCGATGCATTCAAGGATTCCGCTAGACTTGGTGATCCTTTAGTGTACGGAATAGGAATAGACGAAGAAAAGAAT
AAACTAATGGTCGCAGGCCGCAAGGAAAATCATAGCCTCATCAGAGCCAGAATCCAACTGCTCGGCATCGAGCCGGACGC
GTATCTACTGGAAGATCAAAACGGGCCTACCACATTGATGGGCACTCTCTTCGGTGGTTCAAAAATCGTCGCCTCAAATA
CCAATAGCCCGATGTATCGGTGTACTGCGGGATTTAACGTGGTGATAGACTCGATTTACCCAGGAACCATCACTTCCGCA
CATTGCATGCACTACAACGAGTATTGGAATAGAGTGTATTTCGACATCGGAACATCCTCTGGTTCGATCAAAGGCGAGGA
AATAGGTCAATTCATGGCCGATGGCTATCCGAACAAATTGGATGCGATTATTTTCGGAAACACCAACTTTGTTCACACAC
TCTACGGAAAAATAATTACTACGCAAAATAGCTTGGTAGGCGTGAAACCGATGGCGCCCTTAGCGCAAAATACGCCTGTG
TGCACATCGGGAGGCACCAATGGTTGGCGGTGCGGTACCCAGGGAGTAACAAACCGGCAAGCCTGGGTAGGGTCCGAGAT
ATTTACCTTTGCCGAAGCTACATTTTGTGGTGCTGGTGGCGACTCCGGCGGTCCTGTGATAAATGACAAGAACAACGCAT
TGGGAGTATATACAGGAGCCAAAGGCAACTATCCGCAAGGAACCTGCGGGGCTGTATTCGGCGGAAACCCTGTCTCCTTT
TTCCAACCCCTCGCCCCTTATTTGGATCGCCACCCCAATGTGGTCTTGATGACAGAGTAA

Upstream 100 bases:

>100_bases
TGCATGATTGAACTTTCCTACACGACAATGCAGTCGTAACGGTGACGCTGCGACCTCATTTCATCGCCTTATCATTCAAT
ATTTACATTCGGGACCAGAC

Downstream 100 bases:

>100_bases
ATTTTGCCATTGGCCAAAATTGTGATCATGCGATTTGCGTTTTTGGCCAAGGCTTCAAATAATTGCTTTACACAGTTGCA
CTATGTATTGAAATTTAAAT

Product: hypothetical protein

Products: NA

Alternate protein names: Alpha-lytic endopeptidase [H]

Number of amino acids: Translated: 419; Mature: 419

Protein sequence:

>419_residues
MIAKKMILSMVSFFSISLNVTAQPLVPSSEEAPAVFHFTKEQALAASGLDVIATEKNQKIVEALGKNTVSIKELAGDTFA
GSWIHYDENHEAQQFFATTVPLNLHKSRMAGSEGKIFFIQVKYSYKELETIREKIFDAFKDSARLGDPLVYGIGIDEEKN
KLMVAGRKENHSLIRARIQLLGIEPDAYLLEDQNGPTTLMGTLFGGSKIVASNTNSPMYRCTAGFNVVIDSIYPGTITSA
HCMHYNEYWNRVYFDIGTSSGSIKGEEIGQFMADGYPNKLDAIIFGNTNFVHTLYGKIITTQNSLVGVKPMAPLAQNTPV
CTSGGTNGWRCGTQGVTNRQAWVGSEIFTFAEATFCGAGGDSGGPVINDKNNALGVYTGAKGNYPQGTCGAVFGGNPVSF
FQPLAPYLDRHPNVVLMTE

Sequences:

>Translated_419_residues
MIAKKMILSMVSFFSISLNVTAQPLVPSSEEAPAVFHFTKEQALAASGLDVIATEKNQKIVEALGKNTVSIKELAGDTFA
GSWIHYDENHEAQQFFATTVPLNLHKSRMAGSEGKIFFIQVKYSYKELETIREKIFDAFKDSARLGDPLVYGIGIDEEKN
KLMVAGRKENHSLIRARIQLLGIEPDAYLLEDQNGPTTLMGTLFGGSKIVASNTNSPMYRCTAGFNVVIDSIYPGTITSA
HCMHYNEYWNRVYFDIGTSSGSIKGEEIGQFMADGYPNKLDAIIFGNTNFVHTLYGKIITTQNSLVGVKPMAPLAQNTPV
CTSGGTNGWRCGTQGVTNRQAWVGSEIFTFAEATFCGAGGDSGGPVINDKNNALGVYTGAKGNYPQGTCGAVFGGNPVSF
FQPLAPYLDRHPNVVLMTE
>Mature_419_residues
MIAKKMILSMVSFFSISLNVTAQPLVPSSEEAPAVFHFTKEQALAASGLDVIATEKNQKIVEALGKNTVSIKELAGDTFA
GSWIHYDENHEAQQFFATTVPLNLHKSRMAGSEGKIFFIQVKYSYKELETIREKIFDAFKDSARLGDPLVYGIGIDEEKN
KLMVAGRKENHSLIRARIQLLGIEPDAYLLEDQNGPTTLMGTLFGGSKIVASNTNSPMYRCTAGFNVVIDSIYPGTITSA
HCMHYNEYWNRVYFDIGTSSGSIKGEEIGQFMADGYPNKLDAIIFGNTNFVHTLYGKIITTQNSLVGVKPMAPLAQNTPV
CTSGGTNGWRCGTQGVTNRQAWVGSEIFTFAEATFCGAGGDSGGPVINDKNNALGVYTGAKGNYPQGTCGAVFGGNPVSF
FQPLAPYLDRHPNVVLMTE

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase S1 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR009003
- InterPro:   IPR004236
- InterPro:   IPR001316
- InterPro:   IPR018114
- InterPro:   IPR001254 [H]

Pfam domain/function: PF02983 Pro_Al_protease; PF00089 Trypsin [H]

EC number: =3.4.21.12 [H]

Molecular weight: Translated: 45484; Mature: 45484

Theoretical pI: Translated: 6.42; Mature: 6.42

Prosite motif: PS00135 TRYPSIN_SER

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIAKKMILSMVSFFSISLNVTAQPLVPSSEEAPAVFHFTKEQALAASGLDVIATEKNQKI
CCHHHHHHHHHHHHEEEEEEEECCCCCCCCCCCEEEEEEHHHHHHHCCCEEEEECCCHHH
VEALGKNTVSIKELAGDTFAGSWIHYDENHEAQQFFATTVPLNLHKSRMAGSEGKIFFIQ
HHHHCCCCEEHHHHCCCCCCCCEEEECCCCCCHHEEEEECCCEECHHHCCCCCCCEEEEE
VKYSYKELETIREKIFDAFKDSARLGDPLVYGIGIDEEKNKLMVAGRKENHSLIRARIQL
EECCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEECCCCHHHHEEEEEE
LGIEPDAYLLEDQNGPTTLMGTLFGGSKIVASNTNSPMYRCTAGFNVVIDSIYPGTITSA
EECCCCCEEEECCCCCEEEEEEEECCCEEEEECCCCCEEEEECCCEEEEECCCCCCCCHH
HCMHYNEYWNRVYFDIGTSSGSIKGEEIGQFMADGYPNKLDAIIFGNTNFVHTLYGKIIT
HHEEHHHHCCEEEEEEECCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCEEEEEECEEEE
TQNSLVGVKPMAPLAQNTPVCTSGGTNGWRCGTQGVTNRQAWVGSEIFTFAEATFCGAGG
ECCCEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHEEECCCC
DSGGPVINDKNNALGVYTGAKGNYPQGTCGAVFGGNPVSFFQPLAPYLDRHPNVVLMTE
CCCCCEECCCCCEEEEEECCCCCCCCCCCCEEECCCCHHHHHHHHHHHHCCCCEEEEEC
>Mature Secondary Structure
MIAKKMILSMVSFFSISLNVTAQPLVPSSEEAPAVFHFTKEQALAASGLDVIATEKNQKI
CCHHHHHHHHHHHHEEEEEEEECCCCCCCCCCCEEEEEEHHHHHHHCCCEEEEECCCHHH
VEALGKNTVSIKELAGDTFAGSWIHYDENHEAQQFFATTVPLNLHKSRMAGSEGKIFFIQ
HHHHCCCCEEHHHHCCCCCCCCEEEECCCCCCHHEEEEECCCEECHHHCCCCCCCEEEEE
VKYSYKELETIREKIFDAFKDSARLGDPLVYGIGIDEEKNKLMVAGRKENHSLIRARIQL
EECCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEECCCCHHHHEEEEEE
LGIEPDAYLLEDQNGPTTLMGTLFGGSKIVASNTNSPMYRCTAGFNVVIDSIYPGTITSA
EECCCCCEEEECCCCCEEEEEEEECCCEEEEECCCCCEEEEECCCEEEEECCCCCCCCHH
HCMHYNEYWNRVYFDIGTSSGSIKGEEIGQFMADGYPNKLDAIIFGNTNFVHTLYGKIIT
HHEEHHHHCCEEEEEEECCCCCCCHHHHHHHHHCCCCCCEEEEEECCCCEEEEEECEEEE
TQNSLVGVKPMAPLAQNTPVCTSGGTNGWRCGTQGVTNRQAWVGSEIFTFAEATFCGAGG
ECCCEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHEEECCCC
DSGGPVINDKNNALGVYTGAKGNYPQGTCGAVFGGNPVSFFQPLAPYLDRHPNVVLMTE
CCCCCEECCCCCEEEEEECCCCCCCCCCCCEEECCCCHHHHHHHHHHHHCCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 3053694; 3234766; 5482494; 117110; 3900416; 9724517; 9808037 [H]