Definition Acidovorax citrulli AAC00-1 chromosome, complete genome.
Accession NC_008752
Length 5,352,772

Click here to switch to the map view.

The map label for this gene is 120611101

Identifier: 120611101

GI number: 120611101

Start: 2651514

End: 2653082

Strand: Direct

Name: 120611101

Synonym: Aave_2430

Alternate gene names: NA

Gene position: 2651514-2653082 (Clockwise)

Preceding gene: 120611100

Following gene: 120611102

Centisome position: 49.54

GC content: 70.62

Gene sequence:

>1569_bases
ATGACACAGACCGCATCTTCCTCCGCTTCCCTTGTCGAGCTGACCGCCCCCGAGCTGCGCCTGCGGATCGCCCGCCGCGA
AATCTCGCCCGTCGAACTGCTCGAGGCCTGCATCGCACGCATCGAGGCCCTCAACCCCTATGTGAACGCCATCACCGCCA
CCTGCTACGACCGCGCCCGGAACGAGGCGCGCGAGGCGGAGCGGGCAGTGCTGCGCGGCGATGTCCTCGGACTGCTGCAC
GGCCTGCCACTGGGCGTGAAGGACCTGGAGGCCACGCAGGGCCTGCTGACCACCTATGGCTCGCAGATCTACCGCGAGCA
CATTCCCGCGGAAGACAACGTGCTGGTCGCGCGCCTGCGCCGCGCCGGTGCGATCGTCACGGGCAAGACCAACATTCCCG
AGATGGGTGCCGGCGCCAATTCCCGCAACACGGTATGGGGTGCCACAGGCAACCCCTTCGATCCCCGCCTGAACGCGGGG
GGCTCGTCCGGCGGCTCGGCCGCGGCCCTGGCCTGCGACATGCTGCCCGTCTGCACCGGTTCAGACACCGGCGGCTCGCT
GCGCATTCCCGCCGCCAAGTGCGGCGTGGTGGGGTTCCGGCCGTCGCCTGGCGTGGTGCCGAGCTCCCGCAAGCTGCTGG
GCTGGACGCCGATCTCCGTCGTGGGGCCCATGGGCCGCACGGTGGAGGAGGCCTGCCTGCAGCTCGCGGCCACGGCGGGC
ATGTCGGCGGGCGATCCGCTCAGCTACCCGCTGGATCCTGCCGTATTCCTGTCCCTGCCCGAGGTGGACCTCTCCACGCT
GCGCGTCGGGTACACGGAGGATTTCGGTGCCTGTGCCGTGGACGACGGCATCCGTGAGACGTTCCGCGGCAAGATCGCCG
CGATGCGGCACCTGTTCCGCAGTTGCGATCCCCTGTCGCTGGACCTGGGGGACGTGCACCGCTGCTTCGACGTGCTGCGC
GCCGAAGCCTTCGTGGCGGGCACCCGCGAGGCCTACGAGCGCGATCCCGCCAGCCTGGGGCCGAACACCCGCGCCAACTA
CGAGATGGGCGCTGCCATGACGCTCATCGACAGCGCCTGGGCGCAGGCGGAACAGACGCGCATCCTCGCGCGGTTCCAGA
AGGCATTCGAATCGTTCGACATCATCCTTGCCCCGACCACGCCGGTATCCCCGTTCCCGTGGACGGAGCTCTACGCCAGC
CACATCAACGGCGAACCCCAGGCCAACTACTACCGCTGGCTGGCCCTGACCTACGTCACCACGCTGACCACCCATCCCGC
CCTCAGCCTGCCTTGCGGGCGGGATGGCCGCGACATGCCGTTCGGCCTGCAGGTGGTGGGCCGCTTCCGGGACGACCTCG
GCACCCTTGCCATCGGCCGTGCGATGGAACAGTCCTTCGCCGGCATCGAGGGACTGCAGCGCCCGCGGCCGGCGCTGGAC
CGGCTGGCCCCGGTCGAGCCGGCGCTGACTTCCCTCGTCACCACTGCACCCGACGCACGCGATGCACGCGGCGAGCCGCG
CCCGGACGGCTCGGGCAGCCGGGGGGCCGCCGAGGCGTCGGCGGTCTGA

Upstream 100 bases:

>100_bases
CCCAGCGCGCCGGCATCACCGTCGACTGACCACCTGACCGGTTTTTCCGGTATTTTCAAGCCTCCAGGAGACAACGCCGC
CTTCCAGGGCGGCGCACCAC

Downstream 100 bases:

>100_bases
CGGATTGCGGTCGATCGGCAGGAAAGGAATCGACATGACCGAACGGGTGGACTTCCTCGTGGTGGGGGCCGGCATTGCCG
GCGCCTCGGTCGGCTGGGAA

Product: amidase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 522; Mature: 521

Protein sequence:

>522_residues
MTQTASSSASLVELTAPELRLRIARREISPVELLEACIARIEALNPYVNAITATCYDRARNEAREAERAVLRGDVLGLLH
GLPLGVKDLEATQGLLTTYGSQIYREHIPAEDNVLVARLRRAGAIVTGKTNIPEMGAGANSRNTVWGATGNPFDPRLNAG
GSSGGSAAALACDMLPVCTGSDTGGSLRIPAAKCGVVGFRPSPGVVPSSRKLLGWTPISVVGPMGRTVEEACLQLAATAG
MSAGDPLSYPLDPAVFLSLPEVDLSTLRVGYTEDFGACAVDDGIRETFRGKIAAMRHLFRSCDPLSLDLGDVHRCFDVLR
AEAFVAGTREAYERDPASLGPNTRANYEMGAAMTLIDSAWAQAEQTRILARFQKAFESFDIILAPTTPVSPFPWTELYAS
HINGEPQANYYRWLALTYVTTLTTHPALSLPCGRDGRDMPFGLQVVGRFRDDLGTLAIGRAMEQSFAGIEGLQRPRPALD
RLAPVEPALTSLVTTAPDARDARGEPRPDGSGSRGAAEASAV

Sequences:

>Translated_522_residues
MTQTASSSASLVELTAPELRLRIARREISPVELLEACIARIEALNPYVNAITATCYDRARNEAREAERAVLRGDVLGLLH
GLPLGVKDLEATQGLLTTYGSQIYREHIPAEDNVLVARLRRAGAIVTGKTNIPEMGAGANSRNTVWGATGNPFDPRLNAG
GSSGGSAAALACDMLPVCTGSDTGGSLRIPAAKCGVVGFRPSPGVVPSSRKLLGWTPISVVGPMGRTVEEACLQLAATAG
MSAGDPLSYPLDPAVFLSLPEVDLSTLRVGYTEDFGACAVDDGIRETFRGKIAAMRHLFRSCDPLSLDLGDVHRCFDVLR
AEAFVAGTREAYERDPASLGPNTRANYEMGAAMTLIDSAWAQAEQTRILARFQKAFESFDIILAPTTPVSPFPWTELYAS
HINGEPQANYYRWLALTYVTTLTTHPALSLPCGRDGRDMPFGLQVVGRFRDDLGTLAIGRAMEQSFAGIEGLQRPRPALD
RLAPVEPALTSLVTTAPDARDARGEPRPDGSGSRGAAEASAV
>Mature_521_residues
TQTASSSASLVELTAPELRLRIARREISPVELLEACIARIEALNPYVNAITATCYDRARNEAREAERAVLRGDVLGLLHG
LPLGVKDLEATQGLLTTYGSQIYREHIPAEDNVLVARLRRAGAIVTGKTNIPEMGAGANSRNTVWGATGNPFDPRLNAGG
SSGGSAAALACDMLPVCTGSDTGGSLRIPAAKCGVVGFRPSPGVVPSSRKLLGWTPISVVGPMGRTVEEACLQLAATAGM
SAGDPLSYPLDPAVFLSLPEVDLSTLRVGYTEDFGACAVDDGIRETFRGKIAAMRHLFRSCDPLSLDLGDVHRCFDVLRA
EAFVAGTREAYERDPASLGPNTRANYEMGAAMTLIDSAWAQAEQTRILARFQKAFESFDIILAPTTPVSPFPWTELYASH
INGEPQANYYRWLALTYVTTLTTHPALSLPCGRDGRDMPFGLQVVGRFRDDLGTLAIGRAMEQSFAGIEGLQRPRPALDR
LAPVEPALTSLVTTAPDARDARGEPRPDGSGSRGAAEASAV

Specific function: Unknown

COG id: COG0154

COG function: function code J; Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the amidase family [H]

Homologues:

Organism=Homo sapiens, GI222831590, Length=513, Percent_Identity=26.3157894736842, Blast_Score=132, Evalue=8e-31,
Organism=Homo sapiens, GI195972892, Length=247, Percent_Identity=34.0080971659919, Blast_Score=122, Evalue=6e-28,
Organism=Homo sapiens, GI166795287, Length=262, Percent_Identity=31.6793893129771, Blast_Score=75, Evalue=1e-13,
Organism=Caenorhabditis elegans, GI17537465, Length=504, Percent_Identity=26.5873015873016, Blast_Score=137, Evalue=1e-32,
Organism=Caenorhabditis elegans, GI17543272, Length=449, Percent_Identity=25.1670378619154, Blast_Score=93, Evalue=4e-19,
Organism=Caenorhabditis elegans, GI17538252, Length=189, Percent_Identity=31.7460317460317, Blast_Score=92, Evalue=7e-19,
Organism=Caenorhabditis elegans, GI17556264, Length=309, Percent_Identity=27.831715210356, Blast_Score=84, Evalue=2e-16,
Organism=Caenorhabditis elegans, GI71990152, Length=317, Percent_Identity=28.7066246056782, Blast_Score=83, Evalue=4e-16,
Organism=Caenorhabditis elegans, GI17538254, Length=201, Percent_Identity=34.3283582089552, Blast_Score=72, Evalue=9e-13,
Organism=Caenorhabditis elegans, GI17556276, Length=200, Percent_Identity=31, Blast_Score=65, Evalue=6e-11,
Organism=Caenorhabditis elegans, GI17556278, Length=200, Percent_Identity=31, Blast_Score=65, Evalue=8e-11,
Organism=Saccharomyces cerevisiae, GI6323950, Length=416, Percent_Identity=24.2788461538462, Blast_Score=87, Evalue=9e-18,
Organism=Saccharomyces cerevisiae, GI6319685, Length=399, Percent_Identity=26.3157894736842, Blast_Score=80, Evalue=7e-16,
Organism=Saccharomyces cerevisiae, GI6320448, Length=232, Percent_Identity=28.0172413793103, Blast_Score=74, Evalue=7e-14,
Organism=Drosophila melanogaster, GI21356731, Length=489, Percent_Identity=27.19836400818, Blast_Score=140, Evalue=3e-33,
Organism=Drosophila melanogaster, GI24648435, Length=498, Percent_Identity=25.9036144578313, Blast_Score=130, Evalue=2e-30,
Organism=Drosophila melanogaster, GI24648437, Length=498, Percent_Identity=25.9036144578313, Blast_Score=130, Evalue=2e-30,
Organism=Drosophila melanogaster, GI45550774, Length=498, Percent_Identity=25.9036144578313, Blast_Score=130, Evalue=3e-30,
Organism=Drosophila melanogaster, GI24652985, Length=508, Percent_Identity=27.3622047244095, Blast_Score=129, Evalue=4e-30,
Organism=Drosophila melanogaster, GI19922090, Length=508, Percent_Identity=27.3622047244095, Blast_Score=129, Evalue=4e-30,
Organism=Drosophila melanogaster, GI24652981, Length=508, Percent_Identity=27.3622047244095, Blast_Score=129, Evalue=4e-30,
Organism=Drosophila melanogaster, GI24652983, Length=508, Percent_Identity=27.3622047244095, Blast_Score=129, Evalue=4e-30,
Organism=Drosophila melanogaster, GI24648113, Length=512, Percent_Identity=26.5625, Blast_Score=109, Evalue=4e-24,
Organism=Drosophila melanogaster, GI24644968, Length=504, Percent_Identity=24.8015873015873, Blast_Score=104, Evalue=1e-22,
Organism=Drosophila melanogaster, GI24648441, Length=421, Percent_Identity=24.9406175771971, Blast_Score=99, Evalue=5e-21,
Organism=Drosophila melanogaster, GI24648439, Length=421, Percent_Identity=24.9406175771971, Blast_Score=99, Evalue=5e-21,
Organism=Drosophila melanogaster, GI161078093, Length=514, Percent_Identity=26.0700389105058, Blast_Score=92, Evalue=7e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000120
- InterPro:   IPR020556 [H]

Pfam domain/function: PF01425 Amidase [H]

EC number: =3.5.1.4 [H]

Molecular weight: Translated: 55606; Mature: 55474

Theoretical pI: Translated: 5.38; Mature: 5.38

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTQTASSSASLVELTAPELRLRIARREISPVELLEACIARIEALNPYVNAITATCYDRAR
CCCCCCCCCCEEEECCHHHHHHHHHHCCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH
NEAREAERAVLRGDVLGLLHGLPLGVKDLEATQGLLTTYGSQIYREHIPAEDNVLVARLR
HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH
RAGAIVTGKTNIPEMGAGANSRNTVWGATGNPFDPRLNAGGSSGGSAAALACDMLPVCTG
HCCEEEECCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHCEECCC
SDTGGSLRIPAAKCGVVGFRPSPGVVPSSRKLLGWTPISVVGPMGRTVEEACLQLAATAG
CCCCCEEECCHHHCCEEECCCCCCCCCCCCCEEECCCHHHHCCCCCHHHHHHHHHHHHCC
MSAGDPLSYPLDPAVFLSLPEVDLSTLRVGYTEDFGACAVDDGIRETFRGKIAAMRHLFR
CCCCCCCCCCCCCCEEEECCCCCHHHEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
SCDPLSLDLGDVHRCFDVLRAEAFVAGTREAYERDPASLGPNTRANYEMGAAMTLIDSAW
CCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHH
AQAEQTRILARFQKAFESFDIILAPTTPVSPFPWTELYASHINGEPQANYYRWLALTYVT
HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHH
TLTTHPALSLPCGRDGRDMPFGLQVVGRFRDDLGTLAIGRAMEQSFAGIEGLQRPRPALD
HHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH
RLAPVEPALTSLVTTAPDARDARGEPRPDGSGSRGAAEASAV
HCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC
>Mature Secondary Structure 
TQTASSSASLVELTAPELRLRIARREISPVELLEACIARIEALNPYVNAITATCYDRAR
CCCCCCCCCEEEECCHHHHHHHHHHCCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH
NEAREAERAVLRGDVLGLLHGLPLGVKDLEATQGLLTTYGSQIYREHIPAEDNVLVARLR
HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH
RAGAIVTGKTNIPEMGAGANSRNTVWGATGNPFDPRLNAGGSSGGSAAALACDMLPVCTG
HCCEEEECCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHCEECCC
SDTGGSLRIPAAKCGVVGFRPSPGVVPSSRKLLGWTPISVVGPMGRTVEEACLQLAATAG
CCCCCEEECCHHHCCEEECCCCCCCCCCCCCEEECCCHHHHCCCCCHHHHHHHHHHHHCC
MSAGDPLSYPLDPAVFLSLPEVDLSTLRVGYTEDFGACAVDDGIRETFRGKIAAMRHLFR
CCCCCCCCCCCCCCEEEECCCCCHHHEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
SCDPLSLDLGDVHRCFDVLRAEAFVAGTREAYERDPASLGPNTRANYEMGAAMTLIDSAW
CCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHH
AQAEQTRILARFQKAFESFDIILAPTTPVSPFPWTELYASHINGEPQANYYRWLALTYVT
HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHH
TLTTHPALSLPCGRDGRDMPFGLQVVGRFRDDLGTLAIGRAMEQSFAGIEGLQRPRPALD
HHHCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH
RLAPVEPALTSLVTTAPDARDARGEPRPDGSGSRGAAEASAV
HCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9389475 [H]