Definition Acidovorax citrulli AAC00-1 chromosome, complete genome.
Accession NC_008752
Length 5,352,772

Click here to switch to the map view.

The map label for this gene is yejA [C]

Identifier: 120611912

GI number: 120611912

Start: 3596786

End: 3598717

Strand: Reverse

Name: yejA [C]

Synonym: Aave_3254

Alternate gene names: 120611912

Gene position: 3598717-3596786 (Counterclockwise)

Preceding gene: 120611914

Following gene: 120611911

Centisome position: 67.23

GC content: 67.65

Gene sequence:

>1932_bases
ATGCCGGAGGGGCGCGGCGCGGCCCGTGGCGGAAAGATGCGGCGGCCGGGAATCCGACCGGGAAAATCCCCCGGCAGGCG
TGGACGGCCCGCCACCGTAGTGCAGAATTGTCGCATGCGGCTCTGGAAGAAGATCTGTCTGCTGTGGGGAATCTGCCTGT
GTGCGCTGCCGGCCTGGGCCGCGCATGGCTACGCGCTGTGGGACGATCTCAAGTACCCGGCGGGTTTCGCGCATTTCGAC
TACGTCGATCCCGACGCGCCCAAGGGCGGCGAGCTGCGCATGGTGAGCAACCTGCGCTATTCCACCTTCGACAAGTACAA
CCCGTTCACGATGAAGGGGTCGCCGCCGGCCTACCTGTCGGACCTGCTGTTCGAGAGCCTGCTGGCCGGCTCCATGGACG
AGACGGCTTCGGGCTACGGATTGCTGGCCGAGGACGTGCAGGTACCCGAGGACCGCCTCAGCGCCACGTTCCGCCTGCGG
GCCGAAGCGCGCTTCCACGACGGCAGCCCGGTCGAGGCCGCCGACGTGAAGCATACCTACGAGACGCTCGTCGGGCCCCA
TGCGTCGCCGAGCTACGCGACGCTGCTGCAGGAGGTGGCAGGGGTGGATGTGCTGGACCGGCGCACGGTGCGCTTCCGCT
TCAAGCACCCCAACCGCGAGCTGCCGCTCACCGTGGGCAGCCTGCCGATCTTCAGCCGCGCCTGGGGGCGCCAGGCGGAC
GGCAAGGCGAAGCGTTTCGACGAGATCGTGACCGACATCCCCATCGGCAGCGGTCCGTATCGCATCGGCCCGGTGGCGTT
CGGCCGCGACATCACCTACGTGCGCGACCCGCAGTACTGGGGCCGCGACCTGAACGTGAACCGGGGCGCGTACAACTTCG
ACCGCATCACGGTCAAGATCTACAAGGACAACACCGCCCGGCTGGAGGCCGTGAAGGCCGGCGAGTTCGACTTCATGACC
GTATATTCGGCCGGCGACTGGGCGCGCCGCATCGATGGCAAGCTGTTCCGGCAGGGCGTGCTGGTGAAGACGGAACTCAG
GCACCGCCTGCCGGCGGGGTTCCAGAGCTATGTGCTCAACACGCGGCGGCCGATGCTCAAGGACCTGCGGGTGCGTGAAG
CGCTGGGGCTGGCGCTGGACTACGATTGGATGAACCGGCAGATGTTCTACGGCGGCTACCCGCGCGTGGTGGACCTGTTC
GGGAATACCGACTGCCAGGCGACCGGCGTGCCGGGGCCGGAGGAACTGGCGCTGCTGGAACCGTGGCGCGGCAAGGTGCC
GGACAGCGTGTTCGGGCCCATGTACACGCCCCCGGTCACCGAAGGGGCGGGCCATTCCCTGCGCGACAACCTGCGCCGCG
CCCGCCAGTTGCTGGCCGACGCCGGCTGGACCTACCGAGACGGCGCGCTGCGCAACGCGAAGGGCGAGCCCATGGTGATC
GACTACCTCGACAGCAAGGAGGCCGGCGCGCGCATGGTGACGCCCTGGATGCGCAACCTCGAGAAGCTGGGCATCACGCT
GCGCTTCACCTCGGTGGACTTCGCGCTGTACCTGCAGCGCCTGGACAAGTTCGATTACGACATGATCACCCTGGCCTACC
CCGGCACCTACAACCCGGGCCAGGAAATGCTGGAGCTGTTCGGCAGCCGGCGCGCCGACGTGGAGGGCAGCAGCAATTAC
TCGGGGGTGAAGAGCCCGGCGGTCGATGCCCTCGCGATCGCGTTGACGCGGGCGAAATCCAAGGCCGAACTGCTGCCCGC
GTGCCGGGCGCTGGACCGCGTCATCATGCACAGCCACTACCTCATCCCGCAGTGGCAACTGTCCGCGCACCGCATCGTGT
ACAACCAGCAGCGTCTGGCCTACCACGCGCCCATGCCGCCCTATGCGAAAGCCGAGGAGTGGGCGATGTTTTCGTGGTGG
AGCCTGAAGTGA

Upstream 100 bases:

>100_bases
CCGTAGGCGATGGAACGGTTGGACAGCACGCCGGTGATGAGCAGCTTCTTGCCGGTCAGGAAACCCATTGTTGTTCTCCA
GTGAATGCGGTAGGGGTGGA

Downstream 100 bases:

>100_bases
GAGTCCACCCCCCTGAGGCGCTGCGCGCCTTCCCCCCGCTCTCGCCGTGCTGCGCACGTCGGGCAGGGGGACGACGCCGG
TGGCCCGGCGAAGCCGGTTC

Product: extracellular solute-binding protein

Products: ADP; phosphate; oligopeptides [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 643; Mature: 642

Protein sequence:

>643_residues
MPEGRGAARGGKMRRPGIRPGKSPGRRGRPATVVQNCRMRLWKKICLLWGICLCALPAWAAHGYALWDDLKYPAGFAHFD
YVDPDAPKGGELRMVSNLRYSTFDKYNPFTMKGSPPAYLSDLLFESLLAGSMDETASGYGLLAEDVQVPEDRLSATFRLR
AEARFHDGSPVEAADVKHTYETLVGPHASPSYATLLQEVAGVDVLDRRTVRFRFKHPNRELPLTVGSLPIFSRAWGRQAD
GKAKRFDEIVTDIPIGSGPYRIGPVAFGRDITYVRDPQYWGRDLNVNRGAYNFDRITVKIYKDNTARLEAVKAGEFDFMT
VYSAGDWARRIDGKLFRQGVLVKTELRHRLPAGFQSYVLNTRRPMLKDLRVREALGLALDYDWMNRQMFYGGYPRVVDLF
GNTDCQATGVPGPEELALLEPWRGKVPDSVFGPMYTPPVTEGAGHSLRDNLRRARQLLADAGWTYRDGALRNAKGEPMVI
DYLDSKEAGARMVTPWMRNLEKLGITLRFTSVDFALYLQRLDKFDYDMITLAYPGTYNPGQEMLELFGSRRADVEGSSNY
SGVKSPAVDALAIALTRAKSKAELLPACRALDRVIMHSHYLIPQWQLSAHRIVYNQQRLAYHAPMPPYAKAEEWAMFSWW
SLK

Sequences:

>Translated_643_residues
MPEGRGAARGGKMRRPGIRPGKSPGRRGRPATVVQNCRMRLWKKICLLWGICLCALPAWAAHGYALWDDLKYPAGFAHFD
YVDPDAPKGGELRMVSNLRYSTFDKYNPFTMKGSPPAYLSDLLFESLLAGSMDETASGYGLLAEDVQVPEDRLSATFRLR
AEARFHDGSPVEAADVKHTYETLVGPHASPSYATLLQEVAGVDVLDRRTVRFRFKHPNRELPLTVGSLPIFSRAWGRQAD
GKAKRFDEIVTDIPIGSGPYRIGPVAFGRDITYVRDPQYWGRDLNVNRGAYNFDRITVKIYKDNTARLEAVKAGEFDFMT
VYSAGDWARRIDGKLFRQGVLVKTELRHRLPAGFQSYVLNTRRPMLKDLRVREALGLALDYDWMNRQMFYGGYPRVVDLF
GNTDCQATGVPGPEELALLEPWRGKVPDSVFGPMYTPPVTEGAGHSLRDNLRRARQLLADAGWTYRDGALRNAKGEPMVI
DYLDSKEAGARMVTPWMRNLEKLGITLRFTSVDFALYLQRLDKFDYDMITLAYPGTYNPGQEMLELFGSRRADVEGSSNY
SGVKSPAVDALAIALTRAKSKAELLPACRALDRVIMHSHYLIPQWQLSAHRIVYNQQRLAYHAPMPPYAKAEEWAMFSWW
SLK
>Mature_642_residues
PEGRGAARGGKMRRPGIRPGKSPGRRGRPATVVQNCRMRLWKKICLLWGICLCALPAWAAHGYALWDDLKYPAGFAHFDY
VDPDAPKGGELRMVSNLRYSTFDKYNPFTMKGSPPAYLSDLLFESLLAGSMDETASGYGLLAEDVQVPEDRLSATFRLRA
EARFHDGSPVEAADVKHTYETLVGPHASPSYATLLQEVAGVDVLDRRTVRFRFKHPNRELPLTVGSLPIFSRAWGRQADG
KAKRFDEIVTDIPIGSGPYRIGPVAFGRDITYVRDPQYWGRDLNVNRGAYNFDRITVKIYKDNTARLEAVKAGEFDFMTV
YSAGDWARRIDGKLFRQGVLVKTELRHRLPAGFQSYVLNTRRPMLKDLRVREALGLALDYDWMNRQMFYGGYPRVVDLFG
NTDCQATGVPGPEELALLEPWRGKVPDSVFGPMYTPPVTEGAGHSLRDNLRRARQLLADAGWTYRDGALRNAKGEPMVID
YLDSKEAGARMVTPWMRNLEKLGITLRFTSVDFALYLQRLDKFDYDMITLAYPGTYNPGQEMLELFGSRRADVEGSSNYS
GVKSPAVDALAIALTRAKSKAELLPACRALDRVIMHSHYLIPQWQLSAHRIVYNQQRLAYHAPMPPYAKAEEWAMFSWWS
LK

Specific function: Possible binding-protein with either a transport or enzymatic activity [H]

COG id: COG4166

COG function: function code E; ABC-type oligopeptide transport system, periplasmic component

Gene ontology:

Cell location: Periplasm (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the bacterial solute-binding protein 5 family [H]

Homologues:

Organism=Escherichia coli, GI87082063, Length=557, Percent_Identity=33.572710951526, Blast_Score=323, Evalue=2e-89,
Organism=Escherichia coli, GI1787495, Length=566, Percent_Identity=20.8480565371025, Blast_Score=81, Evalue=2e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000914 [H]

Pfam domain/function: PF00496 SBP_bac_5 [H]

EC number: NA

Molecular weight: Translated: 72587; Mature: 72456

Theoretical pI: Translated: 9.62; Mature: 9.62

Prosite motif: PS00210 HEMOCYANIN_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.8 %Met     (Mature Protein)
3.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPEGRGAARGGKMRRPGIRPGKSPGRRGRPATVVQNCRMRLWKKICLLWGICLCALPAWA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AHGYALWDDLKYPAGFAHFDYVDPDAPKGGELRMVSNLRYSTFDKYNPFTMKGSPPAYLS
HCCEEEHHCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHCCCCCCCEEECCCCHHHHH
DLLFESLLAGSMDETASGYGLLAEDVQVPEDRLSATFRLRAEARFHDGSPVEAADVKHTY
HHHHHHHHCCCCCCCCCCCCEEHHCCCCCHHHHCEEEEEEEEEECCCCCCCCHHHHHHHH
ETLVGPHASPSYATLLQEVAGVDVLDRRTVRFRFKHPNRELPLTVGSLPIFSRAWGRQAD
HHHHCCCCCCHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEECCCCHHHHHCCCCCC
GKAKRFDEIVTDIPIGSGPYRIGPVAFGRDITYVRDPQYWGRDLNVNRGAYNFDRITVKI
CHHHHHHHHHHHCCCCCCCCEECCEEECCCCEEEECCHHCCCCCCCCCCCEEEEEEEEEE
YKDNTARLEAVKAGEFDFMTVYSAGDWARRIDGKLFRQGVLVKTELRHRLPAGFQSYVLN
EECCCCEEEEEECCCEEEEEEECCCCHHHHHHHHHHHCCCEEHHHHHHHCCCCHHHHHHH
TRRPMLKDLRVREALGLALDYDWMNRQMFYGGYPRVVDLFGNTDCQATGVPGPEELALLE
CCCHHHHHHHHHHHHCCEEECHHHCCEEEECCCCEEEEEECCCCCEECCCCCHHHEEEEC
PWRGKVPDSVFGPMYTPPVTEGAGHSLRDNLRRARQLLADAGWTYRDGALRNAKGEPMVI
CCCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCEEE
DYLDSKEAGARMVTPWMRNLEKLGITLRFTSVDFALYLQRLDKFDYDMITLAYPGTYNPG
EECCCCCCCCEEHHHHHHHHHHCCEEEEEEHHHHHHHHHHHCCCCCCEEEEECCCCCCCH
QEMLELFGSRRADVEGSSNYSGVKSPAVDALAIALTRAKSKAELLPACRALDRVIMHSHY
HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
LIPQWQLSAHRIVYNQQRLAYHAPMPPYAKAEEWAMFSWWSLK
CCCCCCCCCCCEEECCCCEEEECCCCCCCCCCCCEEEEEEECC
>Mature Secondary Structure 
PEGRGAARGGKMRRPGIRPGKSPGRRGRPATVVQNCRMRLWKKICLLWGICLCALPAWA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AHGYALWDDLKYPAGFAHFDYVDPDAPKGGELRMVSNLRYSTFDKYNPFTMKGSPPAYLS
HCCEEEHHCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHCCCCCCCEEECCCCHHHHH
DLLFESLLAGSMDETASGYGLLAEDVQVPEDRLSATFRLRAEARFHDGSPVEAADVKHTY
HHHHHHHHCCCCCCCCCCCCEEHHCCCCCHHHHCEEEEEEEEEECCCCCCCCHHHHHHHH
ETLVGPHASPSYATLLQEVAGVDVLDRRTVRFRFKHPNRELPLTVGSLPIFSRAWGRQAD
HHHHCCCCCCHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCEEECCCCHHHHHCCCCCC
GKAKRFDEIVTDIPIGSGPYRIGPVAFGRDITYVRDPQYWGRDLNVNRGAYNFDRITVKI
CHHHHHHHHHHHCCCCCCCCEECCEEECCCCEEEECCHHCCCCCCCCCCCEEEEEEEEEE
YKDNTARLEAVKAGEFDFMTVYSAGDWARRIDGKLFRQGVLVKTELRHRLPAGFQSYVLN
EECCCCEEEEEECCCEEEEEEECCCCHHHHHHHHHHHCCCEEHHHHHHHCCCCHHHHHHH
TRRPMLKDLRVREALGLALDYDWMNRQMFYGGYPRVVDLFGNTDCQATGVPGPEELALLE
CCCHHHHHHHHHHHHCCEEECHHHCCEEEECCCCEEEEEECCCCCEECCCCCHHHEEEEC
PWRGKVPDSVFGPMYTPPVTEGAGHSLRDNLRRARQLLADAGWTYRDGALRNAKGEPMVI
CCCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCEEE
DYLDSKEAGARMVTPWMRNLEKLGITLRFTSVDFALYLQRLDKFDYDMITLAYPGTYNPG
EECCCCCCCCEEHHHHHHHHHHCCEEEEEEHHHHHHHHHHHCCCCCCEEEEECCCCCCCH
QEMLELFGSRRADVEGSSNYSGVKSPAVDALAIALTRAKSKAELLPACRALDRVIMHSHY
HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
LIPQWQLSAHRIVYNQQRLAYHAPMPPYAKAEEWAMFSWWSLK
CCCCCCCCCCCEEECCCCEEEECCCCCCCCCCCCEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: ATP; oligopeptides [Periplasm]; H2O [C]

Specific reaction: ATP + oligopeptides [Periplasm] + H2O = ADP + phosphate + oligopeptides [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9163424 [H]