The gene/protein map for NC_007613 is currently unavailable.
Definition Shigella boydii Sb227, complete genome.
Accession NC_007613
Length 4,519,823

Click here to switch to the map view.

The map label for this gene is usg [H]

Identifier: 82544800

GI number: 82544800

Start: 2344531

End: 2345544

Strand: Reverse

Name: usg [H]

Synonym: SBO_2356

Alternate gene names: 82544800

Gene position: 2345544-2344531 (Counterclockwise)

Preceding gene: 82544801

Following gene: 82544799

Centisome position: 51.89

GC content: 55.42

Gene sequence:

>1014_bases
ATGTCTGAAGGCTGGAACATTGCCGTCCTGGGCGCAACTGGCGCTGTGGGCGAAGCCCTGCTTGAAACGCTGGCTGAACG
TCAGTTCCCGGTTGGGGAAATTTATGCACTGGCACGTAACGAAAGCGCAGGCGAACAACTGCGCTTTGGTGGTAAAACAA
TCACCGTGCAGGATGCCGCTGAATTCGACTGGACGCAGGCGCAGCTGGCATTTTTTGTCGCAGGCAAAGAAGCTACCGCT
GCCTGGGTTGAAGAAGCGACCAACTCAGGTTGCCTGGTGATCGACAGCAGCGGACTGTTTGCTCTCGAACCCGACGTACC
GCTGGTGGTGCCGGAAGTAAACCCGTTTGTACTGACCGATTACCGCAACCGGAATGTCATCGCCGTACCAGACAGTCTGA
CCAGCCAGCTGCTGGCGGCACTGAAACCGTTAATCGATCAGGGCGGTTTGTCGCGTATCAGTGTTACCAGCCTGATTTCA
GCCTCCGCCCAGGGCAAAAAAGCGGTCGATGCGTTAGCGGGGCAGAGTGCGAAATTGCTCAACGGCATTCCGATTGACGA
AGAAGATTTCTTCGGTCGTCAACTGGCGTTCAATATGCTGCCGTTGTTGCCGGATAGCGAAGGTAGCGTGCGTGAAGAAC
GTCGTATCGTTGACGAAGTACGCAAAATCCTGCAGGACGAAGGGCTGATGATTTCGGCAAGCGTCGTCCAGGCACCGGTA
TTCTACGGTCATGCCCAGATGGTCAACTTTGAAGCACTGCGTCCGCTGGCGGCAGAAGAAGCGCGTGATGCGTTTGCTCA
GGGCGAAGATATTGTGCTCTCTGAAGAGAACGAATTCCCGACTCAGGTAGGGGATGCTTCGGGTACTCCGCATCTTTCTG
TTGGCTGCGTGCGTAATGACTACGGTATGCCGGAGCAAGTCCAGTTCTGGTCGGTGGCCGATAACGTTCGCTTTGGCGGC
GCGCTGATGGCAGTAAAAATCGCCGAGAAACTGGTGCAGGAGTATCTGTACTAA

Upstream 100 bases:

>100_bases
TGGGTTTTAACGCCGTTCATCATCCGGCACGTTAATCTCTTCTTCATGCTCTCTGCTGTAACATTGGCAGGGAGCTTTGC
TATTTCTGGAGTAAACCACC

Downstream 100 bases:

>100_bases
TGTCCGACCAGCAACAACTGCCAGTTTATAAAATTGCGCTGGGCATTGAGTACGACGGCAGTAAGTATTACGGCTGGCAA
CGGCAGAATGAAGTCCGCAG

Product: semialdehyde dehydrogenase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 337; Mature: 336

Protein sequence:

>337_residues
MSEGWNIAVLGATGAVGEALLETLAERQFPVGEIYALARNESAGEQLRFGGKTITVQDAAEFDWTQAQLAFFVAGKEATA
AWVEEATNSGCLVIDSSGLFALEPDVPLVVPEVNPFVLTDYRNRNVIAVPDSLTSQLLAALKPLIDQGGLSRISVTSLIS
ASAQGKKAVDALAGQSAKLLNGIPIDEEDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDEGLMISASVVQAPV
FYGHAQMVNFEALRPLAAEEARDAFAQGEDIVLSEENEFPTQVGDASGTPHLSVGCVRNDYGMPEQVQFWSVADNVRFGG
ALMAVKIAEKLVQEYLY

Sequences:

>Translated_337_residues
MSEGWNIAVLGATGAVGEALLETLAERQFPVGEIYALARNESAGEQLRFGGKTITVQDAAEFDWTQAQLAFFVAGKEATA
AWVEEATNSGCLVIDSSGLFALEPDVPLVVPEVNPFVLTDYRNRNVIAVPDSLTSQLLAALKPLIDQGGLSRISVTSLIS
ASAQGKKAVDALAGQSAKLLNGIPIDEEDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDEGLMISASVVQAPV
FYGHAQMVNFEALRPLAAEEARDAFAQGEDIVLSEENEFPTQVGDASGTPHLSVGCVRNDYGMPEQVQFWSVADNVRFGG
ALMAVKIAEKLVQEYLY
>Mature_336_residues
SEGWNIAVLGATGAVGEALLETLAERQFPVGEIYALARNESAGEQLRFGGKTITVQDAAEFDWTQAQLAFFVAGKEATAA
WVEEATNSGCLVIDSSGLFALEPDVPLVVPEVNPFVLTDYRNRNVIAVPDSLTSQLLAALKPLIDQGGLSRISVTSLISA
SAQGKKAVDALAGQSAKLLNGIPIDEEDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDEGLMISASVVQAPVF
YGHAQMVNFEALRPLAAEEARDAFAQGEDIVLSEENEFPTQVGDASGTPHLSVGCVRNDYGMPEQVQFWSVADNVRFGGA
LMAVKIAEKLVQEYLY

Specific function: Unknown

COG id: COG0136

COG function: function code E; Aspartate-semialdehyde dehydrogenase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the aspartate-semialdehyde dehydrogenase family [H]

Homologues:

Organism=Escherichia coli, GI1788658, Length=337, Percent_Identity=99.7032640949555, Blast_Score=678, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR012080
- InterPro:   IPR016040
- InterPro:   IPR000534
- InterPro:   IPR012280 [H]

Pfam domain/function: PF01118 Semialdhyde_dh; PF02774 Semialdhyde_dhC [H]

EC number: NA

Molecular weight: Translated: 36336; Mature: 36205

Theoretical pI: Translated: 4.11; Mature: 4.11

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSEGWNIAVLGATGAVGEALLETLAERQFPVGEIYALARNESAGEQLRFGGKTITVQDAA
CCCCCEEEEEECCCHHHHHHHHHHHHCCCCHHHEEEEECCCCCCCCEECCCEEEEEECCC
EFDWTQAQLAFFVAGKEATAAWVEEATNSGCLVIDSSGLFALEPDVPLVVPEVNPFVLTD
CCCCHHEEEEEEEECCCHHHHHHHHCCCCCEEEEECCCEEEECCCCCEEECCCCCEEEEE
YRNRNVIAVPDSLTSQLLAALKPLIDQGGLSRISVTSLISASAQGKKAVDALAGQSAKLL
CCCCCEEECCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHCCCCHHHH
NGIPIDEEDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDEGLMISASVVQAPV
CCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHH
FYGHAQMVNFEALRPLAAEEARDAFAQGEDIVLSEENEFPTQVGDASGTPHLSVGCVRND
EECCHHEECHHHHCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEEEEEECC
YGMPEQVQFWSVADNVRFGGALMAVKIAEKLVQEYLY
CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
SEGWNIAVLGATGAVGEALLETLAERQFPVGEIYALARNESAGEQLRFGGKTITVQDAA
CCCCEEEEEECCCHHHHHHHHHHHHCCCCHHHEEEEECCCCCCCCEECCCEEEEEECCC
EFDWTQAQLAFFVAGKEATAAWVEEATNSGCLVIDSSGLFALEPDVPLVVPEVNPFVLTD
CCCCHHEEEEEEEECCCHHHHHHHHCCCCCEEEEECCCEEEECCCCCEEECCCCCEEEEE
YRNRNVIAVPDSLTSQLLAALKPLIDQGGLSRISVTSLISASAQGKKAVDALAGQSAKLL
CCCCCEEECCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHCCCCHHHH
NGIPIDEEDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDEGLMISASVVQAPV
CCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHH
FYGHAQMVNFEALRPLAAEEARDAFAQGEDIVLSEENEFPTQVGDASGTPHLSVGCVRND
EECCHHEECHHHHCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEEEEEECC
YGMPEQVQFWSVADNVRFGGALMAVKIAEKLVQEYLY
CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 2991861; 9205837; 9278503; 3029016; 2681152 [H]