Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is allD [H]

Identifier: 29142725

GI number: 29142725

Start: 2396850

End: 2397899

Strand: Direct

Name: allD [H]

Synonym: t2333

Alternate gene names: 29142725

Gene position: 2396850-2397899 (Clockwise)

Preceding gene: 29142720

Following gene: 29142726

Centisome position: 50.02

GC content: 49.71

Gene sequence:

>1050_bases
ATGAAAATCAGTCGGGAAACACTCCATCAGCTTATCGAAAATAAGCTTTATAAAGCCGGACTAAAACGTGAGCACGCCGC
CATCGTCGCCGACGTACTGGTTTATGCAGATGCCAGAGGTATTCACTCACATGGCGCCGTACGCGTTGAATATTATGCCG
AACGAATTTCAAAAGGCGGCACCAACCGGGAGCCGACGTTCCGCATTGAGAATACCGGTCCCTGTACGGCGATACTGCAT
GCCGATAATGCTGCCGGACAGGTCGCAGCCAAAATGGGAATGGAGCATGCTATTGAAATAGCCAAAAAAAATGGCGTCGC
GGTTGTCGGCATTAGCAGAATGGGCCATAGCGGCGCCATCTCTTATTTCGTCCGCCAGGCCGCTCGCGAAGGCCTGATTG
GTCTGTCTATCTGTCAGTCCGATCCTATGGTCGTGCCGTTTGGCGGGGCGGATATTTACTATGGCACTAATCCGCTGGCC
TTTGCCGCGCCGAGCGAAGGCGATGACATCATTACCTTTGATATGGCCACCACCGTGCAGGCCTGGGGAAAAGTCCTCGA
TGCACGGTCTCGCAATGAGTCCATTCCGGAGAGTTGGGCCGTCGATAAAAACGGCGCGCCGACACATGATCCTTTTGCCG
TCAATGCGTTATTACCCGCCGCAGGCCCGAAAGGTTACGGCCTGATGATGATGATCGATATTTTGTCCGGCATTCTGCTG
GGGCTGCCGTTTGGTCGCCAGGTCAGTTCGATGTATGAAGATTTACACGCCGGACGCAATTTAGGACAACTTCATCTGGT
CATTAATCCGGCGTTCTTTTCTTCCTGTGAATTATTCCGCAAACATATTAGTCAGACTATGCAGGAACTCAATTCCGTGA
AGCCCGCTCCCGGTTTTAAACAGGTTTATTATCCTGGACAGGATCAGGATATTAAACAGAAAAATGCCGATATGAATGGT
ATCGATATTGTTGATGATATTTATCAATATCTGATTTCCGATGCCCTCTATCTCAAGTCATACGAAACAAAAAATCCCTT
TGCCCAATAA

Upstream 100 bases:

>100_bases
CTAATTATATTTTTTTGCCTGTCTGGATCACATAATCATTTTATTTTCCCGGTATGTTAATCGCAGTCATGCTTCACACC
GTCGTTAAAAAGGAAGACAG

Downstream 100 bases:

>100_bases
TTATTGAAACAGGAATTTTCTATGATAAAGCATTTTCGCCACGCGATAGAAGAAACATTGCCCTGGCTCTCTTCCATTGG
AGCCGATCCTACGGGTGGAA

Product: ureidoglycolate dehydrogenase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 349; Mature: 349

Protein sequence:

>349_residues
MKISRETLHQLIENKLYKAGLKREHAAIVADVLVYADARGIHSHGAVRVEYYAERISKGGTNREPTFRIENTGPCTAILH
ADNAAGQVAAKMGMEHAIEIAKKNGVAVVGISRMGHSGAISYFVRQAAREGLIGLSICQSDPMVVPFGGADIYYGTNPLA
FAAPSEGDDIITFDMATTVQAWGKVLDARSRNESIPESWAVDKNGAPTHDPFAVNALLPAAGPKGYGLMMMIDILSGILL
GLPFGRQVSSMYEDLHAGRNLGQLHLVINPAFFSSCELFRKHISQTMQELNSVKPAPGFKQVYYPGQDQDIKQKNADMNG
IDIVDDIYQYLISDALYLKSYETKNPFAQ

Sequences:

>Translated_349_residues
MKISRETLHQLIENKLYKAGLKREHAAIVADVLVYADARGIHSHGAVRVEYYAERISKGGTNREPTFRIENTGPCTAILH
ADNAAGQVAAKMGMEHAIEIAKKNGVAVVGISRMGHSGAISYFVRQAAREGLIGLSICQSDPMVVPFGGADIYYGTNPLA
FAAPSEGDDIITFDMATTVQAWGKVLDARSRNESIPESWAVDKNGAPTHDPFAVNALLPAAGPKGYGLMMMIDILSGILL
GLPFGRQVSSMYEDLHAGRNLGQLHLVINPAFFSSCELFRKHISQTMQELNSVKPAPGFKQVYYPGQDQDIKQKNADMNG
IDIVDDIYQYLISDALYLKSYETKNPFAQ
>Mature_349_residues
MKISRETLHQLIENKLYKAGLKREHAAIVADVLVYADARGIHSHGAVRVEYYAERISKGGTNREPTFRIENTGPCTAILH
ADNAAGQVAAKMGMEHAIEIAKKNGVAVVGISRMGHSGAISYFVRQAAREGLIGLSICQSDPMVVPFGGADIYYGTNPLA
FAAPSEGDDIITFDMATTVQAWGKVLDARSRNESIPESWAVDKNGAPTHDPFAVNALLPAAGPKGYGLMMMIDILSGILL
GLPFGRQVSSMYEDLHAGRNLGQLHLVINPAFFSSCELFRKHISQTMQELNSVKPAPGFKQVYYPGQDQDIKQKNADMNG
IDIVDDIYQYLISDALYLKSYETKNPFAQ

Specific function: Involved in the anaerobic utilization of allantoin [H]

COG id: COG2055

COG function: function code C; Malate/L-lactate dehydrogenases

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the LDH2/MDH2 oxidoreductase family [H]

Homologues:

Organism=Escherichia coli, GI1786727, Length=349, Percent_Identity=86.2464183381089, Blast_Score=641, Evalue=0.0,
Organism=Escherichia coli, GI1787020, Length=356, Percent_Identity=27.8089887640449, Blast_Score=121, Evalue=8e-29,
Organism=Escherichia coli, GI1790000, Length=333, Percent_Identity=23.7237237237237, Blast_Score=91, Evalue=1e-19,
Organism=Caenorhabditis elegans, GI17507179, Length=311, Percent_Identity=32.1543408360129, Blast_Score=152, Evalue=2e-37,
Organism=Caenorhabditis elegans, GI17536633, Length=312, Percent_Identity=31.4102564102564, Blast_Score=141, Evalue=5e-34,
Organism=Drosophila melanogaster, GI45553211, Length=311, Percent_Identity=31.5112540192926, Blast_Score=141, Evalue=6e-34,
Organism=Drosophila melanogaster, GI24667912, Length=311, Percent_Identity=31.5112540192926, Blast_Score=141, Evalue=6e-34,
Organism=Drosophila melanogaster, GI24667904, Length=311, Percent_Identity=31.5112540192926, Blast_Score=141, Evalue=7e-34,
Organism=Drosophila melanogaster, GI24667908, Length=311, Percent_Identity=31.5112540192926, Blast_Score=141, Evalue=8e-34,
Organism=Drosophila melanogaster, GI24658010, Length=303, Percent_Identity=28.0528052805281, Blast_Score=115, Evalue=4e-26,
Organism=Drosophila melanogaster, GI24648360, Length=339, Percent_Identity=24.7787610619469, Blast_Score=96, Evalue=3e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003767
- InterPro:   IPR017590 [H]

Pfam domain/function: PF02615 Ldh_2 [H]

EC number: =1.1.1.154 [H]

Molecular weight: Translated: 38090; Mature: 38090

Theoretical pI: Translated: 6.74; Mature: 6.74

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.3 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKISRETLHQLIENKLYKAGLKREHAAIVADVLVYADARGIHSHGAVRVEYYAERISKGG
CCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCEEHHHHHHHHHCCC
TNREPTFRIENTGPCTAILHADNAAGQVAAKMGMEHAIEIAKKNGVAVVGISRMGHSGAI
CCCCCCEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHH
SYFVRQAAREGLIGLSICQSDPMVVPFGGADIYYGTNPLAFAAPSEGDDIITFDMATTVQ
HHHHHHHHHCCCEEEEEECCCCEEEEECCCEEEECCCCEEEECCCCCCCEEEEEHHHHHH
AWGKVLDARSRNESIPESWAVDKNGAPTHDPFAVNALLPAAGPKGYGLMMMIDILSGILL
HHHHHHHHCCCCCCCCCHHCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH
GLPFGRQVSSMYEDLHAGRNLGQLHLVINPAFFSSCELFRKHISQTMQELNSVKPAPGFK
CCCCCHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
QVYYPGQDQDIKQKNADMNGIDIVDDIYQYLISDALYLKSYETKNPFAQ
EEECCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure
MKISRETLHQLIENKLYKAGLKREHAAIVADVLVYADARGIHSHGAVRVEYYAERISKGG
CCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCEEHHHHHHHHHCCC
TNREPTFRIENTGPCTAILHADNAAGQVAAKMGMEHAIEIAKKNGVAVVGISRMGHSGAI
CCCCCCEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHH
SYFVRQAAREGLIGLSICQSDPMVVPFGGADIYYGTNPLAFAAPSEGDDIITFDMATTVQ
HHHHHHHHHCCCEEEEEECCCCEEEEECCCEEEECCCCEEEECCCCCCCEEEEEHHHHHH
AWGKVLDARSRNESIPESWAVDKNGAPTHDPFAVNALLPAAGPKGYGLMMMIDILSGILL
HHHHHHHHCCCCCCCCCHHCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH
GLPFGRQVSSMYEDLHAGRNLGQLHLVINPAFFSSCELFRKHISQTMQELNSVKPAPGFK
CCCCCHHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC
QVYYPGQDQDIKQKNADMNGIDIVDDIYQYLISDALYLKSYETKNPFAQ
EEECCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]