The gene/protein map for NC_004741 is currently unavailable.
Definition Shigella flexneri 2a str. 2457T, complete genome.
Accession NC_004741
Length 4,599,354

Click here to switch to the map view.

The map label for this gene is yneH

Identifier: 30063073

GI number: 30063073

Start: 1640697

End: 1641623

Strand: Direct

Name: yneH

Synonym: S1696

Alternate gene names: 30063073

Gene position: 1640697-1641623 (Clockwise)

Preceding gene: 30063070

Following gene: 30063074

Centisome position: 35.67

GC content: 51.78

Gene sequence:

>927_bases
GTGGCAGTCGCCATGGATAATGCAATTTTAGAAAACATCTTGCGGCAAGTGCGGCCGCTCATTGGTCAGGGTAAAGTCGC
GGATTATATTCCGGCGCTGGCTACAGTAGACGGTTCCCGATTGGGGATTGCTATCTGTACCGTTGACGGACAGCTTTTTC
AGGCCGGAGACGCGCAAGAACGTTTTTCCATTCAGTCTATTTCCAAAGTGCTGAGTCTCGTTGTCGCCATGCGTCATTAC
TCCGAAGAGGAAATCTGGCAACGCGTCGGCAAAGATCCGTCTGGATCACCGTTCAATTCTTTAGTGCAACTGGAAATGGA
ACAGGGTATTCCGCGTAATCCGTTCATTAATACCGGTGCGCTGGTGGTCTGCGATATGTTGCAAGGGCGACTAAGCGCAC
CACGGCAACGTATGCTGGAAGTCGTGCGCGGCTTAAGCGGTGTGTCTGATATTTCATACGATACGGTGGTAGCGCGTTCC
GAATTTGAACATTCCGCGCGAAATGCGGCTATCGCCTGGCTGATGAAGTCGTTTGGCAATTTCCATCATGACGTGACAAC
CGTTCTGCAAAACTACTTTCATTACTGCGCTCTGAAAATGAGCTGTGTAGAGCTGGCCCGGACGTTTGTCTTTCTGGCTA
ATCAGGGGAAAGCTATTCATATTGATGAACCTGTGGTGACGCCAATGCAGGCGCGGCAAATTAACGCGCTGATGGCGACC
AGTGGTATGTACCAGAACGCGGGGGAGTTTGCCTGGCGGGTGGGACTACCGGCGAAATCTGGCGTTGGTGGCGGTATTGT
GGCGATTGTTCCGCATGAAATGGCCATCGCAGTCTGGAGTCCGGAACTGGATGATGCAGGTAACTCGCTTGCGGGTATCG
CCGTCCTTGAACAATTGACGAAACAGTTAGGGCGTTCGGTTTATTAA

Upstream 100 bases:

>100_bases
CTGTAATATCCAGACAGTGTGGAAAGACCGGATCTGACGCGACATATTCAGCTCTGATATACTCGCAGGTCTTTTCAGAC
CTGCGGTCCAGGAGTAGAAA

Downstream 100 bases:

>100_bases
TGCAGTCTCTCGATCCACTCTTCGCGCGTTTATCCCGTTCAAAATTTCGCTCTCGCTTTCGTCTGGGCATGAAAGAGCGT
CAGTATTGCCTGGAGAAAGG

Product: glutaminase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 308; Mature: 307

Protein sequence:

>308_residues
MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHY
SEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARS
EFEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT
SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY

Sequences:

>Translated_308_residues
MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHY
SEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARS
EFEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT
SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY
>Mature_307_residues
AVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHYS
EEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARSE
FEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMATS
GMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY

Specific function: Unknown

COG id: COG2066

COG function: function code E; Glutaminase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glutaminase family [H]

Homologues:

Organism=Homo sapiens, GI156104878, Length=280, Percent_Identity=37.5, Blast_Score=184, Evalue=1e-46,
Organism=Homo sapiens, GI20336214, Length=294, Percent_Identity=33.6734693877551, Blast_Score=164, Evalue=1e-40,
Organism=Escherichia coli, GI1787804, Length=308, Percent_Identity=99.6753246753247, Blast_Score=634, Evalue=0.0,
Organism=Escherichia coli, GI1786693, Length=286, Percent_Identity=38.1118881118881, Blast_Score=196, Evalue=2e-51,
Organism=Caenorhabditis elegans, GI193204073, Length=309, Percent_Identity=35.5987055016181, Blast_Score=180, Evalue=8e-46,
Organism=Caenorhabditis elegans, GI193204075, Length=311, Percent_Identity=35.3697749196142, Blast_Score=179, Evalue=1e-45,
Organism=Caenorhabditis elegans, GI17507019, Length=291, Percent_Identity=36.0824742268041, Blast_Score=177, Evalue=5e-45,
Organism=Caenorhabditis elegans, GI17532727, Length=293, Percent_Identity=33.4470989761092, Blast_Score=172, Evalue=2e-43,
Organism=Drosophila melanogaster, GI281363241, Length=294, Percent_Identity=35.0340136054422, Blast_Score=185, Evalue=4e-47,
Organism=Drosophila melanogaster, GI24653164, Length=294, Percent_Identity=35.0340136054422, Blast_Score=185, Evalue=4e-47,
Organism=Drosophila melanogaster, GI281363239, Length=294, Percent_Identity=35.0340136054422, Blast_Score=185, Evalue=4e-47,
Organism=Drosophila melanogaster, GI24653162, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=4e-47,
Organism=Drosophila melanogaster, GI24653156, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=4e-47,
Organism=Drosophila melanogaster, GI24653158, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=4e-47,
Organism=Drosophila melanogaster, GI24653166, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=5e-47,
Organism=Drosophila melanogaster, GI116008307, Length=285, Percent_Identity=35.7894736842105, Blast_Score=184, Evalue=6e-47,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR012338
- InterPro:   IPR015868 [H]

Pfam domain/function: PF04960 Glutaminase [H]

EC number: =3.5.1.2 [H]

Molecular weight: Translated: 33546; Mature: 33415

Theoretical pI: Translated: 6.40; Mature: 6.40

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
5.2 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
3.6 %Met     (Mature Protein)
4.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQE
CCEEHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEEEEECCHHHCCCCHHH
RFSIQSISKVLSLVVAMRHYSEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGA
HHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCH
LVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARSEFEHSARNAAIAWLMKSFGN
HHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC
FHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHH
SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLT
CCCCCCCCCEEEEECCCCCCCCCCCEEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHH
KQLGRSVY
HHHCCCCC
>Mature Secondary Structure 
AVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQE
CEEHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEEEEECCHHHCCCCHHH
RFSIQSISKVLSLVVAMRHYSEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGA
HHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCH
LVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARSEFEHSARNAAIAWLMKSFGN
HHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC
FHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHH
SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLT
CCCCCCCCCEEEEECCCCCCCCCCCEEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHH
KQLGRSVY
HHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]