Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yneH
Identifier: 30063073
GI number: 30063073
Start: 1640697
End: 1641623
Strand: Direct
Name: yneH
Synonym: S1696
Alternate gene names: 30063073
Gene position: 1640697-1641623 (Clockwise)
Preceding gene: 30063070
Following gene: 30063074
Centisome position: 35.67
GC content: 51.78
Gene sequence:
>927_bases GTGGCAGTCGCCATGGATAATGCAATTTTAGAAAACATCTTGCGGCAAGTGCGGCCGCTCATTGGTCAGGGTAAAGTCGC GGATTATATTCCGGCGCTGGCTACAGTAGACGGTTCCCGATTGGGGATTGCTATCTGTACCGTTGACGGACAGCTTTTTC AGGCCGGAGACGCGCAAGAACGTTTTTCCATTCAGTCTATTTCCAAAGTGCTGAGTCTCGTTGTCGCCATGCGTCATTAC TCCGAAGAGGAAATCTGGCAACGCGTCGGCAAAGATCCGTCTGGATCACCGTTCAATTCTTTAGTGCAACTGGAAATGGA ACAGGGTATTCCGCGTAATCCGTTCATTAATACCGGTGCGCTGGTGGTCTGCGATATGTTGCAAGGGCGACTAAGCGCAC CACGGCAACGTATGCTGGAAGTCGTGCGCGGCTTAAGCGGTGTGTCTGATATTTCATACGATACGGTGGTAGCGCGTTCC GAATTTGAACATTCCGCGCGAAATGCGGCTATCGCCTGGCTGATGAAGTCGTTTGGCAATTTCCATCATGACGTGACAAC CGTTCTGCAAAACTACTTTCATTACTGCGCTCTGAAAATGAGCTGTGTAGAGCTGGCCCGGACGTTTGTCTTTCTGGCTA ATCAGGGGAAAGCTATTCATATTGATGAACCTGTGGTGACGCCAATGCAGGCGCGGCAAATTAACGCGCTGATGGCGACC AGTGGTATGTACCAGAACGCGGGGGAGTTTGCCTGGCGGGTGGGACTACCGGCGAAATCTGGCGTTGGTGGCGGTATTGT GGCGATTGTTCCGCATGAAATGGCCATCGCAGTCTGGAGTCCGGAACTGGATGATGCAGGTAACTCGCTTGCGGGTATCG CCGTCCTTGAACAATTGACGAAACAGTTAGGGCGTTCGGTTTATTAA
Upstream 100 bases:
>100_bases CTGTAATATCCAGACAGTGTGGAAAGACCGGATCTGACGCGACATATTCAGCTCTGATATACTCGCAGGTCTTTTCAGAC CTGCGGTCCAGGAGTAGAAA
Downstream 100 bases:
>100_bases TGCAGTCTCTCGATCCACTCTTCGCGCGTTTATCCCGTTCAAAATTTCGCTCTCGCTTTCGTCTGGGCATGAAAGAGCGT CAGTATTGCCTGGAGAAAGG
Product: glutaminase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 308; Mature: 307
Protein sequence:
>308_residues MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHY SEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARS EFEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY
Sequences:
>Translated_308_residues MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHY SEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARS EFEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY >Mature_307_residues AVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHYS EEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARSE FEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMATS GMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY
Specific function: Unknown
COG id: COG2066
COG function: function code E; Glutaminase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glutaminase family [H]
Homologues:
Organism=Homo sapiens, GI156104878, Length=280, Percent_Identity=37.5, Blast_Score=184, Evalue=1e-46, Organism=Homo sapiens, GI20336214, Length=294, Percent_Identity=33.6734693877551, Blast_Score=164, Evalue=1e-40, Organism=Escherichia coli, GI1787804, Length=308, Percent_Identity=99.6753246753247, Blast_Score=634, Evalue=0.0, Organism=Escherichia coli, GI1786693, Length=286, Percent_Identity=38.1118881118881, Blast_Score=196, Evalue=2e-51, Organism=Caenorhabditis elegans, GI193204073, Length=309, Percent_Identity=35.5987055016181, Blast_Score=180, Evalue=8e-46, Organism=Caenorhabditis elegans, GI193204075, Length=311, Percent_Identity=35.3697749196142, Blast_Score=179, Evalue=1e-45, Organism=Caenorhabditis elegans, GI17507019, Length=291, Percent_Identity=36.0824742268041, Blast_Score=177, Evalue=5e-45, Organism=Caenorhabditis elegans, GI17532727, Length=293, Percent_Identity=33.4470989761092, Blast_Score=172, Evalue=2e-43, Organism=Drosophila melanogaster, GI281363241, Length=294, Percent_Identity=35.0340136054422, Blast_Score=185, Evalue=4e-47, Organism=Drosophila melanogaster, GI24653164, Length=294, Percent_Identity=35.0340136054422, Blast_Score=185, Evalue=4e-47, Organism=Drosophila melanogaster, GI281363239, Length=294, Percent_Identity=35.0340136054422, Blast_Score=185, Evalue=4e-47, Organism=Drosophila melanogaster, GI24653162, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=4e-47, Organism=Drosophila melanogaster, GI24653156, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=4e-47, Organism=Drosophila melanogaster, GI24653158, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=4e-47, Organism=Drosophila melanogaster, GI24653166, Length=294, Percent_Identity=35.0340136054422, Blast_Score=184, Evalue=5e-47, Organism=Drosophila melanogaster, GI116008307, Length=285, Percent_Identity=35.7894736842105, Blast_Score=184, Evalue=6e-47,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012338 - InterPro: IPR015868 [H]
Pfam domain/function: PF04960 Glutaminase [H]
EC number: =3.5.1.2 [H]
Molecular weight: Translated: 33546; Mature: 33415
Theoretical pI: Translated: 6.40; Mature: 6.40
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 3.9 %Met (Translated Protein) 5.2 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 4.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQE CCEEHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEEEEECCHHHCCCCHHH RFSIQSISKVLSLVVAMRHYSEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGA HHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCH LVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARSEFEHSARNAAIAWLMKSFGN HHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC FHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHH SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLT CCCCCCCCCEEEEECCCCCCCCCCCEEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHH KQLGRSVY HHHCCCCC >Mature Secondary Structure AVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQE CEEHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEEEEECCHHHCCCCHHH RFSIQSISKVLSLVVAMRHYSEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINTGA HHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCH LVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARSEFEHSARNAAIAWLMKSFGN HHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC FHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHH SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLT CCCCCCCCCEEEEECCCCCCCCCCCEEEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHH KQLGRSVY HHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]