Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is sfcA
Identifier: 30063224
GI number: 30063224
Start: 1819537
End: 1821234
Strand: Direct
Name: sfcA
Synonym: S1879
Alternate gene names: 30063224
Gene position: 1819537-1821234 (Clockwise)
Preceding gene: 30063220
Following gene: 30063225
Centisome position: 39.56
GC content: 51.88
Gene sequence:
>1698_bases ATGGAACCAAAAACAAAAAAACAGCGTTCGCTTTATATCCCTTACGCTGGCCCTGTACTGCTGGAATTTCCGTTGTTGAA TAAAGGCAGTGCCTTCAGCATGGAAGAACGCCGTAACTTCAACCTGCTGGGGTTACTGCCGGAAGTGGTCGAAACCATCG AAGAACAAGCGGAACGAGCATGGATCCAGTATCAGGGATTCAAAACCGAAATCGACAAACACATCTACCTGCGTAACATC CAGGACACCAACGAAACCCTCTTCTACCGTCTGGTAAACAATCATCTTGATGAGATGATGCCTGTTATTTATACCCCAAC CGTCGGCGCAGCCTGTGAGCGTTTTTCTGAGATCTACCGCCGTTCACGCGGCGTGTTTATCTCTTACCAGAACCGCCACA ATATGGACGATATTCTGCAAAACGTGCCGAACCATAATATTAAAGTGATTGTGGTGACTGACGGTGAACGTATTCTGGGG CTTGGTGACCAGGGCATCGGCGGGATGGGCATTCCGATCGGTAAACTGTCGCTCTATACCGCCTGTGGCGGCATCAGCCC GGCGTATACCCTTCCGGTGGTGCTGGATGTCGGAACGAACAACCAACAGCTGCTTAACGATCCGCTGTATATGGGCTGGC GTAATCCGCGTATCACTGACGATGAATACTATGAATTCGTTGATGAATTTATCCAGGCTGTGAAACAACGCTGGCCGGAC GTGCTGTTGCAGTTTGAAGACTTTGCTCAAAAAAATGCGATGCCGTTACTTAACCGCTATCGCAATGAAATTTGTTCTTT TAACGATGACATTCAGGGCACTGCGGCGGTAACAGTCGGCACACTGATCGCAGCCAGCCGCGCGGCAGGTGGTCAGTTAA GCGAGAAAAAAATCGTCTTCCTTGGCGCAGGTTCAGCGGGATGCGGCATTGCCGAAATGATCATCGCCCAGACCCAGCGC GAAGGATTAAGCGAGGAAGCAGCGCGGCAGAAAGTCTTTATGGTCGATCGCTTTGGCCTGCTGACGGACAAGATGCCGAA CCTGCTGCCTTTCCAGACCAAACTGGTGCAGAAGCGCGAAAACCTCAGTGACTGGGATACCGACAGCGATGTGTTGTCAC TGCTGGATGTGGTGCGCAATGTAAAACCAGATATTCTGATTGGCGTCTCAGGACAGACCGGGCTGTTTACGGAAGAGATC ATCCGTGAGATGCATAAACACTGTCCGCGTCCGATCGTGATGCCGCTGTCTAACCCGACGTCTCGCGTGGAAGCCACACC GCAGGACATTATCGCCTGGACCGAAGGTAACGCGCTGGTCGCCACTGGCAGCCCGTTTAATCCTGTGGTATGGAAAGATA AAATCTACCCTATCGCCCAGTGTAACAACGCCTTTATTTTCCCGGGCATCGGCCTGGGTGTTATTGCTTCCGGCGCGTCA CGTATCACCGATGAGATGCTGATGTCGGCAAGTGAAACGCTTGCTCAGTATTCGCCGCTGGTCCTGAACGGCGAAGGTCT GGTACTACCGGAACTGAAAGATATTCAGAAAGTCTCCCGCGCAATTGCGTTTGCGGTTGGCAAAATGGCGCAGCAGCAAG GCGTGGCGGTCAAAACGTCTGCTGAAGCTTTGCAACAAGCCATTGACGATAATTTCTGGCAAGCCGAATACCGCGACTAC CGCCGTACCTCCATCTAA
Upstream 100 bases:
>100_bases GGTGTTTTTATCTGCTTTATACTTGGGGACGACGCCCTGGCGGTAAAGCAAAGACGATAAAAGCGTGCCAGGGATGGATA TTCAAAAAAGAGTGAGTGAC
Downstream 100 bases:
>100_bases GCCTGCACCCGGTAGTGAAGGCTACCGGGCTATTTCCCTCTCCCTTTTTCAGATCTCATCCATACTGGGTAGTGGCGAAT AAATCTCATTTGCCTCACCT
Product: malate dehydrogenase
Products: NA
Alternate protein names: NAD-ME [H]
Number of amino acids: Translated: 565; Mature: 565
Protein sequence:
>565_residues MEPKTKKQRSLYIPYAGPVLLEFPLLNKGSAFSMEERRNFNLLGLLPEVVETIEEQAERAWIQYQGFKTEIDKHIYLRNI QDTNETLFYRLVNNHLDEMMPVIYTPTVGAACERFSEIYRRSRGVFISYQNRHNMDDILQNVPNHNIKVIVVTDGERILG LGDQGIGGMGIPIGKLSLYTACGGISPAYTLPVVLDVGTNNQQLLNDPLYMGWRNPRITDDEYYEFVDEFIQAVKQRWPD VLLQFEDFAQKNAMPLLNRYRNEICSFNDDIQGTAAVTVGTLIAASRAAGGQLSEKKIVFLGAGSAGCGIAEMIIAQTQR EGLSEEAARQKVFMVDRFGLLTDKMPNLLPFQTKLVQKRENLSDWDTDSDVLSLLDVVRNVKPDILIGVSGQTGLFTEEI IREMHKHCPRPIVMPLSNPTSRVEATPQDIIAWTEGNALVATGSPFNPVVWKDKIYPIAQCNNAFIFPGIGLGVIASGAS RITDEMLMSASETLAQYSPLVLNGEGLVLPELKDIQKVSRAIAFAVGKMAQQQGVAVKTSAEALQQAIDDNFWQAEYRDY RRTSI
Sequences:
>Translated_565_residues MEPKTKKQRSLYIPYAGPVLLEFPLLNKGSAFSMEERRNFNLLGLLPEVVETIEEQAERAWIQYQGFKTEIDKHIYLRNI QDTNETLFYRLVNNHLDEMMPVIYTPTVGAACERFSEIYRRSRGVFISYQNRHNMDDILQNVPNHNIKVIVVTDGERILG LGDQGIGGMGIPIGKLSLYTACGGISPAYTLPVVLDVGTNNQQLLNDPLYMGWRNPRITDDEYYEFVDEFIQAVKQRWPD VLLQFEDFAQKNAMPLLNRYRNEICSFNDDIQGTAAVTVGTLIAASRAAGGQLSEKKIVFLGAGSAGCGIAEMIIAQTQR EGLSEEAARQKVFMVDRFGLLTDKMPNLLPFQTKLVQKRENLSDWDTDSDVLSLLDVVRNVKPDILIGVSGQTGLFTEEI IREMHKHCPRPIVMPLSNPTSRVEATPQDIIAWTEGNALVATGSPFNPVVWKDKIYPIAQCNNAFIFPGIGLGVIASGAS RITDEMLMSASETLAQYSPLVLNGEGLVLPELKDIQKVSRAIAFAVGKMAQQQGVAVKTSAEALQQAIDDNFWQAEYRDY RRTSI >Mature_565_residues MEPKTKKQRSLYIPYAGPVLLEFPLLNKGSAFSMEERRNFNLLGLLPEVVETIEEQAERAWIQYQGFKTEIDKHIYLRNI QDTNETLFYRLVNNHLDEMMPVIYTPTVGAACERFSEIYRRSRGVFISYQNRHNMDDILQNVPNHNIKVIVVTDGERILG LGDQGIGGMGIPIGKLSLYTACGGISPAYTLPVVLDVGTNNQQLLNDPLYMGWRNPRITDDEYYEFVDEFIQAVKQRWPD VLLQFEDFAQKNAMPLLNRYRNEICSFNDDIQGTAAVTVGTLIAASRAAGGQLSEKKIVFLGAGSAGCGIAEMIIAQTQR EGLSEEAARQKVFMVDRFGLLTDKMPNLLPFQTKLVQKRENLSDWDTDSDVLSLLDVVRNVKPDILIGVSGQTGLFTEEI IREMHKHCPRPIVMPLSNPTSRVEATPQDIIAWTEGNALVATGSPFNPVVWKDKIYPIAQCNNAFIFPGIGLGVIASGAS RITDEMLMSASETLAQYSPLVLNGEGLVLPELKDIQKVSRAIAFAVGKMAQQQGVAVKTSAEALQQAIDDNFWQAEYRDY RRTSI
Specific function: Unknown
COG id: COG0281
COG function: function code C; Malic enzyme
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the malic enzymes family [H]
Homologues:
Organism=Homo sapiens, GI4505143, Length=540, Percent_Identity=42.7777777777778, Blast_Score=455, Evalue=1e-128, Organism=Homo sapiens, GI62420882, Length=515, Percent_Identity=44.0776699029126, Blast_Score=436, Evalue=1e-122, Organism=Homo sapiens, GI62420880, Length=515, Percent_Identity=44.0776699029126, Blast_Score=436, Evalue=1e-122, Organism=Homo sapiens, GI239049447, Length=515, Percent_Identity=44.0776699029126, Blast_Score=436, Evalue=1e-122, Organism=Homo sapiens, GI4505145, Length=552, Percent_Identity=40.9420289855072, Blast_Score=419, Evalue=1e-117, Organism=Homo sapiens, GI270265879, Length=463, Percent_Identity=44.060475161987, Blast_Score=391, Evalue=1e-108, Organism=Escherichia coli, GI87081919, Length=565, Percent_Identity=99.646017699115, Blast_Score=1168, Evalue=0.0, Organism=Caenorhabditis elegans, GI17537199, Length=567, Percent_Identity=43.9153439153439, Blast_Score=434, Evalue=1e-122, Organism=Saccharomyces cerevisiae, GI6322823, Length=521, Percent_Identity=45.6813819577735, Blast_Score=437, Evalue=1e-123, Organism=Drosophila melanogaster, GI21356279, Length=516, Percent_Identity=44.7674418604651, Blast_Score=452, Evalue=1e-127, Organism=Drosophila melanogaster, GI281362672, Length=516, Percent_Identity=44.7674418604651, Blast_Score=451, Evalue=1e-127, Organism=Drosophila melanogaster, GI281362674, Length=516, Percent_Identity=44.7674418604651, Blast_Score=451, Evalue=1e-127, Organism=Drosophila melanogaster, GI24646388, Length=512, Percent_Identity=42.1875, Blast_Score=430, Evalue=1e-120, Organism=Drosophila melanogaster, GI24646386, Length=512, Percent_Identity=42.1875, Blast_Score=430, Evalue=1e-120, Organism=Drosophila melanogaster, GI78707242, Length=537, Percent_Identity=38.9199255121043, Blast_Score=382, Evalue=1e-106, Organism=Drosophila melanogaster, GI78707238, Length=537, Percent_Identity=38.9199255121043, Blast_Score=382, Evalue=1e-106, Organism=Drosophila melanogaster, GI281363505, Length=540, Percent_Identity=35.9259259259259, Blast_Score=379, Evalue=1e-105, Organism=Drosophila melanogaster, GI78707236, Length=540, Percent_Identity=35.9259259259259, Blast_Score=379, Evalue=1e-105, Organism=Drosophila melanogaster, GI78707232, Length=507, Percent_Identity=38.2642998027613, Blast_Score=378, Evalue=1e-105, Organism=Drosophila melanogaster, GI78707240, Length=513, Percent_Identity=39.766081871345, Blast_Score=377, Evalue=1e-105, Organism=Drosophila melanogaster, GI281363503, Length=538, Percent_Identity=35.6877323420074, Blast_Score=375, Evalue=1e-104, Organism=Drosophila melanogaster, GI19922384, Length=536, Percent_Identity=31.3432835820896, Blast_Score=275, Evalue=5e-74,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR015884 - InterPro: IPR012301 - InterPro: IPR012302 - InterPro: IPR001891 - InterPro: IPR016040 [H]
Pfam domain/function: PF00390 malic; PF03949 Malic_M [H]
EC number: =1.1.1.38 [H]
Molecular weight: Translated: 63164; Mature: 63164
Theoretical pI: Translated: 4.95; Mature: 4.95
Prosite motif: PS00331 MALIC_ENZYMES
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEPKTKKQRSLYIPYAGPVLLEFPLLNKGSAFSMEERRNFNLLGLLPEVVETIEEQAERA CCCCCCCCCEEECCCCCCEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH WIQYQGFKTEIDKHIYLRNIQDTNETLFYRLVNNHLDEMMPVIYTPTVGAACERFSEIYR HHHCCCCHHHCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCEEECCCHHHHHHHHHHHHH RSRGVFISYQNRHNMDDILQNVPNHNIKVIVVTDGERILGLGDQGIGGMGIPIGKLSLYT HCCCEEEEECCCCCHHHHHHHCCCCCEEEEEEECCCEEEECCCCCCCCCCCCHHHHHHHH ACGGISPAYTLPVVLDVGTNNQQLLNDPLYMGWRNPRITDDEYYEFVDEFIQAVKQRWPD HHCCCCCCEEEEEEEEECCCCHHHHCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHCHH VLLQFEDFAQKNAMPLLNRYRNEICSFNDDIQGTAAVTVGTLIAASRAAGGQLSEKKIVF HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEE LGAGSAGCGIAEMIIAQTQREGLSEEAARQKVFMVDRFGLLTDKMPNLLPFQTKLVQKRE EECCCCCCHHHHHHHHHHHHCCCHHHHHHHHEEEHHHHHHHHHCCCCCCCHHHHHHHHHC NLSDWDTDSDVLSLLDVVRNVKPDILIGVSGQTGLFTEEIIREMHKHCPRPIVMPLSNPT CCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCH SRVEATPQDIIAWTEGNALVATGSPFNPVVWKDKIYPIAQCNNAFIFPGIGLGVIASGAS HHCCCCHHHEEEEECCCEEEECCCCCCCCEECCCCCCCEECCCEEEECCCCHHHHHCCHH RITDEMLMSASETLAQYSPLVLNGEGLVLPELKDIQKVSRAIAFAVGKMAQQQGVAVKTS HHHHHHHHHHHHHHHHCCCEEECCCCEECCCHHHHHHHHHHHHHHHHHHHHHCCCEEEHH AEALQQAIDDNFWQAEYRDYRRTSI HHHHHHHHCCCHHHHHHHHHHHCCC >Mature Secondary Structure MEPKTKKQRSLYIPYAGPVLLEFPLLNKGSAFSMEERRNFNLLGLLPEVVETIEEQAERA CCCCCCCCCEEECCCCCCEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH WIQYQGFKTEIDKHIYLRNIQDTNETLFYRLVNNHLDEMMPVIYTPTVGAACERFSEIYR HHHCCCCHHHCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCEEECCCHHHHHHHHHHHHH RSRGVFISYQNRHNMDDILQNVPNHNIKVIVVTDGERILGLGDQGIGGMGIPIGKLSLYT HCCCEEEEECCCCCHHHHHHHCCCCCEEEEEEECCCEEEECCCCCCCCCCCCHHHHHHHH ACGGISPAYTLPVVLDVGTNNQQLLNDPLYMGWRNPRITDDEYYEFVDEFIQAVKQRWPD HHCCCCCCEEEEEEEEECCCCHHHHCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHCHH VLLQFEDFAQKNAMPLLNRYRNEICSFNDDIQGTAAVTVGTLIAASRAAGGQLSEKKIVF HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEE LGAGSAGCGIAEMIIAQTQREGLSEEAARQKVFMVDRFGLLTDKMPNLLPFQTKLVQKRE EECCCCCCHHHHHHHHHHHHCCCHHHHHHHHEEEHHHHHHHHHCCCCCCCHHHHHHHHHC NLSDWDTDSDVLSLLDVVRNVKPDILIGVSGQTGLFTEEIIREMHKHCPRPIVMPLSNPT CCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEECCCCH SRVEATPQDIIAWTEGNALVATGSPFNPVVWKDKIYPIAQCNNAFIFPGIGLGVIASGAS HHCCCCHHHEEEEECCCEEEECCCCCCCCEECCCCCCCEECCCEEEECCCCHHHHHCCHH RITDEMLMSASETLAQYSPLVLNGEGLVLPELKDIQKVSRAIAFAVGKMAQQQGVAVKTS HHHHHHHHHHHHHHHHCCCEEECCCCEECCCHHHHHHHHHHHHHHHHHHHHHCCCEEEHH AEALQQAIDDNFWQAEYRDYRRTSI HHHHHHHHCCCHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA