Definition | Chloroflexus sp. Y-400-fl chromosome, complete genome. |
---|---|
Accession | NC_012032 |
Length | 5,268,950 |
Click here to switch to the map view.
The map label for this gene is gltA [H]
Identifier: 222527326
GI number: 222527326
Start: 5091357
End: 5092643
Strand: Reverse
Name: gltA [H]
Synonym: Chy400_4112
Alternate gene names: 222527326
Gene position: 5092643-5091357 (Counterclockwise)
Preceding gene: 222527327
Following gene: 222527322
Centisome position: 96.65
GC content: 53.77
Gene sequence:
>1287_bases ATGACCAGGAACACTCTCACCGTAACCGACAACCGCACCGGTAAAACGTACGAGATTCCCATCGAGAACAACACCATTCG GGCAACCGATTTACGCCAGATCAAAGTATCGGAAGACGATTTTGGTTTGATGTCCTATGATCCGGCTTACCTCAATACTG CATCCTGTAAGAGTAGCATCACCTACATCGACGGTGATAAGGGGATTCTAGAGTATCGAGGTTATCCAATTGAGCAACTG GCTGAGCAGAGTTCGTATCTTGAGGTTGCGTACCTGCTGCTGTACGGGGAACTCCCCTCAAAGGAACGCCTGGCCTGGTG GGAATACCGGATCAGTCGGCATCTTTTCCTCCACAACAGCCTGGTAGAGCTGATTCAAGCCTTCCGCTACGACGCCCATC CAATGGGTATTCTCATCAGCTCCGTAGCGGCAATGTCCACGCTATACCCCGAAGCCAAGAACATCCACGATCCGGCTGTG CGCGAAAAGCAAATCTGGCGTATTATCGGCCAGATCCCGACAATTGCTGCCTTTGCCTACCGCCACCGCATCGGTCGTCC ATTCAATTTGCCCGATAGCTCGCTCAGCTACACCGCCAACCTGCTGTACATGATGGACTATATGAATCAGCGCGAGTATG AAGTAAATCCAGTGCTGGCCAAAGCGCTGGATGTGCTCTTCATTTTGCATGCTGATCACGAGCAGAACTGCTCTACCTCG GTCATGCGTAGTGTTGGCTCAAGCCACGCTGATCCGTACAATGCGCTGGCCGCTGCCGCAGCCGCTCTCTACGGCCCACT CCACGGTGGTGCCAACGAGGCGGTACTGCGCATGTTGCAGCAGATCGGCCATCCAAAGAACGTGCCGGCATTTATCGAGC GGGTGAAGAAGGGTGAAATGCGTCTCATGGGCTTTGGTCATCGCGTTTACAAGAATTACGATCCGCGGGCCAAGATCATT CGTAAGATCGCTCACGAAGTCTTTGCGGCTACTGCTGCCAACCCGCTGCTCGATGTCGCGATGGAGCTTGAGCGAACTGC ATTGGAAGACGAGTACTTTATCTCGCGCAAGCTCTATCCGAATGTGGATTTCTACAGCGGTCTGATCTATCAGGCACTGC GCTTCCCAATCGAGTACTTCCCCTTCCTGTTCGCAATTCCACGCGCTTCAGGCTGGCTGGCACAGTGGATCGAGATGCTC GAAGACCCCGAGCAGAAGATTACCCGTCCACGACAGGTCTATGTTGGCCCACAGCGGCGGGATTATGTGCCGATTGATCA GCGCTAA
Upstream 100 bases:
>100_bases TAGAACTACTGTAGTACGGCCTCCATGATGTCGATCACAGTGCGGTAGCACTGATGTATCAATGGCGATGCAGTCGCAGT GAGCATGAAAGAGCATGGCT
Downstream 100 bases:
>100_bases TTGAACGATAATAATTATCAGGGCGGTCGTCAGGCCGCCCTGATGATTCGGTAGTCGGTTGGATGAGGGTGAAGACGCAG GTGGTTGTCGTGGGTGGCGA
Product: citrate synthase I
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 428; Mature: 427
Protein sequence:
>428_residues MTRNTLTVTDNRTGKTYEIPIENNTIRATDLRQIKVSEDDFGLMSYDPAYLNTASCKSSITYIDGDKGILEYRGYPIEQL AEQSSYLEVAYLLLYGELPSKERLAWWEYRISRHLFLHNSLVELIQAFRYDAHPMGILISSVAAMSTLYPEAKNIHDPAV REKQIWRIIGQIPTIAAFAYRHRIGRPFNLPDSSLSYTANLLYMMDYMNQREYEVNPVLAKALDVLFILHADHEQNCSTS VMRSVGSSHADPYNALAAAAAALYGPLHGGANEAVLRMLQQIGHPKNVPAFIERVKKGEMRLMGFGHRVYKNYDPRAKII RKIAHEVFAATAANPLLDVAMELERTALEDEYFISRKLYPNVDFYSGLIYQALRFPIEYFPFLFAIPRASGWLAQWIEML EDPEQKITRPRQVYVGPQRRDYVPIDQR
Sequences:
>Translated_428_residues MTRNTLTVTDNRTGKTYEIPIENNTIRATDLRQIKVSEDDFGLMSYDPAYLNTASCKSSITYIDGDKGILEYRGYPIEQL AEQSSYLEVAYLLLYGELPSKERLAWWEYRISRHLFLHNSLVELIQAFRYDAHPMGILISSVAAMSTLYPEAKNIHDPAV REKQIWRIIGQIPTIAAFAYRHRIGRPFNLPDSSLSYTANLLYMMDYMNQREYEVNPVLAKALDVLFILHADHEQNCSTS VMRSVGSSHADPYNALAAAAAALYGPLHGGANEAVLRMLQQIGHPKNVPAFIERVKKGEMRLMGFGHRVYKNYDPRAKII RKIAHEVFAATAANPLLDVAMELERTALEDEYFISRKLYPNVDFYSGLIYQALRFPIEYFPFLFAIPRASGWLAQWIEML EDPEQKITRPRQVYVGPQRRDYVPIDQR >Mature_427_residues TRNTLTVTDNRTGKTYEIPIENNTIRATDLRQIKVSEDDFGLMSYDPAYLNTASCKSSITYIDGDKGILEYRGYPIEQLA EQSSYLEVAYLLLYGELPSKERLAWWEYRISRHLFLHNSLVELIQAFRYDAHPMGILISSVAAMSTLYPEAKNIHDPAVR EKQIWRIIGQIPTIAAFAYRHRIGRPFNLPDSSLSYTANLLYMMDYMNQREYEVNPVLAKALDVLFILHADHEQNCSTSV MRSVGSSHADPYNALAAAAAALYGPLHGGANEAVLRMLQQIGHPKNVPAFIERVKKGEMRLMGFGHRVYKNYDPRAKIIR KIAHEVFAATAANPLLDVAMELERTALEDEYFISRKLYPNVDFYSGLIYQALRFPIEYFPFLFAIPRASGWLAQWIEMLE DPEQKITRPRQVYVGPQRRDYVPIDQR
Specific function: Tricarboxylic acid cycle. [C]
COG id: COG0372
COG function: function code C; Citrate synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the citrate synthase family [H]
Homologues:
Organism=Homo sapiens, GI38327625, Length=385, Percent_Identity=26.4935064935065, Blast_Score=116, Evalue=4e-26, Organism=Escherichia coli, GI1786939, Length=384, Percent_Identity=48.9583333333333, Blast_Score=407, Evalue=1e-115, Organism=Escherichia coli, GI1786527, Length=380, Percent_Identity=31.0526315789474, Blast_Score=179, Evalue=4e-46, Organism=Caenorhabditis elegans, GI17555174, Length=387, Percent_Identity=26.3565891472868, Blast_Score=112, Evalue=4e-25, Organism=Saccharomyces cerevisiae, GI6324328, Length=326, Percent_Identity=26.6871165644172, Blast_Score=97, Evalue=6e-21, Organism=Saccharomyces cerevisiae, GI6319850, Length=326, Percent_Identity=26.9938650306748, Blast_Score=96, Evalue=8e-21, Organism=Saccharomyces cerevisiae, GI6325257, Length=396, Percent_Identity=22.979797979798, Blast_Score=84, Evalue=5e-17, Organism=Drosophila melanogaster, GI24640124, Length=404, Percent_Identity=26.980198019802, Blast_Score=109, Evalue=3e-24, Organism=Drosophila melanogaster, GI24640126, Length=404, Percent_Identity=26.980198019802, Blast_Score=109, Evalue=4e-24, Organism=Drosophila melanogaster, GI21356863, Length=395, Percent_Identity=26.0759493670886, Blast_Score=96, Evalue=4e-20,
Paralogues:
None
Copy number: 624 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 2,000 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016142 - InterPro: IPR016143 - InterPro: IPR002020 - InterPro: IPR016141 - InterPro: IPR019810 - InterPro: IPR010953 [H]
Pfam domain/function: PF00285 Citrate_synt [H]
EC number: =2.3.3.1 [H]
Molecular weight: Translated: 49131; Mature: 49000
Theoretical pI: Translated: 7.17; Mature: 7.17
Prosite motif: PS00480 CITRATE_SYNTHASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 3.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTRNTLTVTDNRTGKTYEIPIENNTIRATDLRQIKVSEDDFGLMSYDPAYLNTASCKSSI CCCCEEEEECCCCCCEEEEECCCCEEEECCCEEEEECCCCCCCEECCCCCCCCHHHCCCC TYIDGDKGILEYRGYPIEQLAEQSSYLEVAYLLLYGELPSKERLAWWEYRISRHLFLHNS EEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH LVELIQAFRYDAHPMGILISSVAAMSTLYPEAKNIHDPAVREKQIWRIIGQIPTIAAFAY HHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHH RHRIGRPFNLPDSSLSYTANLLYMMDYMNQREYEVNPVLAKALDVLFILHADHEQNCSTS HHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHEEECCCCCCHHHH VMRSVGSSHADPYNALAAAAAALYGPLHGGANEAVLRMLQQIGHPKNVPAFIERVKKGEM HHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHCCCE RLMGFGHRVYKNYDPRAKIIRKIAHEVFAATAANPLLDVAMELERTALEDEYFISRKLYP EEEECCHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHCCCC NVDFYSGLIYQALRFPIEYFPFLFAIPRASGWLAQWIEMLEDPEQKITRPRQVYVGPQRR CCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHCCCCCEEEECCCCC DYVPIDQR CCCCCCCC >Mature Secondary Structure TRNTLTVTDNRTGKTYEIPIENNTIRATDLRQIKVSEDDFGLMSYDPAYLNTASCKSSI CCCEEEEECCCCCCEEEEECCCCEEEECCCEEEEECCCCCCCEECCCCCCCCHHHCCCC TYIDGDKGILEYRGYPIEQLAEQSSYLEVAYLLLYGELPSKERLAWWEYRISRHLFLHNS EEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH LVELIQAFRYDAHPMGILISSVAAMSTLYPEAKNIHDPAVREKQIWRIIGQIPTIAAFAY HHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHH RHRIGRPFNLPDSSLSYTANLLYMMDYMNQREYEVNPVLAKALDVLFILHADHEQNCSTS HHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHEEECCCCCCHHHH VMRSVGSSHADPYNALAAAAAALYGPLHGGANEAVLRMLQQIGHPKNVPAFIERVKKGEM HHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHCCCE RLMGFGHRVYKNYDPRAKIIRKIAHEVFAATAANPLLDVAMELERTALEDEYFISRKLYP EEEECCHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHCCCC NVDFYSGLIYQALRFPIEYFPFLFAIPRASGWLAQWIEMLEDPEQKITRPRQVYVGPQRR CCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHCCCCCEEEECCCCC DYVPIDQR CCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12597275 [H]