| Definition | Chlorobium tepidum TLS, complete genome. |
|---|---|
| Accession | NC_002932 |
| Length | 2,154,946 |
Click here to switch to the map view.
The map label for this gene is 21674830
Identifier: 21674830
GI number: 21674830
Start: 1915697
End: 1917892
Strand: Reverse
Name: 21674830
Synonym: CT2020
Alternate gene names: NA
Gene position: 1917892-1915697 (Counterclockwise)
Preceding gene: 21674835
Following gene: 21674829
Centisome position: 89.0
GC content: 53.28
Gene sequence:
>2196_bases ATGGCTGAACAAGTGAAACCCGCAGGGGTCAAGCCGAAGGGGACAGTGCCTCCTCCCAAAGGGAACGCACCAGCTCCGAA GGCAAATGGCGCGCCAGGCGGAGCATCGGTCATTAAAGAACAGGATGCCGCCAAAATGAGACGTTTTCTGTTTCAGCGAA CGGAAACACGCTCAACGAAGTGGTACCAGATCTTTGATACCGAAAAGCTCGATGATGAGCAGGTGGTTGGTGGCCATCTT GCCTTGCTCGGTGTTCTGGGCTTCATCATGGGGATCTACTACATCTCGGGAATCCAGGTTTTTCCCTGGGGCGCTCCGGG TTTCCATGACAACTGGTTCTACCTGACCATCAAGCCGAGAATGGTTTCGCTCGGCATCGACACCTACAGCACCAAGACTG CCGACCTCGAGGCTGCTGGTGCAAGGCTTCTCGGATGGGCAGCCTTCCATTTCCTTGTCGGTTCTGTGCTCATTTTTGGA GGATGGCGTCACTGGACGCACAATCTGACGAACCCCTTCACTGGCCGTTGCGGTAACTTCCGCGATTTCCGTTTCCTCGG CAAGTTTGGTGATGTGGTTTTCAACGGCACCAGTGCAAAGTCGTACAAAGAGGCTCTTGGGCCACACGCCGTGTACATGT CGCTTCTCTTCCTTGGCTGGGGAATCGTAATGTGGGCTATTCTCGGTTTCGCTCCGATTCCGGATTTCCAGACGATCAAC TCCGAGACCTTCATGTCGTTCGTCTTTGCCGTGATCTTCTTCGCTCTCGGCATTTACTGGTGGAACAACCCTCCGAATGC GGCTATTCACCTCAACGATGACATGAAAGCCGCCTTTTCGGTTCACCTGACCGCTATCGGCTACATCAACATTGCGCTTG GCTGTATTGCTTTCGTGGCTTTCCAGCAGCCGTCATTCGCTCCGTACTACAAGGAACTCGACAAGCTGGTCTTCTACCTC TATGGCGAACCGTTCAATCGTGTCAGCTTCAACTTTGTCGAGCAGGGCGGTAAGGTTATCTCTGGTGCGAAGGAGTTTGC CGACTTCCCTGCCTATGCCATTCTGCCCAAGAGCGGCGAGGCGTTTGGCATGGCAAGGGTTGTCACCAACCTGATTGTCT TCAACCACATTATTTGTGGTGTGCTCTATGTCTTTGCGGGCGTATATCATGGTGGTCAGTATCTCCTCAAGATCCAGCTC AACGGTATGTACAACCAGATCAAGTCGATCTGGATCACCAAGGGGCGTGATCAGGAAGTTCAGGTCAAGATTCTTGGTAC CGTCATGGCGCTTTGCTTCGCGACCATGCTTTCGGTTTATGCTGTTATTGTCTGGAACACCATCTGTGAGCTGAACATTT TCGGCACCAACATCACGATGTCGTTCTACTGGCTCAAGCCGCTGCCGATTTTCCAGTGGATGTTCGCCGATCCGAGCATC AATGACTGGGTGATGGCTCACGTTATCACGGCAGGTTCGCTCTTCTCGCTGATCGCTCTGGTTCGTATCGCTTTCTTCGC TCATACCTCTCCGCTGTGGGATGACTTGGGACTGAAGAAGAACTCTTACTCTTTCCCGTGCCTTGGCCCTGTCTATGGTG GTACTTGCGGTGTCTCTATTCAGGACCAGCTCTGGTTCGCCATGCTCTGGGGTATCAAAGGTCTCTCGGCAGTCTGCTGG TACATCGATGGCGCGTGGATCGCGTCGATGATGTACGGTGTGCCTGCTGCTGATGCAAAGGCTTGGGATTCGATTGCCCA CCTGCATCATCACTACACATCGGGTATCTTCTACTACTTCTGGACAGAGACCGTGACGATCTTCTCCAGCTCGCACCTCT CGACCATTCTTATGATCGGTCACCTGGTGTGGTTCATCAGCTTTGCTGTGTGGTTCGAGGATCGTGGTTCGCGTCTCGAA GGTGCTGACATCCAGACCCGCACTATCCGCTGGTTGGGCAAGAAGTTCCTTAACAGGGACGTTAACTTCCGATTCCCGGT ACTGACCATTTCGGATTCCAAGCTTGCCGGTACCTTCCTGTACTTTGGCGGTACTTTCATGCTCGTCTTCCTGTTCCTTG CCAACGGGTTTTATCAGACCAACTCTCCGTTGCCGCCTCCCGTCAGCCATGCAGCGGTTTCCGGACAGCAGATGCTGGCC CAGCTGGTCGATACGCTGATGAAAATGATTGCGTAA
Upstream 100 bases:
>100_bases CGTGCCGTGACACAGCGTTGACCGCATAGACGGGAAAAAGGTTGCCGGTTTTTTCGGTTTCTTCAACCTGAAAATACGTT CAAACAAAGGAGAACATACA
Downstream 100 bases:
>100_bases CATTTCGGTACGAAGGGGCTTATGCCCCTTCGGCCATCTTGAGCAATAACGTTTACCACGTAAAACAATTTTAGAGAGGG GGGTTTGTATGGCCGAACCT
Product: photosystem P840 reaction center, large subunit
Products: NA
Alternate protein names: Reaction Center Core Polypeptide Psha
Number of amino acids: Translated: 731; Mature: 730
Protein sequence:
>731_residues MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTKWYQIFDTEKLDDEQVVGGHL ALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFG GWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTIN SETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVAFQQPSFAPYYKELDKLVFYL YGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQL NGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSI NDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLWGIKGLSAVCW YIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYFWTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLE GADIQTRTIRWLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLA QLVDTLMKMIA
Sequences:
>Translated_731_residues MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTKWYQIFDTEKLDDEQVVGGHL ALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFG GWRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTIN SETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVAFQQPSFAPYYKELDKLVFYL YGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQL NGMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSI NDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLWGIKGLSAVCW YIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYFWTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLE GADIQTRTIRWLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLA QLVDTLMKMIA >Mature_730_residues AEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTKWYQIFDTEKLDDEQVVGGHLA LLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPRMVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFGG WRHWTHNLTNPFTGRCGNFRDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTINS ETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVAFQQPSFAPYYKELDKLVFYLY GEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGEAFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQLN GMYNQIKSIWITKGRDQEVQVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSIN DWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSIQDQLWFAMLWGIKGLSAVCWY IDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYFWTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEG ADIQTRTIRWLGKKFLNRDVNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLAQ LVDTLMKMIA
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 81714; Mature: 81583
Theoretical pI: Translated: 8.73; Mature: 8.73
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTK CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC WYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPR EEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCEEEEEEECE MVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFGGWRHWTHNLTNPFTGRCGNF EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCC RDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTIN HHHHHHHHHCCEEECCCCHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC SETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVA HHHHHHHHHHHHHHHHHHEEECCCCCCEEEECCCCCCHHEEEEHHHHHHHHHHHHHHHHH FQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGE HCCCCCCHHHHHHHHHEEHEECCCCCCCCHHHHHCCCHHHHCHHHHHCCCCEEEECCCCC AFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQLNGMYNQIKSIWITKGRDQEV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCHHHHHHHEEEECCCCCEE QVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSI EEHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECEEEEEEEEECCCHHHHHEECCCCC NDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSI CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCH QDQLWFAMLWGIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCEEEEE WTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIRWLGKKFLNRD EEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCC VNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLA CCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCHHHHHH QLVDTLMKMIA HHHHHHHHHHC >Mature Secondary Structure AEQVKPAGVKPKGTVPPPKGNAPAPKANGAPGGASVIKEQDAAKMRRFLFQRTETRSTK CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC WYQIFDTEKLDDEQVVGGHLALLGVLGFIMGIYYISGIQVFPWGAPGFHDNWFYLTIKPR EEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCEEEEEEECE MVSLGIDTYSTKTADLEAAGARLLGWAAFHFLVGSVLIFGGWRHWTHNLTNPFTGRCGNF EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCC RDFRFLGKFGDVVFNGTSAKSYKEALGPHAVYMSLLFLGWGIVMWAILGFAPIPDFQTIN HHHHHHHHHCCEEECCCCHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC SETFMSFVFAVIFFALGIYWWNNPPNAAIHLNDDMKAAFSVHLTAIGYINIALGCIAFVA HHHHHHHHHHHHHHHHHHEEECCCCCCEEEECCCCCCHHEEEEHHHHHHHHHHHHHHHHH FQQPSFAPYYKELDKLVFYLYGEPFNRVSFNFVEQGGKVISGAKEFADFPAYAILPKSGE HCCCCCCHHHHHHHHHEEHEECCCCCCCCHHHHHCCCHHHHCHHHHHCCCCEEEECCCCC AFGMARVVTNLIVFNHIICGVLYVFAGVYHGGQYLLKIQLNGMYNQIKSIWITKGRDQEV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCHHHHHHHEEEECCCCCEE QVKILGTVMALCFATMLSVYAVIVWNTICELNIFGTNITMSFYWLKPLPIFQWMFADPSI EEHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECEEEEEEEEECCCHHHHHEECCCCC NDWVMAHVITAGSLFSLIALVRIAFFAHTSPLWDDLGLKKNSYSFPCLGPVYGGTCGVSI CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCH QDQLWFAMLWGIKGLSAVCWYIDGAWIASMMYGVPAADAKAWDSIAHLHHHYTSGIFYYF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCEEEEE WTETVTIFSSSHLSTILMIGHLVWFISFAVWFEDRGSRLEGADIQTRTIRWLGKKFLNRD EEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCC VNFRFPVLTISDSKLAGTFLYFGGTFMLVFLFLANGFYQTNSPLPPPVSHAAVSGQQMLA CCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHCCHHHHHH QLVDTLMKMIA HHHHHHHHHHC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA