| Definition | Chloroflexus sp. Y-400-fl chromosome, complete genome. |
|---|---|
| Accession | NC_012032 |
| Length | 5,268,950 |
Click here to switch to the map view.
The map label for this gene is 222524550
Identifier: 222524550
GI number: 222524550
Start: 1628312
End: 1630750
Strand: Direct
Name: 222524550
Synonym: Chy400_1274
Alternate gene names: NA
Gene position: 1628312-1630750 (Clockwise)
Preceding gene: 222524549
Following gene: 222524551
Centisome position: 30.9
GC content: 56.58
Gene sequence:
>2439_bases ATGGCCGAGCGGCTGATTTGTATTCACGGCCACTTTTATCAACCACCACGCGAAAACCCGTGGCTTGAAGCAATCGAACA GCAAGACTCGGCGTATCCATACCACGACTGGAATGAGCGGATTACTGCCGAGTGCTATGAACAAAACGCCGCTTCACGGA TTCTGGACAGCCAAAACCAGATTGTTCAGATTGTCAACAACTACAGCCGGATCAGCTTCAACTTTGGGCCAACCCTGCTG ACCTGGCTGGCCGATCACGCCCCGCAGGTGTATCAGGCTATCCTGGATGCCGATCAAGAGAGCCAGCGCTATTTCGGGGC CGGTTCAGCAATGGCCCAGTGCTACAACCACCTGATTATGCCACTGGCGGCCCGCCGTGATAAGGTGACGCAGGTCATCT GGGGGATTCAGGATTTCAAGTACCGGTTTGGCCGTGACCCGGAAGGGATGTGGTTACCGGAAACCGCCGTTGATCTTGAA ACCCTCGATATTATGGCTACGCACGGGATTCGCTTTACCATACTGGCCCCAACCCAGGTCAGCCATGTGCGCAAAATCGG CGAAATGATCTGGCACGATGTCAGCGGTGGGCGGATCGATCCCACCCAGCCGTATCTGGTCAAACTACCGGAAGGACGGT CAATCACGGTCTTCTTCTACGACGGCCCGGTGGCCCGTGCCGTTGCCTTCGAGCGCCTGCTGAGTAGCGGTGTGGGTTTT GCCAACCGTCTGGCCAGTATCTTCAACGATCAGCGTCCGTGGCCGCAATTAGCCCATATCGCCACCGACGGCGAAACATA TGGGCATCATCACCGCCATGGTGATATGGCGCTGGCCTATGCGTTACATTACATCGAGAAAACCGGGCTGGCGCAGTTGA CGAATTACGCCGCCTACTTGCAGCGCTATCGGCCAACCCACGAAGTGCAGATCATCGAGCGAACGTCGTGGAGCTGTGCG CATGGGGTCGGGCGCTGGATGAGCGATTGTGGTTGTAATACCGGTGGCAATCCGGGCTGGAATCAGGCCTGGCGTGCTCC CTTACGCGCTGCCCTCGATTGGCTCCGCGACACGATTGCCCCTCGCTTCGAGGGGTACGCCCGTCGCTTACTCGTCGATC CATGGGCTGCCCGCGATGACTACATCAGCGTCATTTTGAATCGTTCCCCGGAGAACATTGCCGCCTTCATTGGCCGTCAC GGTCGTGGCCGGCTCGATGAGGGTCAGCGGATCGCGGTGCTGAAGCTGATGGAATTGCAGCGCCACTTGATGCTCATGTA TACCTCGTGTGGCTGGTTCTTCGACGATCTGAGCGGGATTGAGACCATTCAGGTCATGATGTACGCCGGACGGGCACTGC AACTGGCGCACGATCTGTTTGGCGAGGATTTCGAGGCAGAATTTCTTGATCGCCTGGCGCAGGCGCGGAGCAATCTACCG GCCCGTGGCAATGGTCGCGATCTCTACGAACGCCATGTACGCCCGGCAATGGTCGACCTGCGTAAAGTGGGCGCTCACTA CGCCATGAAGACCCTCTTCAACGGTGTAGGTGAGCGTGAACAAATCTACGCCTACACAGTTGACCGTGAAGATTACCATC TGTTGCTCGCCGGTAAAGCCCGCCTGGCCCTGGGACGGATTCACATTCAGTCGACGATCACCGGTGAATCGACCCGCCTG AGCTTTGGCGTACTGCATCTCGGCGATCACAACATTTCGGGTGGCGTCCGGGCGTACCAGGGCGAACAGTCATACCGACA ACTGATTGAAGAGTTAAGCGAACTCTTTCTGCGGGCGGATATTCCGGGTGTGATCCGCATGGTTGATCGGAATTTCGGTC AGGAACAGTATTCGCTCAAGCTGCTCTTCGGTGATGAGCAGCGCCAGATTCTGCATCGCATCCTCACCTCAAGTCTGGCC GAGGCGGAAGCTGCGTATCGTCAAATCTACGAAAACCATGCCCCACTCATGCGGTTTCTTGCCAGTATAGGTATGCCGGT GCCGCGTGAATTTCAGATCGCCGCCGAATTTGCCATCAATACCGAACTACGCCGCCTGTTCGAGGCTGAGCCGCTCGATT TTGATCGGATCAATAGCCTGTTACGAGAAGCCCAACGGTCGGGCGTTACCCTCGATAGCGACGGACTGAGTTACGCGCTA TCGCGGACAATCCGCTCAATCAGCGAGCAATTCCAGTTAACGCCAGAAGACCGTGGATTGTTGACGCAGCTCGATGCCGC CGTTGGCCTGGCCCGCAACCTGCCTTTCGAGGTTGATGTCTGGCATACGCAAAATGTCTACTACGAGTTGTTGCAAACTG TGTATCCGCAAATGAACATTGAAGCAGGTGAAGGCTATGCTGATGCCCATGCGTGGTTACGCCTCTTTCGATCATTGGGT GTCAAGTTACGCTTTCGCCTGCCACCAGGAGAGCCATGA
Upstream 100 bases:
>100_bases AACCTTCACCAATGGATGGGTAACCTGTGTTGGTCACAGTGCTGTCTTACTCACAGCAGAAGCGTCATAACCGTTTACGA GTATTCTGGGAGCGTGCATC
Downstream 100 bases:
>100_bases TTGATGTAGAACTGCGTGTTCCGCGTGCAACCTATCGCCTACAGTTGAATGCCGACCTGACCTTTGCCGACGTTGCTCGC TACGTCCCCTACTTCGTTGA
Product: glycoside hydrolase family protein
Products: NA
Alternate protein names: Glycoside Hydrolase Family Protein; Glycosyl Hydrolase Family; Glycosy Hydrolase Family Protein; Glycoside Hydrolase; Family; 4-Alpha-Glucanotransferase
Number of amino acids: Translated: 812; Mature: 811
Protein sequence:
>812_residues MAERLICIHGHFYQPPRENPWLEAIEQQDSAYPYHDWNERITAECYEQNAASRILDSQNQIVQIVNNYSRISFNFGPTLL TWLADHAPQVYQAILDADQESQRYFGAGSAMAQCYNHLIMPLAARRDKVTQVIWGIQDFKYRFGRDPEGMWLPETAVDLE TLDIMATHGIRFTILAPTQVSHVRKIGEMIWHDVSGGRIDPTQPYLVKLPEGRSITVFFYDGPVARAVAFERLLSSGVGF ANRLASIFNDQRPWPQLAHIATDGETYGHHHRHGDMALAYALHYIEKTGLAQLTNYAAYLQRYRPTHEVQIIERTSWSCA HGVGRWMSDCGCNTGGNPGWNQAWRAPLRAALDWLRDTIAPRFEGYARRLLVDPWAARDDYISVILNRSPENIAAFIGRH GRGRLDEGQRIAVLKLMELQRHLMLMYTSCGWFFDDLSGIETIQVMMYAGRALQLAHDLFGEDFEAEFLDRLAQARSNLP ARGNGRDLYERHVRPAMVDLRKVGAHYAMKTLFNGVGEREQIYAYTVDREDYHLLLAGKARLALGRIHIQSTITGESTRL SFGVLHLGDHNISGGVRAYQGEQSYRQLIEELSELFLRADIPGVIRMVDRNFGQEQYSLKLLFGDEQRQILHRILTSSLA EAEAAYRQIYENHAPLMRFLASIGMPVPREFQIAAEFAINTELRRLFEAEPLDFDRINSLLREAQRSGVTLDSDGLSYAL SRTIRSISEQFQLTPEDRGLLTQLDAAVGLARNLPFEVDVWHTQNVYYELLQTVYPQMNIEAGEGYADAHAWLRLFRSLG VKLRFRLPPGEP
Sequences:
>Translated_812_residues MAERLICIHGHFYQPPRENPWLEAIEQQDSAYPYHDWNERITAECYEQNAASRILDSQNQIVQIVNNYSRISFNFGPTLL TWLADHAPQVYQAILDADQESQRYFGAGSAMAQCYNHLIMPLAARRDKVTQVIWGIQDFKYRFGRDPEGMWLPETAVDLE TLDIMATHGIRFTILAPTQVSHVRKIGEMIWHDVSGGRIDPTQPYLVKLPEGRSITVFFYDGPVARAVAFERLLSSGVGF ANRLASIFNDQRPWPQLAHIATDGETYGHHHRHGDMALAYALHYIEKTGLAQLTNYAAYLQRYRPTHEVQIIERTSWSCA HGVGRWMSDCGCNTGGNPGWNQAWRAPLRAALDWLRDTIAPRFEGYARRLLVDPWAARDDYISVILNRSPENIAAFIGRH GRGRLDEGQRIAVLKLMELQRHLMLMYTSCGWFFDDLSGIETIQVMMYAGRALQLAHDLFGEDFEAEFLDRLAQARSNLP ARGNGRDLYERHVRPAMVDLRKVGAHYAMKTLFNGVGEREQIYAYTVDREDYHLLLAGKARLALGRIHIQSTITGESTRL SFGVLHLGDHNISGGVRAYQGEQSYRQLIEELSELFLRADIPGVIRMVDRNFGQEQYSLKLLFGDEQRQILHRILTSSLA EAEAAYRQIYENHAPLMRFLASIGMPVPREFQIAAEFAINTELRRLFEAEPLDFDRINSLLREAQRSGVTLDSDGLSYAL SRTIRSISEQFQLTPEDRGLLTQLDAAVGLARNLPFEVDVWHTQNVYYELLQTVYPQMNIEAGEGYADAHAWLRLFRSLG VKLRFRLPPGEP >Mature_811_residues AERLICIHGHFYQPPRENPWLEAIEQQDSAYPYHDWNERITAECYEQNAASRILDSQNQIVQIVNNYSRISFNFGPTLLT WLADHAPQVYQAILDADQESQRYFGAGSAMAQCYNHLIMPLAARRDKVTQVIWGIQDFKYRFGRDPEGMWLPETAVDLET LDIMATHGIRFTILAPTQVSHVRKIGEMIWHDVSGGRIDPTQPYLVKLPEGRSITVFFYDGPVARAVAFERLLSSGVGFA NRLASIFNDQRPWPQLAHIATDGETYGHHHRHGDMALAYALHYIEKTGLAQLTNYAAYLQRYRPTHEVQIIERTSWSCAH GVGRWMSDCGCNTGGNPGWNQAWRAPLRAALDWLRDTIAPRFEGYARRLLVDPWAARDDYISVILNRSPENIAAFIGRHG RGRLDEGQRIAVLKLMELQRHLMLMYTSCGWFFDDLSGIETIQVMMYAGRALQLAHDLFGEDFEAEFLDRLAQARSNLPA RGNGRDLYERHVRPAMVDLRKVGAHYAMKTLFNGVGEREQIYAYTVDREDYHLLLAGKARLALGRIHIQSTITGESTRLS FGVLHLGDHNISGGVRAYQGEQSYRQLIEELSELFLRADIPGVIRMVDRNFGQEQYSLKLLFGDEQRQILHRILTSSLAE AEAAYRQIYENHAPLMRFLASIGMPVPREFQIAAEFAINTELRRLFEAEPLDFDRINSLLREAQRSGVTLDSDGLSYALS RTIRSISEQFQLTPEDRGLLTQLDAAVGLARNLPFEVDVWHTQNVYYELLQTVYPQMNIEAGEGYADAHAWLRLFRSLGV KLRFRLPPGEP
Specific function: Unknown
COG id: COG1449
COG function: function code G; Alpha-amylase/alpha-mannosidase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 92759; Mature: 92628
Theoretical pI: Translated: 6.48; Mature: 6.48
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAERLICIHGHFYQPPRENPWLEAIEQQDSAYPYHDWNERITAECYEQNAASRILDSQNQ CCCCEEEEECCCCCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHH IVQIVNNYSRISFNFGPTLLTWLADHAPQVYQAILDADQESQRYFGAGSAMAQCYNHLIM HHHHHHCCCEEEECCCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHH PLAARRDKVTQVIWGIQDFKYRFGRDPEGMWLPETAVDLETLDIMATHGIRFTILAPTQV HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHEECCEEEEEECCHHH SHVRKIGEMIWHDVSGGRIDPTQPYLVKLPEGRSITVFFYDGPVARAVAFERLLSSGVGF HHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEEECCCHHHHHHHHHHHHHCCCH ANRLASIFNDQRPWPQLAHIATDGETYGHHHRHGDMALAYALHYIEKTGLAQLTNYAAYL HHHHHHHHCCCCCCCHHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH QRYRPTHEVQIIERTSWSCAHGVGRWMSDCGCNTGGNPGWNQAWRAPLRAALDWLRDTIA HHCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC PRFEGYARRLLVDPWAARDDYISVILNRSPENIAAFIGRHGRGRLDEGQRIAVLKLMELQ HHHHHHHHHHHCCCCCCCCCCEEEEECCCCHHHHHHHCCCCCCCCCCCCEEHHHHHHHHH RHLMLMYTSCGWFFDDLSGIETIQVMMYAGRALQLAHDLFGEDFEAEFLDRLAQARSNLP HHHHHHHHHCCHHHHHHCCHHHHHHHHHHCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCC ARGNGRDLYERHVRPAMVDLRKVGAHYAMKTLFNGVGEREQIYAYTVDREDYHLLLAGKA CCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCEEEEEECCC RLALGRIHIQSTITGESTRLSFGVLHLGDHNISGGVRAYQGEQSYRQLIEELSELFLRAD CEEEEEEEEEEEECCCCCEEEEEEEEECCCCCCCCCEECCCHHHHHHHHHHHHHHHHHCC IPGVIRMVDRNFGQEQYSLKLLFGDEQRQILHRILTSSLAEAEAAYRQIYENHAPLMRFL CHHHHHHHHHCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH ASIGMPVPREFQIAAEFAINTELRRLFEAEPLDFDRINSLLREAQRSGVTLDSDGLSYAL HHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEECCCCHHHHH SRTIRSISEQFQLTPEDRGLLTQLDAAVGLARNLPFEVDVWHTQNVYYELLQTVYPQMNI HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHCCCCCC EAGEGYADAHAWLRLFRSLGVKLRFRLPPGEP CCCCCHHHHHHHHHHHHHCCCEEEEECCCCCC >Mature Secondary Structure AERLICIHGHFYQPPRENPWLEAIEQQDSAYPYHDWNERITAECYEQNAASRILDSQNQ CCCEEEEECCCCCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHH IVQIVNNYSRISFNFGPTLLTWLADHAPQVYQAILDADQESQRYFGAGSAMAQCYNHLIM HHHHHHCCCEEEECCCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHH PLAARRDKVTQVIWGIQDFKYRFGRDPEGMWLPETAVDLETLDIMATHGIRFTILAPTQV HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHEECCEEEEEECCHHH SHVRKIGEMIWHDVSGGRIDPTQPYLVKLPEGRSITVFFYDGPVARAVAFERLLSSGVGF HHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEEECCCHHHHHHHHHHHHHCCCH ANRLASIFNDQRPWPQLAHIATDGETYGHHHRHGDMALAYALHYIEKTGLAQLTNYAAYL HHHHHHHHCCCCCCCHHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH QRYRPTHEVQIIERTSWSCAHGVGRWMSDCGCNTGGNPGWNQAWRAPLRAALDWLRDTIA HHCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC PRFEGYARRLLVDPWAARDDYISVILNRSPENIAAFIGRHGRGRLDEGQRIAVLKLMELQ HHHHHHHHHHHCCCCCCCCCCEEEEECCCCHHHHHHHCCCCCCCCCCCCEEHHHHHHHHH RHLMLMYTSCGWFFDDLSGIETIQVMMYAGRALQLAHDLFGEDFEAEFLDRLAQARSNLP HHHHHHHHHCCHHHHHHCCHHHHHHHHHHCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCC ARGNGRDLYERHVRPAMVDLRKVGAHYAMKTLFNGVGEREQIYAYTVDREDYHLLLAGKA CCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCEEEEEECCC RLALGRIHIQSTITGESTRLSFGVLHLGDHNISGGVRAYQGEQSYRQLIEELSELFLRAD CEEEEEEEEEEEECCCCCEEEEEEEEECCCCCCCCCEECCCHHHHHHHHHHHHHHHHHCC IPGVIRMVDRNFGQEQYSLKLLFGDEQRQILHRILTSSLAEAEAAYRQIYENHAPLMRFL CHHHHHHHHHCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH ASIGMPVPREFQIAAEFAINTELRRLFEAEPLDFDRINSLLREAQRSGVTLDSDGLSYAL HHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEECCCCHHHHH SRTIRSISEQFQLTPEDRGLLTQLDAAVGLARNLPFEVDVWHTQNVYYELLQTVYPQMNI HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHCCCCCC EAGEGYADAHAWLRLFRSLGVKLRFRLPPGEP CCCCCHHHHHHHHHHHHHCCCEEEEECCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA