Definition | Candidatus Solibacter usitatus Ellin6076 chromosome, complete genome. |
---|---|
Accession | NC_008536 |
Length | 9,965,640 |
Click here to switch to the map view.
The map label for this gene is 116624437
Identifier: 116624437
GI number: 116624437
Start: 6721422
End: 6722834
Strand: Reverse
Name: 116624437
Synonym: Acid_5359
Alternate gene names: NA
Gene position: 6722834-6721422 (Counterclockwise)
Preceding gene: 116624438
Following gene: 116624435
Centisome position: 67.46
GC content: 66.88
Gene sequence:
>1413_bases GTGAAAGCGGCGGCGGCCGGAAGGGCGCGGCTGCTGCTCACCGTCGCACTGGCACTTTGGACGGCGGCCGGCCTGCGGCC GCCATCGCGGCCCGAGCCGGCACCCGCCGGGGCTCCGCCCCGCGAGTTTTCGGCGGCGCGCGCGATGGCTCACGTGCGAG CGATTGCGCAACGTCCGCATCCATTGAAGTCGGCGGATCACGCTCGGGTGCGCACGTATATCGCCGGTCAGTTCGAGGAA CTCGGAACGCCGGCCGGACTCCAGATTATGCCGGTCACCTTTCGGGGCGACACGATCGTATTGCAGAACCTGGTGGCTCG ACTGGCTGGTTCCGGCAGTACTCGACCGATCATGCTTGCGGCTCATTACGATTCCACGCGGCATGGTCCGGGCGCGGGCG ACGACGCGCATGGCGTCGCGGTGCTGCTGGAGACGCTGCGGGCGTTGCGCGCGGGTCCTCCGTTGCGCAACGATGTGATC TTCCTGGTCACCGACGGGGAGGAGGCGGGATTGCTGGGGGCGTCGGCGTTCGCCAAAGAACATCCGTGGCGCCAGGAACC CGGCGTGGTGCTGAACTTCGAAGCGCGCGGCACCGGCGGGCAGGCCACGATGTTCGAGACCAGCGCGGGCAACGAGTGGC TGATCCGCAACCTGCAGGCCGCGGCACCGTGGGCCAATGCCACTTCCTTCGCGTATGAGGTGTATCGCCGGATGCCCAAC GATACCGACCTGACGGTGTTCAAACGCGCCGGGCTCGCGGGGCTGAACTTTGCGTTCATCGAACATCCGGAGTGGTATCA CCATTCGCAGGATGACCCCGAGCATCTTGACCTGCGCAGCGTGCAGGAGCAGGGCGATTATGCACTCTCGCTGGCGCGGC AGTTCGGCGGAGTGGATCTGCGGAGGGCCGCCAGCGGCGACGCGGTTTACTTCCCGACGCGGCTCACCGGGCTGATTGTA TATTCCGGTTGGTGGGTTCTACCGCTGGCGCTGATGACAGTGCTGGCACTGGTGCTGGCCGTGCTGGTTGGATGGCGGCG GGGGAAGCGTGGACTGTGGATGGGGCGGTTGCTGGCGATTCCCGCCGGGCTGCTGCTGTTTGTTGCCAGGGCTGCTCCGG GGGCGAGCTATGTGCTGCAGTGGCCGCTTCTGGGCGGCGTAGTGGCATTGGCGGTGTTGATGACGGCGCGGGAGGAGATC GGCACGGGCTGGCGGCTGGCGGTGCTGATGGTTGTTCCCGCGACGTCGTTTCTGCTGATTGTGCCGATGCTGCGGACGCT GGTCGTCGCGTTGGGAGCGTCGGCGGGCGGGATGATTGCGGCGCTCGCGGTGGGATTTGTTTTGGTTACGGTGATGCCGC AGTTGATGGTGATTGCGCGGAGTGGAGTGGGTGCTCCGCGGAAGGCCGGGTGA
Upstream 100 bases:
>100_bases TGGTCTTTGGCGATGACCGGCACCAGTTTGTGGGCGTGAACTATTGCCCGTTTTGCGGGCGCGTGCTCTCGCGTGAGCTA TGGAATCTGGAAAAGAAGAA
Downstream 100 bases:
>100_bases AGGCGCCCGGATAGGCTGCCTTGCATTGCGGCAAGATGGAAGCGATCAAGCGATCGGCTCGGTGTGAGGGTGATGGCGGA GAATCCCGGCTCGTTACTCG
Product: peptidase M28
Products: NA
Alternate protein names: Peptidase; Aminopeptidase; M28 Family Peptidase; Peptidase Family; Peptidase Family M
Number of amino acids: Translated: 470; Mature: 470
Protein sequence:
>470_residues MKAAAAGRARLLLTVALALWTAAGLRPPSRPEPAPAGAPPREFSAARAMAHVRAIAQRPHPLKSADHARVRTYIAGQFEE LGTPAGLQIMPVTFRGDTIVLQNLVARLAGSGSTRPIMLAAHYDSTRHGPGAGDDAHGVAVLLETLRALRAGPPLRNDVI FLVTDGEEAGLLGASAFAKEHPWRQEPGVVLNFEARGTGGQATMFETSAGNEWLIRNLQAAAPWANATSFAYEVYRRMPN DTDLTVFKRAGLAGLNFAFIEHPEWYHHSQDDPEHLDLRSVQEQGDYALSLARQFGGVDLRRAASGDAVYFPTRLTGLIV YSGWWVLPLALMTVLALVLAVLVGWRRGKRGLWMGRLLAIPAGLLLFVARAAPGASYVLQWPLLGGVVALAVLMTAREEI GTGWRLAVLMVVPATSFLLIVPMLRTLVVALGASAGGMIAALAVGFVLVTVMPQLMVIARSGVGAPRKAG
Sequences:
>Translated_470_residues MKAAAAGRARLLLTVALALWTAAGLRPPSRPEPAPAGAPPREFSAARAMAHVRAIAQRPHPLKSADHARVRTYIAGQFEE LGTPAGLQIMPVTFRGDTIVLQNLVARLAGSGSTRPIMLAAHYDSTRHGPGAGDDAHGVAVLLETLRALRAGPPLRNDVI FLVTDGEEAGLLGASAFAKEHPWRQEPGVVLNFEARGTGGQATMFETSAGNEWLIRNLQAAAPWANATSFAYEVYRRMPN DTDLTVFKRAGLAGLNFAFIEHPEWYHHSQDDPEHLDLRSVQEQGDYALSLARQFGGVDLRRAASGDAVYFPTRLTGLIV YSGWWVLPLALMTVLALVLAVLVGWRRGKRGLWMGRLLAIPAGLLLFVARAAPGASYVLQWPLLGGVVALAVLMTAREEI GTGWRLAVLMVVPATSFLLIVPMLRTLVVALGASAGGMIAALAVGFVLVTVMPQLMVIARSGVGAPRKAG >Mature_470_residues MKAAAAGRARLLLTVALALWTAAGLRPPSRPEPAPAGAPPREFSAARAMAHVRAIAQRPHPLKSADHARVRTYIAGQFEE LGTPAGLQIMPVTFRGDTIVLQNLVARLAGSGSTRPIMLAAHYDSTRHGPGAGDDAHGVAVLLETLRALRAGPPLRNDVI FLVTDGEEAGLLGASAFAKEHPWRQEPGVVLNFEARGTGGQATMFETSAGNEWLIRNLQAAAPWANATSFAYEVYRRMPN DTDLTVFKRAGLAGLNFAFIEHPEWYHHSQDDPEHLDLRSVQEQGDYALSLARQFGGVDLRRAASGDAVYFPTRLTGLIV YSGWWVLPLALMTVLALVLAVLVGWRRGKRGLWMGRLLAIPAGLLLFVARAAPGASYVLQWPLLGGVVALAVLMTAREEI GTGWRLAVLMVVPATSFLLIVPMLRTLVVALGASAGGMIAALAVGFVLVTVMPQLMVIARSGVGAPRKAG
Specific function: Unknown
COG id: COG2234
COG function: function code R; Predicted aminopeptidases
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI55749804, Length=240, Percent_Identity=32.0833333333333, Blast_Score=123, Evalue=4e-28, Organism=Caenorhabditis elegans, GI193204254, Length=265, Percent_Identity=30.188679245283, Blast_Score=120, Evalue=1e-27, Organism=Caenorhabditis elegans, GI17531383, Length=313, Percent_Identity=27.1565495207668, Blast_Score=116, Evalue=3e-26, Organism=Saccharomyces cerevisiae, GI6319548, Length=211, Percent_Identity=29.8578199052133, Blast_Score=84, Evalue=3e-17, Organism=Drosophila melanogaster, GI45550463, Length=232, Percent_Identity=31.4655172413793, Blast_Score=130, Evalue=2e-30, Organism=Drosophila melanogaster, GI24655610, Length=232, Percent_Identity=31.4655172413793, Blast_Score=130, Evalue=2e-30, Organism=Drosophila melanogaster, GI28573565, Length=332, Percent_Identity=29.5180722891566, Blast_Score=120, Evalue=2e-27, Organism=Drosophila melanogaster, GI28573381, Length=260, Percent_Identity=31.1538461538462, Blast_Score=119, Evalue=5e-27, Organism=Drosophila melanogaster, GI24652993, Length=251, Percent_Identity=30.2788844621514, Blast_Score=119, Evalue=7e-27, Organism=Drosophila melanogaster, GI281363746, Length=366, Percent_Identity=26.5027322404372, Blast_Score=117, Evalue=2e-26, Organism=Drosophila melanogaster, GI45550464, Length=366, Percent_Identity=26.5027322404372, Blast_Score=116, Evalue=3e-26, Organism=Drosophila melanogaster, GI24652989, Length=312, Percent_Identity=29.4871794871795, Blast_Score=115, Evalue=6e-26, Organism=Drosophila melanogaster, GI24655613, Length=197, Percent_Identity=31.9796954314721, Blast_Score=114, Evalue=1e-25, Organism=Drosophila melanogaster, GI24655630, Length=232, Percent_Identity=29.3103448275862, Blast_Score=108, Evalue=6e-24, Organism=Drosophila melanogaster, GI221330185, Length=221, Percent_Identity=28.9592760180996, Blast_Score=104, Evalue=2e-22, Organism=Drosophila melanogaster, GI221330187, Length=331, Percent_Identity=29.0030211480363, Blast_Score=103, Evalue=2e-22, Organism=Drosophila melanogaster, GI28573701, Length=405, Percent_Identity=23.9506172839506, Blast_Score=100, Evalue=3e-21,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 50183; Mature: 50183
Theoretical pI: Translated: 10.52; Mature: 10.52
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKAAAAGRARLLLTVALALWTAAGLRPPSRPEPAPAGAPPREFSAARAMAHVRAIAQRPH CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC PLKSADHARVRTYIAGQFEELGTPAGLQIMPVTFRGDTIVLQNLVARLAGSGSTRPIMLA CCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHHHCCCCCCEEEEE AHYDSTRHGPGAGDDAHGVAVLLETLRALRAGPPLRNDVIFLVTDGEEAGLLGASAFAKE EECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCHHHHHHC HPWRQEPGVVLNFEARGTGGQATMFETSAGNEWLIRNLQAAAPWANATSFAYEVYRRMPN CCCCCCCCEEEEEEECCCCCCEEEEECCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHCCC DTDLTVFKRAGLAGLNFAFIEHPEWYHHSQDDPEHLDLRSVQEQGDYALSLARQFGGVDL CCCCHHHHHCCCCCCEEEEEECCHHHCCCCCCCCCCCHHHHHHCCHHHHHHHHHHCCCCE RRAASGDAVYFPTRLTGLIVYSGWWVLPLALMTVLALVLAVLVGWRRGKRGLWMGRLLAI ECCCCCCEEEECHHHEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH PAGLLLFVARAAPGASYVLQWPLLGGVVALAVLMTAREEIGTGWRLAVLMVVPATSFLLI HHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHH VPMLRTLVVALGASAGGMIAALAVGFVLVTVMPQLMVIARSGVGAPRKAG HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC >Mature Secondary Structure MKAAAAGRARLLLTVALALWTAAGLRPPSRPEPAPAGAPPREFSAARAMAHVRAIAQRPH CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC PLKSADHARVRTYIAGQFEELGTPAGLQIMPVTFRGDTIVLQNLVARLAGSGSTRPIMLA CCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHHHHCCCCCCEEEEE AHYDSTRHGPGAGDDAHGVAVLLETLRALRAGPPLRNDVIFLVTDGEEAGLLGASAFAKE EECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCHHHHHHC HPWRQEPGVVLNFEARGTGGQATMFETSAGNEWLIRNLQAAAPWANATSFAYEVYRRMPN CCCCCCCCEEEEEEECCCCCCEEEEECCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHCCC DTDLTVFKRAGLAGLNFAFIEHPEWYHHSQDDPEHLDLRSVQEQGDYALSLARQFGGVDL CCCCHHHHHCCCCCCEEEEEECCHHHCCCCCCCCCCCHHHHHHCCHHHHHHHHHHCCCCE RRAASGDAVYFPTRLTGLIVYSGWWVLPLALMTVLALVLAVLVGWRRGKRGLWMGRLLAI ECCCCCCEEEECHHHEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH PAGLLLFVARAAPGASYVLQWPLLGGVVALAVLMTAREEIGTGWRLAVLMVVPATSFLLI HHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHH VPMLRTLVVALGASAGGMIAALAVGFVLVTVMPQLMVIARSGVGAPRKAG HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA