Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is 113476499

Identifier: 113476499

GI number: 113476499

Start: 4535777

End: 4537462

Strand: Reverse

Name: 113476499

Synonym: Tery_2921

Alternate gene names: NA

Gene position: 4537462-4535777 (Counterclockwise)

Preceding gene: 113476500

Following gene: 113476494

Centisome position: 58.55

GC content: 36.83

Gene sequence:

>1686_bases
ATGACAAAATCAGGGTTTGATTTTAGCCACTACTATAAATATCAAGAAATAGTAAATTTTCTACATCAAATGAGGGAAAA
AAATCCTCATTTAATAGAATTAAAAGTTATTGGAAAAAGCTACGGCGGACTAGATATCTTCTTAGCTACTCTTACCAATC
AAAATACAGGAAAAGCTCGGGAAAAACCTGGATATTGGATTGATGGTAACATTCATGCGGTAGAAGTTACAGGTTCAGCA
GTTGCCCTATATATTATTTATCATCTACTAAATAATTATAATAGCAATCCTCAAGTCACTTATCTATTGGACAACCACAC
AATTTATATACTACCTCGAATTGCTGTTGATGGGGCCGAAAAATATTTAACAACTCCTTATATTGTGCGCTCAAGTATTC
GTCATTATCCCTATCCTGAAGAAAAGGATGGTCTTCACTGGGAAGACATAAATGGAGATGGATTAATTTTGCAAATGCGA
CTCAAAGATAACTGTGGGGCTTGGAAAATCTCATCAGAAGACCCAAGAATTATGGTGCCTCGCGAACCCGATGAATTTGG
AGGTACCTACTACAGTATTTTACCAGAGGGAATGATTAAAAACTATGATGGTTACAATATTAAAGTCGCCCCTAGTAAGG
GAGGAATAGATTTTAATCGTAACTATCCCCACGAATGGCAGCCAGAGGGAAAACAAAAAGGTGCTGGGGATTTTCCTTTT
TCTGAACCTGAAACTCTTGCTGTAGCAGAATTTTGGCGAGAACATCCTAATATTAATGGTTTTATTAACTATCATACGTT
TTCTGGAGTAATTTTACGTTCCTACTGTACTTATCCAGACGAACATTTTCCTGTAAAAGATTTGGAAATTTACAAATTGA
TAGGGGAGAAAGGTACTGCAATTACCGGTTATGAATGCATTTCAATCTATCATAACTTTTTGTACCCTAACAGTGAAAAA
ATTTATGGTGGAATGGTTGACTATTGTTATGATCGCTTTGGTTGGTTTGGTTTTTCTATAGAGCTTTGGGATGCACCCAC
AGAAGCAGGTGTTAAGAAGAATGATTATATTAAGTGGTTACAACGCCATCCTGTTACAGACGATCTAAAAATGATGCAGT
GGAATGATGAAAAATTGGGTGGCACAGGTTTTATTGACTGGGAAACATTTGAACATCCCCAACTGGGTACGGTTGAAATT
GGTGGGTGGAATTGGAAAATTTGGTACAACCCACCTGTTGAATATTTACCAAAGTTGTGTAAGGAACAATGTCAGTTTGC
TATTTCTCATGCTTTGATGTCACCTCGTTTAGCAGTAAGTCAGGTAGTTGTTGAGCATCAGGGTGGTGATATGTATCATC
TGGTAGTACAGTTAGAAAATCAAGGGTTTTTGCCTACCTATACTAGTGAAAAAGCTTTGGAACAAAAAAGTGTGAAACCA
ATAGAGGTAATATTAAATTTACCTGGTAATGTAACGTTGGTGAGTGGAAAACAAAAGCAGGAAATTGAACATTTAGAGGG
GCGAGCAAGTGAAGCTTTTAATTTTTTCAGTTTTTTTACTAAACTTGGAGGTTATCGTTGTCATTTAGAGTGGGTAGTCA
AAGGTGTTGCTAATAGTGAGATTAAGGTGACAGCAAAAGCGGAACGAGCAGGTGTAGTGAAAACTGTGATTACTTTGCCA
GAGTAA

Upstream 100 bases:

>100_bases
GGGCTAATAATTATACTCAAAGCTCATAATTTTAACTTCACACTTAGTTTAATAATGCTGAAAAATACAAACTTAATTTT
TAAGGATTAAATAACTTAAT

Downstream 100 bases:

>100_bases
ATTAACATAAAACCCTTGGCTGCATCTTCATCCCGTGACCGAATATAAACACCAATAATTTCATAAGTTTCTACTTCCAT
GGCGAACCAAACCTACTGCT

Product: peptidase M14, carboxypeptidase A

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 561; Mature: 560

Protein sequence:

>561_residues
MTKSGFDFSHYYKYQEIVNFLHQMREKNPHLIELKVIGKSYGGLDIFLATLTNQNTGKAREKPGYWIDGNIHAVEVTGSA
VALYIIYHLLNNYNSNPQVTYLLDNHTIYILPRIAVDGAEKYLTTPYIVRSSIRHYPYPEEKDGLHWEDINGDGLILQMR
LKDNCGAWKISSEDPRIMVPREPDEFGGTYYSILPEGMIKNYDGYNIKVAPSKGGIDFNRNYPHEWQPEGKQKGAGDFPF
SEPETLAVAEFWREHPNINGFINYHTFSGVILRSYCTYPDEHFPVKDLEIYKLIGEKGTAITGYECISIYHNFLYPNSEK
IYGGMVDYCYDRFGWFGFSIELWDAPTEAGVKKNDYIKWLQRHPVTDDLKMMQWNDEKLGGTGFIDWETFEHPQLGTVEI
GGWNWKIWYNPPVEYLPKLCKEQCQFAISHALMSPRLAVSQVVVEHQGGDMYHLVVQLENQGFLPTYTSEKALEQKSVKP
IEVILNLPGNVTLVSGKQKQEIEHLEGRASEAFNFFSFFTKLGGYRCHLEWVVKGVANSEIKVTAKAERAGVVKTVITLP
E

Sequences:

>Translated_561_residues
MTKSGFDFSHYYKYQEIVNFLHQMREKNPHLIELKVIGKSYGGLDIFLATLTNQNTGKAREKPGYWIDGNIHAVEVTGSA
VALYIIYHLLNNYNSNPQVTYLLDNHTIYILPRIAVDGAEKYLTTPYIVRSSIRHYPYPEEKDGLHWEDINGDGLILQMR
LKDNCGAWKISSEDPRIMVPREPDEFGGTYYSILPEGMIKNYDGYNIKVAPSKGGIDFNRNYPHEWQPEGKQKGAGDFPF
SEPETLAVAEFWREHPNINGFINYHTFSGVILRSYCTYPDEHFPVKDLEIYKLIGEKGTAITGYECISIYHNFLYPNSEK
IYGGMVDYCYDRFGWFGFSIELWDAPTEAGVKKNDYIKWLQRHPVTDDLKMMQWNDEKLGGTGFIDWETFEHPQLGTVEI
GGWNWKIWYNPPVEYLPKLCKEQCQFAISHALMSPRLAVSQVVVEHQGGDMYHLVVQLENQGFLPTYTSEKALEQKSVKP
IEVILNLPGNVTLVSGKQKQEIEHLEGRASEAFNFFSFFTKLGGYRCHLEWVVKGVANSEIKVTAKAERAGVVKTVITLP
E
>Mature_560_residues
TKSGFDFSHYYKYQEIVNFLHQMREKNPHLIELKVIGKSYGGLDIFLATLTNQNTGKAREKPGYWIDGNIHAVEVTGSAV
ALYIIYHLLNNYNSNPQVTYLLDNHTIYILPRIAVDGAEKYLTTPYIVRSSIRHYPYPEEKDGLHWEDINGDGLILQMRL
KDNCGAWKISSEDPRIMVPREPDEFGGTYYSILPEGMIKNYDGYNIKVAPSKGGIDFNRNYPHEWQPEGKQKGAGDFPFS
EPETLAVAEFWREHPNINGFINYHTFSGVILRSYCTYPDEHFPVKDLEIYKLIGEKGTAITGYECISIYHNFLYPNSEKI
YGGMVDYCYDRFGWFGFSIELWDAPTEAGVKKNDYIKWLQRHPVTDDLKMMQWNDEKLGGTGFIDWETFEHPQLGTVEIG
GWNWKIWYNPPVEYLPKLCKEQCQFAISHALMSPRLAVSQVVVEHQGGDMYHLVVQLENQGFLPTYTSEKALEQKSVKPI
EVILNLPGNVTLVSGKQKQEIEHLEGRASEAFNFFSFFTKLGGYRCHLEWVVKGVANSEIKVTAKAERAGVVKTVITLPE

Specific function: Unknown

COG id: COG2866

COG function: function code E; Predicted carboxypeptidase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI188536067, Length=115, Percent_Identity=33.0434782608696, Blast_Score=71, Evalue=2e-12,
Organism=Caenorhabditis elegans, GI25143424, Length=322, Percent_Identity=23.6024844720497, Blast_Score=88, Evalue=1e-17,
Organism=Caenorhabditis elegans, GI32563693, Length=126, Percent_Identity=30.1587301587302, Blast_Score=67, Evalue=4e-11,
Organism=Drosophila melanogaster, GI221330951, Length=133, Percent_Identity=35.3383458646617, Blast_Score=68, Evalue=2e-11,
Organism=Drosophila melanogaster, GI221372162, Length=108, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=4e-11,
Organism=Drosophila melanogaster, GI221372169, Length=108, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=4e-11,
Organism=Drosophila melanogaster, GI221372165, Length=108, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=4e-11,
Organism=Drosophila melanogaster, GI221372158, Length=108, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=4e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 64359; Mature: 64228

Theoretical pI: Translated: 6.09; Mature: 6.09

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTKSGFDFSHYYKYQEIVNFLHQMREKNPHLIELKVIGKSYGGLDIFLATLTNQNTGKAR
CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCEEEEEEEECCCCCCCCC
EKPGYWIDGNIHAVEVTGSAVALYIIYHLLNNYNSNPQVTYLLDNHTIYILPRIAVDGAE
CCCCEEECCEEEEEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCEEEEEEEEEECCCH
KYLTTPYIVRSSIRHYPYPEEKDGLHWEDINGDGLILQMRLKDNCGAWKISSEDPRIMVP
HHCCCHHHHHHHHHCCCCCCCCCCCEEEECCCCEEEEEEEECCCCCEEEECCCCCEEECC
REPDEFGGTYYSILPEGMIKNYDGYNIKVAPSKGGIDFNRNYPHEWQPEGKQKGAGDFPF
CCCHHHCCEEEEECCCHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
SEPETLAVAEFWREHPNINGFINYHTFSGVILRSYCTYPDEHFPVKDLEIYKLIGEKGTA
CCCCHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHCCCCCCCCCCCCEEEEEEECCCCCE
ITGYECISIYHNFLYPNSEKIYGGMVDYCYDRFGWFGFSIELWDAPTEAGVKKNDYIKWL
EECHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCEEEEEEEEECCCCCCCCCCHHHHHHH
QRHPVTDDLKMMQWNDEKLGGTGFIDWETFEHPQLGTVEIGGWNWKIWYNPPVEYLPKLC
HHCCCCCCCEEEEECCCCCCCCEEECCCCCCCCCCCEEEECCEEEEEEECCCHHHHHHHH
KEQCQFAISHALMSPRLAVSQVVVEHQGGDMYHLVVQLENQGFLPTYTSEKALEQKSVKP
HHHHHHHHHHHHHCCCHHHHHHHEECCCCCEEEEEEEECCCCCCCCCCCHHHHHHHCCCE
IEVILNLPGNVTLVSGKQKQEIEHLEGRASEAFNFFSFFTKLGGYRCHLEWVVKGVANSE
EEEEEECCCCEEEECCCHHHHHHHHCCCHHHHHHHHHHHHHHCCEEEEEEHHHHCCCCCE
IKVTAKAERAGVVKTVITLPE
EEEEEECCCCCEEEEEEECCC
>Mature Secondary Structure 
TKSGFDFSHYYKYQEIVNFLHQMREKNPHLIELKVIGKSYGGLDIFLATLTNQNTGKAR
CCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCEEEEEEEECCCCCCCCC
EKPGYWIDGNIHAVEVTGSAVALYIIYHLLNNYNSNPQVTYLLDNHTIYILPRIAVDGAE
CCCCEEECCEEEEEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCEEEEEEEEEECCCH
KYLTTPYIVRSSIRHYPYPEEKDGLHWEDINGDGLILQMRLKDNCGAWKISSEDPRIMVP
HHCCCHHHHHHHHHCCCCCCCCCCCEEEECCCCEEEEEEEECCCCCEEEECCCCCEEECC
REPDEFGGTYYSILPEGMIKNYDGYNIKVAPSKGGIDFNRNYPHEWQPEGKQKGAGDFPF
CCCHHHCCEEEEECCCHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
SEPETLAVAEFWREHPNINGFINYHTFSGVILRSYCTYPDEHFPVKDLEIYKLIGEKGTA
CCCCHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHCCCCCCCCCCCCEEEEEEECCCCCE
ITGYECISIYHNFLYPNSEKIYGGMVDYCYDRFGWFGFSIELWDAPTEAGVKKNDYIKWL
EECHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCEEEEEEEEECCCCCCCCCCHHHHHHH
QRHPVTDDLKMMQWNDEKLGGTGFIDWETFEHPQLGTVEIGGWNWKIWYNPPVEYLPKLC
HHCCCCCCCEEEEECCCCCCCCEEECCCCCCCCCCCEEEECCEEEEEEECCCHHHHHHHH
KEQCQFAISHALMSPRLAVSQVVVEHQGGDMYHLVVQLENQGFLPTYTSEKALEQKSVKP
HHHHHHHHHHHHHCCCHHHHHHHEECCCCCEEEEEEEECCCCCCCCCCCHHHHHHHCCCE
IEVILNLPGNVTLVSGKQKQEIEHLEGRASEAFNFFSFFTKLGGYRCHLEWVVKGVANSE
EEEEEECCCCEEEECCCHHHHHHHHCCCHHHHHHHHHHHHHHCCEEEEEEHHHHCCCCCE
IKVTAKAERAGVVKTVITLPE
EEEEEECCCCCEEEEEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA