The gene/protein map for NC_008312 is currently unavailable.
Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is ycgO [H]

Identifier: 113476544

GI number: 113476544

Start: 4625247

End: 4626683

Strand: Reverse

Name: ycgO [H]

Synonym: Tery_2974

Alternate gene names: 113476544

Gene position: 4626683-4625247 (Counterclockwise)

Preceding gene: 113476545

Following gene: 113476543

Centisome position: 59.7

GC content: 38.41

Gene sequence:

>1437_bases
ATGGAAAAACAAATTTGGATTGGGATAACATTTATAGCCTTTTTGCTATCATTTACAGTAGTAGGCATTTACTCCGCAAC
ACAAAAGCAAAATACAACAACTGATTACTTACTTGCCAGTAGAAATGTTAATCCCTGGTTGACAGCACTATCCGCAATGG
CAACAGGTCAGAGTGGGTTTCTATTTATTGGTTCGATAGGTTTTATCTATAAAGTTGGATTTGCTGCTATTTGGATACCC
CTTGCTTGGACAATAGGAGACTATATTGCTTGGTTGTTAATATTTAAAAGGTTGAGGTTAGTTTCTCAGGAAACAGACTC
AGATACAATCTCCTCATTCTTAGGTCAAGAAAATCTAAGTCCAAAAAATCAAGGGCGCTCGATTACAATAATTTCAGCAC
TAATTACCATAGGAATTCTGGGTACTTATGCTGCAGCTCAACTGGTAGCAGCAAGCAAAGGACTGAATGCTATATTTGGT
TGGAACTATGAACTGGGTATTATTGCTGGGGCTGTAATTGTGGTTGTCTACTGTTTTTCAGGAGGTATCCGTGCTTCTAT
ATGGACTGACTCTGTGCAGGGAATTTTAATGATATTATCTCTGTTGATTTTGTGTATAGTAAGTTTACTGGCTTGTGGAG
GGTTGACAGAACTTTGGGTCAAGCTTAATGCCATTGACCCCACTCTAACAAATTGGATGCCTACTAATTTACCTTGGGGG
TTTTTTCCTTACTTTTTGGGGTGGTTGGTGTCAGGCTTGGGTGTTGTCGGTCAACCTCATGTATTAGTAAGAGCAATGGC
AATTGACTCTGCAGATAATATAGCGTTAGCTCGTAACATAAAATTAGTCTGCGGTCTAATGAATTCGGCTACAGCTTTTG
GTATAGGATTAACTGCCAGAGTTTTGTTACCTGAATTAATGACATCTGGTGACCCAGAGTTAGCATTACCGAATCTATCT
ATAGAATTATTGCCAGCAGTTTTAGTAGGGTTGATGTTAGCAGGACTTTTTTCTGCAGCTATTTCTACAGCAGATTCTCA
AATATTATCATGTTCTGCTGCACTAAGTCAAGATTTAGTTCCCAGTGGATCTAACTCTTATCGAAAAGCTAAAATTGCTA
CCTTAGCTGTTACTGCTTTTGTATTAGCGATCGCTCTCATAACAAACAATAGTGTATTTGCTTTGGTCATTTTCTCTTGG
TCAGTTTTAGCCTGCGCTTTAGGTCCGTTGTTAGTATTGCGAGTGTGGCAAAAACCTGTAAGGGTTCCAGTCGCAATAAC
AATGATGATTACTGGTATAGTAGTTGCGATTATATGGAATAAAGGCTTTAACCTATCAAGCGCTATTTATGAAGTCTTGC
CTGGTATGGCAGCAGGCTTTATTGTTTATGGAATTGCTAATTTGCAAATTTGGCCTAAAGATTTGAGTAAACAATAA

Upstream 100 bases:

>100_bases
GCATCGTCGCAGGTTTTATTTTTTTGGAATTGCTAATTTTACGGTTTTGCCTAAAAGTTTGAGTAAACAATAAAATAATA
CTACAATTTAATAGTGAAAA

Downstream 100 bases:

>100_bases
AATCATACTACTACTTTTTCAATTTCAGCAATTAGAAAAAAGATGAACAATCAAGCTAGATTATTACCATTAAACCATTT
CTCATCAATACCTATAGACT

Product: SSS family solute/sodium (Na+) symporter

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 478; Mature: 478

Protein sequence:

>478_residues
MEKQIWIGITFIAFLLSFTVVGIYSATQKQNTTTDYLLASRNVNPWLTALSAMATGQSGFLFIGSIGFIYKVGFAAIWIP
LAWTIGDYIAWLLIFKRLRLVSQETDSDTISSFLGQENLSPKNQGRSITIISALITIGILGTYAAAQLVAASKGLNAIFG
WNYELGIIAGAVIVVVYCFSGGIRASIWTDSVQGILMILSLLILCIVSLLACGGLTELWVKLNAIDPTLTNWMPTNLPWG
FFPYFLGWLVSGLGVVGQPHVLVRAMAIDSADNIALARNIKLVCGLMNSATAFGIGLTARVLLPELMTSGDPELALPNLS
IELLPAVLVGLMLAGLFSAAISTADSQILSCSAALSQDLVPSGSNSYRKAKIATLAVTAFVLAIALITNNSVFALVIFSW
SVLACALGPLLVLRVWQKPVRVPVAITMMITGIVVAIIWNKGFNLSSAIYEVLPGMAAGFIVYGIANLQIWPKDLSKQ

Sequences:

>Translated_478_residues
MEKQIWIGITFIAFLLSFTVVGIYSATQKQNTTTDYLLASRNVNPWLTALSAMATGQSGFLFIGSIGFIYKVGFAAIWIP
LAWTIGDYIAWLLIFKRLRLVSQETDSDTISSFLGQENLSPKNQGRSITIISALITIGILGTYAAAQLVAASKGLNAIFG
WNYELGIIAGAVIVVVYCFSGGIRASIWTDSVQGILMILSLLILCIVSLLACGGLTELWVKLNAIDPTLTNWMPTNLPWG
FFPYFLGWLVSGLGVVGQPHVLVRAMAIDSADNIALARNIKLVCGLMNSATAFGIGLTARVLLPELMTSGDPELALPNLS
IELLPAVLVGLMLAGLFSAAISTADSQILSCSAALSQDLVPSGSNSYRKAKIATLAVTAFVLAIALITNNSVFALVIFSW
SVLACALGPLLVLRVWQKPVRVPVAITMMITGIVVAIIWNKGFNLSSAIYEVLPGMAAGFIVYGIANLQIWPKDLSKQ
>Mature_478_residues
MEKQIWIGITFIAFLLSFTVVGIYSATQKQNTTTDYLLASRNVNPWLTALSAMATGQSGFLFIGSIGFIYKVGFAAIWIP
LAWTIGDYIAWLLIFKRLRLVSQETDSDTISSFLGQENLSPKNQGRSITIISALITIGILGTYAAAQLVAASKGLNAIFG
WNYELGIIAGAVIVVVYCFSGGIRASIWTDSVQGILMILSLLILCIVSLLACGGLTELWVKLNAIDPTLTNWMPTNLPWG
FFPYFLGWLVSGLGVVGQPHVLVRAMAIDSADNIALARNIKLVCGLMNSATAFGIGLTARVLLPELMTSGDPELALPNLS
IELLPAVLVGLMLAGLFSAAISTADSQILSCSAALSQDLVPSGSNSYRKAKIATLAVTAFVLAIALITNNSVFALVIFSW
SVLACALGPLLVLRVWQKPVRVPVAITMMITGIVVAIIWNKGFNLSSAIYEVLPGMAAGFIVYGIANLQIWPKDLSKQ

Specific function: Catalyzes the sodium-dependent uptake of extracellular amino acids [H]

COG id: COG0591

COG function: function code ER; Na+/proline symporter

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sodium:solute symporter (SSF) (TC 2.A.21) family [H]

Homologues:

Organism=Homo sapiens, GI310128183, Length=487, Percent_Identity=28.1314168377823, Blast_Score=184, Evalue=1e-46,
Organism=Escherichia coli, GI1787251, Length=471, Percent_Identity=29.723991507431, Blast_Score=169, Evalue=3e-43,
Organism=Escherichia coli, GI87082237, Length=370, Percent_Identity=27.2972972972973, Blast_Score=113, Evalue=3e-26,
Organism=Escherichia coli, GI1790503, Length=445, Percent_Identity=22.9213483146067, Blast_Score=83, Evalue=3e-17,
Organism=Drosophila melanogaster, GI28573698, Length=379, Percent_Identity=24.802110817942, Blast_Score=66, Evalue=6e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011851
- InterPro:   IPR001734
- InterPro:   IPR018212
- InterPro:   IPR019900 [H]

Pfam domain/function: PF00474 SSF [H]

EC number: NA

Molecular weight: Translated: 51144; Mature: 51144

Theoretical pI: Translated: 8.55; Mature: 8.55

Prosite motif: PS50283 NA_SOLUT_SYMP_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
3.6 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MEKQIWIGITFIAFLLSFTVVGIYSATQKQNTTTDYLLASRNVNPWLTALSAMATGQSGF
CCCEEEEHHHHHHHHHHHHHHHHHHHHCCCCCCHHHEEECCCCCHHHHHHHHHHCCCCCE
LFIGSIGFIYKVGFAAIWIPLAWTIGDYIAWLLIFKRLRLVSQETDSDTISSFLGQENLS
EEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCC
PKNQGRSITIISALITIGILGTYAAAQLVAASKGLNAIFGWNYELGIIAGAVIVVVYCFS
CCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHC
GGIRASIWTDSVQGILMILSLLILCIVSLLACGGLTELWVKLNAIDPTLTNWMPTNLPWG
CCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHEEEECCCCHHHCCCCCCCCCH
FFPYFLGWLVSGLGVVGQPHVLVRAMAIDSADNIALARNIKLVCGLMNSATAFGIGLTAR
HHHHHHHHHHHCCCCCCCCHHHHEEHHHCCCCCEEHHHHHHHHHHHHHCCHHHHHHHHHH
VLLPELMTSGDPELALPNLSIELLPAVLVGLMLAGLFSAAISTADSQILSCSAALSQDLV
HHHHHHHCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PSGSNSYRKAKIATLAVTAFVLAIALITNNSVFALVIFSWSVLACALGPLLVLRVWQKPV
CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHCCC
RVPVAITMMITGIVVAIIWNKGFNLSSAIYEVLPGMAAGFIVYGIANLQIWPKDLSKQ
CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHCCEECCHHCCCC
>Mature Secondary Structure
MEKQIWIGITFIAFLLSFTVVGIYSATQKQNTTTDYLLASRNVNPWLTALSAMATGQSGF
CCCEEEEHHHHHHHHHHHHHHHHHHHHCCCCCCHHHEEECCCCCHHHHHHHHHHCCCCCE
LFIGSIGFIYKVGFAAIWIPLAWTIGDYIAWLLIFKRLRLVSQETDSDTISSFLGQENLS
EEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCC
PKNQGRSITIISALITIGILGTYAAAQLVAASKGLNAIFGWNYELGIIAGAVIVVVYCFS
CCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHC
GGIRASIWTDSVQGILMILSLLILCIVSLLACGGLTELWVKLNAIDPTLTNWMPTNLPWG
CCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHEEEECCCCHHHCCCCCCCCCH
FFPYFLGWLVSGLGVVGQPHVLVRAMAIDSADNIALARNIKLVCGLMNSATAFGIGLTAR
HHHHHHHHHHHCCCCCCCCHHHHEEHHHCCCCCEEHHHHHHHHHHHHHCCHHHHHHHHHH
VLLPELMTSGDPELALPNLSIELLPAVLVGLMLAGLFSAAISTADSQILSCSAALSQDLV
HHHHHHHCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PSGSNSYRKAKIATLAVTAFVLAIALITNNSVFALVIFSWSVLACALGPLLVLRVWQKPV
CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHCCC
RVPVAITMMITGIVVAIIWNKGFNLSSAIYEVLPGMAAGFIVYGIANLQIWPKDLSKQ
CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHCCEECCHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8969502; 9384377 [H]