| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is appC [H]
Identifier: 113476530
GI number: 113476530
Start: 4593896
End: 4594963
Strand: Reverse
Name: appC [H]
Synonym: Tery_2957
Alternate gene names: 113476530
Gene position: 4594963-4593896 (Counterclockwise)
Preceding gene: 113476531
Following gene: 113476529
Centisome position: 59.29
GC content: 36.99
Gene sequence:
>1068_bases ATGAAATGGTGGAAAAATCTTAAAAAAAATCCTTTAGCAAAATTTGGTGGATTATTGCTATTAATTTTCTATTTGGTAGT AATTGCTGCTGACTTTATTGCTCCCTATGACGCCTATACATCTCAACCCAACGGTTCCCTATTACCTCCTACTCAAATTT ACTGGCAAAACCAAGCAGGTGAGTTTATTGGACCTCATGTTTACCCCACGACTCAAGGACCGGTAGATTTAGAAACTGGA CTCCGGGAGTTAAAGGTAGATTTGAGTAAACCATCCCCTCTAGGTTTATTTGTCCAAGGACCATCCTATAAACTCTTGGG GATTTTGCCCTGGAATCGACATTTGTTTGGAACGATAGGTCAAGCAAAATTTAATTTATTAGGTACTGACGAACAAGCAA GAGACCAATTTAGTCGTTTAATTTTTGGTGGTAGGATTAGTTTATTTATTGGTTTAGTAGGAATTACAATCTATTTTCCT TTGGGAATGATTATTGGTGGAATTTCTGGTTACTTTGGTGGTTGGATAGATAGTATTTTAATGCGTTTTGCTGAAGTATT AATGACTATTCCGGGTATTTATTTATTAGTGGCACTCGCTGCTATATTACCTCCTGGTTTAACTAGTGCCCAAAGATTTC TCTTAATTGTGGTGATTACTTCATTTATTAGTTGGGCAGGGTTAGCTAGAGTTATTCGTGGAGAAGTCTTATCTATCAAA GAACGAGAATTTGTTCAAGCAGTTAGGGCGATGGGGGCAGGTTCTTTTTATATCATTGTTCGCCATGTATTACCCCAAAC AGCAACTTATGTGATTATTTCAGCAACTTTAGCTATTCCTAGCTTTATTATCTCAGAATCAGTTTTGAGTTTGATTGGTT TAGGTATTCAACAGCCAGACCCTTCTTGGGGGAATATGTTATCTTTGGCTACAAACGCTTCTATCTTAGTATTACAACCT TGGTTAGTATGGCCTCCAGCACTATTGATTATTCTGACGGTATTAGCATTTAATTTATTAGGAGATGGGTTGCGAGATGC TTTAGATCCCAGAAATTTACAACAATAA
Upstream 100 bases:
>100_bases TGATGGTTTGACGATTAAAACCTAAATACAATTATGGATAGTTGTTTTTTTACTTTTCAAACAGGTTCTAACTATTCTTA TTCTCTATTAGATAAAATCA
Downstream 100 bases:
>100_bases TAAATATTAGTTAATAGTTTAAACGGAGAAATAATTATCTTAGTATTATTAATAACTCAATAGGGTTAATTATATTTATC AGTTATTTTTACATAACAAA
Product: binding-protein-dependent transport systems inner membrane component
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 355; Mature: 355
Protein sequence:
>355_residues MKWWKNLKKNPLAKFGGLLLLIFYLVVIAADFIAPYDAYTSQPNGSLLPPTQIYWQNQAGEFIGPHVYPTTQGPVDLETG LRELKVDLSKPSPLGLFVQGPSYKLLGILPWNRHLFGTIGQAKFNLLGTDEQARDQFSRLIFGGRISLFIGLVGITIYFP LGMIIGGISGYFGGWIDSILMRFAEVLMTIPGIYLLVALAAILPPGLTSAQRFLLIVVITSFISWAGLARVIRGEVLSIK EREFVQAVRAMGAGSFYIIVRHVLPQTATYVIISATLAIPSFIISESVLSLIGLGIQQPDPSWGNMLSLATNASILVLQP WLVWPPALLIILTVLAFNLLGDGLRDALDPRNLQQ
Sequences:
>Translated_355_residues MKWWKNLKKNPLAKFGGLLLLIFYLVVIAADFIAPYDAYTSQPNGSLLPPTQIYWQNQAGEFIGPHVYPTTQGPVDLETG LRELKVDLSKPSPLGLFVQGPSYKLLGILPWNRHLFGTIGQAKFNLLGTDEQARDQFSRLIFGGRISLFIGLVGITIYFP LGMIIGGISGYFGGWIDSILMRFAEVLMTIPGIYLLVALAAILPPGLTSAQRFLLIVVITSFISWAGLARVIRGEVLSIK EREFVQAVRAMGAGSFYIIVRHVLPQTATYVIISATLAIPSFIISESVLSLIGLGIQQPDPSWGNMLSLATNASILVLQP WLVWPPALLIILTVLAFNLLGDGLRDALDPRNLQQ >Mature_355_residues MKWWKNLKKNPLAKFGGLLLLIFYLVVIAADFIAPYDAYTSQPNGSLLPPTQIYWQNQAGEFIGPHVYPTTQGPVDLETG LRELKVDLSKPSPLGLFVQGPSYKLLGILPWNRHLFGTIGQAKFNLLGTDEQARDQFSRLIFGGRISLFIGLVGITIYFP LGMIIGGISGYFGGWIDSILMRFAEVLMTIPGIYLLVALAAILPPGLTSAQRFLLIVVITSFISWAGLARVIRGEVLSIK EREFVQAVRAMGAGSFYIIVRHVLPQTATYVIISATLAIPSFIISESVLSLIGLGIQQPDPSWGNMLSLATNASILVLQP WLVWPPALLIILTVLAFNLLGDGLRDALDPRNLQQ
Specific function: This protein is a component of an oligopeptide permease, a binding protein-dependent transport system. This APP system can completely substitute for the OPP system in both sporulation and genetic competence, though, unlike OPP, is incapable of transportin
COG id: COG1173
COG function: function code EP; ABC-type dipeptide/oligopeptide/nickel transport systems, permease components
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 ABC transmembrane type-1 domain [H]
Homologues:
Organism=Escherichia coli, GI1787054, Length=225, Percent_Identity=40.4444444444444, Blast_Score=171, Evalue=5e-44, Organism=Escherichia coli, GI1787498, Length=228, Percent_Identity=39.0350877192982, Blast_Score=164, Evalue=8e-42, Organism=Escherichia coli, GI1787760, Length=224, Percent_Identity=38.3928571428571, Blast_Score=148, Evalue=4e-37, Organism=Escherichia coli, GI1789964, Length=236, Percent_Identity=38.135593220339, Blast_Score=135, Evalue=5e-33, Organism=Escherichia coli, GI1789889, Length=225, Percent_Identity=36, Blast_Score=128, Evalue=4e-31, Organism=Escherichia coli, GI1788505, Length=233, Percent_Identity=29.6137339055794, Blast_Score=108, Evalue=6e-25, Organism=Escherichia coli, GI1787549, Length=260, Percent_Identity=30, Blast_Score=93, Evalue=2e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000515 [H]
Pfam domain/function: PF00528 BPD_transp_1 [H]
EC number: NA
Molecular weight: Translated: 38944; Mature: 38944
Theoretical pI: Translated: 9.57; Mature: 9.57
Prosite motif: PS50928 ABC_TM1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 1.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKWWKNLKKNPLAKFGGLLLLIFYLVVIAADFIAPYDAYTSQPNGSLLPPTQIYWQNQAG CCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCHHEEECCCCC EFIGPHVYPTTQGPVDLETGLRELKVDLSKPSPLGLFVQGPSYKLLGILPWNRHLFGTIG CCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEEECCCCHHHHCCC QAKFNLLGTDEQARDQFSRLIFGGRISLFIGLVGITIYFPLGMIIGGISGYFGGWIDSIL CCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH MRFAEVLMTIPGIYLLVALAAILPPGLTSAQRFLLIVVITSFISWAGLARVIRGEVLSIK HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EREFVQAVRAMGAGSFYIIVRHVLPQTATYVIISATLAIPSFIISESVLSLIGLGIQQPD HHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC PSWGNMLSLATNASILVLQPWLVWPPALLIILTVLAFNLLGDGLRDALDPRNLQQ CCHHCEEHHHCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC >Mature Secondary Structure MKWWKNLKKNPLAKFGGLLLLIFYLVVIAADFIAPYDAYTSQPNGSLLPPTQIYWQNQAG CCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCHHEEECCCCC EFIGPHVYPTTQGPVDLETGLRELKVDLSKPSPLGLFVQGPSYKLLGILPWNRHLFGTIG CCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEEECCCCHHHHCCC QAKFNLLGTDEQARDQFSRLIFGGRISLFIGLVGITIYFPLGMIIGGISGYFGGWIDSIL CCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH MRFAEVLMTIPGIYLLVALAAILPPGLTSAQRFLLIVVITSFISWAGLARVIRGEVLSIK HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EREFVQAVRAMGAGSFYIIVRHVLPQTATYVIISATLAIPSFIISESVLSLIGLGIQQPD HHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC PSWGNMLSLATNASILVLQPWLVWPPALLIILTVLAFNLLGDGLRDALDPRNLQQ CCHHCEEHHHCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 7997159; 9384377 [H]