Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
---|---|
Accession | NC_011353 |
Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is csgC
Identifier: 209397004
GI number: 209397004
Start: 1407689
End: 1408021
Strand: Direct
Name: csgC
Synonym: ECH74115_1423
Alternate gene names: 209397004
Gene position: 1407689-1408021 (Clockwise)
Preceding gene: 209399619
Following gene: 209398141
Centisome position: 25.26
GC content: 42.94
Gene sequence:
>333_bases ATGAATGCGTTATTACTCCTTGCGGCACTTTCCAGTCAGATAACCTTTAATACGACCCAGCAAGGGGATATGTACACCAT TATTCCTGAAGTCACTCTTACTCAATCTTGTCTGTGCAGAGTACAAATATTGTCCCTGCGCGAAGGCAGTTCAGGGCAAA GTCAGACGAAGCAAGAAAAGACCCTTTCATTGCCTGCTAATCAACCCATTGCTTTGACGAAGTTGAGTTTAAATATTTCC CCGGACGATCGGGTGAAAATAGTTGTTACTGTTTCTGATGGACAGTCACTTCATTTATCACAACAATGGCCGCCCTCTTC AGAAAAGTCTTAA
Upstream 100 bases:
>100_bases CAGGTTGGCTTTGGTAACAACGCGACCGCTCATCAGTACTAATACATCATCTGTATTAAAGAAACAGGGCGCAAGCCCTG TTTTTTTTCGGGAGAAGAAT
Downstream 100 bases:
>100_bases TTTGTTGAAATATCGAGCATAAGATGAATCTGGAGAGAATGGTCTGCTGCGAATCAGCCAACCTGAAAGTATGGATAACA CAACCCTCAAGGATGACTAA
Product: putative autoagglutination protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 110; Mature: 110
Protein sequence:
>110_residues MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQPIALTKLSLNIS PDDRVKIVVTVSDGQSLHLSQQWPPSSEKS
Sequences:
>Translated_110_residues MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQPIALTKLSLNIS PDDRVKIVVTVSDGQSLHLSQQWPPSSEKS >Mature_110_residues MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQPIALTKLSLNIS PDDRVKIVVTVSDGQSLHLSQQWPPSSEKS
Specific function: Plays a role in the extracellular assembly of CsgA into thin aggregative fimbriae (Tafi) fibers. Assembly may also require CsgE. Tafi are thought to be assembled via an extracellular nucleation-precipitation (ENP) pathway, and possibly also via an intrace
COG id: NA
COG function: NA
Gene ontology:
Cell location: Periplasm
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the CsgC/AgfC family
Homologues:
Organism=Escherichia coli, GI1787280, Length=110, Percent_Identity=98.1818181818182, Blast_Score=215, Evalue=4e-58,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): CSGC_ECO45 (B7MII3)
Other databases:
- EMBL: CU928161 - RefSeq: YP_002390837.1 - EnsemblBacteria: EBESCT00000138236 - GeneID: 7131970 - GenomeReviews: CU928161_GR - GeneTree: EBGT00050000009280 - HOGENOM: HBG416348 - ProtClustDB: PRK10102 - BioCyc: ECOL585035:ECS88_1055-MONOMER - InterPro: IPR014491 - PIRSF: PIRSF018100
Pfam domain/function: PF10610 Tafi-CsgC
EC number: NA
Molecular weight: Translated: 11969; Mature: 11969
Theoretical pI: Translated: 7.25; Mature: 7.25
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK CCCEEEEEECCCCEEEECCCCCCEEEEECCCCCCHHHHEEEEEEEECCCCCCCCCCCCCC TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS EEECCCCCCEEEEEEEEEECCCCCEEEEEEECCCCEEEECCCCCCCCCCC >Mature Secondary Structure MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK CCCEEEEEECCCCEEEECCCCCCEEEEECCCCCCHHHHEEEEEEEECCCCCCCCCCCCCC TLSLPANQPIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPPSSEKS EEECCCCCCEEEEEEEEEECCCCCEEEEEEECCCCEEEECCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA