Definition | Escherichia coli ED1a chromosome, complete genome. |
---|---|
Accession | NC_011745 |
Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is csgC [H]
Identifier: 218688996
GI number: 218688996
Start: 1190945
End: 1191277
Strand: Direct
Name: csgC [H]
Synonym: ECED1_1187
Alternate gene names: 218688996
Gene position: 1190945-1191277 (Clockwise)
Preceding gene: 218688995
Following gene: 218688997
Centisome position: 22.86
GC content: 42.34
Gene sequence:
>333_bases ATGAATGCGTTATTACTCCTTGCGGCACTTTCCAGTCAGATAACCTTTAATACGACCCAGCAAGGGGATATGTACACCAT TATTCCTGAAGTCACTCTTACTCAATCTTGTCTGTGCAGAGTACAAATATTGTCCCTGCGCGAAGGCAGTTCAGGGCAAA GTCAGACGAAGCAAGAAAAGACCCTTTCATTGCCTGCTAATCAACTCATTGCTTTGACGAAGTTGAGTTTAAATATTTCC CCGGACGATCGGGTGAAAATAGTTGTTACTGTTTCTGATGGACAGTCACTTCATTTATCACAACAATGGCCGTCCTCTTC AGAAAAGTCTTAA
Upstream 100 bases:
>100_bases CAGGTTGGCTTTGGTAACAACGCGACCGCTCATCAGTACTAATACATCATCAGTATGAAAGAAACAGGGCGCAAGCCCTG TTTTTTTTCGGGAGAAGAAT
Downstream 100 bases:
>100_bases TTTGTTGAAATATCGAGCATAAGATGAACCTGGAGAGAATGGTCTGCTGCGAATCAGCCAACCTGAAAGTATGGATAACA CAACCCTCAAGGATGACTAA
Product: putative autoagglutination protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 110; Mature: 110
Protein sequence:
>110_residues MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQLIALTKLSLNIS PDDRVKIVVTVSDGQSLHLSQQWPSSSEKS
Sequences:
>Translated_110_residues MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQLIALTKLSLNIS PDDRVKIVVTVSDGQSLHLSQQWPSSSEKS >Mature_110_residues MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQLIALTKLSLNIS PDDRVKIVVTVSDGQSLHLSQQWPSSSEKS
Specific function: Plays a role in the extracellular assembly of CsgA into thin aggregative fimbriae (Tafi) fibers. Assembly may also require CsgE. Tafi are thought to be assembled via an extracellular nucleation-precipitation (ENP) pathway, and possibly also via an intrace
COG id: NA
COG function: NA
Gene ontology:
Cell location: Periplasm [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the CsgC/AgfC family [H]
Homologues:
Organism=Escherichia coli, GI1787280, Length=110, Percent_Identity=96.3636363636364, Blast_Score=192, Evalue=5e-51,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014491 [H]
Pfam domain/function: PF10610 Tafi-CsgC [H]
EC number: NA
Molecular weight: Translated: 11975; Mature: 11975
Theoretical pI: Translated: 7.25; Mature: 7.25
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK CCCEEEEEECCCCEEEECCCCCCEEEEECCCHHCHHHHHEEEEEEECCCCCCCCCCCCHH TLSLPANQLIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPSSSEKS EECCCCCCEEEEEEEEEEECCCCCEEEEEEECCCCEEEEECCCCCCCCCC >Mature Secondary Structure MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEK CCCEEEEEECCCCEEEECCCCCCEEEEECCCHHCHHHHHEEEEEEECCCCCCCCCCCCHH TLSLPANQLIALTKLSLNISPDDRVKIVVTVSDGQSLHLSQQWPSSSEKS EECCCCCCEEEEEEEEEEECCCCCEEEEEEECCCCEEEEECCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA