Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is csgB

Identifier: 157160564

GI number: 157160564

Start: 1166847

End: 1167302

Strand: Direct

Name: csgB

Synonym: EcHS_A1160

Alternate gene names: 157160564

Gene position: 1166847-1167302 (Clockwise)

Preceding gene: 157160558

Following gene: 157160565

Centisome position: 25.13

GC content: 41.89

Gene sequence:

>456_bases
ATGAAAAACAAATTGTTATTTATGATGTTAACAATACTGGGTGCGCCTGGGATTGCAGCCGCAGCAGGTTATGATTTAGC
TAATTCAGAATATAACTTCGCGGTAAATGAATTGAGTAAGTCTTCATTTAATCAGGCAGCCATAATTGGTCAAGCTGGGA
CTAATAATAGTGCTCAGTTACGGCAGGGAGGCTCAAAACTTTTGGCGGTTGTTGCGCAAGAAGGTAGTAGCAACCGGGCA
AAGATTGACCAGACAGGAGATTATAACCTTGCATATATTGATCAGGCGGGCAGTGCCAACGATGCCAGTATTTCGCAAGG
TGCTTATGGTAATACTGCGATGATTATCCAGAAAGGTTCTGGTAATAAAGCAAATATTACACAGTATGGTACTCAAAAAA
CGGCAATTGTAGTGCAGAGACAGTCGCAAATGGCTATTCGCGTGACACAACGTTAA

Upstream 100 bases:

>100_bases
TTTCCATCGTAACGCAGCGTTAACAAAATACAGGTTGCGTTAACAACCAAGTTGAAATGATTTAATTTCTTAAATGTACG
ACCAGGTCCAGGGTGACAAC

Downstream 100 bases:

>100_bases
TTTCCATTCGACTTTTAAATCAATCCGATGGGGGTTTTACATGAAACTTTTAAAAGTAGCAGCAATTGCAGCAATCGTAT
TCTCCGGTAGCGCTCTGGCA

Product: curlin minor subunit

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 151; Mature: 151

Protein sequence:

>151_residues
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQLRQGGSKLLAVVAQEGSSNRA
KIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQR

Sequences:

>Translated_151_residues
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQLRQGGSKLLAVVAQEGSSNRA
KIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQR
>Mature_151_residues
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQLRQGGSKLLAVVAQEGSSNRA
KIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQR

Specific function: Curlin is the structural subunit of the curli. Curli are coiled surface structures that assemble preferentially at growth temperatures below 37 degrees Celsius. Curli can bind to fibronectin. The minor subunit is the nucleation component of curlin monomer

COG id: NA

COG function: NA

Gene ontology:

Cell location: Fimbrium. Note=Part of the curli surface structure

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the CsgA/CsgB family

Homologues:

Organism=Escherichia coli, GI1787278, Length=151, Percent_Identity=100, Blast_Score=298, Evalue=1e-82,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): CSGB_ECO57 (P0ABK8)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   C90806
- PIR:   G85665
- RefSeq:   NP_287175.1
- RefSeq:   NP_309446.1
- ProteinModelPortal:   P0ABK8
- EnsemblBacteria:   EBESCT00000028326
- EnsemblBacteria:   EBESCT00000056293
- GeneID:   912479
- GeneID:   959370
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1675
- KEGG:   ecs:ECs1419
- GeneTree:   EBGT00050000011898
- HOGENOM:   HBG678327
- OMA:   TNDASIS
- ProtClustDB:   PRK10101
- BioCyc:   ECOL83334:ECS1419-MONOMER
- GO:   GO:0009289
- InterPro:   IPR009742

Pfam domain/function: PF07012 Curlin_rpt

EC number: NA

Molecular weight: Translated: 15882; Mature: 15882

Theoretical pI: Translated: 10.05; Mature: 10.05

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
3.3 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQL
CCCHHHHHHHHHHCCCCCHHHCCCCCCCCCCCEEHHHHCCCCCCCEEEEECCCCCCCHHH
RQGGSKLLAVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQKGS
HCCCCEEEEEEECCCCCCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEECCC
GNKANITQYGTQKTAIVVQRQSQMAIRVTQR
CCCCEEEECCCCEEEEEEEECCCEEEEEECC
>Mature Secondary Structure
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQL
CCCHHHHHHHHHHCCCCCHHHCCCCCCCCCCCEEHHHHCCCCCCCEEEEECCCCCCCHHH
RQGGSKLLAVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQKGS
HCCCCEEEEEEECCCCCCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEECCC
GNKANITQYGTQKTAIVVQRQSQMAIRVTQR
CCCCEEEECCCCEEEEEEEECCCEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796