Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is cirA

Identifier: 218695768

GI number: 218695768

Start: 2460231

End: 2462222

Strand: Reverse

Name: cirA

Synonym: EC55989_2407

Alternate gene names: 218695768

Gene position: 2462222-2460231 (Counterclockwise)

Preceding gene: 218695769

Following gene: 218695765

Centisome position: 47.77

GC content: 52.71

Gene sequence:

>1992_bases
ATGTTTAGGTTGAACCCTTTCGTACGGGTCGGGCTGTGTTTGTCCGCTATTTCTTGTGCATGGCCTGTGTTAGCGGTCGA
TGATGATGGCGAAACGATGGTTGTCACTGCATCTTCCGTTGAACAAAACCTCAAAGATGCACCCGCCAGTATCAGCGTCA
TTACCCAGGAAGACCTGCAGCGAAAACCGGTACAGAATCTGAAGGATGTCCTCAAAGAAGTGCCTGGCGTACAACTGACG
AACGAAGGGGATAACCGTAAGGGCGTTAGTATTCGTGGTCTGGACAGCAGCTACACCCTGATTCTCGTCGACGGTAAACG
CGTTAACTCCCGCAATGCCGTCTTCCGCCACAATGATTTCGATCTGAACTGGATCCCGGTCGATTCCATCGAACGTATTG
AAGTGGTTCGTGGCCCGATGTCGTCGCTATACGGCTCCGATGCGCTCGGCGGTGTAGTGAATATCATCACCAAAAAAATC
GGTCAGAAATGGTCGGGTACCGTTACCGTCGATACCACCATTCAGGAACATCGCGATCGCGGTGATACCTATAACGGTCA
GTTCTTTACCAGTGGACCATTAATTGATGGTGTGCTGGGAATGAAAGCTTACGGCAGCCTGGCAAAACGTGAAAAGGATG
ACCCGCAAAACTCAACGACCACCGATACCGGAGAAACGCCGCGTATTGAAGGATTCTCCAGCCGCGACGGCAATGTCGAA
TTTGCCTGGACACCGAATCAAAATCACGATTTTACTGCCGGATACGGTTTCGACCGTCAGGATCGTGATTCCGACTCGCT
GGACAAAAACCGCCTGGAACGCCAGAACTACTCCGTCAGCCATAATGGGCGTTGGGATTACGGCACCAGCGAACTGAAAT
ACTACGGTGAGAAAGTCGAGAACAAAAACCCTGGCAACAGCAGCCCGATAACTTCCGAAAGCAATACGGTCGACGGCAAA
TACACGTTGCCGCTGACGGCGATTAATCAGTTTCTCACGGTTGGCGGTGAATGGCGTCACGACAAACTTAGCGATGCGGT
GAACCTGACCGGGGGAACCAGCTCCAAAACGTCTGCCAGCCAGTACGCGCTGTTTGTGGAAGATGAATGGCGGATCTTCG
AGCCGCTGGCGCTGACGACCGGCGTGCGTATGGACGATCACGAAACCTACGGTGAACACTGGAGTCCGCGTGCCTACCTG
GTTTATAACGCCACCGACACCGTAACGGTGAAAGGGGGCTGGGCGACGGCATTTAAAGCACCTTCTCTGTTGCAACTTAG
CCCTGACTGGACGAGCAATTCCTGCCGTGGCGCATGTAAGATTGTGGGTAGCCCGGATCTGAAACCAGAAACCAGCGAAA
GTTGGGAGCTGGGGCTTTACTACATGGGTGAAGAAGGCTGGCTGGAAGGGGTTGAATCCAGCGTTACCGTTTTCCGTAAC
GATGTGAAAGATCGTATCAGCATCAGCCGTACGTCTGACGTCAACGCTGCACCGGGCTACCAAAACTTTGTTGGTTTTGA
GACGGGCGCTAACGGACGGCGCATACCGGTATTTAGCTACTACAACGTTAACAAAGCTCGTATTCAGGGCGTGGAAACCG
AACTGAAAATTCCGTTCAACGATGAATGGAAACTGTCGATCAACTACACCTACAACGATGGTCGTGATGTCAGCAACGGC
GAAAACAAACCGCTATCCGATCTGCCGTTCCATACTGCTAACGGTACGCTGGACTGGAAACCGCTGGCGCTGGAAGACTG
GTCATTCTATGTTTCTGGTCACTATACCGGGCAGAAACGCGCCGACAGCGCGACGGCTAAAACACCGGGCGGTTATACCA
TCTGGAATACCGGCGCGGCCTGGCAGGTGACTAAAGACGTCAAACTGCGCGCAGGCGTGCTGAACCTTGGCGACAAGGAT
CTCAGTCGTGACGACTACAGCTATAACGAAGACGGACGTCGTTACTTTATGGCAGTGGATTATCGCTTCTGA

Upstream 100 bases:

>100_bases
ACAAATAAGTCCACCGCGATGCTGCCGTACGCAAGGGGACGTGAAGAAGATGTGAGCGATAACCCATTTTATTTTCGTAG
TTACCTCATGGAGATATGGA

Downstream 100 bases:

>100_bases
TGAGAAGATGCCCGGCGAACCGGGCGGACTTTCACTTCAGTAAATACTGCGCATGGAAGCGCAGGTGATCCTCTATAAAA
GAGGCGATGAAGTAGTAACT

Product: colicin I receptor

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 663; Mature: 663

Protein sequence:

>663_residues
MFRLNPFVRVGLCLSAISCAWPVLAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQRKPVQNLKDVLKEVPGVQLT
NEGDNRKGVSIRGLDSSYTLILVDGKRVNSRNAVFRHNDFDLNWIPVDSIERIEVVRGPMSSLYGSDALGGVVNIITKKI
GQKWSGTVTVDTTIQEHRDRGDTYNGQFFTSGPLIDGVLGMKAYGSLAKREKDDPQNSTTTDTGETPRIEGFSSRDGNVE
FAWTPNQNHDFTAGYGFDRQDRDSDSLDKNRLERQNYSVSHNGRWDYGTSELKYYGEKVENKNPGNSSPITSESNTVDGK
YTLPLTAINQFLTVGGEWRHDKLSDAVNLTGGTSSKTSASQYALFVEDEWRIFEPLALTTGVRMDDHETYGEHWSPRAYL
VYNATDTVTVKGGWATAFKAPSLLQLSPDWTSNSCRGACKIVGSPDLKPETSESWELGLYYMGEEGWLEGVESSVTVFRN
DVKDRISISRTSDVNAAPGYQNFVGFETGANGRRIPVFSYYNVNKARIQGVETELKIPFNDEWKLSINYTYNDGRDVSNG
ENKPLSDLPFHTANGTLDWKPLALEDWSFYVSGHYTGQKRADSATAKTPGGYTIWNTGAAWQVTKDVKLRAGVLNLGDKD
LSRDDYSYNEDGRRYFMAVDYRF

Sequences:

>Translated_663_residues
MFRLNPFVRVGLCLSAISCAWPVLAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQRKPVQNLKDVLKEVPGVQLT
NEGDNRKGVSIRGLDSSYTLILVDGKRVNSRNAVFRHNDFDLNWIPVDSIERIEVVRGPMSSLYGSDALGGVVNIITKKI
GQKWSGTVTVDTTIQEHRDRGDTYNGQFFTSGPLIDGVLGMKAYGSLAKREKDDPQNSTTTDTGETPRIEGFSSRDGNVE
FAWTPNQNHDFTAGYGFDRQDRDSDSLDKNRLERQNYSVSHNGRWDYGTSELKYYGEKVENKNPGNSSPITSESNTVDGK
YTLPLTAINQFLTVGGEWRHDKLSDAVNLTGGTSSKTSASQYALFVEDEWRIFEPLALTTGVRMDDHETYGEHWSPRAYL
VYNATDTVTVKGGWATAFKAPSLLQLSPDWTSNSCRGACKIVGSPDLKPETSESWELGLYYMGEEGWLEGVESSVTVFRN
DVKDRISISRTSDVNAAPGYQNFVGFETGANGRRIPVFSYYNVNKARIQGVETELKIPFNDEWKLSINYTYNDGRDVSNG
ENKPLSDLPFHTANGTLDWKPLALEDWSFYVSGHYTGQKRADSATAKTPGGYTIWNTGAAWQVTKDVKLRAGVLNLGDKD
LSRDDYSYNEDGRRYFMAVDYRF
>Mature_663_residues
MFRLNPFVRVGLCLSAISCAWPVLAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQRKPVQNLKDVLKEVPGVQLT
NEGDNRKGVSIRGLDSSYTLILVDGKRVNSRNAVFRHNDFDLNWIPVDSIERIEVVRGPMSSLYGSDALGGVVNIITKKI
GQKWSGTVTVDTTIQEHRDRGDTYNGQFFTSGPLIDGVLGMKAYGSLAKREKDDPQNSTTTDTGETPRIEGFSSRDGNVE
FAWTPNQNHDFTAGYGFDRQDRDSDSLDKNRLERQNYSVSHNGRWDYGTSELKYYGEKVENKNPGNSSPITSESNTVDGK
YTLPLTAINQFLTVGGEWRHDKLSDAVNLTGGTSSKTSASQYALFVEDEWRIFEPLALTTGVRMDDHETYGEHWSPRAYL
VYNATDTVTVKGGWATAFKAPSLLQLSPDWTSNSCRGACKIVGSPDLKPETSESWELGLYYMGEEGWLEGVESSVTVFRN
DVKDRISISRTSDVNAAPGYQNFVGFETGANGRRIPVFSYYNVNKARIQGVETELKIPFNDEWKLSINYTYNDGRDVSNG
ENKPLSDLPFHTANGTLDWKPLALEDWSFYVSGHYTGQKRADSATAKTPGGYTIWNTGAAWQVTKDVKLRAGVLNLGDKD
LSRDDYSYNEDGRRYFMAVDYRF

Specific function: Not yet known. Postulated to participate in iron transport. Outer membrane receptor for colicins IA and IB

COG id: COG4771

COG function: function code P; Outer membrane receptor for ferrienterochelin and colicins

Gene ontology:

Cell location: Cell outer membrane

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the tonB-dependent receptor family

Homologues:

Organism=Escherichia coli, GI1788478, Length=663, Percent_Identity=100, Blast_Score=1365, Evalue=0.0,
Organism=Escherichia coli, GI1786798, Length=746, Percent_Identity=32.4396782841823, Blast_Score=268, Evalue=1e-72,
Organism=Escherichia coli, GI1790405, Length=701, Percent_Identity=25.962910128388, Blast_Score=163, Evalue=3e-41,
Organism=Escherichia coli, GI1787723, Length=155, Percent_Identity=32.9032258064516, Blast_Score=69, Evalue=7e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): CIRA_ECOLI (P17315)

Other databases:

- EMBL:   J04229
- EMBL:   U00007
- EMBL:   U00096
- EMBL:   AP009048
- EMBL:   M19295
- EMBL:   M89774
- PIR:   B64984
- RefSeq:   AP_002752.1
- RefSeq:   NP_416660.1
- PDB:   2HDF
- PDB:   2HDI
- PDBsum:   2HDF
- PDBsum:   2HDI
- ProteinModelPortal:   P17315
- SMR:   P17315
- DIP:   DIP-9282N
- MINT:   MINT-4794244
- STRING:   P17315
- SWISS-2DPAGE:   P17315
- 2DBase-Ecoli:   P17315
- PRIDE:   P17315
- EnsemblBacteria:   EBESCT00000001612
- EnsemblBacteria:   EBESCT00000001613
- EnsemblBacteria:   EBESCT00000014536
- GeneID:   949042
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW2142
- KEGG:   eco:b2155
- EchoBASE:   EB0153
- EcoGene:   EG10155
- eggNOG:   COG4771
- GeneTree:   EBGT00070000031689
- HOGENOM:   HBG464298
- OMA:   HERIDKR
- ProtClustDB:   PRK10064
- BioCyc:   EcoCyc:EG10155-MONOMER
- Genevestigator:   P17315
- InterPro:   IPR012910
- InterPro:   IPR000531
- InterPro:   IPR010916
- InterPro:   IPR010917
- Gene3D:   G3DSA:2.170.130.10
- Gene3D:   G3DSA:2.40.170.20

Pfam domain/function: PF07715 Plug; PF00593 TonB_dep_Rec

EC number: NA

Molecular weight: Translated: 73896; Mature: 73896

Theoretical pI: Translated: 4.89; Mature: 4.89

Prosite motif: PS00430 TONB_DEPENDENT_REC_1; PS01156 TONB_DEPENDENT_REC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MFRLNPFVRVGLCLSAISCAWPVLAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQ
CCCCCHHHHHHHHHHHHHHCCEEEEECCCCCEEEEEHHHHHHHHHCCCCEEEEEEHHHHH
RKPVQNLKDVLKEVPGVQLTNEGDNRKGVSIRGLDSSYTLILVDGKRVNSRNAVFRHNDF
CCHHHHHHHHHHHCCCEEEECCCCCCCCCEEEECCCCEEEEEEECCEECCCCCEEEECCC
DLNWIPVDSIERIEVVRGPMSSLYGSDALGGVVNIITKKIGQKWSGTVTVDTTIQEHRDR
CEEEEECCCCHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHCC
GDTYNGQFFTSGPLIDGVLGMKAYGSLAKREKDDPQNSTTTDTGETPRIEGFSSRDGNVE
CCCCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE
FAWTPNQNHDFTAGYGFDRQDRDSDSLDKNRLERQNYSVSHNGRWDYGTSELKYYGEKVE
EEECCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC
NKNPGNSSPITSESNTVDGKYTLPLTAINQFLTVGGEWRHDKLSDAVNLTGGTSSKTSAS
CCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCHHCCEEEECCCCCCCCCCC
QYALFVEDEWRIFEPLALTTGVRMDDHETYGEHWSPRAYLVYNATDTVTVKGGWATAFKA
EEEEEEECCCEEECCEEHHCCCEECCCCCCCCCCCCCEEEEECCCCEEEEECCEEEEECC
PSLLQLSPDWTSNSCRGACKIVGSPDLKPETSESWELGLYYMGEEGWLEGVESSVTVFRN
CCEEEECCCCCCCCCCCEEEEECCCCCCCCCCCCEEEEEEEECCCHHHHCHHHHHHHHHH
DVKDRISISRTSDVNAAPGYQNFVGFETGANGRRIPVFSYYNVNKARIQGVETELKIPFN
CHHHHEEEEECCCCCCCCCCCCCEEEECCCCCCEEEEEEEECCCHHHEECCCEEEECCCC
DEWKLSINYTYNDGRDVSNGENKPLSDLPFHTANGTLDWKPLALEDWSFYVSGHYTGQKR
CCEEEEEEEEECCCCCCCCCCCCCCCCCCEECCCCCCCCCCEEEECEEEEEEEEECCCCC
ADSATAKTPGGYTIWNTGAAWQVTKDVKLRAGVLNLGDKDLSRDDYSYNEDGRRYFMAVD
CCCCCCCCCCCEEEECCCCEEEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE
YRF
ECC
>Mature Secondary Structure
MFRLNPFVRVGLCLSAISCAWPVLAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQ
CCCCCHHHHHHHHHHHHHHCCEEEEECCCCCEEEEEHHHHHHHHHCCCCEEEEEEHHHHH
RKPVQNLKDVLKEVPGVQLTNEGDNRKGVSIRGLDSSYTLILVDGKRVNSRNAVFRHNDF
CCHHHHHHHHHHHCCCEEEECCCCCCCCCEEEECCCCEEEEEEECCEECCCCCEEEECCC
DLNWIPVDSIERIEVVRGPMSSLYGSDALGGVVNIITKKIGQKWSGTVTVDTTIQEHRDR
CEEEEECCCCHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHCC
GDTYNGQFFTSGPLIDGVLGMKAYGSLAKREKDDPQNSTTTDTGETPRIEGFSSRDGNVE
CCCCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE
FAWTPNQNHDFTAGYGFDRQDRDSDSLDKNRLERQNYSVSHNGRWDYGTSELKYYGEKVE
EEECCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHC
NKNPGNSSPITSESNTVDGKYTLPLTAINQFLTVGGEWRHDKLSDAVNLTGGTSSKTSAS
CCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCHHCCEEEECCCCCCCCCCC
QYALFVEDEWRIFEPLALTTGVRMDDHETYGEHWSPRAYLVYNATDTVTVKGGWATAFKA
EEEEEEECCCEEECCEEHHCCCEECCCCCCCCCCCCCEEEEECCCCEEEEECCEEEEECC
PSLLQLSPDWTSNSCRGACKIVGSPDLKPETSESWELGLYYMGEEGWLEGVESSVTVFRN
CCEEEECCCCCCCCCCCEEEEECCCCCCCCCCCCEEEEEEEECCCHHHHCHHHHHHHHHH
DVKDRISISRTSDVNAAPGYQNFVGFETGANGRRIPVFSYYNVNKARIQGVETELKIPFN
CHHHHEEEEECCCCCCCCCCCCCEEEECCCCCCEEEEEEEECCCHHHEECCCEEEECCCC
DEWKLSINYTYNDGRDVSNGENKPLSDLPFHTANGTLDWKPLALEDWSFYVSGHYTGQKR
CCEEEEEEEEECCCCCCCCCCCCCCCCCCEECCCCCCCCCCEEEECEEEEEEEEECCCCC
ADSATAKTPGGYTIWNTGAAWQVTKDVKLRAGVLNLGDKDLSRDDYSYNEDGRRYFMAVD
CCCCCCCCCCCEEEECCCCEEEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE
YRF
ECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 2644220; 9278503; 3316180; 1315732; 2160948