Definition | Escherichia coli IAI39 chromosome, complete genome. |
---|---|
Accession | NC_011750 |
Length | 5,132,068 |
Click here to switch to the map view.
The map label for this gene is spr
Identifier: 218700647
GI number: 218700647
Start: 2384134
End: 2384700
Strand: Direct
Name: spr
Synonym: ECIAI39_2316
Alternate gene names: 218700647
Gene position: 2384134-2384700 (Clockwise)
Preceding gene: 218700646
Following gene: 218700648
Centisome position: 46.46
GC content: 47.27
Gene sequence:
>567_bases ATGGTCAAATCTCAACCGATTTTGAGATATATCTTGCGCGGGATCCCCGCGATTGCAGTAGCGGTTCTGCTTTCTGCATG TAGTGCAAATAACACCGCAAAGAATATGCATCCTGAGACACGTGCAGTGGGTAGTGAAACATCATCACTGCAAGCTTCTC AGGATGAATTTGAAAACCTGGTTCGTAATGTCGACGTAAAATCGCGGATTATGGATCAGTATGCTGACTGGAAAGGCGTA CGCTATCGTCTGGGCGGCAGCACTAAAAAAGGTATCGATTGTTCTGGTTTCGTACAGCGTACATTCCGTGAGCAATTTGG CTTAGAACTTCCGCGTTCGACTTATGAACAGCAGGAAATGGGTAAATCTGTTTCCCGCAGTAATTTGCGTACGGGTGATT TAGTTCTGTTCCGTGCCGGTTCAACGGGACGCCATGTCGGTATTTATATCGGCAACAACCAGTTTGTCCATGCTTCCACC AGCAGTGGCGTTATTATTTCCAGCATGAATGAACCATACTGGAAGAAGCGTTACAACGAAGCACGCCGGGTTCTCAGCCG CAGCTAA
Upstream 100 bases:
>100_bases AAGGAATTGTTTCAACATGCCCAGGTAATTAGTCTCGTGTCGCTTGGCATTTTTTTATAACGATATTTGTCGTTAAGGAC TTCAAGGGAAAACAAACAAC
Downstream 100 bases:
>100_bases TAAACCGTTTGGATGCAATCCCTTGGCTATCCTGACGAGTTAACTGAAAGCACTGCTTAGGCAGTGCTTTTTTGTTTTCA TTCATCAGAGAAAATGATGT
Product: putative outer membrane lipoprotein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 188; Mature: 188
Protein sequence:
>188_residues MVKSQPILRYILRGIPAIAVAVLLSACSANNTAKNMHPETRAVGSETSSLQASQDEFENLVRNVDVKSRIMDQYADWKGV RYRLGGSTKKGIDCSGFVQRTFREQFGLELPRSTYEQQEMGKSVSRSNLRTGDLVLFRAGSTGRHVGIYIGNNQFVHAST SSGVIISSMNEPYWKKRYNEARRVLSRS
Sequences:
>Translated_188_residues MVKSQPILRYILRGIPAIAVAVLLSACSANNTAKNMHPETRAVGSETSSLQASQDEFENLVRNVDVKSRIMDQYADWKGV RYRLGGSTKKGIDCSGFVQRTFREQFGLELPRSTYEQQEMGKSVSRSNLRTGDLVLFRAGSTGRHVGIYIGNNQFVHAST SSGVIISSMNEPYWKKRYNEARRVLSRS >Mature_188_residues MVKSQPILRYILRGIPAIAVAVLLSACSANNTAKNMHPETRAVGSETSSLQASQDEFENLVRNVDVKSRIMDQYADWKGV RYRLGGSTKKGIDCSGFVQRTFREQFGLELPRSTYEQQEMGKSVSRSNLRTGDLVLFRAGSTGRHVGIYIGNNQFVHAST SSGVIISSMNEPYWKKRYNEARRVLSRS
Specific function: Unknown
COG id: COG0791
COG function: function code M; Cell wall-associated hydrolases (invasion-associated proteins)
Gene ontology:
Cell location: Cell membrane; Lipid-anchor (Potential)
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the nlpC/p60 family
Homologues:
Organism=Escherichia coli, GI1788501, Length=188, Percent_Identity=100, Blast_Score=390, Evalue=1e-110, Organism=Escherichia coli, GI1788001, Length=127, Percent_Identity=48.0314960629921, Blast_Score=129, Evalue=1e-31, Organism=Escherichia coli, GI1787944, Length=122, Percent_Identity=36.0655737704918, Blast_Score=85, Evalue=4e-18, Organism=Escherichia coli, GI1786421, Length=114, Percent_Identity=34.2105263157895, Blast_Score=70, Evalue=9e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): SPR_ECO57 (P0AFV6)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: C91012 - PIR: E85856 - RefSeq: NP_288758.1 - RefSeq: NP_311094.1 - ProteinModelPortal: P0AFV6 - SMR: P0AFV6 - EnsemblBacteria: EBESCT00000026997 - EnsemblBacteria: EBESCT00000059080 - GeneID: 916771 - GeneID: 957128 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z3434 - KEGG: ecs:ECs3067 - GeneTree: EBGT00050000009578 - HOGENOM: HBG362075 - OMA: GRHIGIY - ProtClustDB: PRK10838 - BioCyc: ECOL83334:ECS3067-MONOMER - InterPro: IPR000064
Pfam domain/function: PF00877 NLPC_P60
EC number: NA
Molecular weight: Translated: 21040; Mature: 21040
Theoretical pI: Translated: 10.48; Mature: 10.48
Prosite motif: PS51257 PROKAR_LIPOPROTEIN; PS00013 PROKAR_LIPOPROTEIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVKSQPILRYILRGIPAIAVAVLLSACSANNTAKNMHPETRAVGSETSSLQASQDEFENL CCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCCHHHCCCHHHHHCCCCHHCCCCHHHHHHH VRNVDVKSRIMDQYADWKGVRYRLGGSTKKGIDCSGFVQRTFREQFGLELPRSTYEQQEM HHCCCHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH GKSVSRSNLRTGDLVLFRAGSTGRHVGIYIGNNQFVHASTSSGVIISSMNEPYWKKRYNE HHHHHHCCCCCCCEEEEECCCCCCEEEEEECCCEEEEECCCCCEEEECCCCCHHHHHHHH ARRVLSRS HHHHHCCC >Mature Secondary Structure MVKSQPILRYILRGIPAIAVAVLLSACSANNTAKNMHPETRAVGSETSSLQASQDEFENL CCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCCHHHCCCHHHHHCCCCHHCCCCHHHHHHH VRNVDVKSRIMDQYADWKGVRYRLGGSTKKGIDCSGFVQRTFREQFGLELPRSTYEQQEM HHCCCHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH GKSVSRSNLRTGDLVLFRAGSTGRHVGIYIGNNQFVHASTSSGVIISSMNEPYWKKRYNE HHHHHHCCCCCCCEEEEECCCCCCEEEEEECCCEEEEECCCCCEEEECCCCCHHHHHHHH ARRVLSRS HHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796