Definition | Escherichia coli IAI39 chromosome, complete genome. |
---|---|
Accession | NC_011750 |
Length | 5,132,068 |
Click here to switch to the map view.
The map label for this gene is ipaB [H]
Identifier: 218702498
GI number: 218702498
Start: 4420346
End: 4422127
Strand: Direct
Name: ipaB [H]
Synonym: ECIAI39_4254
Alternate gene names: 218702498
Gene position: 4420346-4422127 (Clockwise)
Preceding gene: 218702497
Following gene: 218702499
Centisome position: 86.13
GC content: 40.85
Gene sequence:
>1782_bases ATGTCGGCACCGATTACGGGACAAACGATCACGTTTGAACAAATCAGTGAAACATTGCGTACACAGTATAGCGATGCGGA AAAAAGACTGCAGGACAGCAGCAAAACGCAAGTTGATCCCATGCGGCTTAATAAAAACCCGAAATCCCTTGATAACGATA TTCGCGCTCGTCTTGAAAACAAACCGATGCTTGCTCCGCCTGAAATACAGGTTTCAGATTCAGACAACGCCACTACAGCA AAGACGAATGATGCCCGCCTGACAATGATTTTGGGTAATTTAACAGGCATTGCCGATCAGGATATTACAAAACGATTGCA TAATAACCTTGACAGCACCCTGCTTCGACACGAAATGGCACATAATAAGTTTCGTGAATTAAGTGACGCTTACTCCTCAT CATTAGATGATGCGCAAAAAGCCGACGATATCATGCATCAGGCCAATAATAATTACAATGCGGTCGATAAAAAGGTGCAA TCGCTGGAGAAAAAAGTCAACACATTGAATCAAGAGCTGTCACAACTGCAACCTAGCGATCCGCAATATAATAAAGTACT GACGCAAAAAAATGCGGCTGAAAAAACACTCACGCTGTCATTACAAAAAAAATCGTTAGCTGAGCAATCGTTAAATACAG CCATTATGGATGCTGATGCTGCGATCGGCCAAAGTATGGAAATTTTTGACGAAATTCAACAGCAAGAACAGATTAATAAC TTCACCACCAATATTTGCCTGACACAGGAAAACCAGAAAAATAGAAACGCGACAGCCACATTTATCCTTTTGATTACTTC AGTAATGGAAGTTATTGGTGATACCAATTGCGACTCTATTAAAAATCAATCTGAGGTAATGAAAGAGATTAACCATGTCA GGGAAAATAAACTCAATGAAACAGCGCGTAAATATACTACCACGACTAAAGTTTTAAAAATCGTTAATGAGTGTGTAACT GTAGTTACCTTTGCCGTAAGCGCAGTCTTAATTGTTGTTGGTCTTCTTGCTGCAGTTCCAAGTGGTGGTTCAAGTATTGC GGGTGCATTGGCACTTATTGGAGGAATAGCGGGCGCAGTAGTATTAGGTGTAGATATCACTTGTCAAATCGCTCTGGGCA CCACGGCTACCGGCTGGATTTTAGGGAAAGTTGTTGAAGGTCTTTCGGCTGCGATTAAAACAGTTGATCCCACTCTGCTC GCAATCACCGCGCTTCTGGACGTTATTGGTGTCGATCAAAATACGATTGAACTGGTTAAAAGTATTTACGCAAGTGCAGC CGCTTCTATTGTCATGGCGACAGTAATGATTGGTGCAGCTGTAATATGTTCTGTAGCAATAGGGGCCGTTGTTTCTGCAT TATCAAAAACAGCCGCCGAAGAAGTCACCAAAGAAATAACAAGCACTATAAAATCAACCATTGAATCAATTATTAATTCA GTTTCAAAAAATATTATAAAGGTATTAGACAGCGTATGCAGCGTGCTACAAACATCAGCCGTAGTGTTGAAGTTGATTGC CAAAATAAGTAATGGCCTGGAGAAAATAGGCTTACTCATCTGTGCAATAGCAACATCGACGATGAATTGTTTTGTTGCTG GAAACTCTGCCGACATGGCAATTTTACAACAGGACATGAGTAATCTATCAAAAACGCGTGAACAAATGCTTTCAGTATTG CAAAGGGTGGATAAAACCGTCGAACAAGAGGTAAGCCAGATGGTAAGAGTATTACAACACCGAACTGAAGCCTTAAAATT TGCTTCCCATTCTATCGTATAA
Upstream 100 bases:
>100_bases TTTCCCGTTGATGCCATTTTAGATGGGAATAAACATCTGCCATGAGTTAAATAAAAAAGAATTAATTACACCCGATGATA GCTTAAAAAGGAGGTGAATT
Downstream 100 bases:
>100_bases GAGGGTTTTAACATGGCCACCTATGAAATTAAAAATGCGACCAGCACATCTAATATTAACAGTCTTGATAAAACAATCTT TGATAGAGATTATAGTAAAA
Product: hypothetical protein
Products: NA
Alternate protein names: 62 kDa antigen [H]
Number of amino acids: Translated: 593; Mature: 592
Protein sequence:
>593_residues MSAPITGQTITFEQISETLRTQYSDAEKRLQDSSKTQVDPMRLNKNPKSLDNDIRARLENKPMLAPPEIQVSDSDNATTA KTNDARLTMILGNLTGIADQDITKRLHNNLDSTLLRHEMAHNKFRELSDAYSSSLDDAQKADDIMHQANNNYNAVDKKVQ SLEKKVNTLNQELSQLQPSDPQYNKVLTQKNAAEKTLTLSLQKKSLAEQSLNTAIMDADAAIGQSMEIFDEIQQQEQINN FTTNICLTQENQKNRNATATFILLITSVMEVIGDTNCDSIKNQSEVMKEINHVRENKLNETARKYTTTTKVLKIVNECVT VVTFAVSAVLIVVGLLAAVPSGGSSIAGALALIGGIAGAVVLGVDITCQIALGTTATGWILGKVVEGLSAAIKTVDPTLL AITALLDVIGVDQNTIELVKSIYASAAASIVMATVMIGAAVICSVAIGAVVSALSKTAAEEVTKEITSTIKSTIESIINS VSKNIIKVLDSVCSVLQTSAVVLKLIAKISNGLEKIGLLICAIATSTMNCFVAGNSADMAILQQDMSNLSKTREQMLSVL QRVDKTVEQEVSQMVRVLQHRTEALKFASHSIV
Sequences:
>Translated_593_residues MSAPITGQTITFEQISETLRTQYSDAEKRLQDSSKTQVDPMRLNKNPKSLDNDIRARLENKPMLAPPEIQVSDSDNATTA KTNDARLTMILGNLTGIADQDITKRLHNNLDSTLLRHEMAHNKFRELSDAYSSSLDDAQKADDIMHQANNNYNAVDKKVQ SLEKKVNTLNQELSQLQPSDPQYNKVLTQKNAAEKTLTLSLQKKSLAEQSLNTAIMDADAAIGQSMEIFDEIQQQEQINN FTTNICLTQENQKNRNATATFILLITSVMEVIGDTNCDSIKNQSEVMKEINHVRENKLNETARKYTTTTKVLKIVNECVT VVTFAVSAVLIVVGLLAAVPSGGSSIAGALALIGGIAGAVVLGVDITCQIALGTTATGWILGKVVEGLSAAIKTVDPTLL AITALLDVIGVDQNTIELVKSIYASAAASIVMATVMIGAAVICSVAIGAVVSALSKTAAEEVTKEITSTIKSTIESIINS VSKNIIKVLDSVCSVLQTSAVVLKLIAKISNGLEKIGLLICAIATSTMNCFVAGNSADMAILQQDMSNLSKTREQMLSVL QRVDKTVEQEVSQMVRVLQHRTEALKFASHSIV >Mature_592_residues SAPITGQTITFEQISETLRTQYSDAEKRLQDSSKTQVDPMRLNKNPKSLDNDIRARLENKPMLAPPEIQVSDSDNATTAK TNDARLTMILGNLTGIADQDITKRLHNNLDSTLLRHEMAHNKFRELSDAYSSSLDDAQKADDIMHQANNNYNAVDKKVQS LEKKVNTLNQELSQLQPSDPQYNKVLTQKNAAEKTLTLSLQKKSLAEQSLNTAIMDADAAIGQSMEIFDEIQQQEQINNF TTNICLTQENQKNRNATATFILLITSVMEVIGDTNCDSIKNQSEVMKEINHVRENKLNETARKYTTTTKVLKIVNECVTV VTFAVSAVLIVVGLLAAVPSGGSSIAGALALIGGIAGAVVLGVDITCQIALGTTATGWILGKVVEGLSAAIKTVDPTLLA ITALLDVIGVDQNTIELVKSIYASAAASIVMATVMIGAAVICSVAIGAVVSALSKTAAEEVTKEITSTIKSTIESIINSV SKNIIKVLDSVCSVLQTSAVVLKLIAKISNGLEKIGLLICAIATSTMNCFVAGNSADMAILQQDMSNLSKTREQMLSVLQ RVDKTVEQEVSQMVRVLQHRTEALKFASHSIV
Specific function: Associated with the entry of the bacteria into colonic epithelial cells [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Secreted. Host cell membrane; Multi-pass membrane protein (Potential). Note=Secreted through a type-III system [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006972 - InterPro: IPR003895 [H]
Pfam domain/function: PF04888 SseC [H]
EC number: NA
Molecular weight: Translated: 63932; Mature: 63801
Theoretical pI: Translated: 5.56; Mature: 5.56
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSAPITGQTITFEQISETLRTQYSDAEKRLQDSSKTQVDPMRLNKNPKSLDNDIRARLEN CCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCHHHHHHHHHHHCC KPMLAPPEIQVSDSDNATTAKTNDARLTMILGNLTGIADQDITKRLHNNLDSTLLRHEMA CCCCCCCCEEECCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHH HNKFRELSDAYSSSLDDAQKADDIMHQANNNYNAVDKKVQSLEKKVNTLNQELSQLQPSD HHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC PQYNKVLTQKNAAEKTLTLSLQKKSLAEQSLNTAIMDADAAIGQSMEIFDEIQQQEQINN CHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHHHH FTTNICLTQENQKNRNATATFILLITSVMEVIGDTNCDSIKNQSEVMKEINHVRENKLNE HHHHEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHHHH TARKYTTTTKVLKIVNECVTVVTFAVSAVLIVVGLLAAVPSGGSSIAGALALIGGIAGAV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH VLGVDITCQIALGTTATGWILGKVVEGLSAAIKTVDPTLLAITALLDVIGVDQNTIELVK HHCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHH SIYASAAASIVMATVMIGAAVICSVAIGAVVSALSKTAAEEVTKEITSTIKSTIESIINS HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VSKNIIKVLDSVCSVLQTSAVVLKLIAKISNGLEKIGLLICAIATSTMNCFVAGNSADMA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEEEECCCHHHH ILQQDMSNLSKTREQMLSVLQRVDKTVEQEVSQMVRVLQHRTEALKFASHSIV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure SAPITGQTITFEQISETLRTQYSDAEKRLQDSSKTQVDPMRLNKNPKSLDNDIRARLEN CCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCHHHHHHHHHHHCC KPMLAPPEIQVSDSDNATTAKTNDARLTMILGNLTGIADQDITKRLHNNLDSTLLRHEMA CCCCCCCCEEECCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHH HNKFRELSDAYSSSLDDAQKADDIMHQANNNYNAVDKKVQSLEKKVNTLNQELSQLQPSD HHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC PQYNKVLTQKNAAEKTLTLSLQKKSLAEQSLNTAIMDADAAIGQSMEIFDEIQQQEQINN CHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHHHH FTTNICLTQENQKNRNATATFILLITSVMEVIGDTNCDSIKNQSEVMKEINHVRENKLNE HHHHEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHHHH TARKYTTTTKVLKIVNECVTVVTFAVSAVLIVVGLLAAVPSGGSSIAGALALIGGIAGAV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH VLGVDITCQIALGTTATGWILGKVVEGLSAAIKTVDPTLLAITALLDVIGVDQNTIELVK HHCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHH SIYASAAASIVMATVMIGAAVICSVAIGAVVSALSKTAAEEVTKEITSTIKSTIESIINS HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VSKNIIKVLDSVCSVLQTSAVVLKLIAKISNGLEKIGLLICAIATSTMNCFVAGNSADMA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEEEECCCHHHH ILQQDMSNLSKTREQMLSVLQRVDKTVEQEVSQMVRVLQHRTEALKFASHSIV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 1766387 [H]