Definition | Chromobacterium violaceum ATCC 12472 chromosome, complete genome. |
---|---|
Accession | NC_005085 |
Length | 4,751,080 |
Click here to switch to the map view.
The map label for this gene is sipA [H]
Identifier: 34498071
GI number: 34498071
Start: 2822273
End: 2824330
Strand: Reverse
Name: sipA [H]
Synonym: CV_2616
Alternate gene names: 34498071
Gene position: 2824330-2822273 (Counterclockwise)
Preceding gene: 34498072
Following gene: 34498070
Centisome position: 59.45
GC content: 71.72
Gene sequence:
>2058_bases ATGCCCATCACAGGAGCTTCCCACGCCGCGATGCGCCCCTTGTCCGCGCCGCCGGACGGAACGGCCGGCCCCGCCAGTCC GCTGGGACGGCAATTGCGCGATGTGCGCGCCGACAGCGTCGAACCCGCGCGCCGCCATTCGCTGGCCGATCTGGCCAGCT ACCAGGCCGACTACGCCCGCCGCAGCGTGGCGACGCTGTTCGCCGCCTCGCCGCATGCCGATAAGCTGGGCGAGCTGTAC CAGGCCTCGCCCAATCGCTACGCCAAGCTGGAGATCGCCGAATTCGCCAAGGTCTACAGCCAGCTCAAGCGGCAGCCGGA TCTCGACCCCGCCGCAGGCAAGACGCTGGACGACCTGGCGCAGCAGTACGCCGCCCGAATCCTGAAGGATGGCTTGGGGG AGAAGTCCGCTTTCGGCCCCTGGACCCAGAGAACCGACAAGCACTATCAGCTGCGCAGCGGCCTGGAGCGCAAGCTGGCG GAGATCGCCAGCCGGCACTGCCAGGGCGATGCGCAAAAGCTGGGCAACGATTTCATGCGCGCGGAGGTGACCACCTTCAT CCTGTCCTGCGTAGAAACCCATCTGGGGCGACAGCTGGACGAGGCCACCAGCCGGCAGATCACCGGGCTGGTGGACAGCG CGGCGATGCAGGCCTTCGAAGATCTGCGCCAGCGCCGCGGCGACCTGATCGAGCAGCGCGGTTTCAGCGTGGGCACGCTG GCGCGCGACCTCGATACCGTGGCGGTGCTGCCGCAATTGCTGCGCAGCCTGCTGGAGGCCTTGCCGCCCGGTCCGGGCCA ACGGGCGCCGGAGGAGCCTGCGCGCGACGGTCCGGCCAGGCCGACGCCCAGCCCGGACCCCGGCCCGGCGGGGCCGGGAG AGGCGCAGCGGCCGCAGGAAATCCATTACCACATCGACAACAGCATCCACTGGAACGACAACAGCCAGGACAACCGGCGC TGGAACCGGCGCGGCGACACCTACCTGGGCGGCGCGCGGCAGGGCGACCGCCATCTCCGTTCGTCCGGTTTCCCCGCGTC GGGGCTGGCGGCGTCGCTGCGGACCGCGGCCAGCCAGGATCTGCCCAAGCCGCAGCAGGCGGTGTTGTCCGGGCCGGCCT CGCCTGCCGCTGGGCGCCATCCGCTGCTGAATGCCGTCGACCAGGTGGGACAGAGCCTGTCCGGGCTGGTGGACGCGGCC ATCGGAACCGCCGGCACCCGGCCATTGTCGCCGGGCGGGGCAGCCGAATCATCGGCTGTCAGGCTGGCGGGCCTCGGGAG CGACGGCCTGAGCGCGCTGCCGCCGCGGGACGATCCGGGCGCAAGCCTGCATGGCGCGGCCCGCGCCGTGGCAGACAGCC TGTCCGCCGTGCTGACGGCGGCGGACGGCGCGAAAATCACGAAAATCGCGACGCCGGCGCGGCTGGATGCCGCCGCGAGC GGCGAGGACGCGCAGCAGCCTGCTGCCGGCGAGGCGATCCCGGCCGACGCCGGCCAGCCGGGCGCAGGCGGCTTCGACGG CATCAGGTTCCGCGATGGCAATCTGTACATGCTGCCGACCCAGGCCTATCTGCGCGGCCTCGCGTCTCCGCGCAGGGCCG AGAACGAGCTGTTGCGGGCGGTGCGCGGCGCGCTGGAGCCGGCCGCCGCTCAGCCGATGGCGCAGCGGCGGGAATTCGAG GCGTTGCGCAACCGTATCCTGCCCAGCGACCGTTTCGACCAGGATAAGGTGCTGGACCGTTTCGCTTCCGGCGGCGAAGA CCGCTTGGCCGACGACGCGGCGCGGCTGCGGCAAGCGCTGGCCGGGCATCCCGGACTGGAGCGGCATCGCCAGGCCCTGC GCAGTTTCGCGCGCACCCTGATACGCGAGGCCAACCTGTTGCCCAGCGCCAAGCCGGCCAATCCGCTGGTGGCCGGGCTG CTGGACGCGCTGGGCATCCAGGCGGACGAGGAGGCGCGCCGCGCGCCGCTGCCGAGCAAACCATGCGGCGTGGTTCTGAC CACGGATGGCCTGCATGTCGATTCCAGTCGCGCCGGAGGCCGTCGCGGTGCATCGTGA
Upstream 100 bases:
>100_bases CTGTTCGACAACCTGGTCAAGGTGCTCAGCAGCACCATCAGCAGCTGCCTGGAAACCGCCAAGTCCTTCCTGCAAATCTG AACGCACAGGAGCGCCACAC
Downstream 100 bases:
>100_bases TATCGAATCCATGGTCAGGGAGGTGATGGCGTACAGCCTTTCCGCCGATGCCGGGCGCTTGCAGCCTCAGGCCAGGCTGG TGGACGATTTGTACGCCGAC
Product: invasion protein
Products: NA
Alternate protein names: 70 kDa antigen [H]
Number of amino acids: Translated: 685; Mature: 684
Protein sequence:
>685_residues MPITGASHAAMRPLSAPPDGTAGPASPLGRQLRDVRADSVEPARRHSLADLASYQADYARRSVATLFAASPHADKLGELY QASPNRYAKLEIAEFAKVYSQLKRQPDLDPAAGKTLDDLAQQYAARILKDGLGEKSAFGPWTQRTDKHYQLRSGLERKLA EIASRHCQGDAQKLGNDFMRAEVTTFILSCVETHLGRQLDEATSRQITGLVDSAAMQAFEDLRQRRGDLIEQRGFSVGTL ARDLDTVAVLPQLLRSLLEALPPGPGQRAPEEPARDGPARPTPSPDPGPAGPGEAQRPQEIHYHIDNSIHWNDNSQDNRR WNRRGDTYLGGARQGDRHLRSSGFPASGLAASLRTAASQDLPKPQQAVLSGPASPAAGRHPLLNAVDQVGQSLSGLVDAA IGTAGTRPLSPGGAAESSAVRLAGLGSDGLSALPPRDDPGASLHGAARAVADSLSAVLTAADGAKITKIATPARLDAAAS GEDAQQPAAGEAIPADAGQPGAGGFDGIRFRDGNLYMLPTQAYLRGLASPRRAENELLRAVRGALEPAAAQPMAQRREFE ALRNRILPSDRFDQDKVLDRFASGGEDRLADDAARLRQALAGHPGLERHRQALRSFARTLIREANLLPSAKPANPLVAGL LDALGIQADEEARRAPLPSKPCGVVLTTDGLHVDSSRAGGRRGAS
Sequences:
>Translated_685_residues MPITGASHAAMRPLSAPPDGTAGPASPLGRQLRDVRADSVEPARRHSLADLASYQADYARRSVATLFAASPHADKLGELY QASPNRYAKLEIAEFAKVYSQLKRQPDLDPAAGKTLDDLAQQYAARILKDGLGEKSAFGPWTQRTDKHYQLRSGLERKLA EIASRHCQGDAQKLGNDFMRAEVTTFILSCVETHLGRQLDEATSRQITGLVDSAAMQAFEDLRQRRGDLIEQRGFSVGTL ARDLDTVAVLPQLLRSLLEALPPGPGQRAPEEPARDGPARPTPSPDPGPAGPGEAQRPQEIHYHIDNSIHWNDNSQDNRR WNRRGDTYLGGARQGDRHLRSSGFPASGLAASLRTAASQDLPKPQQAVLSGPASPAAGRHPLLNAVDQVGQSLSGLVDAA IGTAGTRPLSPGGAAESSAVRLAGLGSDGLSALPPRDDPGASLHGAARAVADSLSAVLTAADGAKITKIATPARLDAAAS GEDAQQPAAGEAIPADAGQPGAGGFDGIRFRDGNLYMLPTQAYLRGLASPRRAENELLRAVRGALEPAAAQPMAQRREFE ALRNRILPSDRFDQDKVLDRFASGGEDRLADDAARLRQALAGHPGLERHRQALRSFARTLIREANLLPSAKPANPLVAGL LDALGIQADEEARRAPLPSKPCGVVLTTDGLHVDSSRAGGRRGAS >Mature_684_residues PITGASHAAMRPLSAPPDGTAGPASPLGRQLRDVRADSVEPARRHSLADLASYQADYARRSVATLFAASPHADKLGELYQ ASPNRYAKLEIAEFAKVYSQLKRQPDLDPAAGKTLDDLAQQYAARILKDGLGEKSAFGPWTQRTDKHYQLRSGLERKLAE IASRHCQGDAQKLGNDFMRAEVTTFILSCVETHLGRQLDEATSRQITGLVDSAAMQAFEDLRQRRGDLIEQRGFSVGTLA RDLDTVAVLPQLLRSLLEALPPGPGQRAPEEPARDGPARPTPSPDPGPAGPGEAQRPQEIHYHIDNSIHWNDNSQDNRRW NRRGDTYLGGARQGDRHLRSSGFPASGLAASLRTAASQDLPKPQQAVLSGPASPAAGRHPLLNAVDQVGQSLSGLVDAAI GTAGTRPLSPGGAAESSAVRLAGLGSDGLSALPPRDDPGASLHGAARAVADSLSAVLTAADGAKITKIATPARLDAAASG EDAQQPAAGEAIPADAGQPGAGGFDGIRFRDGNLYMLPTQAYLRGLASPRRAENELLRAVRGALEPAAAQPMAQRREFEA LRNRILPSDRFDQDKVLDRFASGGEDRLADDAARLRQALAGHPGLERHRQALRSFARTLIREANLLPSAKPANPLVAGLL DALGIQADEEARRAPLPSKPCGVVLTTDGLHVDSSRAGGRRGAS
Specific function: Rapidly associates with the first 265 amino acids of vinculin after bacteria-cell contact. This interaction is critical for efficient Shigella uptake. IpaA acts as a potent activator of vinculin and increase its ability to interact with F-actin. The compl
COG id: NA
COG function: NA
Gene ontology:
Cell location: Secreted. Note=Secreted through the specialized type-III secretion system Mxi/Spa from the bacterium through the cell cytosol [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sipA/ipaA family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR015138 - InterPro: IPR023225 [H]
Pfam domain/function: PF09052 SipA [H]
EC number: NA
Molecular weight: Translated: 72580; Mature: 72449
Theoretical pI: Translated: 8.74; Mature: 8.74
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.3 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 1.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPITGASHAAMRPLSAPPDGTAGPASPLGRQLRDVRADSVEPARRHSLADLASYQADYAR CCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH RSVATLFAASPHADKLGELYQASPNRYAKLEIAEFAKVYSQLKRQPDLDPAAGKTLDDLA HHHHHHHHCCCCHHHHHHHHHCCCCCEEEHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH QQYAARILKDGLGEKSAFGPWTQRTDKHYQLRSGLERKLAEIASRHCQGDAQKLGNDFMR HHHHHHHHHHCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH AEVTTFILSCVETHLGRQLDEATSRQITGLVDSAAMQAFEDLRQRRGDLIEQRGFSVGTL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH ARDLDTVAVLPQLLRSLLEALPPGPGQRAPEEPARDGPARPTPSPDPGPAGPGEAQRPQE HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH IHYHIDNSIHWNDNSQDNRRWNRRGDTYLGGARQGDRHLRSSGFPASGLAASLRTAASQD HHEEECCEEECCCCCCHHHHHHCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHC LPKPQQAVLSGPASPAAGRHPLLNAVDQVGQSLSGLVDAAIGTAGTRPLSPGGAAESSAV CCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE RLAGLGSDGLSALPPRDDPGASLHGAARAVADSLSAVLTAADGAKITKIATPARLDAAAS EEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHCCCCC GEDAQQPAAGEAIPADAGQPGAGGFDGIRFRDGNLYMLPTQAYLRGLASPRRAENELLRA CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCEEEECHHHHHHHHCCCHHHHHHHHHH VRGALEPAAAQPMAQRREFEALRNRILPSDRFDQDKVLDRFASGGEDRLADDAARLRQAL HHHHCCHHHCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHH AGHPGLERHRQALRSFARTLIREANLLPSAKPANPLVAGLLDALGIQADEEARRAPLPSK CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCC PCGVVLTTDGLHVDSSRAGGRRGAS CCCEEEECCCCEECCCCCCCCCCCC >Mature Secondary Structure PITGASHAAMRPLSAPPDGTAGPASPLGRQLRDVRADSVEPARRHSLADLASYQADYAR CCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH RSVATLFAASPHADKLGELYQASPNRYAKLEIAEFAKVYSQLKRQPDLDPAAGKTLDDLA HHHHHHHHCCCCHHHHHHHHHCCCCCEEEHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH QQYAARILKDGLGEKSAFGPWTQRTDKHYQLRSGLERKLAEIASRHCQGDAQKLGNDFMR HHHHHHHHHHCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH AEVTTFILSCVETHLGRQLDEATSRQITGLVDSAAMQAFEDLRQRRGDLIEQRGFSVGTL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH ARDLDTVAVLPQLLRSLLEALPPGPGQRAPEEPARDGPARPTPSPDPGPAGPGEAQRPQE HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH IHYHIDNSIHWNDNSQDNRRWNRRGDTYLGGARQGDRHLRSSGFPASGLAASLRTAASQD HHEEECCEEECCCCCCHHHHHHCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHC LPKPQQAVLSGPASPAAGRHPLLNAVDQVGQSLSGLVDAAIGTAGTRPLSPGGAAESSAV CCCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE RLAGLGSDGLSALPPRDDPGASLHGAARAVADSLSAVLTAADGAKITKIATPARLDAAAS EEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHCCCCC GEDAQQPAAGEAIPADAGQPGAGGFDGIRFRDGNLYMLPTQAYLRGLASPRRAENELLRA CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCEEEECHHHHHHHHCCCHHHHHHHHHH VRGALEPAAAQPMAQRREFEALRNRILPSDRFDQDKVLDRFASGGEDRLADDAARLRQAL HHHHCCHHHCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHH AGHPGLERHRQALRSFARTLIREANLLPSAKPANPLVAGLLDALGIQADEEARRAPLPSK CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCC PCGVVLTTDGLHVDSSRAGGRRGAS CCCEEEECCCCEECCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2183200; 11115111; 11292750; 12384590; 3057506; 9184218; 10545097 [H]