| Definition | Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 chromosome, complete genome. |
|---|---|
| Accession | NC_003197 |
| Length | 4,857,432 |
Click here to switch to the map view.
The map label for this gene is invF
Identifier: 16766205
GI number: 16766205
Start: 3043282
End: 3043932
Strand: Reverse
Name: invF
Synonym: STM2899
Alternate gene names: 16766205
Gene position: 3043932-3043282 (Counterclockwise)
Preceding gene: 16766209
Following gene: 16766204
Centisome position: 62.67
GC content: 45.93
Gene sequence:
>651_bases ATGTCATTTTCTGAAAGCCGACACAATGAAAATTGCCTGATTCAGGAAGGCGCGCTGCTTTTTTGCGAGCAGGCCGTTGT CGCACCAGTATCAGGAGACCTGGTTTTTCGACCGTTAAAAATTGAAGTACTCAGCAAATTACTGGCATTTATCGATGGCG CAGGATTAGTGGACACGACATATGCTGAATCCGATAAATGGGTTTTGCTGAGTCCTGAGTTTCGCGCTATTTGGCAAGAT CGTAAACGCTGCGAGTACTGGTTTTTGCAGCAAATTATTACGCCTTCTCCGGCCTTCAATAAGGTACTGGCGCTGTTACG AAAAAGCGAGAGTTACTGGTTGGTTGGCTATTTACTCGCTCAGTCAACCAGCGGCAACACGATGAGAATGCTGGGAGAAG ACTATGGCGTTTCTTATACCCATTTTCGTCGTTTGTGCAGCAGAGCGTTGGGCGGAAAAGCGAAGAGTGAATTACGAAAC TGGCGTATGGCGCAATCGCTGCTGAATAGTGTAGAAGGCCACGAGAACATCACCCAATTAGCCGTTAATCATGGTTACTC ATCGCCTTCACATTTTTCTAGTGAGATCAAAGAGCTGATCGGCGTTTCGCCGCGGAAATTATCAAATATTATTCAATTGG CAGACAAATGA
Upstream 100 bases:
>100_bases TATGCTAAATACGCAGGAAGTACTTAAAGAAGGAGAGAAGCGGAAAATCCGCAGCCCGGAAGCATGGTTTATACAGACGT GTTCCGCGCAAAAGCTGCAT
Downstream 100 bases:
>100_bases AGACACATATTCTTTTGGCCAGAGTGCTGGCATGTGCCGCGCTTGTTCTGGTTACACCTGGTTATTCTAGTGAAAAAATA CCTGTAACGGGAAGTGGGTT
Product: invasion regulatory protein
Products: NA
Alternate protein names: Transcriptional regulator invF
Number of amino acids: Translated: 216; Mature: 215
Protein sequence:
>216_residues MSFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQD RKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRN WRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK
Sequences:
>Translated_216_residues MSFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQD RKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRN WRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK >Mature_215_residues SFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQDR KRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRNW RMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK
Specific function: Transcriptional regulator required for the expression of several genes encoding type III secretion system SPI1 effector proteins. The interaction with sicA is necessary for the activation of sigDE (sopB pipC), sicAsipBCDA, and sopE
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): INVF_SALTI (P69342)
Other databases:
- EMBL: AL627276 - EMBL: AE014613 - RefSeq: NP_457293.1 - RefSeq: NP_806502.1 - ProteinModelPortal: P69342 - SMR: P69342 - PRIDE: P69342 - GeneID: 1068843 - GeneID: 1249328 - GenomeReviews: AE014613_GR - GenomeReviews: AL513382_GR - KEGG: stt:t2801 - KEGG: sty:STY3022 - HOGENOM: HBG391031 - OMA: SEGRHNE - ProtClustDB: PRK15340 - BioCyc: SENT209261:T2801-MONOMER - BioCyc: SENT220341:STY3022-MONOMER - GO: GO:0009405 - GO: GO:0006350 - InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR018060 - Gene3D: G3DSA:1.10.10.60 - SMART: SM00342
Pfam domain/function: SSF46689 Homeodomain_like
EC number: NA
Molecular weight: Translated: 24389; Mature: 24258
Theoretical pI: Translated: 8.32; Mature: 8.32
Prosite motif: PS00041 HTH_ARAC_FAMILY_1; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 3.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTT CCCCCCCCCCCEEEECCHHHHHHHHHHCCCCCCEEECHHHHHHHHHHHHHHCCCCCEECC YAESDKWVLLSPEFRAIWQDRKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLA CCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEHHHHH QSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRNWRMAQSLLNSVEGHENITQL HCCCCCCCEECCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCHHHHHHH AVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK HHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHCC >Mature Secondary Structure SFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTT CCCCCCCCCCEEEECCHHHHHHHHHHCCCCCCEEECHHHHHHHHHHHHHHCCCCCEECC YAESDKWVLLSPEFRAIWQDRKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLA CCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEEHHHHH QSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRNWRMAQSLLNSVEGHENITQL HCCCCCCCEECCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCHHHHHHH AVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK HHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11677608; 12644504