Definition | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 chromosome, complete genome. |
---|---|
Accession | NC_011094 |
Length | 4,709,075 |
Click here to switch to the map view.
The map label for this gene is invF [H]
Identifier: 194737921
GI number: 194737921
Start: 2950478
End: 2951227
Strand: Reverse
Name: invF [H]
Synonym: SeSA_A3052
Alternate gene names: 194737921
Gene position: 2951227-2950478 (Counterclockwise)
Preceding gene: 194737363
Following gene: 194736395
Centisome position: 62.67
GC content: 46.27
Gene sequence:
>750_bases ATGCTAAATACGCAGGAAGTACTTAAAGAAGGAGAGAAGCGGAAAATCCGCAGCCCGGAAGCATGGTTTATACAGACGTG TTCCGCGCAAAAGCTGCATATGTCATTTTCTGAAAGCCGACGCAATGAAAACTGCCTGATTCAGGAAGGCGCGCTGCTTT TTTGCGAGCAGGCCGTTGTCGCACCAGTATCAGGAGACCTGGTTTTTCGACCGTTAAAAATTGAAGTACTCAGCAAATTA CTGGCATTTATCGATGGCGCAGGATTAGTGGACACGACATATGCTGAATCCGATAAATGGGTTTTGCTGAGTCCTGAGTT TCGCGCTATTTGGCAAGATCGTAAACGCTGCGAGTACTGGTTTTTGCAGCAAATTATTACGCCTTCTCCGGCCTTCAATA AGGTACTGGCGCTGTTACGAAAAAGCGAGAGTTACTGGTTGGTTGGCTATTTACTCGCTCAGTCAACCAGCGGCAACACG ATGAGAATGCTGGGAGAAGACTATGGCGTTTCTTATACCCATTTTCGTCGTTTGTGCAGCAGAGCGTTGGGCGGAAAAGC GAAGAGTGAATTACGAAACTGGCGTATGGCGCAATCGCTGCTGAATAGTGTAGAAGGCCACGAGAACATCACCCAATTAG CCGTTAATCATGGTTACTCATCGCCTTCACATTTTTCTAGTGAGATCAAAGAGCTGATCGGCGTTTCGCCTCGGAAATTA TCAAATATTATTCAATTGGCAGACAAATGA
Upstream 100 bases:
>100_bases AGCGGGTGGCATCAGTTTCATAATGATTGCATCAGGATTTTGCCACTCGCTCCCGGTATTGTTTACATATTAAAATGATT TTTAACTGGTGCTGACAACT
Downstream 100 bases:
>100_bases AGACACATATTCTTTTGGCCAGAGTGCTGGCATGTGCCGCGCTTGTTCTGGTTGCACCTGGTTATTCTAGTGAAAAAATA CCTGTAACGGGAAGTGGGTT
Product: invasion protein
Products: NA
Alternate protein names: Transcriptional regulator invF [H]
Number of amino acids: Translated: 249; Mature: 249
Protein sequence:
>249_residues MLNTQEVLKEGEKRKIRSPEAWFIQTCSAQKLHMSFSESRRNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKL LAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQDRKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNT MRMLGEDYGVSYTHFRRLCSRALGGKAKSELRNWRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKL SNIIQLADK
Sequences:
>Translated_249_residues MLNTQEVLKEGEKRKIRSPEAWFIQTCSAQKLHMSFSESRRNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKL LAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQDRKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNT MRMLGEDYGVSYTHFRRLCSRALGGKAKSELRNWRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKL SNIIQLADK >Mature_249_residues MLNTQEVLKEGEKRKIRSPEAWFIQTCSAQKLHMSFSESRRNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKL LAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQDRKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNT MRMLGEDYGVSYTHFRRLCSRALGGKAKSELRNWRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKL SNIIQLADK
Specific function: Transcriptional regulator required for the expression of several genes encoding type III secretion system SPI1 effector proteins. The interaction with sicA is necessary for the activation of sigDE (sopB pipC), sicAsipBCDA, and sopE [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR018060 [H]
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 28291; Mature: 28291
Theoretical pI: Translated: 8.94; Mature: 8.94
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 4.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLNTQEVLKEGEKRKIRSPEAWFIQTCSAQKLHMSFSESRRNENCLIQEGALLFCEQAVV CCCHHHHHHCCHHHCCCCCCHHEEECCCCHHHHHHHHHHCCCCCCEEECCHHHHHHHHHH APVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQDRKRCEYW CCCCCCEEECHHHHHHHHHHHHHHCCCCCEECCCCCCCCEEEECCHHHHHHHHHHHHHHH FLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCS HHHHHHCCCHHHHHHHHHHHCCCCEEEEEHHHHHCCCCCCCEECCCCCCCCHHHHHHHHH RALGGKAKSELRNWRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKL HHHCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHH SNIIQLADK HHHHHHHCC >Mature Secondary Structure MLNTQEVLKEGEKRKIRSPEAWFIQTCSAQKLHMSFSESRRNENCLIQEGALLFCEQAVV CCCHHHHHHCCHHHCCCCCCHHEEECCCCHHHHHHHHHHCCCCCCEEECCHHHHHHHHHH APVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQDRKRCEYW CCCCCCEEECHHHHHHHHHHHHHHCCCCCEECCCCCCCCEEEECCHHHHHHHHHHHHHHH FLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCS HHHHHHCCCHHHHHHHHHHHCCCCEEEEEHHHHHCCCCCCCEECCCCCCCCHHHHHHHHH RALGGKAKSELRNWRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKL HHHCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHH SNIIQLADK HHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11677608; 12644504 [H]