Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yhiX
Identifier: 30065159
GI number: 30065159
Start: 4077096
End: 4077920
Strand: Reverse
Name: yhiX
Synonym: S4172
Alternate gene names: 30065159
Gene position: 4077920-4077096 (Counterclockwise)
Preceding gene: 30065160
Following gene: 30065158
Centisome position: 88.66
GC content: 41.21
Gene sequence:
>825_bases ATGCAATCATTACATGGGAATTGTCTAATTGCGTACGCAAGACATAAATATATTCTCACCATGGTTAATGGTGAATATCG CTATTTTAATGGCGGTGACCTGGTTTTTGCGGATGCAAGCCAAATTCGAGTAGATAAGTGTGTTGAAAATTTTGTATTAG TATCAAGGGATACGCTTTCATTATTTCTGCCGATGCTCAAGGAGGAGGCATTAAATCTTCATGCACATAAAAAAATTTCT TCATTACTCGTTCATCACTGTAGCAGAGATATTCCTGTTTTTCAGGAAGTTGCGCAACTATCGCAGAATAAGAATCTTCG CTATGCAGAAATGCTACGTAAAAGAGCATTAATCTTTGCGTTGTTATCTGTTTTTCTTGAGGATGAGCACTTTATACCGC TGCTTTTGAACGTTTTGCAACCGAACATGCGAACACGAGTTTGTACGGTTATCAATAATAATATCGCCCATGAGTGGACA CTAGCCCGAATCGCCAGCGAGCTGTTGATGAGTCCAAGCCTGTTAAAGAAAAAATTGCGCGAAGAAGAGACATCATATTC ACAGTTGCTTACTGAGTGTAGAATGCAACGTGCTTTGCAACTTATTGTTATACATGGCTTTTCAATTAAGCGAGTCGCAG TGTCCTGTGGATATCACAGCGTGTCGTATTTCATTTACGTCTTTCGAAATTATTATGGGATGACGCCCACAGAGTATCAG GAGCGATCGGCGCAGGGATTGCCGAACCGTGACTCGGCGGCAAGTATTGTTGCGCAAGGGAATTTTTACGGCACTAACCG TTCTGCGGAAGGAATAAGATTATAG
Upstream 100 bases:
>100_bases CTCCCGCTTCGTTTAAATTTATTTATCAATCAATTTGACTTAAGAGGGCGGCGTGCTACATTAATTAACAGTAATATGTT TATGTAATATTAAGTCAACT
Downstream 100 bases:
>100_bases AGTTTTACTCAGACATAAAAAAAACCCGGCATAGGGGACCGGGAAGAGGATAGTCTGCCGTCTCCAGACTAATAAACCGT TATAACACTCCCTGTTGGCA
Product: DNA-binding transcriptional regulator GadX
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 274; Mature: 274
Protein sequence:
>274_residues MQSLHGNCLIAYARHKYILTMVNGEYRYFNGGDLVFADASQIRVDKCVENFVLVSRDTLSLFLPMLKEEALNLHAHKKIS SLLVHHCSRDIPVFQEVAQLSQNKNLRYAEMLRKRALIFALLSVFLEDEHFIPLLLNVLQPNMRTRVCTVINNNIAHEWT LARIASELLMSPSLLKKKLREEETSYSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVSYFIYVFRNYYGMTPTEYQ ERSAQGLPNRDSAASIVAQGNFYGTNRSAEGIRL
Sequences:
>Translated_274_residues MQSLHGNCLIAYARHKYILTMVNGEYRYFNGGDLVFADASQIRVDKCVENFVLVSRDTLSLFLPMLKEEALNLHAHKKIS SLLVHHCSRDIPVFQEVAQLSQNKNLRYAEMLRKRALIFALLSVFLEDEHFIPLLLNVLQPNMRTRVCTVINNNIAHEWT LARIASELLMSPSLLKKKLREEETSYSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVSYFIYVFRNYYGMTPTEYQ ERSAQGLPNRDSAASIVAQGNFYGTNRSAEGIRL >Mature_274_residues MQSLHGNCLIAYARHKYILTMVNGEYRYFNGGDLVFADASQIRVDKCVENFVLVSRDTLSLFLPMLKEEALNLHAHKKIS SLLVHHCSRDIPVFQEVAQLSQNKNLRYAEMLRKRALIFALLSVFLEDEHFIPLLLNVLQPNMRTRVCTVINNNIAHEWT LARIASELLMSPSLLKKKLREEETSYSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVSYFIYVFRNYYGMTPTEYQ ERSAQGLPNRDSAASIVAQGNFYGTNRSAEGIRL
Specific function: Positively regulates the expression of about fifteen genes involved in acid resistance such as gadA, gadB and gadC. Depending on the conditions (growth phase and medium), can repress gadW. Negatively regulates perA expression in acidic conditions and posi
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789933, Length=274, Percent_Identity=98.1751824817518, Blast_Score=559, Evalue=1e-161, Organism=Escherichia coli, GI1790557, Length=147, Percent_Identity=44.8979591836735, Blast_Score=131, Evalue=4e-32, Organism=Escherichia coli, GI1786778, Length=194, Percent_Identity=39.6907216494845, Blast_Score=129, Evalue=2e-31, Organism=Escherichia coli, GI1787776, Length=136, Percent_Identity=43.3823529411765, Blast_Score=103, Evalue=9e-24, Organism=Escherichia coli, GI1786776, Length=132, Percent_Identity=40.1515151515151, Blast_Score=97, Evalue=2e-21, Organism=Escherichia coli, GI1789932, Length=128, Percent_Identity=38.28125, Blast_Score=91, Evalue=7e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 [H]
Pfam domain/function: PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 31453; Mature: 31453
Theoretical pI: Translated: 9.11; Mature: 9.11
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.2 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 5.1 %Cys+Met (Translated Protein) 2.2 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 5.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQSLHGNCLIAYARHKYILTMVNGEYRYFNGGDLVFADASQIRVDKCVENFVLVSRDTLS CCCCCCCEEEEEECCEEEEEEECCCEEEECCCCEEEECHHHHHHHHHHHHHHEECCHHHH LFLPMLKEEALNLHAHKKISSLLVHHCSRDIPVFQEVAQLSQNKNLRYAEMLRKRALIFA HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH LLSVFLEDEHFIPLLLNVLQPNMRTRVCTVINNNIAHEWTLARIASELLMSPSLLKKKLR HHHHHHCCCCHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHH EEETSYSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVSYFIYVFRNYYGMTPTEYQ HHHHHHHHHHHHHHHHHHHHHHHEECCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHH ERSAQGLPNRDSAASIVAQGNFYGTNRSAEGIRL HHHHCCCCCCCCHHHEEECCCEECCCCCCCCCCC >Mature Secondary Structure MQSLHGNCLIAYARHKYILTMVNGEYRYFNGGDLVFADASQIRVDKCVENFVLVSRDTLS CCCCCCCEEEEEECCEEEEEEECCCEEEECCCCEEEECHHHHHHHHHHHHHHEECCHHHH LFLPMLKEEALNLHAHKKISSLLVHHCSRDIPVFQEVAQLSQNKNLRYAEMLRKRALIFA HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH LLSVFLEDEHFIPLLLNVLQPNMRTRVCTVINNNIAHEWTLARIASELLMSPSLLKKKLR HHHHHHCCCCHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHH EEETSYSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVSYFIYVFRNYYGMTPTEYQ HHHHHHHHHHHHHHHHHHHHHHHEECCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHH ERSAQGLPNRDSAASIVAQGNFYGTNRSAEGIRL HHHHCCCCCCCCHHHEEECCCEECCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA