Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is celD [H]

Identifier: 29141659

GI number: 29141659

Start: 1261853

End: 1262695

Strand: Direct

Name: celD [H]

Synonym: t1194

Alternate gene names: 29141659

Gene position: 1261853-1262695 (Clockwise)

Preceding gene: 29141658

Following gene: 29141660

Centisome position: 26.33

GC content: 43.18

Gene sequence:

>843_bases
ATGATGCAGCTACAGGTTAACGCAACGGAAATCAAAACGGTTTATGAGCAGCAGCTCTTCAATGGCAAAAATTTCCATGT
GTTTATCTATAACAAGACGGAAAGCGTCACCGGGCTGCATCAGCACGACTATTATGAATTTACCCTGGTATTAACCGGAC
GTTATTACCAGGAGATTAACGGGAAGCGTGTGCTGCTGGAACGTGGAGATTTTGTTTTTATCCCGGTGGGGTCGAATCAC
CAAAGTTTCTATGAGTTTGGCGCAACGCGTATTCTGAACGTAGGAATTAGTAAACGTTTTTTTGAACAGCATTATCTTCC
GTTACTGCCGTTTTGTTTTGTGGCATCGCAGGTATACAGGGCGAATAGCACCTTCCTTACCTATATTGAAACGGTGATTG
CGTCGTTGAATTTTCGCGGCAATGGGCTCGACGAATTTATCGAAGTGGTGACATTTTATATTATAAACCGCTTACGTCAT
TACCGCGAAGAGCAGGTTTATGATGATATTCCACAATGGCTGAAAGCAACCGTGGAGATAATGCATGATAAAACGCAGTT
TGGCGAACATGCGCTGGAAAATATGGTGCGCCTGTCGGCAAAATCCCAGGAGTATCTGACGCGCGCCACCCGACGTTATT
ACAGCAAAACGCCGATGCAAATCATTAATGAGATTCGCATTAATTTTGCCAAGAAACAGTTGGAAATGACCAACTATTCG
GTTACGGATATTGCTTATGAAGCCGGGTACAGTAGTCCAAGTCTGTTTATTAAAACGTTTAAGAAGATGACCTCATTCAC
GCCGAATAGTTATCGGAAGCGATTGACGGAAATTAATGAGTAA

Upstream 100 bases:

>100_bases
CTGGTTCACGCTCAGGATCATTTGATGGCCTCAATGCTGGCTCGTGAACTGATCGCTGAGTTGATTGAGCTCCATGAGAA
ACTGAAATAACAGGGGGCGG

Downstream 100 bases:

>100_bases
CGGTTTACATTGCTCGATTAGTTACATTTCACGCTTGCCCCAATAGCGCGGGGAACGTCACGCTATTTCTTACGAGCGGG
CATAGCGTAATACATATCAG

Product: DNA-binding transcriptional regulator ChbR

Products: NA

Alternate protein names: Chb operon repressor [H]

Number of amino acids: Translated: 280; Mature: 280

Protein sequence:

>280_residues
MMQLQVNATEIKTVYEQQLFNGKNFHVFIYNKTESVTGLHQHDYYEFTLVLTGRYYQEINGKRVLLERGDFVFIPVGSNH
QSFYEFGATRILNVGISKRFFEQHYLPLLPFCFVASQVYRANSTFLTYIETVIASLNFRGNGLDEFIEVVTFYIINRLRH
YREEQVYDDIPQWLKATVEIMHDKTQFGEHALENMVRLSAKSQEYLTRATRRYYSKTPMQIINEIRINFAKKQLEMTNYS
VTDIAYEAGYSSPSLFIKTFKKMTSFTPNSYRKRLTEINE

Sequences:

>Translated_280_residues
MMQLQVNATEIKTVYEQQLFNGKNFHVFIYNKTESVTGLHQHDYYEFTLVLTGRYYQEINGKRVLLERGDFVFIPVGSNH
QSFYEFGATRILNVGISKRFFEQHYLPLLPFCFVASQVYRANSTFLTYIETVIASLNFRGNGLDEFIEVVTFYIINRLRH
YREEQVYDDIPQWLKATVEIMHDKTQFGEHALENMVRLSAKSQEYLTRATRRYYSKTPMQIINEIRINFAKKQLEMTNYS
VTDIAYEAGYSSPSLFIKTFKKMTSFTPNSYRKRLTEINE
>Mature_280_residues
MMQLQVNATEIKTVYEQQLFNGKNFHVFIYNKTESVTGLHQHDYYEFTLVLTGRYYQEINGKRVLLERGDFVFIPVGSNH
QSFYEFGATRILNVGISKRFFEQHYLPLLPFCFVASQVYRANSTFLTYIETVIASLNFRGNGLDEFIEVVTFYIINRLRH
YREEQVYDDIPQWLKATVEIMHDKTQFGEHALENMVRLSAKSQEYLTRATRRYYSKTPMQIINEIRINFAKKQLEMTNYS
VTDIAYEAGYSSPSLFIKTFKKMTSFTPNSYRKRLTEINE

Specific function: Dual-function repressor/activator of the chbBCARFG operon. In the absence of the inducing sugar chitobiose, together with NagC, represses the chbBCARFG operon for the uptake and metabolism of chitobiose. In association with Crp, and probably in the presen

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1788030, Length=280, Percent_Identity=85.7142857142857, Blast_Score=509, Evalue=1e-146,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013096
- InterPro:   IPR011051
- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR018062
- InterPro:   IPR020449
- InterPro:   IPR018060
- InterPro:   IPR014710 [H]

Pfam domain/function: PF07883 Cupin_2; PF00165 HTH_AraC [H]

EC number: NA

Molecular weight: Translated: 33135; Mature: 33135

Theoretical pI: Translated: 8.96; Mature: 8.96

Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MMQLQVNATEIKTVYEQQLFNGKNFHVFIYNKTESVTGLHQHDYYEFTLVLTGRYYQEIN
CEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEEEEEECHHHHHCC
GKRVLLERGDFVFIPVGSNHQSFYEFGATRILNVGISKRFFEQHYLPLLPFCFVASQVYR
CCEEEEECCCEEEEEECCCCHHHHHHHHHHEECCCHHHHHHHHHCHHHHHHHHHHHHHHH
ANSTFLTYIETVIASLNFRGNGLDEFIEVVTFYIINRLRHYREEQVYDDIPQWLKATVEI
CCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
MHDKTQFGEHALENMVRLSAKSQEYLTRATRRYYSKTPMQIINEIRINFAKKQLEMTNYS
HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCC
VTDIAYEAGYSSPSLFIKTFKKMTSFTPNSYRKRLTEINE
HHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCC
>Mature Secondary Structure
MMQLQVNATEIKTVYEQQLFNGKNFHVFIYNKTESVTGLHQHDYYEFTLVLTGRYYQEIN
CEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEEEEEECHHHHHCC
GKRVLLERGDFVFIPVGSNHQSFYEFGATRILNVGISKRFFEQHYLPLLPFCFVASQVYR
CCEEEEECCCEEEEEECCCCHHHHHHHHHHEECCCHHHHHHHHHCHHHHHHHHHHHHHHH
ANSTFLTYIETVIASLNFRGNGLDEFIEVVTFYIINRLRHYREEQVYDDIPQWLKATVEI
CCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
MHDKTQFGEHALENMVRLSAKSQEYLTRATRRYYSKTPMQIINEIRINFAKKQLEMTNYS
HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCC
VTDIAYEAGYSSPSLFIKTFKKMTSFTPNSYRKRLTEINE
HHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 2179047; 9097039; 9278503; 9405618 [H]