Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is hisB
Identifier: 159184137
GI number: 159184137
Start: 43805
End: 44401
Strand: Reverse
Name: hisB
Synonym: Atu0043
Alternate gene names: 159184137
Gene position: 44401-43805 (Counterclockwise)
Preceding gene: 159184139
Following gene: 15887400
Centisome position: 1.56
GC content: 58.63
Gene sequence:
>597_bases ATGGCAGAACGTAAAGGCGAGATTATCCGCAAGACCAACGAAACCTCGGTTTCGGTGCGCGTGGATATCGACGGCACCGG CAAATCGAAAATTTCCACCGGCGTGGGATTTTTCGACCATATGCTGGACCAGCTCAGCCGCCATTCGCTGATCGACATGG ACATCGAAGTTCAGGGCGATCTGCATATCGATGACCACCACACCGTGGAAGACACCGGCATCGCCATCGGCCAGGCAATT GCCAAGGCGCTGGGCGACCGGCGTGGCATTACCCGTTACGCCTCGCTCGATCTGGCCATGGACGAAACGATGACCAAGGC TGCCGTCGATATTTCCGGCAGGCCGTTCCTTGTGTGGAACGTCAATTTCTCCGCGCCGAAGATCGGCACCTTCGATACCG AACTGGTGCGCGAGTTCTTTCAGGCACTCGCCCAGAATGCCGGCATTACGCTGCATATCCTGAACCATTACGGCGCCAAC AACCACCATATCGCCGAAACGTGCTTCAAGGCCGTTGCCCGCGTTCTTCGCACGGCAACCGAGATCGACCCCCGGCAGGC AGGCCGCGTTCCCTCGACCAAGGGTATGCTGGCCTGA
Upstream 100 bases:
>100_bases CGACGCACCGTTTTCGGCGCGTCACGCAAGACGATGGAACAACTGGATATTTGTGTCCCGTGCTGATAAGGAACGGCAAC ATTTTCACGGAGCGGCCAAA
Downstream 100 bases:
>100_bases CGCCAAAGCTGTCAGGAAGCCCATGACAAGCTATCTCGTTCTGGAAGCCCCCAACGGGCCGGACAGGGACCACAAGACGA CCCGCTTCATTGCAGACCGT
Product: imidazoleglycerol-phosphate dehydratase
Products: NA
Alternate protein names: IGPD
Number of amino acids: Translated: 198; Mature: 197
Protein sequence:
>198_residues MAERKGEIIRKTNETSVSVRVDIDGTGKSKISTGVGFFDHMLDQLSRHSLIDMDIEVQGDLHIDDHHTVEDTGIAIGQAI AKALGDRRGITRYASLDLAMDETMTKAAVDISGRPFLVWNVNFSAPKIGTFDTELVREFFQALAQNAGITLHILNHYGAN NHHIAETCFKAVARVLRTATEIDPRQAGRVPSTKGMLA
Sequences:
>Translated_198_residues MAERKGEIIRKTNETSVSVRVDIDGTGKSKISTGVGFFDHMLDQLSRHSLIDMDIEVQGDLHIDDHHTVEDTGIAIGQAI AKALGDRRGITRYASLDLAMDETMTKAAVDISGRPFLVWNVNFSAPKIGTFDTELVREFFQALAQNAGITLHILNHYGAN NHHIAETCFKAVARVLRTATEIDPRQAGRVPSTKGMLA >Mature_197_residues AERKGEIIRKTNETSVSVRVDIDGTGKSKISTGVGFFDHMLDQLSRHSLIDMDIEVQGDLHIDDHHTVEDTGIAIGQAIA KALGDRRGITRYASLDLAMDETMTKAAVDISGRPFLVWNVNFSAPKIGTFDTELVREFFQALAQNAGITLHILNHYGANN HHIAETCFKAVARVLRTATEIDPRQAGRVPSTKGMLA
Specific function: Histidine biosynthesis; sixth step. Histidine biosynthesis; eighth step. [C]
COG id: COG0131
COG function: function code E; Imidazoleglycerol-phosphate dehydratase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the imidazoleglycerol-phosphate dehydratase family
Homologues:
Organism=Escherichia coli, GI87082027, Length=195, Percent_Identity=48.2051282051282, Blast_Score=193, Evalue=7e-51, Organism=Saccharomyces cerevisiae, GI6324776, Length=221, Percent_Identity=37.5565610859729, Blast_Score=157, Evalue=1e-39,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): HIS7_AGRT5 (Q8UJ89)
Other databases:
- EMBL: AE007869 - PIR: AD2582 - PIR: B97364 - RefSeq: NP_353082.2 - ProteinModelPortal: Q8UJ89 - SMR: Q8UJ89 - STRING: Q8UJ89 - GeneID: 1132081 - GenomeReviews: AE007869_GR - KEGG: atu:Atu0043 - eggNOG: COG0131 - HOGENOM: HBG289010 - OMA: TLHVETL - PhylomeDB: Q8UJ89 - ProtClustDB: PRK00951 - BioCyc: ATUM176299-1:ATU0043-MONOMER - GO: GO:0005737 - HAMAP: MF_00076 - InterPro: IPR000807 - InterPro: IPR020565 - InterPro: IPR020568
Pfam domain/function: PF00475 IGPD; SSF54211 Ribosomal_S5_D2-typ_fold
EC number: =4.2.1.19
Molecular weight: Translated: 21725; Mature: 21594
Theoretical pI: Translated: 6.60; Mature: 6.60
Prosite motif: PS00954 IGP_DEHYDRATASE_1; PS00955 IGP_DEHYDRATASE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAERKGEIIRKTNETSVSVRVDIDGTGKSKISTGVGFFDHMLDQLSRHSLIDMDIEVQGD CCCCCCCHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEEEECCC LHIDDHHTVEDTGIAIGQAIAKALGDRRGITRYASLDLAMDETMTKAAVDISGRPFLVWN EEECCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHEECCCCEEEEEE VNFSAPKIGTFDTELVREFFQALAQNAGITLHILNHYGANNHHIAETCFKAVARVLRTAT ECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHH EIDPRQAGRVPSTKGMLA HCCHHHCCCCCCCCCCCC >Mature Secondary Structure AERKGEIIRKTNETSVSVRVDIDGTGKSKISTGVGFFDHMLDQLSRHSLIDMDIEVQGD CCCCCCHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEEEECCC LHIDDHHTVEDTGIAIGQAIAKALGDRRGITRYASLDLAMDETMTKAAVDISGRPFLVWN EEECCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHEECCCCEEEEEE VNFSAPKIGTFDTELVREFFQALAQNAGITLHILNHYGANNHHIAETCFKAVARVLRTAT ECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHH EIDPRQAGRVPSTKGMLA HCCHHHCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11743193; 11743194