Definition | Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome. |
---|---|
Accession | NC_009972 |
Length | 6,346,587 |
Click here to switch to the map view.
The map label for this gene is frzE [H]
Identifier: 159897254
GI number: 159897254
Start: 833061
End: 835349
Strand: Direct
Name: frzE [H]
Synonym: Haur_0725
Alternate gene names: 159897254
Gene position: 833061-835349 (Clockwise)
Preceding gene: 159897253
Following gene: 159897255
Centisome position: 13.13
GC content: 54.7
Gene sequence:
>2289_bases ATGGGCGGTTTTGATTTATCAGCATTTTTTGGCCAGTTTCGCGAAGAAACTGAAGAGAATGTGCGGGCCTTGACCACAGG CTTATTGGCCTTGGAGTCAAACCCAGGCGATCGCGAAGCGATTGATACGATTTTTCGGGCGGCGCATACGATCAAAGGTT CGGCGCGTATGCTGGGTCAAGTCGATATGGGGCGGTTGGCGCATACCATGGAAAGTTTGCTTTCGGCCTTGCGCAGTGGC ATGCTAGCCATGAATTCGAGCATTAACGATGTGCTACTGGCCAGTGTTGATGTATTGCTGGTGTTGAATTCCCAAGTCAA CGAGCCACCGCCAACCGACCCCAACGTTGATCGTTTGGTTGAGCAACTGAATGCCTTGGCTGCTGGCGAAAGCTTGCCTG CTGCGCCGATTGTCGCGCCAGTAACCGAACCAGAACCAGAGCCTGAGCCAGTAGTGCTTGAGCAACCCAAGCCCGAACCA GCGGTTGCTAAGCCAGCCGCGCCTGCCAAACCCAAAAAATCAGCCTCAGCCGAAGCCCCCAAGTCGGTGAGTAGCACCCG TTCAACCGTGCGTGTGCCAATTTCGCGCTTAGATCGTTTGTTGAATACCGCTGGCGAGTTGGTCGTAACCCGCCAATTGC ACCTTGAGCATGTCGCTGATCTTGAGGCTTTGGATAAATTGCTGACCAAAAGTGAGCGCCTGAGCCAACAATTGAGCGAA CGCTTGACGGGTCAACGGGTGACCTTTCAGCAACGGCGCGAGGCCAGCGAATTAGCCAGCCAATTGCAAAATCTGGCCCA ATCGACCCGCAATCAGTTGCGTTTGCTAACCGAGCGTTGGAGCAGCCATAGCGCCGCCAGTGAGGCCTTGGTCGATGAAC TTGAGGCTGAGGTGATGGCGACCCGTTTGCAACCAGTCGCTGGTTTGTTTGCACCAATTCCTCGGGCCGTGCGCGAGCTG GCTCGTTCGTTGGGCAAAGAAGTTAACTTAATCACCGAAGGCGAAACCACCGAGGCCGATCGCAAAGTGATTGAGTTAAT GGCTGATCCGTTGGTGCATTTGGTGCGCAACGCGCTTGATCATGGCATCGAAAGCCCCGATGAGCGGGTGAAAGCCCACA AGCCTGCCGAAGCAAGCTTGCGTTTAGAAGCTCGCTCGTTGGGCGGCACGATTGAAATTATTATTAGCGACGATGGCCGT GGCATCGATCCAGCGGTGATTCGGGCAACTGCAATTAAACGCGGAATTATCGAGGCTGATACAGCGGCTCGCTTGCGTGA TGAAGAAGCTTTGGAGTTGATCTGGCAGCCTGGTTTTTCCACCAGCGCAATCATCACCGATGTTTCAGGCCGTGGCGTTG GCATGGACGTGGTACGGGCAGCAGTGACCGAGGTTGGTGGGCGGGTCGATGTGCATTCGGTGCTTGGCCAAGGCACGACC TTCACGCTGATTTTGCCAATTACCTTGCTAACCACCCGCGTGTTGTTGTTTGATGTGGCTGGCACAACCTATGCCTTGCC TTCGACTGCTTGTCTAGGTGGGCGGCGGGTTGCTGGCGGGCAAATTCAGACCGTCGAAGGGCGACCAACCGTGCGGGTTG ATGAGCGCAGCGTGAGCATTGTAGCGCTTGCGCCCTTGCTTGAGCAGCGTGGCCCCTTGCCGCAACCATCGGATATTTCC AATTTGGTGATTTTGGGGCCAGCTAATCGCCCATTGGCCTTGTTGGTCGATAAATTGGTCGATGAACGTGAGGTGGTGGT TAAATCGTTGGGCGCATTGTTGCATGAACAACGTTTGTGTACTGGCGCGATTGCCCTGCCTGATGGGCGTTTGGTGTTAG TGCTCAATCCCTTGGCGATTGCGGCGCGGGCACGTGAATGGGGCAAACCAGTTGCCTTGCCAGCGCCAACCAAGCTCCAG CCTGCCAAATTATTGGTCGCGGAAGATTCATTTACCACCCGCGAACTGCTCCGATCCATGCTGCAATCGGCGGGCTATGT GGTTGAAACGGCGATTAACGGCCAAGATGCGCTTGACAAGCTCAATCACAATTCCTACGATCTGCTGGTAAGCGATGTTG AAATGCCGTTGCTAACTGGCTTTGAGCTAACCCGCCGTGTGCGTGCCCATGACCGTTTGCGCCAACTGCCAATTATCATT ATCACCAGCTTGGCCCGCGATAGCGATCGGCGTGAAGGCTTGTTGGCTGGTGCGCAAGCCTATATCGTCAAAAGCCAGTT TGATCAAAGCAACTTGCTCGAAACGATTCATCAATTACTTGGCCGCTAA
Upstream 100 bases:
>100_bases CCAAACTGCCGATGCCGCATCCAAACTCACCGCGATTGCAAATCGCTTGCATTCGTTGGTCAACATCGAGGCACATACAC GCTAAGCGAGGAGCTATCGC
Downstream 100 bases:
>100_bases AATAGCGCAGCCCTGAAGGCCTATATTCCCCTGCGAGGAGTTTTTCAATGAGTGAACGCATCTTAGTTGTTGACGATAGC AAACTGGTTACCGATATTGT
Product: CheA signal transduction histidine kinase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 762; Mature: 761
Protein sequence:
>762_residues MGGFDLSAFFGQFREETEENVRALTTGLLALESNPGDREAIDTIFRAAHTIKGSARMLGQVDMGRLAHTMESLLSALRSG MLAMNSSINDVLLASVDVLLVLNSQVNEPPPTDPNVDRLVEQLNALAAGESLPAAPIVAPVTEPEPEPEPVVLEQPKPEP AVAKPAAPAKPKKSASAEAPKSVSSTRSTVRVPISRLDRLLNTAGELVVTRQLHLEHVADLEALDKLLTKSERLSQQLSE RLTGQRVTFQQRREASELASQLQNLAQSTRNQLRLLTERWSSHSAASEALVDELEAEVMATRLQPVAGLFAPIPRAVREL ARSLGKEVNLITEGETTEADRKVIELMADPLVHLVRNALDHGIESPDERVKAHKPAEASLRLEARSLGGTIEIIISDDGR GIDPAVIRATAIKRGIIEADTAARLRDEEALELIWQPGFSTSAIITDVSGRGVGMDVVRAAVTEVGGRVDVHSVLGQGTT FTLILPITLLTTRVLLFDVAGTTYALPSTACLGGRRVAGGQIQTVEGRPTVRVDERSVSIVALAPLLEQRGPLPQPSDIS NLVILGPANRPLALLVDKLVDEREVVVKSLGALLHEQRLCTGAIALPDGRLVLVLNPLAIAARAREWGKPVALPAPTKLQ PAKLLVAEDSFTTRELLRSMLQSAGYVVETAINGQDALDKLNHNSYDLLVSDVEMPLLTGFELTRRVRAHDRLRQLPIII ITSLARDSDRREGLLAGAQAYIVKSQFDQSNLLETIHQLLGR
Sequences:
>Translated_762_residues MGGFDLSAFFGQFREETEENVRALTTGLLALESNPGDREAIDTIFRAAHTIKGSARMLGQVDMGRLAHTMESLLSALRSG MLAMNSSINDVLLASVDVLLVLNSQVNEPPPTDPNVDRLVEQLNALAAGESLPAAPIVAPVTEPEPEPEPVVLEQPKPEP AVAKPAAPAKPKKSASAEAPKSVSSTRSTVRVPISRLDRLLNTAGELVVTRQLHLEHVADLEALDKLLTKSERLSQQLSE RLTGQRVTFQQRREASELASQLQNLAQSTRNQLRLLTERWSSHSAASEALVDELEAEVMATRLQPVAGLFAPIPRAVREL ARSLGKEVNLITEGETTEADRKVIELMADPLVHLVRNALDHGIESPDERVKAHKPAEASLRLEARSLGGTIEIIISDDGR GIDPAVIRATAIKRGIIEADTAARLRDEEALELIWQPGFSTSAIITDVSGRGVGMDVVRAAVTEVGGRVDVHSVLGQGTT FTLILPITLLTTRVLLFDVAGTTYALPSTACLGGRRVAGGQIQTVEGRPTVRVDERSVSIVALAPLLEQRGPLPQPSDIS NLVILGPANRPLALLVDKLVDEREVVVKSLGALLHEQRLCTGAIALPDGRLVLVLNPLAIAARAREWGKPVALPAPTKLQ PAKLLVAEDSFTTRELLRSMLQSAGYVVETAINGQDALDKLNHNSYDLLVSDVEMPLLTGFELTRRVRAHDRLRQLPIII ITSLARDSDRREGLLAGAQAYIVKSQFDQSNLLETIHQLLGR >Mature_761_residues GGFDLSAFFGQFREETEENVRALTTGLLALESNPGDREAIDTIFRAAHTIKGSARMLGQVDMGRLAHTMESLLSALRSGM LAMNSSINDVLLASVDVLLVLNSQVNEPPPTDPNVDRLVEQLNALAAGESLPAAPIVAPVTEPEPEPEPVVLEQPKPEPA VAKPAAPAKPKKSASAEAPKSVSSTRSTVRVPISRLDRLLNTAGELVVTRQLHLEHVADLEALDKLLTKSERLSQQLSER LTGQRVTFQQRREASELASQLQNLAQSTRNQLRLLTERWSSHSAASEALVDELEAEVMATRLQPVAGLFAPIPRAVRELA RSLGKEVNLITEGETTEADRKVIELMADPLVHLVRNALDHGIESPDERVKAHKPAEASLRLEARSLGGTIEIIISDDGRG IDPAVIRATAIKRGIIEADTAARLRDEEALELIWQPGFSTSAIITDVSGRGVGMDVVRAAVTEVGGRVDVHSVLGQGTTF TLILPITLLTTRVLLFDVAGTTYALPSTACLGGRRVAGGQIQTVEGRPTVRVDERSVSIVALAPLLEQRGPLPQPSDISN LVILGPANRPLALLVDKLVDEREVVVKSLGALLHEQRLCTGAIALPDGRLVLVLNPLAIAARAREWGKPVALPAPTKLQP AKLLVAEDSFTTRELLRSMLQSAGYVVETAINGQDALDKLNHNSYDLLVSDVEMPLLTGFELTRRVRAHDRLRQLPIIII TSLARDSDRREGLLAGAQAYIVKSQFDQSNLLETIHQLLGR
Specific function: FrzE is involved in a sensory transduction pathway that controls the frequency at which cells reverse their gliding direction. FrzE seems to be capable of autophosphorylating itself on an histidine residue and then to transfer that group to an aspartate r
COG id: COG0643
COG function: function code NT; Chemotaxis protein histidine kinase and related kinases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 response regulatory domain [H]
Homologues:
Organism=Escherichia coli, GI1788197, Length=469, Percent_Identity=32.1961620469083, Blast_Score=206, Evalue=4e-54, Organism=Escherichia coli, GI1788713, Length=116, Percent_Identity=37.9310344827586, Blast_Score=72, Evalue=2e-13, Organism=Escherichia coli, GI1788191, Length=119, Percent_Identity=34.453781512605, Blast_Score=72, Evalue=2e-13, Organism=Escherichia coli, GI1786784, Length=120, Percent_Identity=35, Blast_Score=66, Evalue=9e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR002545 - InterPro: IPR011006 - InterPro: IPR004358 - InterPro: IPR008207 - InterPro: IPR005467 - InterPro: IPR001789 [H]
Pfam domain/function: PF01584 CheW; PF02518 HATPase_c; PF01627 Hpt; PF00072 Response_reg [H]
EC number: =2.7.13.3 [H]
Molecular weight: Translated: 82367; Mature: 82235
Theoretical pI: Translated: 5.50; Mature: 5.50
Prosite motif: PS50851 CHEW ; PS50894 HPT ; PS50110 RESPONSE_REGULATORY ; PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGGFDLSAFFGQFREETEENVRALTTGLLALESNPGDREAIDTIFRAAHTIKGSARMLGQ CCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCHHHHHHHHHHHHHHCCHHHHHHH VDMGRLAHTMESLLSALRSGMLAMNSSINDVLLASVDVLLVLNSQVNEPPPTDPNVDRLV HHHHHHHHHHHHHHHHHHHCHHHHCCCCHHHHHHHHHHHEEECCCCCCCCCCCCCHHHHH EQLNALAAGESLPAAPIVAPVTEPEPEPEPVVLEQPKPEPAVAKPAAPAKPKKSASAEAP HHHHHHHCCCCCCCCCEEECCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCC KSVSSTRSTVRVPISRLDRLLNTAGELVVTRQLHLEHVADLEALDKLLTKSERLSQQLSE CHHHHHHHHEECCHHHHHHHHHHHCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RLTGQRVTFQQRREASELASQLQNLAQSTRNQLRLLTERWSSHSAASEALVDELEAEVMA HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH TRLQPVAGLFAPIPRAVRELARSLGKEVNLITEGETTEADRKVIELMADPLVHLVRNALD HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH HGIESPDERVKAHKPAEASLRLEARSLGGTIEIIISDDGRGIDPAVIRATAIKRGIIEAD CCCCCCHHHHHCCCCCCHHEEEEHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHCCCCCC TAARLRDEEALELIWQPGFSTSAIITDVSGRGVGMDVVRAAVTEVGGRVDVHSVLGQGTT HHHHHCCHHHHHHHCCCCCCCCEEEEECCCCCCCHHHHHHHHHHCCCCEEHHHHHCCCCE FTLILPITLLTTRVLLFDVAGTTYALPSTACLGGRRVAGGQIQTVEGRPTVRVDERSVSI EEEHHHHHHHHHHHHHHHCCCCCEECCCHHHCCCCEECCCEEEEECCCCCEEECCCCEEE VALAPLLEQRGPLPQPSDISNLVILGPANRPLALLVDKLVDEREVVVKSLGALLHEQRLC HHHHHHHHHCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH TGAIALPDGRLVLVLNPLAIAARAREWGKPVALPAPTKLQPAKLLVAEDSFTTRELLRSM HCEEECCCCCEEEEECHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH LQSAGYVVETAINGQDALDKLNHNSYDLLVSDVEMPLLTGFELTRRVRAHDRLRQLPIII HHHCCEEEEEECCCHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCHHE ITSLARDSDRREGLLAGAQAYIVKSQFDQSNLLETIHQLLGR EEHHHCCCCHHHHHHHCHHHHEEHHHCCHHHHHHHHHHHHCC >Mature Secondary Structure GGFDLSAFFGQFREETEENVRALTTGLLALESNPGDREAIDTIFRAAHTIKGSARMLGQ CCCCHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCHHHHHHHHHHHHHHCCHHHHHHH VDMGRLAHTMESLLSALRSGMLAMNSSINDVLLASVDVLLVLNSQVNEPPPTDPNVDRLV HHHHHHHHHHHHHHHHHHHCHHHHCCCCHHHHHHHHHHHEEECCCCCCCCCCCCCHHHHH EQLNALAAGESLPAAPIVAPVTEPEPEPEPVVLEQPKPEPAVAKPAAPAKPKKSASAEAP HHHHHHHCCCCCCCCCEEECCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCC KSVSSTRSTVRVPISRLDRLLNTAGELVVTRQLHLEHVADLEALDKLLTKSERLSQQLSE CHHHHHHHHEECCHHHHHHHHHHHCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RLTGQRVTFQQRREASELASQLQNLAQSTRNQLRLLTERWSSHSAASEALVDELEAEVMA HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH TRLQPVAGLFAPIPRAVRELARSLGKEVNLITEGETTEADRKVIELMADPLVHLVRNALD HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH HGIESPDERVKAHKPAEASLRLEARSLGGTIEIIISDDGRGIDPAVIRATAIKRGIIEAD CCCCCCHHHHHCCCCCCHHEEEEHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHCCCCCC TAARLRDEEALELIWQPGFSTSAIITDVSGRGVGMDVVRAAVTEVGGRVDVHSVLGQGTT HHHHHCCHHHHHHHCCCCCCCCEEEEECCCCCCCHHHHHHHHHHCCCCEEHHHHHCCCCE FTLILPITLLTTRVLLFDVAGTTYALPSTACLGGRRVAGGQIQTVEGRPTVRVDERSVSI EEEHHHHHHHHHHHHHHHCCCCCEECCCHHHCCCCEECCCEEEEECCCCCEEECCCCEEE VALAPLLEQRGPLPQPSDISNLVILGPANRPLALLVDKLVDEREVVVKSLGALLHEQRLC HHHHHHHHHCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH TGAIALPDGRLVLVLNPLAIAARAREWGKPVALPAPTKLQPAKLLVAEDSFTTRELLRSM HCEEECCCCCEEEEECHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH LQSAGYVVETAINGQDALDKLNHNSYDLLVSDVEMPLLTGFELTRRVRAHDRLRQLPIII HHHCCEEEEEECCCHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCHHE ITSLARDSDRREGLLAGAQAYIVKSQFDQSNLLETIHQLLGR EEHHHCCCCHHHHHHHCHHHHEEHHHCCHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2165608; 2123853 [H]