| Definition | Klebsiella pneumoniae NTUH-K2044 chromosome, complete genome. |
|---|---|
| Accession | NC_012731 |
| Length | 5,248,520 |
Click here to switch to the map view.
The map label for this gene is arcB [H]
Identifier: 238896716
GI number: 238896716
Start: 4689784
End: 4692123
Strand: Reverse
Name: arcB [H]
Synonym: KP1_4931
Alternate gene names: 238896716
Gene position: 4692123-4689784 (Counterclockwise)
Preceding gene: 238896717
Following gene: 238896715
Centisome position: 89.4
GC content: 55.47
Gene sequence:
>2340_bases ATGAAGCAAATTCGTATGCTGGCGCAGTATTATGTCGACCTGATGATGAAGCTGGGTCTGGTCCGCTTTTCGATGCTGTT GGCTCTTGCGCTGGTGGTGCTGGCCATCGTGGTACAGATGGCGGTTACCATGGTGCTGCATGGCCAGGTTGAAAGTATCG ACGTTATCCGCTCTATCTTCTTTGGCCTGCTGATCACGCCTTGGGCGGTCTACTTTCTCTCGGTGGTGGTGGAGCAGCTC GAAGAGTCCCGTCAGCGTCTTTCGCGGCTGGTGGACAAGCTGGAGGAGATGCGTGAGCGCGATCTAAAGCTGAACGTCCA ACTGAAAGATAACATCGCGCAGTTGAATCAGGAGATTGGCGAACGTGAAAAAGCGGAAGCCGAGCGCGAGACCACCCTCG AGCAGTTGAAGATTGAGATGAAAGAGCGTGAAGAGACGCAGATCCAGCTCGAACAGCAATCCTCCTTTCTGCGCTCTTTC CTCGATGCTTCCCCGGACCTGGTCTTCTATCGCAATGAGGATAAGGAGTTTTCAGGCTGTAACCGGGCGATGGAGCTGCT GACCGGCAAAAGCGAGAAACAGCTGATCCACCTCAAACCGCAGGACGTCTATAGCGAAGAGGCGGCGGAAAAGGTACTGG AGACCGACGAGAAGGTGTTCCGCCATAATGTATCGCTGACCTATGAGCAGTGGCTGGACTATCCCGATGGCCGCAAAGCG TGCTTTGAAATCCGCAAGGTCCCCTACTACGATCGCGTCGGTAAGCGTCGCGGCCTGATGGGCTTCGGCCGCGATATCAC CGAACGTAAACGCTATCAGGACGCTCTTGAGCGCGCCAGCCGGGATAAAACCACCTTTATTTCCACCATCAGTCATGAGC TGCGTACCCCGCTAAACGGGATCGTCGGGCTGAGCCGAATTTTGCTGGATACCGAACTGACCAGCGAACAGGAAAAATAC CTGAAAACTATCCATGTATCGGCGGTGACGCTGGGGAATATCTTCAATGATATTATCGACATGGACAAAATGGAGCGCCG CAAGGTGCAGCTCGATAACCAGCCGGTGGACTTCACCAGCTTCCTTGCCGACCTTGAAAACCTATCCGGCCTGCAGGCAC AGCAGAAGGGGCTGCGCTTTGTGCTGGAGCCGAGCCTGCCGCTGCCGCATAAGGTAATCACGGATGGGACCCGCCTGCGG CAGATCCTGTGGAATCTTATCAGTAACGCCGTGAAGTTTACCCCGCAGGGCGGAGGGGTGAACGTCCGCGTGCGCTATGA CGAAGGCGATATCCTGCATTTCGAAGTGGAAGACTCCGGAATTGGTATTCCTGAAGCGGAACAGGACAAAATTTTCGCCA TGTATTACCAGGTCAAAGACAGCCACGGCGGCAAACCGGCCACCGGAACCGGGATTGGTCTCGCCGTCTCGCGTCGCCTG GCGCGTAATATGGGCGGTGATATCAGCGTTACCAGCCAGCCGGGTAAAGGGGCGACCTTTACGCTTACCGTCCATGCGCC TGCCATTGCGGAAGAAGTGGAAGATACGCTGGCGGAAGACGACATGCCATTACCGGCGCTCAACGTGCTGCTGGTTGAGG ACATTGAGCTCAACGTTATTGTGGCCCGTTCAGTGCTGGAAAAACTGGGTAACAGCGTTGATGTGGCGATGACCGGGAAA GCCGCGCTGGAGATGTTTGAGCCAGGCGAATATGACCTTGTGCTGTTGGATATTCAGTTACCGGACATGACCGGACTGGA TATTTCCCGGGAACTGAAACAGCGCTTTGCCGCTGACGAGCTACCGCCGCTGGTCGCCCTCACCGCCAACGTGCTGAAGA ATAAAAAAGAGTATCTCGACGCCGGGATGGACGATGTGTTGAGTAAGCCGCTATCGGTGCCAGCGCTAACGGCCATGATC AAGAAATTCTGGGATGCACCGGATGAGGAAGCGCAGGAAGCGCCGGCGGCCGATCTGCATAAAGCCGACGCGGTGCTGGA TACCGATATGCTGGAGCAATATATCGAGCTGGTGGGACCGAAGCTTATCAACGATGGTCTCGCGGTGTTTGAGAAGATGA TGCCGGGCTACATGTCCGTTCTGGAGTCTAATCTGACCGCCCGCGACCAAAAAGGGATTGTCGAAGAGGGGCATAAGATC AAAGGGGCGGCCGGTTCTATCGGACTGCGCCACATCCAGCAGCTGGGCCAGCAGATCCAGACTCCGGATCTGCCTGCCTG GTCTGATAATGTCGCTGAATGGGTTGAAGAGATGAAATCCGAGTGGCAAAACGATGTAGCGGTACTGAAGGCGTGGGTGG CGAAAGCCAGCAAAAAATGA
Upstream 100 bases:
>100_bases CGGCCGCCGTATCGACTCCTGCCCGCATTTTTTGCACAACTTACAGCGCATTGCTCAGAATTGAGTATTATTGTGCGGAG TTGTCGTGAAGGAATCCCCT
Downstream 100 bases:
>100_bases CCCCGGACAGACCGGGGTGCGCGAATACTGCGCCAACACCAGGGAACTGGTGGTTGAGCCAGTGTTGTTTGAGTGATGTT GTTACGTGGCGCAACCGAAG
Product: aerobic respiration control sensor protein ArcB
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 779; Mature: 779
Protein sequence:
>779_residues MKQIRMLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIFFGLLITPWAVYFLSVVVEQL EESRQRLSRLVDKLEEMRERDLKLNVQLKDNIAQLNQEIGEREKAEAERETTLEQLKIEMKEREETQIQLEQQSSFLRSF LDASPDLVFYRNEDKEFSGCNRAMELLTGKSEKQLIHLKPQDVYSEEAAEKVLETDEKVFRHNVSLTYEQWLDYPDGRKA CFEIRKVPYYDRVGKRRGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNGIVGLSRILLDTELTSEQEKY LKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTSFLADLENLSGLQAQQKGLRFVLEPSLPLPHKVITDGTRLR QILWNLISNAVKFTPQGGGVNVRVRYDEGDILHFEVEDSGIGIPEAEQDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRL ARNMGGDISVTSQPGKGATFTLTVHAPAIAEEVEDTLAEDDMPLPALNVLLVEDIELNVIVARSVLEKLGNSVDVAMTGK AALEMFEPGEYDLVLLDIQLPDMTGLDISRELKQRFAADELPPLVALTANVLKNKKEYLDAGMDDVLSKPLSVPALTAMI KKFWDAPDEEAQEAPAADLHKADAVLDTDMLEQYIELVGPKLINDGLAVFEKMMPGYMSVLESNLTARDQKGIVEEGHKI KGAAGSIGLRHIQQLGQQIQTPDLPAWSDNVAEWVEEMKSEWQNDVAVLKAWVAKASKK
Sequences:
>Translated_779_residues MKQIRMLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIFFGLLITPWAVYFLSVVVEQL EESRQRLSRLVDKLEEMRERDLKLNVQLKDNIAQLNQEIGEREKAEAERETTLEQLKIEMKEREETQIQLEQQSSFLRSF LDASPDLVFYRNEDKEFSGCNRAMELLTGKSEKQLIHLKPQDVYSEEAAEKVLETDEKVFRHNVSLTYEQWLDYPDGRKA CFEIRKVPYYDRVGKRRGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNGIVGLSRILLDTELTSEQEKY LKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTSFLADLENLSGLQAQQKGLRFVLEPSLPLPHKVITDGTRLR QILWNLISNAVKFTPQGGGVNVRVRYDEGDILHFEVEDSGIGIPEAEQDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRL ARNMGGDISVTSQPGKGATFTLTVHAPAIAEEVEDTLAEDDMPLPALNVLLVEDIELNVIVARSVLEKLGNSVDVAMTGK AALEMFEPGEYDLVLLDIQLPDMTGLDISRELKQRFAADELPPLVALTANVLKNKKEYLDAGMDDVLSKPLSVPALTAMI KKFWDAPDEEAQEAPAADLHKADAVLDTDMLEQYIELVGPKLINDGLAVFEKMMPGYMSVLESNLTARDQKGIVEEGHKI KGAAGSIGLRHIQQLGQQIQTPDLPAWSDNVAEWVEEMKSEWQNDVAVLKAWVAKASKK >Mature_779_residues MKQIRMLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIFFGLLITPWAVYFLSVVVEQL EESRQRLSRLVDKLEEMRERDLKLNVQLKDNIAQLNQEIGEREKAEAERETTLEQLKIEMKEREETQIQLEQQSSFLRSF LDASPDLVFYRNEDKEFSGCNRAMELLTGKSEKQLIHLKPQDVYSEEAAEKVLETDEKVFRHNVSLTYEQWLDYPDGRKA CFEIRKVPYYDRVGKRRGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNGIVGLSRILLDTELTSEQEKY LKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTSFLADLENLSGLQAQQKGLRFVLEPSLPLPHKVITDGTRLR QILWNLISNAVKFTPQGGGVNVRVRYDEGDILHFEVEDSGIGIPEAEQDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRL ARNMGGDISVTSQPGKGATFTLTVHAPAIAEEVEDTLAEDDMPLPALNVLLVEDIELNVIVARSVLEKLGNSVDVAMTGK AALEMFEPGEYDLVLLDIQLPDMTGLDISRELKQRFAADELPPLVALTANVLKNKKEYLDAGMDDVLSKPLSVPALTAMI KKFWDAPDEEAQEAPAADLHKADAVLDTDMLEQYIELVGPKLINDGLAVFEKMMPGYMSVLESNLTARDQKGIVEEGHKI KGAAGSIGLRHIQQLGQQIQTPDLPAWSDNVAEWVEEMKSEWQNDVAVLKAWVAKASKK
Specific function: Member of the two-component regulatory system ArcB/ArcA. Sensor-regulator protein for anaerobic repression of the arc modulon. Activates ArcA via a four-step phosphorelay. ArcB can also dephosphorylate ArcA by a reverse phosphorelay involving His- 717 and
COG id: COG0642
COG function: function code T; Signal transduction histidine kinase
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 response regulatory domain [H]
Homologues:
Organism=Escherichia coli, GI48994928, Length=779, Percent_Identity=88.8318356867779, Blast_Score=1394, Evalue=0.0, Organism=Escherichia coli, GI1788713, Length=638, Percent_Identity=28.2131661442006, Blast_Score=194, Evalue=1e-50, Organism=Escherichia coli, GI87081816, Length=516, Percent_Identity=27.7131782945736, Blast_Score=182, Evalue=9e-47, Organism=Escherichia coli, GI1789149, Length=516, Percent_Identity=29.2635658914729, Blast_Score=174, Evalue=1e-44, Organism=Escherichia coli, GI145693157, Length=255, Percent_Identity=35.6862745098039, Blast_Score=137, Evalue=3e-33, Organism=Escherichia coli, GI1788549, Length=417, Percent_Identity=25.6594724220624, Blast_Score=91, Evalue=2e-19, Organism=Escherichia coli, GI1790436, Length=246, Percent_Identity=29.6747967479675, Blast_Score=91, Evalue=2e-19, Organism=Escherichia coli, GI1786600, Length=273, Percent_Identity=28.5714285714286, Blast_Score=88, Evalue=2e-18, Organism=Escherichia coli, GI1786912, Length=242, Percent_Identity=28.099173553719, Blast_Score=87, Evalue=3e-18, Organism=Escherichia coli, GI1790861, Length=220, Percent_Identity=23.1818181818182, Blast_Score=73, Evalue=9e-14, Organism=Escherichia coli, GI87082128, Length=225, Percent_Identity=25.3333333333333, Blast_Score=72, Evalue=2e-13, Organism=Escherichia coli, GI1786783, Length=243, Percent_Identity=25.5144032921811, Blast_Score=70, Evalue=5e-13, Organism=Escherichia coli, GI1790300, Length=240, Percent_Identity=27.5, Blast_Score=69, Evalue=8e-13, Organism=Escherichia coli, GI1788393, Length=230, Percent_Identity=24.7826086956522, Blast_Score=69, Evalue=1e-12, Organism=Escherichia coli, GI1790346, Length=206, Percent_Identity=27.1844660194175, Blast_Score=67, Evalue=4e-12, Organism=Escherichia coli, GI1788279, Length=270, Percent_Identity=23.7037037037037, Blast_Score=64, Evalue=5e-11, Organism=Saccharomyces cerevisiae, GI6322044, Length=145, Percent_Identity=33.7931034482759, Blast_Score=72, Evalue=4e-13, Organism=Saccharomyces cerevisiae, GI6322000, Length=119, Percent_Identity=31.0924369747899, Blast_Score=66, Evalue=3e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003594 - InterPro: IPR011006 - InterPro: IPR000014 - InterPro: IPR000700 - InterPro: IPR013767 - InterPro: IPR004358 - InterPro: IPR008207 - InterPro: IPR014409 - InterPro: IPR003661 - InterPro: IPR005467 - InterPro: IPR009082 - InterPro: IPR001789 [H]
Pfam domain/function: PF02518 HATPase_c; PF00512 HisKA; PF01627 Hpt; PF00989 PAS; PF00072 Response_reg [H]
EC number: =2.7.13.3 [H]
Molecular weight: Translated: 87745; Mature: 87745
Theoretical pI: Translated: 4.73; Mature: 4.73
Prosite motif: PS50894 HPT ; PS50112 PAS ; PS50113 PAC ; PS50110 RESPONSE_REGULATORY ; PS50109 HIS_KIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKQIRMLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIF CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH FGLLITPWAVYFLSVVVEQLEESRQRLSRLVDKLEEMRERDLKLNVQLKDNIAQLNQEIG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHC EREKAEAERETTLEQLKIEMKEREETQIQLEQQSSFLRSFLDASPDLVFYRNEDKEFSGC CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHH NRAMELLTGKSEKQLIHLKPQDVYSEEAAEKVLETDEKVFRHNVSLTYEQWLDYPDGRKA HHHHHHHHCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCEEHHHHCCCCCCHHH CFEIRKVPYYDRVGKRRGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNG HHHHHCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCHHH IVGLSRILLDTELTSEQEKYLKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTS HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH FLADLENLSGLQAQQKGLRFVLEPSLPLPHKVITDGTRLRQILWNLISNAVKFTPQGGGV HHHHHHHCCCCHHHHCCCEEEECCCCCCCHHHHCCCHHHHHHHHHHHHHHHEECCCCCCE NVRVRYDEGDILHFEVEDSGIGIPEAEQDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRL EEEEEECCCCEEEEEECCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCHHHHHHHHH ARNMGGDISVTSQPGKGATFTLTVHAPAIAEEVEDTLAEDDMPLPALNVLLVEDIELNVI HHCCCCCEEEECCCCCCCEEEEEEECHHHHHHHHHHHHCCCCCCCHHHEEEEECCCCHHH VARSVLEKLGNSVDVAMTGKAALEMFEPGEYDLVLLDIQLPDMTGLDISRELKQRFAADE HHHHHHHHCCCCEEEEEECHHHHHHCCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHCC LPPLVALTANVLKNKKEYLDAGMDDVLSKPLSVPALTAMIKKFWDAPDEEAQEAPAADLH CCHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHCCCHHHH KADAVLDTDMLEQYIELVGPKLINDGLAVFEKMMPGYMSVLESNLTARDQKGIVEEGHKI HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCHHHCCCCC KGAAGSIGLRHIQQLGQQIQTPDLPAWSDNVAEWVEEMKSEWQNDVAVLKAWVAKASKK CCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MKQIRMLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIF CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH FGLLITPWAVYFLSVVVEQLEESRQRLSRLVDKLEEMRERDLKLNVQLKDNIAQLNQEIG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHC EREKAEAERETTLEQLKIEMKEREETQIQLEQQSSFLRSFLDASPDLVFYRNEDKEFSGC CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHH NRAMELLTGKSEKQLIHLKPQDVYSEEAAEKVLETDEKVFRHNVSLTYEQWLDYPDGRKA HHHHHHHHCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCEEHHHHCCCCCCHHH CFEIRKVPYYDRVGKRRGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNG HHHHHCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCHHH IVGLSRILLDTELTSEQEKYLKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTS HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH FLADLENLSGLQAQQKGLRFVLEPSLPLPHKVITDGTRLRQILWNLISNAVKFTPQGGGV HHHHHHHCCCCHHHHCCCEEEECCCCCCCHHHHCCCHHHHHHHHHHHHHHHEECCCCCCE NVRVRYDEGDILHFEVEDSGIGIPEAEQDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRL EEEEEECCCCEEEEEECCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCHHHHHHHHH ARNMGGDISVTSQPGKGATFTLTVHAPAIAEEVEDTLAEDDMPLPALNVLLVEDIELNVI HHCCCCCEEEECCCCCCCEEEEEEECHHHHHHHHHHHHCCCCCCCHHHEEEEECCCCHHH VARSVLEKLGNSVDVAMTGKAALEMFEPGEYDLVLLDIQLPDMTGLDISRELKQRFAADE HHHHHHHHCCCCEEEEEECHHHHHHCCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHCC LPPLVALTANVLKNKKEYLDAGMDDVLSKPLSVPALTAMIKKFWDAPDEEAQEAPAADLH CCHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHCCCHHHH KADAVLDTDMLEQYIELVGPKLINDGLAVFEKMMPGYMSVLESNLTARDQKGIVEEGHKI HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCHHHCCCCC KGAAGSIGLRHIQQLGQQIQTPDLPAWSDNVAEWVEEMKSEWQNDVAVLKAWVAKASKK CCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]