Definition Escherichia coli str. K-12 substr. MG1655 chromosome, complete genome.
Accession NC_000913
Length 4,639,675

Click here to switch to the map view.

The map label for this gene is rhsD

Identifier: 16128481

GI number:

Start: 522485

End: 526765

Strand: Direct

Name: rhsD

Synonym: b0497

Alternate gene names: NA

Gene position: NA

Preceding gene: 16128480

Following gene: 16128482

Centisome position: NA

GC content: NA

Gene sequence:

>4281_bases
ATGAGCGGAAAACCAGCGGCGCGTCAGGGAGATATGACTCAGTATGGCGGTCCCATTGTCCAGGGTTCGGCAGGTGTAAG
AATTGGCGCGCCCACCGGCGTGGCGTGCTCGGTGTGTCCGGGCGGGATGACTTCGGGCAACCCGGTAAATCCGCTGCTGG
GGGCGAAGGTGCTGCCCGGCGAGACGGACCTTGCGCTGCCCGGCCCGCTGCCGTTCATTCTCTCCCGCACCTACAGCAGC
TACCGGACGAAGACGCCTGCACCGGTGGGCGTTTTCGGCCCCGGCTGGAAAGCGCCTTCTGATATCCGCTTACAGCTACG
TGATGACGGACTGATACTCAACGACAACGGCGGGCGGAGCATTCACTTTGAGCCGCTGCTGCCGGGGGAGGCGGTGTACA
GCCGCAGTGAGTCAATGTGGCTGGTGCGCGGTGGTAAGGCAGCACAGCCGGACGGCCATACGCTGGCGCGGCTGTGGGGG
GCGCTGCCGCCGGATATCCGGTTAAGCCCGCATCTTTACCTGGCGACCAACAGCGCACAGGGGCCGTGGTGGATACTGGG
GTGGTCTGAGCGGGTGCCGGGTGCTGAGGACGTACTGCCAGCGCCGCTGCCGCCGTACCGGGTGCTTACCGGGATGGCGG
ACCGCTTCGGGCGGACGCTGACGTACCGGCGTGAGGCCGCCGGTGACCTGGCCGGGGAAATCACCGGCGTGACGGACGGT
GCCGGGCGGGAGTTCCGTCTGGTGCTGACCACGCAGGCGCAGCGTGCGGAAGAGGCCCGCACCTCTTCGCTATCTTCTTC
TGACAGTTCCCGCCCTCTCTCAGCCTCAGCGTTCCCCGACACACTGCCCGGTACCGAATACGGCCCCGACAGGGGTATCC
GCCTTTCGGCGGTGTGGCTGATGCACGACCCGGCATACCCGGAGAGCCTGCCCGCTGCGCCACTGGTGCGGTACACGTAT
ACGGAAGCCGGTGAACTGCTGGCGGTATATGACCGCAGCAATACGCAGGTGCGCGCTTTCACGTATGACGCGCAGCACCC
GGGCCGGATGGTGGCGCACCGTTACGCGGGAAGGCCGGAGATGCGCTACCGCTACGACGATACGGGGCGGGTGGTGGAGC
AACTGAACCCGGCAGGGTTAAGCTACCGCTATCTTTATGAGCAGGACCGCATCACCGTCACCGACAGCCTGAACCGGCGT
GAGGTGCTGCATACAGAAGGCGGGGCCGGGCTGAAACGGGTGGTGAAAAAAGAACTGGCGGACGGCAGCGTCACGCGCAG
CGGGTATGACGCGGCAGGAAGGCTCACGGCGCAGACGGACGCGGCGGGACGGAGGACAGAGTACGGTCTGAATGTGGTGT
CCGGCGATATCACGGACATCACCACACCGGACGGGCGGGAGACGAAATTTTACTATAACGACGGGAACCAGCTGACGGCG
GTGGTGTCCCCGGACGGGCTGGAGAGCCGCCGGGAATATGATGAACCGGGCAGGCTGGTATCGGAGACATCGCGCAGCGG
GGAGACAGTACGCTACCGCTACGATGACGCGCACAGTGAGTTACCGGCGACGACAACGGATGCGACGGGCAGCACCCGGC
AGATGACCTGGAGCCGCTACGGGCAGTTGCTGGCGTTCACCGACTGCTCGGGCTACCAGACCCGTTATGAATACGACCGC
TTCGGCCAGATGACGGCGGTCCACCGCGAGGAAGGCATCAGCCTTTACCGCCGCTATGACAACCGTGGCCGGTTAACCTC
GGTGAAAGACGCACAGGGCCGTGAAACGCGGTATGAATACAACGCCGCAGGCGACCTGACTGCCGTTATCACCCCGGACG
GCAACCGGAGCGAGACACAGTACGATGCGTGGGGAAAGGCGGTCAGCACCACGCAGGGCGGGCTGACGCGCAGTATGGAG
TACGATGCTGCCGGACGTGTCATCAGCCTGACCAACGAGAACGGCAGCCACAGCGTCTTCAGTTACGATGCGCTGGACCG
GCTGGTACAGCAGGGCGGCTTTGACGGGCGGACGCAACGTTATCATTATGACCTGACCGGAAAACTCACACAGAGTGAGG
ATGAGGGACTTGTCATCCTCTGGTACTACGATGAATCGGACCGTATCACTCACCGCACGGTGAACGGCGAACCGGCAGAG
CAGTGGCAGTATGATGGCCACGGCTGGCTGACAGACATCAGCCACCTGAGCGAAGGCCACCGTGTTGCCGTCCACTATGG
CTATGACGATAAAGGCCGCCTGACCGGCGAATGCCAGACGGTGGAGAACCCGGAGACGGGGGAACTGCTGTGGCAGCATG
AGACGAAACACGCATACAACGAGCAGGGGCTGGCAAACCGCGTCACGCCGGACAGCCTGCCGCCGGTGGAGTGGCTGACG
TATGGCAGCGGTTACCTGGCGGGAATGAAGCTGGGCGGGACGCCGCTGGTCGAGTATACGCGGGACAGGCTGCACCGTGA
GACGGTGCGCAGCTTCGGCAGCATGGCAGGCAGTAATGCCGCATACGAACTGACCAGCACATACACCCCCGCAGGCCAGT
TACAGAGCCAGCACCTGAACAGCCTGGTATATGACCGTGACTACGGGTGGAGTGACAACGGCGACCTGGTGCGCATCAGC
GGCCCGCGACAGACGCGGGAATACGGCTACAGCGCCACGGGCAGGCTGGAGAGTGTGCGCACCCTCGCACCAGACCTGGA
CATCCGCATCCCGTATGCCACGGACCCGGCGGGCAACCGGCTGCCGGACCCGGAGCTGCACCCGGACAGTACACTCACAG
TGTGGCCGGATAACCGCATCGCGGAGGATGCGCACTATGTCTACCGCCACGATGAATACGGCAGGCTGACGGAGAAGACG
GACCGCATCCCGGCGGGTGTGATACGGACGGACGACGAGCGGACCCACCACTACCACTACGACAGCCAGCACCGCCTGGT
GTTCTACACGCGGATACAGCATGGCGAGCCACTGGTCGAGAGCCGCTACCTCTACGACCCGCTGGGACGGCGAATGGCAA
AACGGGTCTGGCGGCGGGAGCGTGACCTGACGGGGTGGATGTCGCTGTCGCGTAAACCGGAGGTGACGTGGTATGGCTGG
GACGGAGACAGGCTGACGACGGTGCAGACTGACACCACACGTATCCAGACGGTATACGAGCCGGGAAGCTTCACGCCGCT
CATCCGGGTCGAGACAGAGAACGGCGAGCGGGAAAAAGCGCAGCGGCGCAGCCTGGCAGAGACGCTCCAGCAGGAAGGGA
GTGAGAACGGCCACGGCGTGGTGTTCCCGGCTGAACTGGTGCGGCTGCTGGACAGGCTGGAGGAAGAAATCCGGGCAGAC
CGCGTGAGCAGTGAAAGCCGGGCGTGGCTTGCGCAGTGCGGGCTGACGGTGGAGCAACTGGCCAGACAGGTGGAGCCGGA
ATACACACCGGCGCGAAAAGCTCATCTTTATCACTGCGACCACCGGGGACTGCCGCTGGCGCTTATCAGCGAAGACGGCA
ATACGGCGTGGAGCGCGGAATATGATGAATGGGGCAACCAGCTTAATGAGGAGAACCCGCATCATGTGTATCAGCCGTAC
CGTCTGCCAGGGCAGCAGCATGATGAGGAATCAGGGCTGTACTATAACCGTCACCGGTACTACGATCCGTTGCAGGGGCG
GTATATTACTCAGGACCCGATGGGGTTGAAAGGGGGATGGAATTTATATCAGTATCCTTTAAATCCACTACAACAAATTG
ACCCTATGGGATTATTGCAGACTTGGGATGATGCCAGATCTGGAGCATGTACGGGGGGAGTTTGTGGTGTTCTTTCACGT
ATAATAGGACCAAGTAAATTTGATAGTACTGCAGATGCTGCGTTAGATGCTTTGAAAGAAACGCAGAATAGATCTCTATG
TAATGATATGGAATACTCTGGTATTGTCTGTAAAGATACTAATGGAAAATATTTTGCATCTAAGGCAGAAACTGATAATT
TAAGAAAGGAGTCATATCCTCTGAAAAGAAAATGTCCCACAGGTACAGATAGAGTTGCTGCTTATCATACTCACGGTGCA
GATAGTCATGGCGATTATGTTGATGAATTTTTTTCAAGTAGCGATAAAAATCTTGTAAGAAGTAAAGATAATAATCTTGA
AGCATTTTATCTCGCAACACCTGATGGACGATTTGAGGCGCTTAATAATAAAGGAGAATATATTTTTATCAGAAATAGTG
TCCCGGGATTGAGTTCAGTATGCATACCGTATCATGATTAA

Upstream 100 bases:

>100_bases
TGTGAAAAATATATAAATACATTAGCTGGTCTTGTGTGTCATTTTATTTTTTTTTGTTGCTAACACAGGGATATGAACAA
TAACTAAAAGGGCACTTTAT

Downstream 100 bases:

>100_bases
TTTTAGTGCTTTTATTAGTGGGGCCTATAAGGAGATTCAATGAAATATAGTTCAATATTTTCGATGCTTTCATTTTTTAT
ACTATTTGCCTGTAATGAGA

Product: rhsD element protein

Products: NA

Alternate protein names: NA

Number of amino acids: NA

Protein sequence:

>1426_residues
MSGKPAARQGDMTQYGGPIVQGSAGVRIGAPTGVACSVCPGGMTSGNPVNPLLGAKVLPGETDLALPGPLPFILSRTYSS
YRTKTPAPVGVFGPGWKAPSDIRLQLRDDGLILNDNGGRSIHFEPLLPGEAVYSRSESMWLVRGGKAAQPDGHTLARLWG
ALPPDIRLSPHLYLATNSAQGPWWILGWSERVPGAEDVLPAPLPPYRVLTGMADRFGRTLTYRREAAGDLAGEITGVTDG
AGREFRLVLTTQAQRAEEARTSSLSSSDSSRPLSASAFPDTLPGTEYGPDRGIRLSAVWLMHDPAYPESLPAAPLVRYTY
TEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEMRYRYDDTGRVVEQLNPAGLSYRYLYEQDRITVTDSLNRR
EVLHTEGGAGLKRVVKKELADGSVTRSGYDAAGRLTAQTDAAGRRTEYGLNVVSGDITDITTPDGRETKFYYNDGNQLTA
VVSPDGLESRREYDEPGRLVSETSRSGETVRYRYDDAHSELPATTTDATGSTRQMTWSRYGQLLAFTDCSGYQTRYEYDR
FGQMTAVHREEGISLYRRYDNRGRLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQGGLTRSME
YDAAGRVISLTNENGSHSVFSYDALDRLVQQGGFDGRTQRYHYDLTGKLTQSEDEGLVILWYYDESDRITHRTVNGEPAE
QWQYDGHGWLTDISHLSEGHRVAVHYGYDDKGRLTGECQTVENPETGELLWQHETKHAYNEQGLANRVTPDSLPPVEWLT
YGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGSMAGSNAAYELTSTYTPAGQLQSQHLNSLVYDRDYGWSDNGDLVRIS
GPRQTREYGYSATGRLESVRTLAPDLDIRIPYATDPAGNRLPDPELHPDSTLTVWPDNRIAEDAHYVYRHDEYGRLTEKT
DRIPAGVIRTDDERTHHYHYDSQHRLVFYTRIQHGEPLVESRYLYDPLGRRMAKRVWRRERDLTGWMSLSRKPEVTWYGW
DGDRLTTVQTDTTRIQTVYEPGSFTPLIRVETENGEREKAQRRSLAETLQQEGSENGHGVVFPAELVRLLDRLEEEIRAD
RVSSESRAWLAQCGLTVEQLARQVEPEYTPARKAHLYHCDHRGLPLALISEDGNTAWSAEYDEWGNQLNEENPHHVYQPY
RLPGQQHDEESGLYYNRHRYYDPLQGRYITQDPMGLKGGWNLYQYPLNPLQQIDPMGLLQTWDDARSGACTGGVCGVLSR
IIGPSKFDSTADAALDALKETQNRSLCNDMEYSGIVCKDTNGKYFASKAETDNLRKESYPLKRKCPTGTDRVAAYHTHGA
DSHGDYVDEFFSSSDKNLVRSKDNNLEAFYLATPDGRFEALNNKGEYIFIRNSVPGLSSVCIPYHD

Sequences:
NA

Specific function: NA

COG id: COG3209

COG function: function code M; Rhs family protein

Gene ontology:

Cell location: NA

Metaboloic importance: NA

Operon status: NA

Operon components: NA

Similarity: NA

Homologues:

NA

Paralogues:

NA

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: NA

Theoretical pI: NA

Prosite motif: NA

Important sites: NA

Signals:

NA

Transmembrane regions:

NA

Cys/Met content:

NA

Secondary structure: NA

PDB accession: NA

Resolution: NA

Structure class: NA

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: NA

TargetDB status: NA

Availability: NA

References: NA