| Definition | Yersinia pestis CO92 chromosome, complete genome. |
|---|---|
| Accession | NC_003143 |
| Length | 4,653,728 |
Click here to switch to the map view.
The map label for this gene is yhdA [H]
Identifier: 218930675
GI number: 218930675
Start: 4085044
End: 4086960
Strand: Direct
Name: yhdA [H]
Synonym: YPO3664
Alternate gene names: 218930675
Gene position: 4085044-4086960 (Clockwise)
Preceding gene: 218930663
Following gene: 218930676
Centisome position: 87.78
GC content: 48.15
Gene sequence:
>1917_bases ATGCGATTTACCACCAAACTTTCTGCGTTTATGACGCTGCTGGTCGTATTGGCTATGTGTTTAGTCTTATTGGGCAGCAC GATCAGCTTCTTCTATTTGTGCCAGAAGAAAATGGAACAACGGCTCCAAAGTATTGTGACCGCTTACGATCAGTCACTGC TGCTGCAACCCATCAATAAAAAACGTGAATGGCTGCCACTGATGATGCAGACCCTTGGTGTGGTCGATGTCAGTGTCAAA AATAGCACCAGTACCCTCTATCAACTCCACATTCCTGCAGTTTATACGCCTTGGAACAGCCACAGTCGCTACCGTCAGAT GGTATTGCCTCTGTTGCATCAGCCAGGGACGGAGATGCATTTCAACTATATTGACCCGTTGGGTAGCTATGCCCGTTCTA TTTATGCCGCGGCCATTTTATCACTGGTGGTGGTGGTCATTGCGTTAACGTTGTTGTTGAGTTTCCGCTGGCTACGTGAC CAGACCGTGGGGCAGGAAAAACTTGAACGGCGAGCAAGGCGCATTCTCAATGGTGAACGTGAACATGCGGTGCGCAGCGA GGATTACGAATGGCCACCTTGTGCCAGCAGGGCTATTGACCATTTATTATCTGAGTTGATGGAGGTGCGGGCAGAACGCA ATCGGGTAGATACGTTAATCCGCACGTTTGCGGCTCAAGATGCTCAGACTGGCCTCAGTAACCGTCAGTTTTTTGATAAC CAACTGACCACACAACTTGAAGAGACCGGTGCGCACGGTGTGGTCATGATGGTGCAACTGCCCGATTTTGAAGCCTTAAA TGAAACGCATGATCAGCAGCAAGTACAGGAATTGATGAGTTCACTGGTTAACCTGCTGTCAACTTTTGTGGCACGCTACC CCTCAGCACTCTTGGCCCGCTACCTTAATAGTGACATCGCGATATTACTGCCCCACAAAACCCTGAAAGATGCCGATGTG ATGGCCGCACAGTTGGTGAATGCGGTGAGAACCTTACCGGAGCCTCATATTATTGATCGCGAATCACTGTTGCATATTGG CATCGTGGCTTACCGGAGTGGTGAGTCAGTCGAACAAATTATGGACAATGCCGGGCAGGCGACCAAAAGTGCGGCGCTGT ATGGTGGCAACGGTTGGTATGTGTTCGACACTCAGGTACCGGAGCGCGGACGTGGCAGTGTTAAATGGCGTACATTACTG GAACAGACTCTGGCGAGCGGAGGCCCAAGGCTTTATCAAAAACCCGTTATTACTGTTGATGGCAAAATAAGCCATCGTGA AATAATTAGCCGCATATATGATGGTGAGCAGGAGCTACTGGCGGCCGAGTTTATGCCCTTGGTACAATTGCTCGGATTGG GTGAACGTTATGATCGGCAAAAGATTGATAAGATAATTCCTTTATTATCTTTATGGCCGGATGAAACGCTGGCATTTTCT ATCAGCGTTGATTCATTATTACATCGTCCTTTCCAGCGCTGGTTGCGTGACACGCTATTACAGTGCAAAAAATCTGACCG AATGCGAATTATCTTTGAACTTGCAGAGGCAGATGTGTGTCAACATATCGAAGAAATCCGCCAAATGGTCCGTTTATTGC GTGGCGTAGGCTGCAAAGTGATGGCATCACAGGCTGGATTGACGGTTGTCAGCACCTCATATATTAAGTCTTTGCAGGTT GAAATGATAAAGCTTCATCCCGGTGTGGTAAGAAGCATTAATTTTCGCTATGAAAACCAGTTATTTGTAGAAAGTCTGAC CGGTGCCTGTGCAGGGACGCAGACGAAAGTTTTTGCGGCAGAAGTGCGTACCCGTGAAGAATGGCAAACGCTGCAAGAGA AAGGCGTTTACGGCGGACAGGGCAATTTCTTTGCTCCACCGACCCCATTAAACTCCGGAAAGAAAAAATATTCGTAA
Upstream 100 bases:
>100_bases AGTAAAAAATCAGCGTTACGGGTTATTTTTGTCATATAGTACTTGCCCCCATGGATTTGAGGGACTCTATTCATTTAATT GCACTAGGCAAGGCTAAGTA
Downstream 100 bases:
>100_bases TAGGGCCATGTTAGCCTGATCGTTGAGCAAAAATTAACGTAGAATGTGTCAGTCATTACCGTGAATCGTTGGTGTGCACT TACTTTCAACGAGAATCAGG
Product: regulatory protein CsrD
Products: NA
Alternate protein names: Regulator of CsrB and CsrC decay CsrD [H]
Number of amino acids: Translated: 638; Mature: 638
Protein sequence:
>638_residues MRFTTKLSAFMTLLVVLAMCLVLLGSTISFFYLCQKKMEQRLQSIVTAYDQSLLLQPINKKREWLPLMMQTLGVVDVSVK NSTSTLYQLHIPAVYTPWNSHSRYRQMVLPLLHQPGTEMHFNYIDPLGSYARSIYAAAILSLVVVVIALTLLLSFRWLRD QTVGQEKLERRARRILNGEREHAVRSEDYEWPPCASRAIDHLLSELMEVRAERNRVDTLIRTFAAQDAQTGLSNRQFFDN QLTTQLEETGAHGVVMMVQLPDFEALNETHDQQQVQELMSSLVNLLSTFVARYPSALLARYLNSDIAILLPHKTLKDADV MAAQLVNAVRTLPEPHIIDRESLLHIGIVAYRSGESVEQIMDNAGQATKSAALYGGNGWYVFDTQVPERGRGSVKWRTLL EQTLASGGPRLYQKPVITVDGKISHREIISRIYDGEQELLAAEFMPLVQLLGLGERYDRQKIDKIIPLLSLWPDETLAFS ISVDSLLHRPFQRWLRDTLLQCKKSDRMRIIFELAEADVCQHIEEIRQMVRLLRGVGCKVMASQAGLTVVSTSYIKSLQV EMIKLHPGVVRSINFRYENQLFVESLTGACAGTQTKVFAAEVRTREEWQTLQEKGVYGGQGNFFAPPTPLNSGKKKYS
Sequences:
>Translated_638_residues MRFTTKLSAFMTLLVVLAMCLVLLGSTISFFYLCQKKMEQRLQSIVTAYDQSLLLQPINKKREWLPLMMQTLGVVDVSVK NSTSTLYQLHIPAVYTPWNSHSRYRQMVLPLLHQPGTEMHFNYIDPLGSYARSIYAAAILSLVVVVIALTLLLSFRWLRD QTVGQEKLERRARRILNGEREHAVRSEDYEWPPCASRAIDHLLSELMEVRAERNRVDTLIRTFAAQDAQTGLSNRQFFDN QLTTQLEETGAHGVVMMVQLPDFEALNETHDQQQVQELMSSLVNLLSTFVARYPSALLARYLNSDIAILLPHKTLKDADV MAAQLVNAVRTLPEPHIIDRESLLHIGIVAYRSGESVEQIMDNAGQATKSAALYGGNGWYVFDTQVPERGRGSVKWRTLL EQTLASGGPRLYQKPVITVDGKISHREIISRIYDGEQELLAAEFMPLVQLLGLGERYDRQKIDKIIPLLSLWPDETLAFS ISVDSLLHRPFQRWLRDTLLQCKKSDRMRIIFELAEADVCQHIEEIRQMVRLLRGVGCKVMASQAGLTVVSTSYIKSLQV EMIKLHPGVVRSINFRYENQLFVESLTGACAGTQTKVFAAEVRTREEWQTLQEKGVYGGQGNFFAPPTPLNSGKKKYS >Mature_638_residues MRFTTKLSAFMTLLVVLAMCLVLLGSTISFFYLCQKKMEQRLQSIVTAYDQSLLLQPINKKREWLPLMMQTLGVVDVSVK NSTSTLYQLHIPAVYTPWNSHSRYRQMVLPLLHQPGTEMHFNYIDPLGSYARSIYAAAILSLVVVVIALTLLLSFRWLRD QTVGQEKLERRARRILNGEREHAVRSEDYEWPPCASRAIDHLLSELMEVRAERNRVDTLIRTFAAQDAQTGLSNRQFFDN QLTTQLEETGAHGVVMMVQLPDFEALNETHDQQQVQELMSSLVNLLSTFVARYPSALLARYLNSDIAILLPHKTLKDADV MAAQLVNAVRTLPEPHIIDRESLLHIGIVAYRSGESVEQIMDNAGQATKSAALYGGNGWYVFDTQVPERGRGSVKWRTLL EQTLASGGPRLYQKPVITVDGKISHREIISRIYDGEQELLAAEFMPLVQLLGLGERYDRQKIDKIIPLLSLWPDETLAFS ISVDSLLHRPFQRWLRDTLLQCKKSDRMRIIFELAEADVCQHIEEIRQMVRLLRGVGCKVMASQAGLTVVSTSYIKSLQV EMIKLHPGVVRSINFRYENQLFVESLTGACAGTQTKVFAAEVRTREEWQTLQEKGVYGGQGNFFAPPTPLNSGKKKYS
Specific function: Serves as a specificity factor required for RNase E- mediated decay of the small global regulatory RNAs CsrB and CsrC, it is probably not a nuclease. Nor does its activity involve c-di- GMP, despite its domain composition. Positively modulates motility ge
COG id: COG2200
COG function: function code T; FOG: EAL domain
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 GGDEF domain [H]
Homologues:
Organism=Escherichia coli, GI1789650, Length=641, Percent_Identity=50.7020280811232, Blast_Score=675, Evalue=0.0, Organism=Escherichia coli, GI87081921, Length=440, Percent_Identity=22.5, Blast_Score=77, Evalue=3e-15, Organism=Escherichia coli, GI1787541, Length=433, Percent_Identity=21.2471131639723, Blast_Score=64, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001054 - InterPro: IPR000160 - InterPro: IPR001633 [H]
Pfam domain/function: PF00563 EAL; PF00990 GGDEF [H]
EC number: NA
Molecular weight: Translated: 72417; Mature: 72417
Theoretical pI: Translated: 8.43; Mature: 8.43
Prosite motif: PS50883 EAL ; PS50887 GGDEF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRFTTKLSAFMTLLVVLAMCLVLLGSTISFFYLCQKKMEQRLQSIVTAYDQSLLLQPINK CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC KREWLPLMMQTLGVVDVSVKNSTSTLYQLHIPAVYTPWNSHSRYRQMVLPLLHQPGTEMH HHHHHHHHHHHHCCEEEEECCCCCEEEEEECCEEECCCCCHHHHHHHHHHHHHCCCCCCC FNYIDPLGSYARSIYAAAILSLVVVVIALTLLLSFRWLRDQTVGQEKLERRARRILNGER CCHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCH EHAVRSEDYEWPPCASRAIDHLLSELMEVRAERNRVDTLIRTFAAQDAQTGLSNRQFFDN HHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH QLTTQLEETGAHGVVMMVQLPDFEALNETHDQQQVQELMSSLVNLLSTFVARYPSALLAR HHHHHHHHCCCCCEEEEEECCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YLNSDIAILLPHKTLKDADVMAAQLVNAVRTLPEPHIIDRESLLHIGIVAYRSGESVEQI HHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHEEEECCCCHHHHH MDNAGQATKSAALYGGNGWYVFDTQVPERGRGSVKWRTLLEQTLASGGPRLYQKPVITVD HHCCCCHHHCCEEECCCEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEC GKISHREIISRIYDGEQELLAAEFMPLVQLLGLGERYDRQKIDKIIPLLSLWPDETLAFS CCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCEEEE ISVDSLLHRPFQRWLRDTLLQCKKSDRMRIIFELAEADVCQHIEEIRQMVRLLRGVGCKV EEHHHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE MASQAGLTVVSTSYIKSLQVEMIKLHPGVVRSINFRYENQLFVESLTGACAGTQTKVFAA ECCCCCCCEEHHHHHHHHHHHHHHCCCCHHHHCCCCCCCHHHHHHHHHHCCCCCHHHHHH EVRTREEWQTLQEKGVYGGQGNFFAPPTPLNSGKKKYS HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure MRFTTKLSAFMTLLVVLAMCLVLLGSTISFFYLCQKKMEQRLQSIVTAYDQSLLLQPINK CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC KREWLPLMMQTLGVVDVSVKNSTSTLYQLHIPAVYTPWNSHSRYRQMVLPLLHQPGTEMH HHHHHHHHHHHHCCEEEEECCCCCEEEEEECCEEECCCCCHHHHHHHHHHHHHCCCCCCC FNYIDPLGSYARSIYAAAILSLVVVVIALTLLLSFRWLRDQTVGQEKLERRARRILNGER CCHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCH EHAVRSEDYEWPPCASRAIDHLLSELMEVRAERNRVDTLIRTFAAQDAQTGLSNRQFFDN HHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH QLTTQLEETGAHGVVMMVQLPDFEALNETHDQQQVQELMSSLVNLLSTFVARYPSALLAR HHHHHHHHCCCCCEEEEEECCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YLNSDIAILLPHKTLKDADVMAAQLVNAVRTLPEPHIIDRESLLHIGIVAYRSGESVEQI HHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHEEEECCCCHHHHH MDNAGQATKSAALYGGNGWYVFDTQVPERGRGSVKWRTLLEQTLASGGPRLYQKPVITVD HHCCCCHHHCCEEECCCEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEC GKISHREIISRIYDGEQELLAAEFMPLVQLLGLGERYDRQKIDKIIPLLSLWPDETLAFS CCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCEEEE ISVDSLLHRPFQRWLRDTLLQCKKSDRMRIIFELAEADVCQHIEEIRQMVRLLRGVGCKV EEHHHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE MASQAGLTVVSTSYIKSLQVEMIKLHPGVVRSINFRYENQLFVESLTGACAGTQTKVFAA ECCCCCCCEEHHHHHHHHHHHHHHCCCCHHHHCCCCCCCHHHHHHHHHHCCCCCHHHHHH EVRTREEWQTLQEKGVYGGQGNFFAPPTPLNSGKKKYS HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9278503; 3049542 [H]