| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is yjiY [H]
Identifier: 157163790
GI number: 157163790
Start: 4581121
End: 4583286
Strand: Reverse
Name: yjiY [H]
Synonym: EcHS_A4574
Alternate gene names: 157163790
Gene position: 4583286-4581121 (Counterclockwise)
Preceding gene: 157163791
Following gene: 157163789
Centisome position: 98.7
GC content: 56.0
Gene sequence:
>2166_bases ATGCCAGGTTTTACTATGGATACGAAAAAACTATTCAAGCACATACCCTGGGTGATTCTCGGAATCATCGGTGCATTCTG CCTCGCGGTAGTTGCATTACGTCGGGGGGAGCACGTCAGCGCCCTGTGGATCGTGGTCGCCTCTGTGTCGGTGTATCTGG TGGCGTATCGCTACTACAGTCTGTACATCGCCCAGAAGGTGATGAAACTCGACCCCACGCGCGCGACGCCTGCGGTTATT AACAACGACGGTCTGAACTACGTTCCGACCAACCGTTACGTGTTGTTTGGTCACCACTTCGCCGCTATCGCCGGTGCTGG TCCGCTGGTCGGTCCGGTTCTCGCCGCACAGATGGGCTACCTGCCTGGCACGCTGTGGCTGCTGGCGGGGGTAGTACTGG CCGGTGCGGTTCAGGACTTTATGGTGCTGTTTATCTCCTCTCGCCGTAACGGCGCATCTCTCGGTGAGATGATCAAAGAA GAGATGGGACCAGTACCGGGGACTATCGCGCTGTTTGGCTGTTTCTTAATCATGATCATCATCCTCGCCGTCCTGGCGCT GATTGTGGTTAAAGCCCTGGCCGAAAGTCCGTGGGGTGTCTTCACCGTTTGCTCAACCGTACCGATTGCGCTGTTTATGG GTATCTACATGCGCTTTATCCGTCCGGGGCGTGTGGGTGAAGTCTCTGTTATTGGTATCGTGCTGCTGGTTGCCTCTATC TACTTCGGTGGCGTGATTGCTCACGATCCGTACTGGGGTCCGGCACTGACCTTTAAAGACACCACCATTACCTTCGCGCT GATTGGCTATGCGTTTGTTTCCGCACTGCTGCCGGTGTGGCTGATCCTCGCACCGCGCGACTATCTGGCAACCTTCCTGA AAATCGGCGTTATCGTCGGCCTGGCGCTGGGTATCGTGGTGCTGAACCCGGAACTGAAAATGCCTGCGATGACCCAGTAC ATTGACGGTACTGGCCCGCTGTGGAAAGGCGCTCTGTTCCCGTTCCTGTTCATCACCATCGCCTGTGGTGCGGTATCTGG CTTCCACGCGCTGATCTCTTCCGGTACGACGCCGAAACTGCTGGCTAACGAAACCGATGCGCGTTTCATTGGCTACGGCG CAATGCTGATGGAGTCCTTCGTGGCGATTATGGCGCTGGTTGCAGCGTCCATCATCGAACCGGGTCTTTACTTCGCGATG AACACCCCGCCTGCGGGCCTTGGCATCACCATGCCTAACCTGCATGAAATGGGTGGCGAGAACGCGCCGATCATCATGGC GCAGCTGAAAGACGTTACCGCACACGCGGCAGCGACCGTCAGCTCCTGGGGCTTCGTGATTTCGCCAGAGCAGATCCTGC AAACCGCGAAAGACATTGGTGAGCCTTCTGTCCTGAACCGTGCAGGTGGTGCGCCAACGCTGGCGGTAGGTATCGCTCAT GTGTTCCACAAAGTGCTGCCGATGGCTGACATGGGCTTCTGGTATCACTTCGGTATTCTGTTCGAAGCCCTGTTCATCCT GACCGCGCTGGATGCGGGTACCCGTTCTGGCCGCTTTATGCTGCAAGACCTGCTGGGTAACTTCATCCCGTTCCTGAAAA AAACCGACTCTCTGGTTGCTGGTATCATCGGTACTGCGGGCTGTGTGGGTCTGTGGGGCTACCTGCTGTATCAGGGCGTG GTCGATCCGCTGGGCGGCGTTAAGAGCCTGTGGCCGCTGTTCGGTATCTCCAACCAGATGCTGGCAGCCGTAGCGCTGGT ACTGGGCACCGTTGTGCTGATTAAGATGAAGCGCACCCAATACATCTGGGTAACTGTTGTTCCGGCTGTATGGCTGCTTA TTTGCACCACCTGGGCGCTGGGTCTGAAACTGTTCAGCACCAACCCGCAGATGGAAGGCTTCTTCTACATGGCAAGCCAG TACAAAGAGAAGATTGCTAACGGTACTGACCTGACGGCGCAGCAGATTGCCAACATGAACCACATCGTTGTGAACAACTA CACCAACGCAGGCCTGAGTATTCTGTTCCTGATTGTGGTGTACAGCATCATCTTCTACGGTTTCAAAACCTGGCTTGCGG TGCGTAACAGCGACAAACGTACTGACAAAGAAACTCCGTACGTTCCAATCCCGGAAGGCGGCGTGAAGATCTCTTCGCAC CACTAA
Upstream 100 bases:
>100_bases AAGGCAGCAAGTGAGTGAATCCCCGGGAGCTTACAACAGTAAGTGACAGGGGTGAACGAACGCAACTGCCGCACCTGTAA GCCAAAAGACGACGAGTAAA
Downstream 100 bases:
>100_bases CCGTGTTTAGCCCCGCTTCGGCGGGGCTTTGTTCTATCAGAGTGAACTATGTTTGGTAACTTAGGACAGGCAAAAAAATA TCTCGGCCAGGCGGCGAAGA
Product: carbon starvation family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 721; Mature: 720
Protein sequence:
>721_residues MPGFTMDTKKLFKHIPWVILGIIGAFCLAVVALRRGEHVSALWIVVASVSVYLVAYRYYSLYIAQKVMKLDPTRATPAVI NNDGLNYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGYLPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKE EMGPVPGTIALFGCFLIMIIILAVLALIVVKALAESPWGVFTVCSTVPIALFMGIYMRFIRPGRVGEVSVIGIVLLVASI YFGGVIAHDPYWGPALTFKDTTITFALIGYAFVSALLPVWLILAPRDYLATFLKIGVIVGLALGIVVLNPELKMPAMTQY IDGTGPLWKGALFPFLFITIACGAVSGFHALISSGTTPKLLANETDARFIGYGAMLMESFVAIMALVAASIIEPGLYFAM NTPPAGLGITMPNLHEMGGENAPIIMAQLKDVTAHAAATVSSWGFVISPEQILQTAKDIGEPSVLNRAGGAPTLAVGIAH VFHKVLPMADMGFWYHFGILFEALFILTALDAGTRSGRFMLQDLLGNFIPFLKKTDSLVAGIIGTAGCVGLWGYLLYQGV VDPLGGVKSLWPLFGISNQMLAAVALVLGTVVLIKMKRTQYIWVTVVPAVWLLICTTWALGLKLFSTNPQMEGFFYMASQ YKEKIANGTDLTAQQIANMNHIVVNNYTNAGLSILFLIVVYSIIFYGFKTWLAVRNSDKRTDKETPYVPIPEGGVKISSH H
Sequences:
>Translated_721_residues MPGFTMDTKKLFKHIPWVILGIIGAFCLAVVALRRGEHVSALWIVVASVSVYLVAYRYYSLYIAQKVMKLDPTRATPAVI NNDGLNYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGYLPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKE EMGPVPGTIALFGCFLIMIIILAVLALIVVKALAESPWGVFTVCSTVPIALFMGIYMRFIRPGRVGEVSVIGIVLLVASI YFGGVIAHDPYWGPALTFKDTTITFALIGYAFVSALLPVWLILAPRDYLATFLKIGVIVGLALGIVVLNPELKMPAMTQY IDGTGPLWKGALFPFLFITIACGAVSGFHALISSGTTPKLLANETDARFIGYGAMLMESFVAIMALVAASIIEPGLYFAM NTPPAGLGITMPNLHEMGGENAPIIMAQLKDVTAHAAATVSSWGFVISPEQILQTAKDIGEPSVLNRAGGAPTLAVGIAH VFHKVLPMADMGFWYHFGILFEALFILTALDAGTRSGRFMLQDLLGNFIPFLKKTDSLVAGIIGTAGCVGLWGYLLYQGV VDPLGGVKSLWPLFGISNQMLAAVALVLGTVVLIKMKRTQYIWVTVVPAVWLLICTTWALGLKLFSTNPQMEGFFYMASQ YKEKIANGTDLTAQQIANMNHIVVNNYTNAGLSILFLIVVYSIIFYGFKTWLAVRNSDKRTDKETPYVPIPEGGVKISSH H >Mature_720_residues PGFTMDTKKLFKHIPWVILGIIGAFCLAVVALRRGEHVSALWIVVASVSVYLVAYRYYSLYIAQKVMKLDPTRATPAVIN NDGLNYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGYLPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKEE MGPVPGTIALFGCFLIMIIILAVLALIVVKALAESPWGVFTVCSTVPIALFMGIYMRFIRPGRVGEVSVIGIVLLVASIY FGGVIAHDPYWGPALTFKDTTITFALIGYAFVSALLPVWLILAPRDYLATFLKIGVIVGLALGIVVLNPELKMPAMTQYI DGTGPLWKGALFPFLFITIACGAVSGFHALISSGTTPKLLANETDARFIGYGAMLMESFVAIMALVAASIIEPGLYFAMN TPPAGLGITMPNLHEMGGENAPIIMAQLKDVTAHAAATVSSWGFVISPEQILQTAKDIGEPSVLNRAGGAPTLAVGIAHV FHKVLPMADMGFWYHFGILFEALFILTALDAGTRSGRFMLQDLLGNFIPFLKKTDSLVAGIIGTAGCVGLWGYLLYQGVV DPLGGVKSLWPLFGISNQMLAAVALVLGTVVLIKMKRTQYIWVTVVPAVWLLICTTWALGLKLFSTNPQMEGFFYMASQY KEKIANGTDLTAQQIANMNHIVVNNYTNAGLSILFLIVVYSIIFYGFKTWLAVRNSDKRTDKETPYVPIPEGGVKISSHH
Specific function: Unknown
COG id: COG1966
COG function: function code T; Carbon starvation protein, predicted membrane protein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the CstA family [H]
Homologues:
Organism=Escherichia coli, GI87082431, Length=716, Percent_Identity=99.7206703910614, Blast_Score=1436, Evalue=0.0, Organism=Escherichia coli, GI1786814, Length=713, Percent_Identity=61.4305750350631, Blast_Score=870, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003706 [H]
Pfam domain/function: PF02554 CstA [H]
EC number: NA
Molecular weight: Translated: 77845; Mature: 77713
Theoretical pI: Translated: 8.93; Mature: 8.93
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 3.7 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 4.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPGFTMDTKKLFKHIPWVILGIIGAFCLAVVALRRGEHVSALWIVVASVSVYLVAYRYYS CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH LYIAQKVMKLDPTRATPAVINNDGLNYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGY HHHHHHHHHCCCCCCCCCEECCCCCCCCCCCCEEEEECHHHHHHCCCHHHHHHHHHHHCC LPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKEEMGPVPGTIALFGCFLIMII CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH ILAVLALIVVKALAESPWGVFTVCSTVPIALFMGIYMRFIRPGRVGEVSVIGIVLLVASI HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH YFGGVIAHDPYWGPALTFKDTTITFALIGYAFVSALLPVWLILAPRDYLATFLKIGVIVG HHCCEEECCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH LALGIVVLNPELKMPAMTQYIDGTGPLWKGALFPFLFITIACGAVSGFHALISSGTTPKL HHHHHEEECCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC LANETDARFIGYGAMLMESFVAIMALVAASIIEPGLYFAMNTPPAGLGITMPNLHEMGGE CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCHHHCCCC NAPIIMAQLKDVTAHAAATVSSWGFVISPEQILQTAKDIGEPSVLNRAGGAPTLAVGIAH CCCEEEHHHHHHHHHHHHHHHCCCCEECHHHHHHHHHHCCCCHHHHCCCCCCHHHHHHHH VFHKVLPMADMGFWYHFGILFEALFILTALDAGTRSGRFMLQDLLGNFIPFLKKTDSLVA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH GIIGTAGCVGLWGYLLYQGVVDPLGGVKSLWPLFGISNQMLAAVALVLGTVVLIKMKRTQ HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHEECCCC YIWVTVVPAVWLLICTTWALGLKLFSTNPQMEGFFYMASQYKEKIANGTDLTAQQIANMN EEEEEHHHHHHHHHHHHHHHHHEEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHCCCC HIVVNNYTNAGLSILFLIVVYSIIFYGFKTWLAVRNSDKRTDKETPYVPIPEGGVKISSH EEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCEEECCC H C >Mature Secondary Structure PGFTMDTKKLFKHIPWVILGIIGAFCLAVVALRRGEHVSALWIVVASVSVYLVAYRYYS CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH LYIAQKVMKLDPTRATPAVINNDGLNYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGY HHHHHHHHHCCCCCCCCCEECCCCCCCCCCCCEEEEECHHHHHHCCCHHHHHHHHHHHCC LPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKEEMGPVPGTIALFGCFLIMII CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH ILAVLALIVVKALAESPWGVFTVCSTVPIALFMGIYMRFIRPGRVGEVSVIGIVLLVASI HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH YFGGVIAHDPYWGPALTFKDTTITFALIGYAFVSALLPVWLILAPRDYLATFLKIGVIVG HHCCEEECCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH LALGIVVLNPELKMPAMTQYIDGTGPLWKGALFPFLFITIACGAVSGFHALISSGTTPKL HHHHHEEECCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC LANETDARFIGYGAMLMESFVAIMALVAASIIEPGLYFAMNTPPAGLGITMPNLHEMGGE CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCHHHCCCC NAPIIMAQLKDVTAHAAATVSSWGFVISPEQILQTAKDIGEPSVLNRAGGAPTLAVGIAH CCCEEEHHHHHHHHHHHHHHHCCCCEECHHHHHHHHHHCCCCHHHHCCCCCCHHHHHHHH VFHKVLPMADMGFWYHFGILFEALFILTALDAGTRSGRFMLQDLLGNFIPFLKKTDSLVA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH GIIGTAGCVGLWGYLLYQGVVDPLGGVKSLWPLFGISNQMLAAVALVLGTVVLIKMKRTQ HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHEECCCC YIWVTVVPAVWLLICTTWALGLKLFSTNPQMEGFFYMASQYKEKIANGTDLTAQQIANMN EEEEEHHHHHHHHHHHHHHHHHEEECCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHCCCC HIVVNNYTNAGLSILFLIVVYSIIFYGFKTWLAVRNSDKRTDKETPYVPIPEGGVKISSH EEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEECCCCCEEECCC H C
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7610040; 9278503 [H]