Definition | Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome. |
---|---|
Accession | NC_004631 |
Length | 4,791,961 |
Click here to switch to the map view.
The map label for this gene is yebT [H]
Identifier: 29141505
GI number: 29141505
Start: 1111037
End: 1113676
Strand: Reverse
Name: yebT [H]
Synonym: t1029
Alternate gene names: 29141505
Gene position: 1113676-1111037 (Counterclockwise)
Preceding gene: 29141506
Following gene: 29141504
Centisome position: 23.24
GC content: 54.62
Gene sequence:
>2640_bases ATGCACATGAGTCAGGAAACGCCCGCTTCGAAGACTGAAGCGCAAATTAAAACCAAACGCCGTATTTCACCTTTCTGGCT GCTACCGCTTATCGCGCTAATGATCGCGGGGTGGCTGGTATGGGATAGCTACCAGGATCGCGGCAATAGCGTGACTATCG ATTTTATGTCGGCGGACGGTATCGTACCGGGCCGTACTCCCGTGCGTTATCAGGGAGTAGAAGTCGGCACCGTGGAAGAT GTCAGTCTGAGCAAAGATCTGCGCAAAATTGAAGTTCGCGTCAGTATCAAATCAGATATGGAAGATGCGTTGCGCGAAGA GACGCAATTCTGGCTGGTGACGCCCAAAGCCTCGCTGGCGGGCGTTTCCGGCCTGGATGCGTTGGTCGGCGGGAATTACA TCGGTATGATGCCAGGTAAAGGCAAGCCCAGAGATCATTTCGTCGCCCTAGATACACAGCCTAAATACCGGCTTAGCAAC GGCGATCTGATGATTCATCTCAATGCGCCGGATCTCGGTTCGCTTAATAGCGGTTCACTGGTCTATTTCCGTAAAATCCC TGTCGGACGGGTGTATGACTATTCGATTAACCCTAACAAACAGGGCGTGACGATTGACGTTCTGATTGAGCGACGGTTTA CCGATCTGGTGAAAAAAGGCAGCCGTTTTTGGAATGTCTCCGGCATTGACGCCGATCTAAGCCTGAGCGGCGCGAAGGTG AAACTGGAGAGCCTCGCGGCCCTGGTCAATGGCGCGATTGCGTTTGACTCACCGGACAATTCCAAACCCGCCGCCCAGGA TGACACGTTCGGCTTATATAAAGATTTAGCCCACAGCCAACGAGGGGTAATCGTTAAACTTGAGCTGCCCAGCGGAGACG GTCTGAAAGCGGAATCTACGCCGCTAATGTACCAGGGACTGGAGGTGGGTGAGCTTTCTAAACTGACGCTCAACCCTGGC GGCAAAGTCACCGGAGAGATGACCGTCGATCCCAGCGTTGTTCCGCTGATGCGGGAAAATACGCGTATTGAGTTACGCAA TCCCAAACTGTCGCTAAGTGACGCGAATATCAGTTCGTTGTTAACCGGAAAAACCTTCGAGCTGGTGCCGGGCGACGGCG AACCACGCAGTGAATTTGTGGTGGTGCCGGGTGAAAAAGCCCTGCTGCATGAGGCGAATGCCTTAACCCTGACGCTGACG GCCCCGGAAAGTTACGGCATCGAACCGGGCCAGCCGTTAATTTTACATGGCGTAAAAATTGGCCAGGTCATTGAGCGCAA CTTATCCAGTAAAGGCGTGTCATTCATCGTCGCGATTGAACCGCAGCACCGGGATTTGGTACAGGGCGACAGTAAATTCG TGGTCAACAGCCGGGTGGATGTCAAAGTCGGCCTTGACGGCGTAGAGTTCCTCGGCGCCAGCGCCAGCGAGTGGATTGAC GGCGGAATTCGTATTTTACCCGGTACGAGCGGGAAGATGAAATCCACCTACCCGCTCTATGCTAACCTGGAAAAAGCGCT GGAAAATAGCCTCAGTGACTTACCGACTACCACCCTGACGCTGACGGCCGAAACGTTGCCGGATGTCCAGGCAGGTTCCG TCGTGCTGTATCGAAAATTTGAAGTAGGCGAAGTCATCACCGTTCGCCCACGCGCCAATACCTTTGACATCGACCTGCAT ATTAAGCCGGAATATCGCCACCTGTTAACCAGCAATAGCGTGTTCTGGGCGGAAGGCGGCGCGAAGGTGCAACTTAACGG CAGCGGCCTAACGGTACAGGCCTCGCCACTCTCCCGTGCGCTGAAAGGGGCCATTAGTTTTGATAACCTGAGCGGCGCCA GCGCCAGTCGGCGCAAAGGCGATAAACGCATTCTTTATGCTTCAGAAACTTCCGCCCGCGCGGTAGGCGGACAAATTACG CTACACGCGTTCGACGCCGGAAAATTGGCGGAGGGGATGCCCATTCGTTACCTCGGTATTGATATCGGCCAGATCCAGAC GCTGGAATTGATCACCGCACGTAATGAAGTGCAGGCAAAAGCCGTACTTTATCCGGAGTACGTGCAAACATTTGCCCGCG CCGGGACACGTTTTTCCGTTATTACGCCACAAATCTCCGCGGCGGGCGTCGAGCATCTGGATACGATTCTCCAGCCCTAT ATTAACGTTGAGCCAGGACGCGGCGCGGCGCGGCGCGACTTTGAACTGCAGGAAGCCACGATTACCGACTCACGCTATCT GGATGGGTTAAGCATCGTCGTGGAGGCGCCAGAAGCAGGCTCGCTTAATATTGGCACACCCGTCCTGTTTCGCGGTATCG AAGTGGGAACCGTCACCGGAATGTCCCTGGGGTCGCTCTCCGATCGCGTGATGATCACCTTGCGCATCAGTAAGCGTTAC CAATATCTGGTGCGTAATAACTCCGTATTCTGGCTTGCCTCCGGCTATAGTCTCGACTTTGGTCTGACAGGCGGCGTGGT GAAAACGGGGACATTTAATCAATTCATCCGTGGCGGTATCGCCTTCGCTACTCCACCAGGTACGCCGCTGGCGCCAAAAG CGCAAGCCGGTAAGCATTTCCTGTTACAAGAGAGCGAACCGAAAGAGTGGCGTGAATGGGGTACCGCTCTGCCACGTTAA
Upstream 100 bases:
>100_bases ATTCTGGCTTTTACTATGGGACCGGCTGCGTTTTATTTCGGCGCAGCGGTAATTTTGACTATTCTTGCAGTGGAATGGCT GGATAGCCGCTTACTTTGGG
Downstream 100 bases:
>100_bases ACACCAGGCTCCGGCGTACTCGCGCCGGAGCGTTTTATGCTACACTGCGCGCCTGTTTTTTTGCCGGCGATACACCTGTG GCTCAACACGCTGTCTATTT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 879; Mature: 879
Protein sequence:
>879_residues MHMSQETPASKTEAQIKTKRRISPFWLLPLIALMIAGWLVWDSYQDRGNSVTIDFMSADGIVPGRTPVRYQGVEVGTVED VSLSKDLRKIEVRVSIKSDMEDALREETQFWLVTPKASLAGVSGLDALVGGNYIGMMPGKGKPRDHFVALDTQPKYRLSN GDLMIHLNAPDLGSLNSGSLVYFRKIPVGRVYDYSINPNKQGVTIDVLIERRFTDLVKKGSRFWNVSGIDADLSLSGAKV KLESLAALVNGAIAFDSPDNSKPAAQDDTFGLYKDLAHSQRGVIVKLELPSGDGLKAESTPLMYQGLEVGELSKLTLNPG GKVTGEMTVDPSVVPLMRENTRIELRNPKLSLSDANISSLLTGKTFELVPGDGEPRSEFVVVPGEKALLHEANALTLTLT APESYGIEPGQPLILHGVKIGQVIERNLSSKGVSFIVAIEPQHRDLVQGDSKFVVNSRVDVKVGLDGVEFLGASASEWID GGIRILPGTSGKMKSTYPLYANLEKALENSLSDLPTTTLTLTAETLPDVQAGSVVLYRKFEVGEVITVRPRANTFDIDLH IKPEYRHLLTSNSVFWAEGGAKVQLNGSGLTVQASPLSRALKGAISFDNLSGASASRRKGDKRILYASETSARAVGGQIT LHAFDAGKLAEGMPIRYLGIDIGQIQTLELITARNEVQAKAVLYPEYVQTFARAGTRFSVITPQISAAGVEHLDTILQPY INVEPGRGAARRDFELQEATITDSRYLDGLSIVVEAPEAGSLNIGTPVLFRGIEVGTVTGMSLGSLSDRVMITLRISKRY QYLVRNNSVFWLASGYSLDFGLTGGVVKTGTFNQFIRGGIAFATPPGTPLAPKAQAGKHFLLQESEPKEWREWGTALPR
Sequences:
>Translated_879_residues MHMSQETPASKTEAQIKTKRRISPFWLLPLIALMIAGWLVWDSYQDRGNSVTIDFMSADGIVPGRTPVRYQGVEVGTVED VSLSKDLRKIEVRVSIKSDMEDALREETQFWLVTPKASLAGVSGLDALVGGNYIGMMPGKGKPRDHFVALDTQPKYRLSN GDLMIHLNAPDLGSLNSGSLVYFRKIPVGRVYDYSINPNKQGVTIDVLIERRFTDLVKKGSRFWNVSGIDADLSLSGAKV KLESLAALVNGAIAFDSPDNSKPAAQDDTFGLYKDLAHSQRGVIVKLELPSGDGLKAESTPLMYQGLEVGELSKLTLNPG GKVTGEMTVDPSVVPLMRENTRIELRNPKLSLSDANISSLLTGKTFELVPGDGEPRSEFVVVPGEKALLHEANALTLTLT APESYGIEPGQPLILHGVKIGQVIERNLSSKGVSFIVAIEPQHRDLVQGDSKFVVNSRVDVKVGLDGVEFLGASASEWID GGIRILPGTSGKMKSTYPLYANLEKALENSLSDLPTTTLTLTAETLPDVQAGSVVLYRKFEVGEVITVRPRANTFDIDLH IKPEYRHLLTSNSVFWAEGGAKVQLNGSGLTVQASPLSRALKGAISFDNLSGASASRRKGDKRILYASETSARAVGGQIT LHAFDAGKLAEGMPIRYLGIDIGQIQTLELITARNEVQAKAVLYPEYVQTFARAGTRFSVITPQISAAGVEHLDTILQPY INVEPGRGAARRDFELQEATITDSRYLDGLSIVVEAPEAGSLNIGTPVLFRGIEVGTVTGMSLGSLSDRVMITLRISKRY QYLVRNNSVFWLASGYSLDFGLTGGVVKTGTFNQFIRGGIAFATPPGTPLAPKAQAGKHFLLQESEPKEWREWGTALPR >Mature_879_residues MHMSQETPASKTEAQIKTKRRISPFWLLPLIALMIAGWLVWDSYQDRGNSVTIDFMSADGIVPGRTPVRYQGVEVGTVED VSLSKDLRKIEVRVSIKSDMEDALREETQFWLVTPKASLAGVSGLDALVGGNYIGMMPGKGKPRDHFVALDTQPKYRLSN GDLMIHLNAPDLGSLNSGSLVYFRKIPVGRVYDYSINPNKQGVTIDVLIERRFTDLVKKGSRFWNVSGIDADLSLSGAKV KLESLAALVNGAIAFDSPDNSKPAAQDDTFGLYKDLAHSQRGVIVKLELPSGDGLKAESTPLMYQGLEVGELSKLTLNPG GKVTGEMTVDPSVVPLMRENTRIELRNPKLSLSDANISSLLTGKTFELVPGDGEPRSEFVVVPGEKALLHEANALTLTLT APESYGIEPGQPLILHGVKIGQVIERNLSSKGVSFIVAIEPQHRDLVQGDSKFVVNSRVDVKVGLDGVEFLGASASEWID GGIRILPGTSGKMKSTYPLYANLEKALENSLSDLPTTTLTLTAETLPDVQAGSVVLYRKFEVGEVITVRPRANTFDIDLH IKPEYRHLLTSNSVFWAEGGAKVQLNGSGLTVQASPLSRALKGAISFDNLSGASASRRKGDKRILYASETSARAVGGQIT LHAFDAGKLAEGMPIRYLGIDIGQIQTLELITARNEVQAKAVLYPEYVQTFARAGTRFSVITPQISAAGVEHLDTILQPY INVEPGRGAARRDFELQEATITDSRYLDGLSIVVEAPEAGSLNIGTPVLFRGIEVGTVTGMSLGSLSDRVMITLRISKRY QYLVRNNSVFWLASGYSLDFGLTGGVVKTGTFNQFIRGGIAFATPPGTPLAPKAQAGKHFLLQESEPKEWREWGTALPR
Specific function: Unknown
COG id: COG3008
COG function: function code R; Paraquat-inducible protein B
Gene ontology:
Cell location: Membrane; Single-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the pqiB family [H]
Homologues:
Organism=Escherichia coli, GI87081984, Length=877, Percent_Identity=90.5359179019384, Blast_Score=1637, Evalue=0.0, Organism=Escherichia coli, GI1787184, Length=347, Percent_Identity=29.971181556196, Blast_Score=165, Evalue=1e-41,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003399 [H]
Pfam domain/function: PF02470 MCE [H]
EC number: NA
Molecular weight: Translated: 95317; Mature: 95317
Theoretical pI: Translated: 6.99; Mature: 6.99
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 1.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MHMSQETPASKTEAQIKTKRRISPFWLLPLIALMIAGWLVWDSYQDRGNSVTIDFMSADG CCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHEECCCCCCCCEEEEEEEECCC IVPGRTPVRYQGVEVGTVEDVSLSKDLRKIEVRVSIKSDMEDALREETQFWLVTPKASLA CCCCCCCCEECCEEECCCCCCHHHCCCEEEEEEEEEHHHHHHHHHCCCEEEEEECCHHHC GVSGLDALVGGNYIGMMPGKGKPRDHFVALDTQPKYRLSNGDLMIHLNAPDLGSLNSGSL CCCCCHHHHCCCEEEECCCCCCCCCCEEEEECCCCEEECCCCEEEEECCCCCCCCCCCCE VYFRKIPVGRVYDYSINPNKQGVTIDVLIERRFTDLVKKGSRFWNVSGIDADLSLSGAKV EEEEECCCCEEEEEECCCCCCCEEEEEEEHHHHHHHHHCCCCEEEEECCCCCEECCCCEE KLESLAALVNGAIAFDSPDNSKPAAQDDTFGLYKDLAHSQRGVIVKLELPSGDGLKAEST EHHHHHHHHCCEEEECCCCCCCCCCCCCCHHHHHHHHCCCCCEEEEEECCCCCCCCCCCC PLMYQGLEVGELSKLTLNPGGKVTGEMTVDPSVVPLMRENTRIELRNPKLSLSDANISSL CEEEECCCCCCEEEEEECCCCCEEEEEEECCCEEEEECCCCEEEEECCEEEECCCCHHHH LTGKTFELVPGDGEPRSEFVVVPGEKALLHEANALTLTLTAPESYGIEPGQPLILHGVKI HCCCEEEEECCCCCCCCCEEEECCCHHEEECCCEEEEEEECCCCCCCCCCCEEEEECCHH GQVIERNLSSKGVSFIVAIEPQHRDLVQGDSKFVVNSRVDVKVGLDGVEFLGASASEWID HHHHHHCCCCCCCEEEEEECCCCCHHHCCCCEEEEECEEEEEECCCHHHHHCCCHHHHCC GGIRILPGTSGKMKSTYPLYANLEKALENSLSDLPTTTLTLTAETLPDVQAGSVVLYRKF CCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCEEEEEEE EVGEVITVRPRANTFDIDLHIKPEYRHLLTSNSVFWAEGGAKVQLNGSGLTVQASPLSRA CCCCEEEEECCCCEEEEEEEECCHHHHHHCCCCEEEECCCCEEEECCCCEEEECCHHHHH LKGAISFDNLSGASASRRKGDKRILYASETSARAVGGQITLHAFDAGKLAEGMPIRYLGI HHCCEEECCCCCCCCHHCCCCCEEEEECCCCCEECCCEEEEEEECCCCCCCCCCEEEEEC DIGQIQTLELITARNEVQAKAVLYPEYVQTFARAGTRFSVITPQISAAGVEHLDTILQPY CCCCEEEEEEEECCCCCEEEEEECHHHHHHHHHCCCEEEEECCCHHHCCHHHHHHHHHCC INVEPGRGAARRDFELQEATITDSRYLDGLSIVVEAPEAGSLNIGTPVLFRGIEVGTVTG EEECCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCHHHEECEEEEEECC MSLGSLSDRVMITLRISKRYQYLVRNNSVFWLASGYSLDFGLTGGVVKTGTFNQFIRGGI CCCCCCCCEEEEEEEECCCEEEEEECCCEEEEEECCEEEECCCCCEEECCCHHHHHHCCE AFATPPGTPLAPKAQAGKHFLLQESEPKEWREWGTALPR EEECCCCCCCCCCCCCCCEEEEECCCCHHHHHHCCCCCC >Mature Secondary Structure MHMSQETPASKTEAQIKTKRRISPFWLLPLIALMIAGWLVWDSYQDRGNSVTIDFMSADG CCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHEECCCCCCCCEEEEEEEECCC IVPGRTPVRYQGVEVGTVEDVSLSKDLRKIEVRVSIKSDMEDALREETQFWLVTPKASLA CCCCCCCCEECCEEECCCCCCHHHCCCEEEEEEEEEHHHHHHHHHCCCEEEEEECCHHHC GVSGLDALVGGNYIGMMPGKGKPRDHFVALDTQPKYRLSNGDLMIHLNAPDLGSLNSGSL CCCCCHHHHCCCEEEECCCCCCCCCCEEEEECCCCEEECCCCEEEEECCCCCCCCCCCCE VYFRKIPVGRVYDYSINPNKQGVTIDVLIERRFTDLVKKGSRFWNVSGIDADLSLSGAKV EEEEECCCCEEEEEECCCCCCCEEEEEEEHHHHHHHHHCCCCEEEEECCCCCEECCCCEE KLESLAALVNGAIAFDSPDNSKPAAQDDTFGLYKDLAHSQRGVIVKLELPSGDGLKAEST EHHHHHHHHCCEEEECCCCCCCCCCCCCCHHHHHHHHCCCCCEEEEEECCCCCCCCCCCC PLMYQGLEVGELSKLTLNPGGKVTGEMTVDPSVVPLMRENTRIELRNPKLSLSDANISSL CEEEECCCCCCEEEEEECCCCCEEEEEEECCCEEEEECCCCEEEEECCEEEECCCCHHHH LTGKTFELVPGDGEPRSEFVVVPGEKALLHEANALTLTLTAPESYGIEPGQPLILHGVKI HCCCEEEEECCCCCCCCCEEEECCCHHEEECCCEEEEEEECCCCCCCCCCCEEEEECCHH GQVIERNLSSKGVSFIVAIEPQHRDLVQGDSKFVVNSRVDVKVGLDGVEFLGASASEWID HHHHHHCCCCCCCEEEEEECCCCCHHHCCCCEEEEECEEEEEECCCHHHHHCCCHHHHCC GGIRILPGTSGKMKSTYPLYANLEKALENSLSDLPTTTLTLTAETLPDVQAGSVVLYRKF CCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCEEEEEEE EVGEVITVRPRANTFDIDLHIKPEYRHLLTSNSVFWAEGGAKVQLNGSGLTVQASPLSRA CCCCEEEEECCCCEEEEEEEECCHHHHHHCCCCEEEECCCCEEEECCCCEEEECCHHHHH LKGAISFDNLSGASASRRKGDKRILYASETSARAVGGQITLHAFDAGKLAEGMPIRYLGI HHCCEEECCCCCCCCHHCCCCCEEEEECCCCCEECCCEEEEEEECCCCCCCCCCEEEEEC DIGQIQTLELITARNEVQAKAVLYPEYVQTFARAGTRFSVITPQISAAGVEHLDTILQPY CCCCEEEEEEEECCCCCEEEEEECHHHHHHHHHCCCEEEEECCCHHHCCHHHHHHHHHCC INVEPGRGAARRDFELQEATITDSRYLDGLSIVVEAPEAGSLNIGTPVLFRGIEVGTVTG EEECCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCHHHEECEEEEEECC MSLGSLSDRVMITLRISKRYQYLVRNNSVFWLASGYSLDFGLTGGVVKTGTFNQFIRGGI CCCCCCCCEEEEEEEECCCEEEEEECCCEEEEEECCEEEECCCCCEEECCCHHHHHHCCE AFATPPGTPLAPKAQAGKHFLLQESEPKEWREWGTALPR EEECCCCCCCCCCCCCCCEEEEECCCCHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9097040; 9278503 [H]