Definition | Exiguobacterium sp. AT1b, complete genome. |
---|---|
Accession | NC_012673 |
Length | 2,999,895 |
Click here to switch to the map view.
The map label for this gene is ydiU [C]
Identifier: 229918366
GI number: 229918366
Start: 2618893
End: 2620329
Strand: Reverse
Name: ydiU [C]
Synonym: EAT1b_2651
Alternate gene names: 229918366
Gene position: 2620329-2618893 (Counterclockwise)
Preceding gene: 229918370
Following gene: 229918341
Centisome position: 87.35
GC content: 53.17
Gene sequence:
>1437_bases ATGACAACTGACTGGAACTTCGAGGCGACGTACCTCGAACTACGTGACATCTTTTACGACCGTGGACCTATCCACCCGGT CGACAATCCGACACTCGTCTTGTTTAACGATGCATTAGCCGCTTCGCTCGGTCTTGATGCGCAAAGTCTCAAACAAGACA TCGATTTACTGGCCGGCAATCGTCAAACCGAAACGTCTTTCTCACAAGCATACGCCGGACACCAATTCGGCAACTTGACG ATGCTCGGGGACGGACGCGCCCTCATGCTCGGAGAACATGTGACTCCGAACGGGAAACGTGTCGATGTGCAACTAAAGGG TTCTGGACCGACAGAGTATTCTAGAGGGGGAGACGGACGGGCCGCACTCGGGCCGATGGTACGCGAGTTCATCATTAGCG AGGCGATGCACGCTCTCGGCATTCCGACGAACCGGGCGCTCGCCGTCATTCAGACAGGGGAAGCCATCATGCGTCAAGGT CCAAAGCATGGGGCCATCCTGACTCGGGTCGCCTCAAGTCATCTTCGTGTCGGAACATTTCAATTCGCAGCAGGCGCCGG CTCCATCGATGACGTCATCGCTCTTACTGAAGTAGCCATCAAACGACACGACCCAGACTTGATCGATGCACCGAATCGCT ATGAACAGTTCCTCGGACGCGTCGTCGAAAGACAGGCGAGACTGATTGCCAACTGGCAACTCGTCGGTTTTATACATGGG GTGATGAATACGGACAATATGTTCATCAGCGGAGAAGGTCTCGATTACGGACCATGCGCGTTCATGGATACGTATCATCC GGAGACCGTCTTCAGCTCGATTGACCGAGAAGGACGCTACGCTTATGCGAACCAACCGTATATCGGGTCTTGGAACCTCG CCCGACTTGCAGAGACATTGCTGCCGCTCCTTGGCGAGACGAAAGAAGAAGCGGTCGATGTGGCGAACAAACAGCTCACC CGTTATACCGAGTTGTATAAAGAAGCGTACTTTACCGGACTCGCACATAAGATTGGCCTATTCGTCCGAAAAGATGGTGA CGATGAGCTAACGGATGAACTGCTTCGCTTGATGATGGAGACTGAGGCAGATTATACGAATACGTTCCGCTCGCTGACGC TTGGTGAGATTGAATCGCTCCCATTTGCAACTCGGAAAGACGGGAAGGTCTGGCTTGGTGCTTGGCGTAAACGCCTGAAT GGGCAAGGTCTCCCGGACGAGGACGTCAGTCGAATCATGCGCCAGTATAATCCGGCCGTTATTCCTCGAAATCATCATGT CGAAAAAGCGATTCAAGCAGCAGAACGCGGTGACTTCGGTCCGACCGAGGCAATACTCACCATTTTGCGTGACCCCTATA ACTACGATCAATCGTCTGAATACGTATCGGCAGGTCCACCACGAACGTATCCGTATCAAACGTTTTGTGGCACGTGA
Upstream 100 bases:
>100_bases TCCTACTGTTTACACTTCTGTTTCATCTTATCAAAATTCACGACAGCCGTTATGGAAAACAGTCTGTGCTATACTCGAAG CGAAGAAGGAGGCATGCGCC
Downstream 100 bases:
>100_bases ACCATCTCGCTTTTTCGAGGTGGTTTTTTTGAAACCTATTTTTTATTTGTAGGATGCTTCATTAATTAAATTGCGAGGAC TTGACTCCTTTAACAATTAC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 478; Mature: 477
Protein sequence:
>478_residues MTTDWNFEATYLELRDIFYDRGPIHPVDNPTLVLFNDALAASLGLDAQSLKQDIDLLAGNRQTETSFSQAYAGHQFGNLT MLGDGRALMLGEHVTPNGKRVDVQLKGSGPTEYSRGGDGRAALGPMVREFIISEAMHALGIPTNRALAVIQTGEAIMRQG PKHGAILTRVASSHLRVGTFQFAAGAGSIDDVIALTEVAIKRHDPDLIDAPNRYEQFLGRVVERQARLIANWQLVGFIHG VMNTDNMFISGEGLDYGPCAFMDTYHPETVFSSIDREGRYAYANQPYIGSWNLARLAETLLPLLGETKEEAVDVANKQLT RYTELYKEAYFTGLAHKIGLFVRKDGDDELTDELLRLMMETEADYTNTFRSLTLGEIESLPFATRKDGKVWLGAWRKRLN GQGLPDEDVSRIMRQYNPAVIPRNHHVEKAIQAAERGDFGPTEAILTILRDPYNYDQSSEYVSAGPPRTYPYQTFCGT
Sequences:
>Translated_478_residues MTTDWNFEATYLELRDIFYDRGPIHPVDNPTLVLFNDALAASLGLDAQSLKQDIDLLAGNRQTETSFSQAYAGHQFGNLT MLGDGRALMLGEHVTPNGKRVDVQLKGSGPTEYSRGGDGRAALGPMVREFIISEAMHALGIPTNRALAVIQTGEAIMRQG PKHGAILTRVASSHLRVGTFQFAAGAGSIDDVIALTEVAIKRHDPDLIDAPNRYEQFLGRVVERQARLIANWQLVGFIHG VMNTDNMFISGEGLDYGPCAFMDTYHPETVFSSIDREGRYAYANQPYIGSWNLARLAETLLPLLGETKEEAVDVANKQLT RYTELYKEAYFTGLAHKIGLFVRKDGDDELTDELLRLMMETEADYTNTFRSLTLGEIESLPFATRKDGKVWLGAWRKRLN GQGLPDEDVSRIMRQYNPAVIPRNHHVEKAIQAAERGDFGPTEAILTILRDPYNYDQSSEYVSAGPPRTYPYQTFCGT >Mature_477_residues TTDWNFEATYLELRDIFYDRGPIHPVDNPTLVLFNDALAASLGLDAQSLKQDIDLLAGNRQTETSFSQAYAGHQFGNLTM LGDGRALMLGEHVTPNGKRVDVQLKGSGPTEYSRGGDGRAALGPMVREFIISEAMHALGIPTNRALAVIQTGEAIMRQGP KHGAILTRVASSHLRVGTFQFAAGAGSIDDVIALTEVAIKRHDPDLIDAPNRYEQFLGRVVERQARLIANWQLVGFIHGV MNTDNMFISGEGLDYGPCAFMDTYHPETVFSSIDREGRYAYANQPYIGSWNLARLAETLLPLLGETKEEAVDVANKQLTR YTELYKEAYFTGLAHKIGLFVRKDGDDELTDELLRLMMETEADYTNTFRSLTLGEIESLPFATRKDGKVWLGAWRKRLNG QGLPDEDVSRIMRQYNPAVIPRNHHVEKAIQAAERGDFGPTEAILTILRDPYNYDQSSEYVSAGPPRTYPYQTFCGT
Specific function: Unknown
COG id: COG0397
COG function: function code S; Uncharacterized conserved protein
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the UPF0061 (SELO) family
Homologues:
Organism=Homo sapiens, GI32880229, Length=384, Percent_Identity=36.71875, Blast_Score=218, Evalue=8e-57, Organism=Escherichia coli, GI1787999, Length=462, Percent_Identity=38.0952380952381, Blast_Score=308, Evalue=5e-85, Organism=Saccharomyces cerevisiae, GI6325034, Length=319, Percent_Identity=33.5423197492163, Blast_Score=160, Evalue=3e-40,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y2651_EXISA (C4L4K4)
Other databases:
- EMBL: CP001615 - RefSeq: YP_002887012.1 - GeneID: 7869664 - GenomeReviews: CP001615_GR - KEGG: eat:EAT1b_2651 - OMA: RRDIQLK - HAMAP: MF_00692 - InterPro: IPR003846
Pfam domain/function: PF02696 UPF0061
EC number: NA
Molecular weight: Translated: 53205; Mature: 53073
Theoretical pI: Translated: 5.13; Mature: 5.13
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 2.9 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTTDWNFEATYLELRDIFYDRGPIHPVDNPTLVLFNDALAASLGLDAQSLKQDIDLLAGN CCCCCCCEEHHHHHHHHHHCCCCCCCCCCCEEEEECCHHHHHHCCCHHHHHHHHHHHCCC RQTETSFSQAYAGHQFGNLTMLGDGRALMLGEHVTPNGKRVDVQLKGSGPTEYSRGGDGR CCCHHHHHHHHCCCCCCCEEEEECCCEEEEECCCCCCCCEEEEEEECCCCCCCCCCCCCC AALGPMVREFIISEAMHALGIPTNRALAVIQTGEAIMRQGPKHGAILTRVASSHLRVGTF HHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHCCCCCCHHHHHHHHCCEEEEEE QFAAGAGSIDDVIALTEVAIKRHDPDLIDAPNRYEQFLGRVVERQARLIANWQLVGFIHG EEECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHH VMNTDNMFISGEGLDYGPCAFMDTYHPETVFSSIDREGRYAYANQPYIGSWNLARLAETL HCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHH LPLLGETKEEAVDVANKQLTRYTELYKEAYFTGLAHKIGLFVRKDGDDELTDELLRLMME HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCHHHHHHHHHHHHH TEADYTNTFRSLTLGEIESLPFATRKDGKVWLGAWRKRLNGQGLPDEDVSRIMRQYNPAV HCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHCCCCC IPRNHHVEKAIQAAERGDFGPTEAILTILRDPYNYDQSSEYVSAGPPRTYPYQTFCGT CCCCCHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHCCCC >Mature Secondary Structure TTDWNFEATYLELRDIFYDRGPIHPVDNPTLVLFNDALAASLGLDAQSLKQDIDLLAGN CCCCCCEEHHHHHHHHHHCCCCCCCCCCCEEEEECCHHHHHHCCCHHHHHHHHHHHCCC RQTETSFSQAYAGHQFGNLTMLGDGRALMLGEHVTPNGKRVDVQLKGSGPTEYSRGGDGR CCCHHHHHHHHCCCCCCCEEEEECCCEEEEECCCCCCCCEEEEEEECCCCCCCCCCCCCC AALGPMVREFIISEAMHALGIPTNRALAVIQTGEAIMRQGPKHGAILTRVASSHLRVGTF HHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHHCCCCCCHHHHHHHHCCEEEEEE QFAAGAGSIDDVIALTEVAIKRHDPDLIDAPNRYEQFLGRVVERQARLIANWQLVGFIHG EEECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHH VMNTDNMFISGEGLDYGPCAFMDTYHPETVFSSIDREGRYAYANQPYIGSWNLARLAETL HCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHH LPLLGETKEEAVDVANKQLTRYTELYKEAYFTGLAHKIGLFVRKDGDDELTDELLRLMME HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCHHHHHHHHHHHHH TEADYTNTFRSLTLGEIESLPFATRKDGKVWLGAWRKRLNGQGLPDEDVSRIMRQYNPAV HCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHCCCCC IPRNHHVEKAIQAAERGDFGPTEAILTILRDPYNYDQSSEYVSAGPPRTYPYQTFCGT CCCCCHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA