Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is nupX [H]

Identifier: 209396884

GI number: 209396884

Start: 3029185

End: 3030435

Strand: Reverse

Name: nupX [H]

Synonym: ECH74115_3297

Alternate gene names: 209396884

Gene position: 3030435-3029185 (Counterclockwise)

Preceding gene: 209396314

Following gene: 209399117

Centisome position: 54.39

GC content: 50.76

Gene sequence:

>1251_bases
ATGGATGTCATGAGAAGTGTTCTGGGAATGGTGGTATTGCTGACGATTGCGTTTTTACTGTCAGTAAACAAGAAGAAGAT
CAGCCTGCGTACCGTTGGCGCGGCGTTAGTGTTACAGGTCGTGATTGGCGGCATTATGCTTTGGTTACCGCCAGGGCGTT
GGGTCGCTGAAAAAGTCGCTTTTGGTGTGCATAAAGTGATGGCGTACAGCGACGCGGGTAGCGCATTTATCTTCGGTTCT
CTGGTCGGACCGAAAATGGACACCTTATTTGATGGCGCAGGATTTATCTTTGGTTTCAGGGTATTACCGGCAATTATCTT
CGTCACTGCACTGGTGAGTATTCTCTACTACATCGGTGTGATGGGGATTTTAATTCGCATTCTCGGCGGTATATTCCAGA
AAGCATTAAATATCAGCAAGATTGAGTCATTCGTCGCGGTCACTACCATTTTCCTCGGGCAAAACGAAATTCCGGCGATT
GTGAAGCCCTTTATCGATCGGCTGAATCGCAATGAATTATTTACAGCGATTTGTAGCGGCATGGCCTCGATTGCTGGTTC
GACAATGATTGGTTATGCCGCACTGGGCGTGCCTGTGGAATATTTGCTGGCGGCATCGTTAATGGCGATCCCCGGGGGGA
TCTTGTTTGCCCGCCTGCTAAGCCCGGCTACGGAATCTTCGCAGGTTTCTTTTAATAACCTCTCTTTCACCGAAACACCG
CCAAAAAGCATTATTGAAGCCGCCGCGACAGGGGCAATGACCGGGCTGAAAATCGCCGCAGGTGTGGCGACAGTAGTGAT
GGCATTCGTCGCCATCATTGCGTTAATTAACGGTATTATCGGCGGCGTTGGCGGCTGGTTTGGTTTTGAACATGCCTCGC
TGGAGTCCATTGTAGGTTATCTGCTGGCCCCACTGGCGTGGGTAATGGGTGTTGACTGGAGTGATGCGAATCTTGCCGGG
AGTTTGATTGGACAGAAACTGGCAATAAATGAATTTGTCGCTTATCTCAATTTCTCACCCTATCTGCAAACGGCTGGCAC
TCTGGATGCTAAAACCGTGGCGATTATTTCCTTCGCGTTGTGCGGTTTCGCTAACTTTGGTTCTATCGGGGTGGTGGTGG
GGGCGTTTTCTGCGGTTGCGCCACACCGTGCGCCGGAAATCGCCCAGCTTGGTTTACGCGCGCTGGCGGCGGCGACACTT
TCTAACCTGATGAGTGCCACCATTGCCGGGTTCTTTATTGGTTTAGCGTAG

Upstream 100 bases:

>100_bases
AGTGTTGATAACAAGCCGGGCCGGTATCGCAAGATGCAGGCCTGGCTATTTCTGATGGGCAGAAAGCCGTCGCCCAATAT
GACATGAAGAGATAATCACT

Downstream 100 bases:

>100_bases
TGTGCGATTTATGAGCCGGATAAGATGCGGTCAAGCGTCGAATCCGGCAACATTGTTACACCATTGGCACTAATGAAAGC
GCGTTATCGGCAGACAGGGT

Product: nucleoside transporter, NupC family

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 416; Mature: 416

Protein sequence:

>416_residues
MDVMRSVLGMVVLLTIAFLLSVNKKKISLRTVGAALVLQVVIGGIMLWLPPGRWVAEKVAFGVHKVMAYSDAGSAFIFGS
LVGPKMDTLFDGAGFIFGFRVLPAIIFVTALVSILYYIGVMGILIRILGGIFQKALNISKIESFVAVTTIFLGQNEIPAI
VKPFIDRLNRNELFTAICSGMASIAGSTMIGYAALGVPVEYLLAASLMAIPGGILFARLLSPATESSQVSFNNLSFTETP
PKSIIEAAATGAMTGLKIAAGVATVVMAFVAIIALINGIIGGVGGWFGFEHASLESIVGYLLAPLAWVMGVDWSDANLAG
SLIGQKLAINEFVAYLNFSPYLQTAGTLDAKTVAIISFALCGFANFGSIGVVVGAFSAVAPHRAPEIAQLGLRALAAATL
SNLMSATIAGFFIGLA

Sequences:

>Translated_416_residues
MDVMRSVLGMVVLLTIAFLLSVNKKKISLRTVGAALVLQVVIGGIMLWLPPGRWVAEKVAFGVHKVMAYSDAGSAFIFGS
LVGPKMDTLFDGAGFIFGFRVLPAIIFVTALVSILYYIGVMGILIRILGGIFQKALNISKIESFVAVTTIFLGQNEIPAI
VKPFIDRLNRNELFTAICSGMASIAGSTMIGYAALGVPVEYLLAASLMAIPGGILFARLLSPATESSQVSFNNLSFTETP
PKSIIEAAATGAMTGLKIAAGVATVVMAFVAIIALINGIIGGVGGWFGFEHASLESIVGYLLAPLAWVMGVDWSDANLAG
SLIGQKLAINEFVAYLNFSPYLQTAGTLDAKTVAIISFALCGFANFGSIGVVVGAFSAVAPHRAPEIAQLGLRALAAATL
SNLMSATIAGFFIGLA
>Mature_416_residues
MDVMRSVLGMVVLLTIAFLLSVNKKKISLRTVGAALVLQVVIGGIMLWLPPGRWVAEKVAFGVHKVMAYSDAGSAFIFGS
LVGPKMDTLFDGAGFIFGFRVLPAIIFVTALVSILYYIGVMGILIRILGGIFQKALNISKIESFVAVTTIFLGQNEIPAI
VKPFIDRLNRNELFTAICSGMASIAGSTMIGYAALGVPVEYLLAASLMAIPGGILFARLLSPATESSQVSFNNLSFTETP
PKSIIEAAATGAMTGLKIAAGVATVVMAFVAIIALINGIIGGVGGWFGFEHASLESIVGYLLAPLAWVMGVDWSDANLAG
SLIGQKLAINEFVAYLNFSPYLQTAGTLDAKTVAIISFALCGFANFGSIGVVVGAFSAVAPHRAPEIAQLGLRALAAATL
SNLMSATIAGFFIGLA

Specific function: Nucleoside transporter [H]

COG id: COG1972

COG function: function code F; Nucleoside permease

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the concentrative nucleoside transporter (CNT) (TC 2.A.41) family [H]

Homologues:

Organism=Homo sapiens, GI11545853, Length=415, Percent_Identity=33.2530120481928, Blast_Score=229, Evalue=3e-60,
Organism=Homo sapiens, GI42542381, Length=422, Percent_Identity=31.2796208530806, Blast_Score=221, Evalue=7e-58,
Organism=Homo sapiens, GI227116277, Length=419, Percent_Identity=30.3102625298329, Blast_Score=209, Evalue=4e-54,
Organism=Escherichia coli, GI1788485, Length=416, Percent_Identity=99.7596153846154, Blast_Score=811, Evalue=0.0,
Organism=Escherichia coli, GI1788488, Length=416, Percent_Identity=88.2211538461538, Blast_Score=697, Evalue=0.0,
Organism=Escherichia coli, GI1788737, Length=414, Percent_Identity=31.8840579710145, Blast_Score=191, Evalue=1e-49,
Organism=Caenorhabditis elegans, GI17560276, Length=396, Percent_Identity=29.2929292929293, Blast_Score=201, Evalue=4e-52,
Organism=Caenorhabditis elegans, GI71991794, Length=399, Percent_Identity=32.0802005012531, Blast_Score=195, Evalue=3e-50,
Organism=Caenorhabditis elegans, GI25146537, Length=202, Percent_Identity=30.6930693069307, Blast_Score=105, Evalue=4e-23,
Organism=Drosophila melanogaster, GI45552517, Length=408, Percent_Identity=32.1078431372549, Blast_Score=229, Evalue=3e-60,
Organism=Drosophila melanogaster, GI19921868, Length=408, Percent_Identity=32.1078431372549, Blast_Score=229, Evalue=3e-60,
Organism=Drosophila melanogaster, GI45552519, Length=408, Percent_Identity=32.1078431372549, Blast_Score=229, Evalue=3e-60,
Organism=Drosophila melanogaster, GI281360430, Length=407, Percent_Identity=32.9238329238329, Blast_Score=206, Evalue=2e-53,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008276
- InterPro:   IPR018270
- InterPro:   IPR011642
- InterPro:   IPR011657
- InterPro:   IPR002668 [H]

Pfam domain/function: PF07670 Gate; PF07662 Nucleos_tra2_C; PF01773 Nucleos_tra2_N [H]

EC number: NA

Molecular weight: Translated: 43396; Mature: 43396

Theoretical pI: Translated: 8.99; Mature: 8.99

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDVMRSVLGMVVLLTIAFLLSVNKKKISLRTVGAALVLQVVIGGIMLWLPPGRWVAEKVA
CHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHEECCCCHHHHHHHH
FGVHKVMAYSDAGSAFIFGSLVGPKMDTLFDGAGFIFGFRVLPAIIFVTALVSILYYIGV
HHHHHHHHCCCCCCEEEEHHHCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
MGILIRILGGIFQKALNISKIESFVAVTTIFLGQNEIPAIVKPFIDRLNRNELFTAICSG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHHHH
MASIAGSTMIGYAALGVPVEYLLAASLMAIPGGILFARLLSPATESSQVSFNNLSFTETP
HHHHHCHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCEEECCCCCCCCC
PKSIIEAAATGAMTGLKIAAGVATVVMAFVAIIALINGIIGGVGGWFGFEHASLESIVGY
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
LLAPLAWVMGVDWSDANLAGSLIGQKLAINEFVAYLNFSPYLQTAGTLDAKTVAIISFAL
HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCHHHHHHHHHHH
CGFANFGSIGVVVGAFSAVAPHRAPEIAQLGLRALAAATLSNLMSATIAGFFIGLA
HCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MDVMRSVLGMVVLLTIAFLLSVNKKKISLRTVGAALVLQVVIGGIMLWLPPGRWVAEKVA
CHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHEECCCCHHHHHHHH
FGVHKVMAYSDAGSAFIFGSLVGPKMDTLFDGAGFIFGFRVLPAIIFVTALVSILYYIGV
HHHHHHHHCCCCCCEEEEHHHCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
MGILIRILGGIFQKALNISKIESFVAVTTIFLGQNEIPAIVKPFIDRLNRNELFTAICSG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHHHH
MASIAGSTMIGYAALGVPVEYLLAASLMAIPGGILFARLLSPATESSQVSFNNLSFTETP
HHHHHCHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCEEECCCCCCCCC
PKSIIEAAATGAMTGLKIAAGVATVVMAFVAIIALINGIIGGVGGWFGFEHASLESIVGY
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
LLAPLAWVMGVDWSDANLAGSLIGQKLAINEFVAYLNFSPYLQTAGTLDAKTVAIISFAL
HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCHHHHHHHHHHH
CGFANFGSIGVVVGAFSAVAPHRAPEIAQLGLRALAAATLSNLMSATIAGFFIGLA
HCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]