Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is nupC [H]

Identifier: 49187911

GI number: 49187911

Start: 4802357

End: 4803538

Strand: Reverse

Name: nupC [H]

Synonym: BAS4923

Alternate gene names: 49187911

Gene position: 4803538-4802357 (Counterclockwise)

Preceding gene: 49187912

Following gene: 49187910

Centisome position: 91.87

GC content: 34.94

Gene sequence:

>1182_bases
ATGAAGTTTGTTATGTTTCTTGTAGGATTACTCGTTGTATTTGTACTCGGTTTTCTTATAAGTGCCGATCGAAAGAAGAT
TAAGTATAAACCAATCGCAATTATGCTTGTTATTCAGTTAGCGTTATCTTATTTCTTATTAAATACGCAAGTTGGTTATA
TTTTAGTAAAAGGAATTTCAGATGGATTTGGCGCGCTTCTTGGATATGCAGAAGCTGGAATCGTTTTCGTATTTGGTGGC
CTTGTTAATAAAGGAGAGGTTTCATTCTTCTTAACAGCGTTATTACCAATCGTATTCTTTGCCGTTTTAATCGGAATTCT
GCAACACTTTAAAATTTTACCGATATTTATTCGTGCTATTGGTACTTTGTTAAGTAAAGTAAATGGTCTAGGAAAACTAG
AATCATATAACGCAGTAGCAGCTGCTATTGTTGGGCAAGCGGAAGTATTTATTACAGTAAAAGATCAATTAAGTAAAATC
CCAAAACATCGTTTATATACATTATGTGCATCTTCCATGTCGACAGTATCGATGTCAATCGTCGGTTCTTACATGAAAAT
GATCGAACCAAAATATGTAGTAACAGCACTTGTATTAAATTTATTTAGTGGTTTCATTATTATTCATATTATTAACCCGT
ACGATATTACAGAAGAAGAAGATACACTGAAATTAGAAAATAAGAAAAAACAGTCATTCTTTGAAATGTTAAGTGAATAT
ATTATGCTTGGTTTCACAATCGCGATTACAGTAGCAGCGATGTTACTTGGTTTCGTAGCGTTAATTACAGCAATCAATAG
CTTGTTTGATTCCATGTTCGGTATTACATTCCAAGCGATTTTAGGATATATTTTCTCCCCATTAGCATTCGTAATGGGTA
TCCCGCAAGCAGAGATGGTAACAGCGGGACAAATTATGGCAACGAAATTAGTATCAAACGAATTTGTTGCGATGCTTGAT
CTTGGAAAAGTAGCTGGTGATTTATCAGCTCGTACAGTTGGTATCCTTTCTGTATTCCTTGTATCATTTGCGAACTTCTC
ATCAATCGGAATTATCGCAGGTGCAACGAAAGGTATCGATGAGAACCAATCAAATGTAGTATCATCATTCGGTCTACGCC
TTGTGTACGGTGCGACATTAGTAAGTATTCTATCAGCGATTATCGTTGGTGTTATGTTATAG

Upstream 100 bases:

>100_bases
ATTGAAGGTTTTGAAGGAGAAATGGAAAGTTCGGATTATTTCTCGTTTTGAGTGGTCAGACGTTCAACGTTAATGAAGAG
TTAAATGGAGGATTTTATTA

Downstream 100 bases:

>100_bases
AAAAAGATGGCCCTAGCTCATAGCTAGGGCTATTTTTTATGGTAAATTTGCATGAAAAAACCAATTAATTAGGAATATAT
TAACATATAAAAATCTTTTA

Product: NupC family nucleoside transporter

Products: uridine [Cytoplasm]; thymidine [Cytoplasm]; inosine [Cytoplasm]; Deoxyuridine [Cytoplasm]; deoxyinosine [Cytoplasm]; deoxycytidine [Cytoplasm]; deoxyadenosine [Cytoplasm]; cytidine [Cytoplasm]; Proton [Cytoplasm]; adenosine [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 393; Mature: 393

Protein sequence:

>393_residues
MKFVMFLVGLLVVFVLGFLISADRKKIKYKPIAIMLVIQLALSYFLLNTQVGYILVKGISDGFGALLGYAEAGIVFVFGG
LVNKGEVSFFLTALLPIVFFAVLIGILQHFKILPIFIRAIGTLLSKVNGLGKLESYNAVAAAIVGQAEVFITVKDQLSKI
PKHRLYTLCASSMSTVSMSIVGSYMKMIEPKYVVTALVLNLFSGFIIIHIINPYDITEEEDTLKLENKKKQSFFEMLSEY
IMLGFTIAITVAAMLLGFVALITAINSLFDSMFGITFQAILGYIFSPLAFVMGIPQAEMVTAGQIMATKLVSNEFVAMLD
LGKVAGDLSARTVGILSVFLVSFANFSSIGIIAGATKGIDENQSNVVSSFGLRLVYGATLVSILSAIIVGVML

Sequences:

>Translated_393_residues
MKFVMFLVGLLVVFVLGFLISADRKKIKYKPIAIMLVIQLALSYFLLNTQVGYILVKGISDGFGALLGYAEAGIVFVFGG
LVNKGEVSFFLTALLPIVFFAVLIGILQHFKILPIFIRAIGTLLSKVNGLGKLESYNAVAAAIVGQAEVFITVKDQLSKI
PKHRLYTLCASSMSTVSMSIVGSYMKMIEPKYVVTALVLNLFSGFIIIHIINPYDITEEEDTLKLENKKKQSFFEMLSEY
IMLGFTIAITVAAMLLGFVALITAINSLFDSMFGITFQAILGYIFSPLAFVMGIPQAEMVTAGQIMATKLVSNEFVAMLD
LGKVAGDLSARTVGILSVFLVSFANFSSIGIIAGATKGIDENQSNVVSSFGLRLVYGATLVSILSAIIVGVML
>Mature_393_residues
MKFVMFLVGLLVVFVLGFLISADRKKIKYKPIAIMLVIQLALSYFLLNTQVGYILVKGISDGFGALLGYAEAGIVFVFGG
LVNKGEVSFFLTALLPIVFFAVLIGILQHFKILPIFIRAIGTLLSKVNGLGKLESYNAVAAAIVGQAEVFITVKDQLSKI
PKHRLYTLCASSMSTVSMSIVGSYMKMIEPKYVVTALVLNLFSGFIIIHIINPYDITEEEDTLKLENKKKQSFFEMLSEY
IMLGFTIAITVAAMLLGFVALITAINSLFDSMFGITFQAILGYIFSPLAFVMGIPQAEMVTAGQIMATKLVSNEFVAMLD
LGKVAGDLSARTVGILSVFLVSFANFSSIGIIAGATKGIDENQSNVVSSFGLRLVYGATLVSILSAIIVGVML

Specific function: Transports Nucleosides With A High Affinity Except Guanosine And Deoxyguanosine. Driven By A Proton Motive Force. [C]

COG id: COG1972

COG function: function code F; Nucleoside permease

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the concentrative nucleoside transporter (CNT) (TC 2.A.41) family [H]

Homologues:

Organism=Homo sapiens, GI11545853, Length=412, Percent_Identity=26.2135922330097, Blast_Score=122, Evalue=7e-28,
Organism=Homo sapiens, GI42542381, Length=411, Percent_Identity=25.3041362530414, Blast_Score=121, Evalue=1e-27,
Organism=Homo sapiens, GI227116277, Length=410, Percent_Identity=22.6829268292683, Blast_Score=102, Evalue=7e-22,
Organism=Escherichia coli, GI1788737, Length=398, Percent_Identity=58.7939698492462, Blast_Score=458, Evalue=1e-130,
Organism=Escherichia coli, GI1788485, Length=410, Percent_Identity=29.7560975609756, Blast_Score=177, Evalue=1e-45,
Organism=Escherichia coli, GI1788488, Length=412, Percent_Identity=30.0970873786408, Blast_Score=167, Evalue=1e-42,
Organism=Caenorhabditis elegans, GI17560276, Length=396, Percent_Identity=25.2525252525253, Blast_Score=119, Evalue=3e-27,
Organism=Caenorhabditis elegans, GI71991794, Length=389, Percent_Identity=24.6786632390745, Blast_Score=107, Evalue=8e-24,
Organism=Drosophila melanogaster, GI45552517, Length=403, Percent_Identity=26.302729528536, Blast_Score=130, Evalue=2e-30,
Organism=Drosophila melanogaster, GI45552519, Length=403, Percent_Identity=26.302729528536, Blast_Score=130, Evalue=2e-30,
Organism=Drosophila melanogaster, GI19921868, Length=403, Percent_Identity=26.302729528536, Blast_Score=130, Evalue=2e-30,
Organism=Drosophila melanogaster, GI281360430, Length=403, Percent_Identity=25.3101736972705, Blast_Score=108, Evalue=8e-24,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008276
- InterPro:   IPR018270
- InterPro:   IPR011657
- InterPro:   IPR002668 [H]

Pfam domain/function: PF07662 Nucleos_tra2_C; PF01773 Nucleos_tra2_N [H]

EC number: NA

Molecular weight: Translated: 42472; Mature: 42472

Theoretical pI: Translated: 9.30; Mature: 9.30

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
4.1 %Met     (Translated Protein)
4.3 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
4.1 %Met     (Mature Protein)
4.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKFVMFLVGLLVVFVLGFLISADRKKIKYKPIAIMLVIQLALSYFLLNTQVGYILVKGIS
CHHHHHHHHHHHHHHHHHHHHCCCCCCEECHHHHHHHHHHHHHHHHHHCCCCEEEEECCC
DGFGALLGYAEAGIVFVFGGLVNKGEVSFFLTALLPIVFFAVLIGILQHFKILPIFIRAI
CHHHHHHHHHHCCHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GTLLSKVNGLGKLESYNAVAAAIVGQAEVFITVKDQLSKIPKHRLYTLCASSMSTVSMSI
HHHHHHHCCCCCCHHHHHHHHHHHCCCEEEEEEHHHHHHCHHHHHHHHHHHHHHHHHHHH
VGSYMKMIEPKYVVTALVLNLFSGFIIIHIINPYDITEEEDTLKLENKKKQSFFEMLSEY
HHHHHHHHCHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH
IMLGFTIAITVAAMLLGFVALITAINSLFDSMFGITFQAILGYIFSPLAFVMGIPQAEMV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH
TAGQIMATKLVSNEFVAMLDLGKVAGDLSARTVGILSVFLVSFANFSSIGIIAGATKGID
HHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCC
ENQSNVVSSFGLRLVYGATLVSILSAIIVGVML
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MKFVMFLVGLLVVFVLGFLISADRKKIKYKPIAIMLVIQLALSYFLLNTQVGYILVKGIS
CHHHHHHHHHHHHHHHHHHHHCCCCCCEECHHHHHHHHHHHHHHHHHHCCCCEEEEECCC
DGFGALLGYAEAGIVFVFGGLVNKGEVSFFLTALLPIVFFAVLIGILQHFKILPIFIRAI
CHHHHHHHHHHCCHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GTLLSKVNGLGKLESYNAVAAAIVGQAEVFITVKDQLSKIPKHRLYTLCASSMSTVSMSI
HHHHHHHCCCCCCHHHHHHHHHHHCCCEEEEEEHHHHHHCHHHHHHHHHHHHHHHHHHHH
VGSYMKMIEPKYVVTALVLNLFSGFIIIHIINPYDITEEEDTLKLENKKKQSFFEMLSEY
HHHHHHHHCHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH
IMLGFTIAITVAAMLLGFVALITAINSLFDSMFGITFQAILGYIFSPLAFVMGIPQAEMV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH
TAGQIMATKLVSNEFVAMLDLGKVAGDLSARTVGILSVFLVSFANFSSIGIIAGATKGID
HHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCC
ENQSNVVSSFGLRLVYGATLVSILSAIIVGVML
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: uridine [Periplasm]; thymidine [Periplasm]; inosine [Periplasm]; Deoxyuridine [Periplasm]; deoxyinosine [Periplasm]; deoxycytidine [Periplasm]; deoxyadenosine [Periplasm]; cytidine [Periplasm]; Proton [Periplasm]; adenosine [Periplasm] [C]

Specific reaction: Proton [Periplasm] + uridine [Periplasm] = Proton [Cytoplasm] + uridine [Cytoplasm] Proton [Periplasm] + thymidine [Periplasm] = Proton [Cytoplasm] + thymidine [Cytoplasm] Proton [Periplasm] + inosine [Periplasm] = Proton [Cytoplasm] + inosine [Cytoplasm]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8550462; 8867804; 9384377 [H]