The gene/protein map for NC_003197 is currently unavailable.
Definition Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 chromosome, complete genome.
Accession NC_003197
Length 4,857,432

Click here to switch to the map view.

The map label for this gene is rfaI

Identifier: 16767003

GI number: 16767003

Start: 3913515

End: 3914528

Strand: Reverse

Name: rfaI

Synonym: STM3718

Alternate gene names: 16767003

Gene position: 3914528-3913515 (Counterclockwise)

Preceding gene: 16767004

Following gene: 16767002

Centisome position: 80.59

GC content: 37.38

Gene sequence:

>1014_bases
ATGAGCAGAAAATATTTTGAAGAAGAAGTCATTCAACAGACTTTAGATTATAACTATGCACAACATAGTGATGCTGATAA
ATTTAATATAGCTTATGGGATTGATAAAAACTTTCTTTTTGGCTGTGGTGTCTCTATTGCATCGGTTCTCCTCGCTAATC
CAGAGAAGGCGTTAGCTTTCCATGTTTTTACCGATTTCTTTGACTCTGAAGACCAGCAGCGATTTGAGGCATTAGCAAAA
CAGTACGCTACGCAGATTGTTGTTTACCTAATCGACTGTGAGCGCTTAAAATCGCTACCCAGTACCAAAAACTGGACCTA
TGCAACATACTTTAGATTCATTATCGCCGATTATTTTTCAGATAAAACAGATAGAGTACTTTATCTGGATGCAGATATTG
CATGTAAGGGGAGTATTCAGGAACTTATTGATCTTAATTTTGCTGAAAATGAGATTGCGGCTGTCGTTGCTGAAGGCGAG
TTGGAATGGTGGACTAAGCGCTCGGTTAGCCTGGCAACGCCTGGGCTGGTTTCTGGCTATTTTAATGCCGGTTTTATTTT
AATTAACATACCTCTTTGGACTGCAGAAAATATCTCTAAGAAAGCGATTGAAATGCTAAAAGATCCAGAGGTAGTACAGC
GCATAACGCACCTTGATCAGGATGTATTAAATATATTTTTAGTGAATAAAGCGCGTTTTGTTGATAAAAAATTTAATACA
CAATTTAGTCTTAACTATGAATTAAAAGATTCAGTTATTAATCCAGTTGATGCTGAGACTGTATTTGTTCATTATATCGG
ACCAACGAAGCCCTGGCATAGTTGGGGGGCTTACCCCGTGTCACAATATTTTTTACAGGCTAAGAGCAACTCACCATGGT
CTCATTGTGCACTTTTAAATCCAGTGACTAGCCATCAGTTACGTTATGCGGCAAAGCATATGTTTAATCAGAAGCATTAT
ACTTCGGGTATAAATTATTATATAGCCTACTTTAAACGTAAACTTCTTGAATAA

Upstream 100 bases:

>100_bases
AGCATGATATTATTCCCGGCACGATTGAGAGATTTTACGATGTGCTATATTTTAAAAATTTTAATAATGCAATATTCTCG
AAATTACAAAAGTGATCACT

Downstream 100 bases:

>100_bases
AACCCATAGGTGATGTAATGGATTCATTTCCTGAGATAGAAATAGCTGAATATAAAGTTTTTGATGAAAGTAATAATAAT
GATGATAACGTATTAAACAT

Product: lipopolysaccharide-alpha-1, 3-D-galactosyltransferase

Products: NA

Alternate protein names: Lipopolysaccharide 3-alpha-galactosyltransferase

Number of amino acids: Translated: 337; Mature: 336

Protein sequence:

>337_residues
MSRKYFEEEVIQQTLDYNYAQHSDADKFNIAYGIDKNFLFGCGVSIASVLLANPEKALAFHVFTDFFDSEDQQRFEALAK
QYATQIVVYLIDCERLKSLPSTKNWTYATYFRFIIADYFSDKTDRVLYLDADIACKGSIQELIDLNFAENEIAAVVAEGE
LEWWTKRSVSLATPGLVSGYFNAGFILINIPLWTAENISKKAIEMLKDPEVVQRITHLDQDVLNIFLVNKARFVDKKFNT
QFSLNYELKDSVINPVDAETVFVHYIGPTKPWHSWGAYPVSQYFLQAKSNSPWSHCALLNPVTSHQLRYAAKHMFNQKHY
TSGINYYIAYFKRKLLE

Sequences:

>Translated_337_residues
MSRKYFEEEVIQQTLDYNYAQHSDADKFNIAYGIDKNFLFGCGVSIASVLLANPEKALAFHVFTDFFDSEDQQRFEALAK
QYATQIVVYLIDCERLKSLPSTKNWTYATYFRFIIADYFSDKTDRVLYLDADIACKGSIQELIDLNFAENEIAAVVAEGE
LEWWTKRSVSLATPGLVSGYFNAGFILINIPLWTAENISKKAIEMLKDPEVVQRITHLDQDVLNIFLVNKARFVDKKFNT
QFSLNYELKDSVINPVDAETVFVHYIGPTKPWHSWGAYPVSQYFLQAKSNSPWSHCALLNPVTSHQLRYAAKHMFNQKHY
TSGINYYIAYFKRKLLE
>Mature_336_residues
SRKYFEEEVIQQTLDYNYAQHSDADKFNIAYGIDKNFLFGCGVSIASVLLANPEKALAFHVFTDFFDSEDQQRFEALAKQ
YATQIVVYLIDCERLKSLPSTKNWTYATYFRFIIADYFSDKTDRVLYLDADIACKGSIQELIDLNFAENEIAAVVAEGEL
EWWTKRSVSLATPGLVSGYFNAGFILINIPLWTAENISKKAIEMLKDPEVVQRITHLDQDVLNIFLVNKARFVDKKFNTQ
FSLNYELKDSVINPVDAETVFVHYIGPTKPWHSWGAYPVSQYFLQAKSNSPWSHCALLNPVTSHQLRYAAKHMFNQKHYT
SGINYYIAYFKRKLLE

Specific function: Adds the galactose(I) group on the glucose(I) group of LPS

COG id: COG1442

COG function: function code M; Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 8 family

Homologues:

Organism=Escherichia coli, GI1790057, Length=337, Percent_Identity=54.0059347181009, Blast_Score=397, Evalue=1e-112,
Organism=Escherichia coli, GI1790056, Length=324, Percent_Identity=34.2592592592593, Blast_Score=196, Evalue=2e-51,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): RFAI_SALTY (P19816)

Other databases:

- EMBL:   X53847
- EMBL:   AF026386
- EMBL:   AE006468
- PIR:   S12097
- RefSeq:   NP_462618.1
- ProteinModelPortal:   P19816
- PRIDE:   P19816
- GeneID:   1255242
- GenomeReviews:   AE006468_GR
- KEGG:   stm:STM3718
- NMPDR:   fig|99287.1.peg.3594
- HOGENOM:   HBG417202
- OMA:   MLADKAV
- ProtClustDB:   PRK15171
- BioCyc:   STYP99287:STM3718-MONOMER
- BRENDA:   2.4.1.44
- InterPro:   IPR002495
- InterPro:   IPR013645

Pfam domain/function: PF01501 Glyco_transf_8; PF08437 Glyco_transf_8N

EC number: =2.4.1.44

Molecular weight: Translated: 38905; Mature: 38774

Theoretical pI: Translated: 6.24; Mature: 6.24

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
0.9 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
0.6 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSRKYFEEEVIQQTLDYNYAQHSDADKFNIAYGIDKNFLFGCGVSIASVLLANPEKALAF
CCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCEEECCHHHHHHHHCCCCHHHHH
HVFTDFFDSEDQQRFEALAKQYATQIVVYLIDCERLKSLPSTKNWTYATYFRFIIADYFS
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHEEEHHHHHCCCCCCCCHHHHHHHHHHHHHHC
DKTDRVLYLDADIACKGSIQELIDLNFAENEIAAVVAEGELEWWTKRSVSLATPGLVSGY
CCCCEEEEEECCCCCCCCHHHHHCCCCCCCCEEEEEECCCCHHHHCCCCCCCCCCHHHHC
FNAGFILINIPLWTAENISKKAIEMLKDPEVVQRITHLDQDVLNIFLVNKARFVDKKFNT
CCCCEEEEEECCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
QFSLNYELKDSVINPVDAETVFVHYIGPTKPWHSWGAYPVSQYFLQAKSNSPWSHCALLN
EEEEEEEHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCHHHEEECC
PVTSHQLRYAAKHMFNQKHYTSGINYYIAYFKRKLLE
CHHHHHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHCC
>Mature Secondary Structure 
SRKYFEEEVIQQTLDYNYAQHSDADKFNIAYGIDKNFLFGCGVSIASVLLANPEKALAF
CCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCEEECCHHHHHHHHCCCCHHHHH
HVFTDFFDSEDQQRFEALAKQYATQIVVYLIDCERLKSLPSTKNWTYATYFRFIIADYFS
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHEEEHHHHHCCCCCCCCHHHHHHHHHHHHHHC
DKTDRVLYLDADIACKGSIQELIDLNFAENEIAAVVAEGELEWWTKRSVSLATPGLVSGY
CCCCEEEEEECCCCCCCCHHHHHCCCCCCCCEEEEEECCCCHHHHCCCCCCCCCCHHHHC
FNAGFILINIPLWTAENISKKAIEMLKDPEVVQRITHLDQDVLNIFLVNKARFVDKKFNT
CCCCEEEEEECCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
QFSLNYELKDSVINPVDAETVFVHYIGPTKPWHSWGAYPVSQYFLQAKSNSPWSHCALLN
EEEEEEEHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCHHHEEECC
PVTSHQLRYAAKHMFNQKHYTSGINYYIAYFKRKLLE
CHHHHHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 2235496; 9535865; 11677609