Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is rfbC [H]

Identifier: 113476435

GI number: 113476435

Start: 4430305

End: 4432779

Strand: Reverse

Name: rfbC [H]

Synonym: Tery_2849

Alternate gene names: 113476435

Gene position: 4432779-4430305 (Counterclockwise)

Preceding gene: 113476436

Following gene: 113476431

Centisome position: 57.2

GC content: 35.88

Gene sequence:

>2475_bases
ATGAAAACTTGTGAAAATTACCTAGAAGAAGCCAAAGATTTCTACAAGAATCAGAGCTGGTCAGAAGCGATCGCTGCCTA
CCAACGCGCCCTCGAACTCAACCCCAACCTCCCAGGAATACACAAAAAAATCGGGGATGCTCTACAACAACAAGCAAAAG
CAGAAAAAACCAATCTGCTGAACTATTACAAACAGAAAATTCAACAAAACCCAGACGATCTACAAACCTACTATCAAACA
TTAGAAATTTCTCCCAATGACGCCGAGGTTTACCTGGGCTTAGGAAAAGCGTTGACCAAAAAAGGATTCTTCGACAAAGC
TCAACTCGCCTATCAAAAAGTTCTCCAACTGCAACCCCACCATCCCTTAGCAGAGAGTTTTCAATGTGCTCCAAATTCCA
CCATTAGATCCCCAGACCCTTCTCTCCCCCAAACCCCCCAACTAGACCAAGCAAAACAAACTCTTGATACTCTTAACCAA
ATCACTCTTGATAGCTTCCTCAATACCAACTCTCAACTAAACTTTCCCCTGGTTGAAAATCCAGAAATTAGCATCATTAT
TATTCTCTATAACCGAGCAGAAATAACTTTAAGCTGCTTATATTCCCTTTTGAGAAACCCCTTTCAATCTTTTGAACTGA
TATTAGTCGATAACAACTCCACTGATACCACCCGTCAACTGTTACAACAGATAAACGGGGCCAAAATTACCCTGAATCAT
CAAAATCTTCATTATCTTCTGGGATGTAACCAAGGTAGCAAAATTGCCCAAGGAGACTATCTCCTATTTTTAAATAATGA
TGCTCAAGTTTTAGGTAATAGTATCCCCTCCGCTCTCGAAACTATTAAATCTTCAGATGATATTGGTGCAGTGGGTGGAA
AACTAATCCTTCCCAATGGCACTCTCCAAGAAGCAGGAAGTATTATTTGGCAAGATGGAACTTGCTTAGGATATGGACGT
GGAAACTTACCCACTGCACCAGAATATCAATTTCAACGTGCTGTTGATTATTGTTCGGGAGCTTTTCTCTTAACTCGACG
AGATTTGTTTTTACAATTAGAAGGGTTTGATAAAGATTATCAACCAGCTTATTTTGAGGAAACCGACTATTGTGTGCGAC
TGCAAAAATTAGGAAAAAAAATTATTTACGACCCCAATGTAAATATTCTCCATTATGAGTTTGCTAGTTCCAGTCATACA
GGTTCAAGCGAGCAGGCGAGCGCTCTGATGGAAAAAAATCAAAAGATATTTCAACAAAAACACAAAGACTGGTTTTCATC
TCAATATCTTACCGAACTCAAGAATTTAATATTTGCTCGAACTCAAGCTAGAGAAAGACCAAAACCCATATTATTTATTG
ATGATAGAATTCCTCATCCTTGGTTAGGTCCAGGTTATACTCGCAGTCATTCTATTCTCTGTAATCTTGTTAAATTAGGC
TATTTTGTTACCCTTTACCCAGGGGATCTGCGTCATCTCGAAGATTGGTCAACTGTTTATGCCGATATTCCCCAAACTGT
TGAAGTTATGCGAGGATATGGGTTAATTATGTTAGAAGATTTTTTAAGAGAACGCAAAGGATACTATGATTTAGTTTTTA
TTAGTCGCCCTCACAATATCAAACATTTAAACTCTATTTTAGTTAAAGAAAATTTACTCAAATCAGCAAAAATTATTTAT
GATGCTGAAGCAATTTTTAGCATTCGAGACTATGAATATAAACGTTTAAATCAAACTTATTTTACTGAAATAGAACGTCA
AACAGCCATTAAAGAAGAAATTAAACTGGCAAAAAATAGCCATCATATTATCACCGTTTCCTCCCCAGAAAAACAACAAT
TTATTGAGCAAAACTATGCCAATGTCAGCTTGTTAGGTCATTCTCTGTCTGTTTATCCCACACCTCAATCTTTTTTGTCA
AGACAAAATTTCTTATTTGTGGGTTCAGTTTATGAAAAAGAATCTCCCAATGCTGATTCTATTTTATGGCTAACTTCTGA
AATATTTCCTCTAATTCAGAAACAACTAACTCAAGATGTTGAATTAATAATTATTGGGAATAATACTGTAGAAGAAATTC
AACAAAAAGTAAATAGTTTAAACAATACATCAATCAAAATTTTAGGAAAAGTAGATGATATCAGACCATTTTATAATCAG
GCGAGATTATTTCTAGCTCCCACTCGATATGCAGCAGGAATTCCTCATAAAGTACATGAAGCAGCAGCTTACGGTTTACC
TATTGTTACCACATCCTTAATTGCTCAACAATTAGGATGGAAACATGAAACAGAATTATTAGTTAGCGATGATCCAATAA
ATTTTACTCAACAATGTGTCAAGCTTTATCAAGATTCAAGCTTATGGAATAAACTGAGAAAAAATGCCATAAAGCGAGTT
CAAACTGAATGTTCTCCTGAATTTTTTTCAGAAACTTTAAAGTCGATCTTAAAATCTTTAGAAAATTCAATATAG

Upstream 100 bases:

>100_bases
ATCAACCTAGTTTATGAAGTAGTTTTATTTTTTTATTCGCACTTATATTTCCTCCTTGTAATAAAAGGGATGTGGAACTT
TTTGACTAAAATTTAAGAAT

Downstream 100 bases:

>100_bases
ATAGCTTTTAGTAAATCATTTTCAATATACTAATCTTTAGGGAACTTATTAGTAAAAAAACATGATCTTCACCTCTAAAT
AAATTTAGCATATATTCTGC

Product: glycosyl transferase family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 824; Mature: 824

Protein sequence:

>824_residues
MKTCENYLEEAKDFYKNQSWSEAIAAYQRALELNPNLPGIHKKIGDALQQQAKAEKTNLLNYYKQKIQQNPDDLQTYYQT
LEISPNDAEVYLGLGKALTKKGFFDKAQLAYQKVLQLQPHHPLAESFQCAPNSTIRSPDPSLPQTPQLDQAKQTLDTLNQ
ITLDSFLNTNSQLNFPLVENPEISIIIILYNRAEITLSCLYSLLRNPFQSFELILVDNNSTDTTRQLLQQINGAKITLNH
QNLHYLLGCNQGSKIAQGDYLLFLNNDAQVLGNSIPSALETIKSSDDIGAVGGKLILPNGTLQEAGSIIWQDGTCLGYGR
GNLPTAPEYQFQRAVDYCSGAFLLTRRDLFLQLEGFDKDYQPAYFEETDYCVRLQKLGKKIIYDPNVNILHYEFASSSHT
GSSEQASALMEKNQKIFQQKHKDWFSSQYLTELKNLIFARTQARERPKPILFIDDRIPHPWLGPGYTRSHSILCNLVKLG
YFVTLYPGDLRHLEDWSTVYADIPQTVEVMRGYGLIMLEDFLRERKGYYDLVFISRPHNIKHLNSILVKENLLKSAKIIY
DAEAIFSIRDYEYKRLNQTYFTEIERQTAIKEEIKLAKNSHHIITVSSPEKQQFIEQNYANVSLLGHSLSVYPTPQSFLS
RQNFLFVGSVYEKESPNADSILWLTSEIFPLIQKQLTQDVELIIIGNNTVEEIQQKVNSLNNTSIKILGKVDDIRPFYNQ
ARLFLAPTRYAAGIPHKVHEAAAYGLPIVTTSLIAQQLGWKHETELLVSDDPINFTQQCVKLYQDSSLWNKLRKNAIKRV
QTECSPEFFSETLKSILKSLENSI

Sequences:

>Translated_824_residues
MKTCENYLEEAKDFYKNQSWSEAIAAYQRALELNPNLPGIHKKIGDALQQQAKAEKTNLLNYYKQKIQQNPDDLQTYYQT
LEISPNDAEVYLGLGKALTKKGFFDKAQLAYQKVLQLQPHHPLAESFQCAPNSTIRSPDPSLPQTPQLDQAKQTLDTLNQ
ITLDSFLNTNSQLNFPLVENPEISIIIILYNRAEITLSCLYSLLRNPFQSFELILVDNNSTDTTRQLLQQINGAKITLNH
QNLHYLLGCNQGSKIAQGDYLLFLNNDAQVLGNSIPSALETIKSSDDIGAVGGKLILPNGTLQEAGSIIWQDGTCLGYGR
GNLPTAPEYQFQRAVDYCSGAFLLTRRDLFLQLEGFDKDYQPAYFEETDYCVRLQKLGKKIIYDPNVNILHYEFASSSHT
GSSEQASALMEKNQKIFQQKHKDWFSSQYLTELKNLIFARTQARERPKPILFIDDRIPHPWLGPGYTRSHSILCNLVKLG
YFVTLYPGDLRHLEDWSTVYADIPQTVEVMRGYGLIMLEDFLRERKGYYDLVFISRPHNIKHLNSILVKENLLKSAKIIY
DAEAIFSIRDYEYKRLNQTYFTEIERQTAIKEEIKLAKNSHHIITVSSPEKQQFIEQNYANVSLLGHSLSVYPTPQSFLS
RQNFLFVGSVYEKESPNADSILWLTSEIFPLIQKQLTQDVELIIIGNNTVEEIQQKVNSLNNTSIKILGKVDDIRPFYNQ
ARLFLAPTRYAAGIPHKVHEAAAYGLPIVTTSLIAQQLGWKHETELLVSDDPINFTQQCVKLYQDSSLWNKLRKNAIKRV
QTECSPEFFSETLKSILKSLENSI
>Mature_824_residues
MKTCENYLEEAKDFYKNQSWSEAIAAYQRALELNPNLPGIHKKIGDALQQQAKAEKTNLLNYYKQKIQQNPDDLQTYYQT
LEISPNDAEVYLGLGKALTKKGFFDKAQLAYQKVLQLQPHHPLAESFQCAPNSTIRSPDPSLPQTPQLDQAKQTLDTLNQ
ITLDSFLNTNSQLNFPLVENPEISIIIILYNRAEITLSCLYSLLRNPFQSFELILVDNNSTDTTRQLLQQINGAKITLNH
QNLHYLLGCNQGSKIAQGDYLLFLNNDAQVLGNSIPSALETIKSSDDIGAVGGKLILPNGTLQEAGSIIWQDGTCLGYGR
GNLPTAPEYQFQRAVDYCSGAFLLTRRDLFLQLEGFDKDYQPAYFEETDYCVRLQKLGKKIIYDPNVNILHYEFASSSHT
GSSEQASALMEKNQKIFQQKHKDWFSSQYLTELKNLIFARTQARERPKPILFIDDRIPHPWLGPGYTRSHSILCNLVKLG
YFVTLYPGDLRHLEDWSTVYADIPQTVEVMRGYGLIMLEDFLRERKGYYDLVFISRPHNIKHLNSILVKENLLKSAKIIY
DAEAIFSIRDYEYKRLNQTYFTEIERQTAIKEEIKLAKNSHHIITVSSPEKQQFIEQNYANVSLLGHSLSVYPTPQSFLS
RQNFLFVGSVYEKESPNADSILWLTSEIFPLIQKQLTQDVELIIIGNNTVEEIQQKVNSLNNTSIKILGKVDDIRPFYNQ
ARLFLAPTRYAAGIPHKVHEAAAYGLPIVTTSLIAQQLGWKHETELLVSDDPINFTQQCVKLYQDSSLWNKLRKNAIKRV
QTECSPEFFSETLKSILKSLENSI

Specific function: Involved in O-antigen biosynthesis [H]

COG id: COG0438

COG function: function code M; Glycosyltransferase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001296
- InterPro:   IPR001173 [H]

Pfam domain/function: PF00534 Glycos_transf_1; PF00535 Glycos_transf_2 [H]

EC number: NA

Molecular weight: Translated: 94103; Mature: 94103

Theoretical pI: Translated: 6.64; Mature: 6.64

Prosite motif: PS50005 TPR ; PS50293 TPR_REGION

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
0.5 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
0.5 %Met     (Mature Protein)
1.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKTCENYLEEAKDFYKNQSWSEAIAAYQRALELNPNLPGIHKKIGDALQQQAKAEKTNLL
CCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH
NYYKQKIQQNPDDLQTYYQTLEISPNDAEVYLGLGKALTKKGFFDKAQLAYQKVLQLQPH
HHHHHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCHHHHHCCCCHHHHHHHHHHHHCCCC
HPLAESFQCAPNSTIRSPDPSLPQTPQLDQAKQTLDTLNQITLDSFLNTNSQLNFPLVEN
CCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECC
PEISIIIILYNRAEITLSCLYSLLRNPFQSFELILVDNNSTDTTRQLLQQINGAKITLNH
CCEEEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCCEEEEEC
QNLHYLLGCNQGSKIAQGDYLLFLNNDAQVLGNSIPSALETIKSSDDIGAVGGKLILPNG
CCEEEEEECCCCCCCCCCCEEEEECCCHHHHHCHHHHHHHHHHCCCCCCCCCCEEECCCC
TLQEAGSIIWQDGTCLGYGRGNLPTAPEYQFQRAVDYCSGAFLLTRRDLFLQLEGFDKDY
CHHHCCCEEEECCCEEECCCCCCCCCCCHHHHHHHHHHCCEEEEEECEEEEEEECCCCCC
QPAYFEETDYCVRLQKLGKKIIYDPNVNILHYEFASSSHTGSSEQASALMEKNQKIFQQK
CCCCCCCCHHHHHHHHHCCEEEECCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH
HKDWFSSQYLTELKNLIFARTQARERPKPILFIDDRIPHPWLGPGYTRSHSILCNLVKLG
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHC
YFVTLYPGDLRHLEDWSTVYADIPQTVEVMRGYGLIMLEDFLRERKGYYDLVFISRPHNI
EEEEECCCCHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHCCCCEEEEEEECCCCH
KHLNSILVKENLLKSAKIIYDAEAIFSIRDYEYKRLNQTYFTEIERQTAIKEEIKLAKNS
HHHHHHHHHHHHHHHHHHEEEHHHHHHHHCCHHHHHCHHHHHHHHHHHHHHHHHHHHCCC
HHIITVSSPEKQQFIEQNYANVSLLGHSLSVYPTPQSFLSRQNFLFVGSVYEKESPNADS
CEEEEECCCHHHHHHHHCCCEEEEECCCEEECCCCHHHHCCCCEEEEEEHHCCCCCCCCE
ILWLTSEIFPLIQKQLTQDVELIIIGNNTVEEIQQKVNSLNNTSIKILGKVDDIRPFYNQ
EEEEHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHC
ARLFLAPTRYAAGIPHKVHEAAAYGLPIVTTSLIAQQLGWKHETELLVSDDPINFTQQCV
CEEEEECCHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHH
KLYQDSSLWNKLRKNAIKRVQTECSPEFFSETLKSILKSLENSI
HHHHCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MKTCENYLEEAKDFYKNQSWSEAIAAYQRALELNPNLPGIHKKIGDALQQQAKAEKTNLL
CCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH
NYYKQKIQQNPDDLQTYYQTLEISPNDAEVYLGLGKALTKKGFFDKAQLAYQKVLQLQPH
HHHHHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCHHHHHCCCCHHHHHHHHHHHHCCCC
HPLAESFQCAPNSTIRSPDPSLPQTPQLDQAKQTLDTLNQITLDSFLNTNSQLNFPLVEN
CCHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECC
PEISIIIILYNRAEITLSCLYSLLRNPFQSFELILVDNNSTDTTRQLLQQINGAKITLNH
CCEEEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCCEEEEEC
QNLHYLLGCNQGSKIAQGDYLLFLNNDAQVLGNSIPSALETIKSSDDIGAVGGKLILPNG
CCEEEEEECCCCCCCCCCCEEEEECCCHHHHHCHHHHHHHHHHCCCCCCCCCCEEECCCC
TLQEAGSIIWQDGTCLGYGRGNLPTAPEYQFQRAVDYCSGAFLLTRRDLFLQLEGFDKDY
CHHHCCCEEEECCCEEECCCCCCCCCCCHHHHHHHHHHCCEEEEEECEEEEEEECCCCCC
QPAYFEETDYCVRLQKLGKKIIYDPNVNILHYEFASSSHTGSSEQASALMEKNQKIFQQK
CCCCCCCCHHHHHHHHHCCEEEECCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH
HKDWFSSQYLTELKNLIFARTQARERPKPILFIDDRIPHPWLGPGYTRSHSILCNLVKLG
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHC
YFVTLYPGDLRHLEDWSTVYADIPQTVEVMRGYGLIMLEDFLRERKGYYDLVFISRPHNI
EEEEECCCCHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHCCCCEEEEEEECCCCH
KHLNSILVKENLLKSAKIIYDAEAIFSIRDYEYKRLNQTYFTEIERQTAIKEEIKLAKNS
HHHHHHHHHHHHHHHHHHEEEHHHHHHHHCCHHHHHCHHHHHHHHHHHHHHHHHHHHCCC
HHIITVSSPEKQQFIEQNYANVSLLGHSLSVYPTPQSFLSRQNFLFVGSVYEKESPNADS
CEEEEECCCHHHHHHHHCCCEEEEECCCEEECCCCHHHHCCCCEEEEEEHHCCCCCCCCE
ILWLTSEIFPLIQKQLTQDVELIIIGNNTVEEIQQKVNSLNNTSIKILGKVDDIRPFYNQ
EEEEHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHC
ARLFLAPTRYAAGIPHKVHEAAAYGLPIVTTSLIAQQLGWKHETELLVSDDPINFTQQCV
CEEEEECCHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHH
KLYQDSSLWNKLRKNAIKRVQTECSPEFFSETLKSILKSLENSI
HHHHCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8626291 [H]