Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is ndvB [H]

Identifier: 222527375

GI number: 222527375

Start: 5157041

End: 5159422

Strand: Direct

Name: ndvB [H]

Synonym: Chy400_4165

Alternate gene names: 222527375

Gene position: 5157041-5159422 (Clockwise)

Preceding gene: 222527374

Following gene: 222527376

Centisome position: 97.88

GC content: 54.87

Gene sequence:

>2382_bases
ATGGAACACCTGTTTGCCAGCACCTATGGCTACTTTCGAGCCGATGGACGGGAATACGTGATCACGACTCCCTTCACCCC
CCGCCCCTGGGGGAACGTGATCAGTAATGGCGACTACGCGATGATGGTGTCGCAAACCGGATCGGGTTATAGCTGGCGCA
ACAATGCCGGTCAAAACCGGATTACCCGTTCATTTCAAGACCTGATTCAGGACAACTGGGGAAAATATCTGTACCTGCGC
GATCTCGACAGCGGGCAGTATTGGGCGGCCACGTACAAACCAACCTGTCATCAGTACGATCACTATCAGGTCAGGCACGG
CCTGGGCTATTCATCGTTTGAACAGATCGTACAAGGCATTCACAGCATCCTGACCGTCTTTGTCGCCCCAGATGATCCGG
TCGAAATCTTTCAGCTTACCCTCACCAATCAGAGTGATCGTCAGCGGCGTCTTGACATTACCTCATACGTCGAATGGCTG
CTGGGTTTTGCCCCTGATGAACACCGTGAGTTTCACAAATTATTTATTGAGACGACGCCTGAGCCGGAGTTTCATGCGTT
GCTGGCGCGCAAATATCTGTGGGGGTTTGCCGATGAACTGGGGCGCCACAATAATATTGACTGGCCCTATGTAGCATTTA
TGGCCGTCAGTGAACCGCTGAAGAGCTTCGATGGCGATAAAGAGTCGTTTATCGGTCTGTATTCCAGTCTTGAACACCCC
CAGGCAATGCAGCAACCAACGCTAGCTGGGCGTAGTGGTCGCTTCGGCGATGCGATTGCCGCTCTTCAGGTTGAAATCAC
GCTAGCACCGGGCGAACGTCGCACAGTGGTCTTTACGCTGGGGGCTGCACTGCAGGGAAGTGAAGACCCTCTCGCGCTCA
TTCAACGCTATACGTCAGTTACTGCAAGTGATCAGACGTTACAGGCCGTGCATGCTTTTTGGTCGCGACTGGTTGATGCC
GAGCACGTTGAGACACCTGATCCGGCCCTCAATCTGATGACCAATTACTGGCTGAAGTACCAGGCGATCTCTGGGCGATT
GTGGGGAAAATCGGCCTTTTACCAGGTTTCAGCCGGCTACGGGTTCCGTGATCAACTACAAGACTGCCAGATATTTCTGG
TCTGCGATCCGGCATTAGCCAAACGACAAATCCTCCTGCACGCTGCCCAGCAATTTGTCGAAGGCGATGTGTTGCACTGG
TGGTTCTCGATTCGCGGTGGTGGACCACGCACCAATTGTTCAGACGATCTACTCTGGCTACCTTTTGTGGTTGACAGCTA
CCTGCAAGAGACCGGTGATGTCGCTATTCTCGACGAAATGGTACCGTATTTGAACGGCCCGGCGGAGCCGCTCTACCTGC
ACTGCAAACGAGCGATTGAACGCGCGTTCAGTCGTTTCTCGCCACGAGGGATTCCGCTGATGGGTGATCACGACTGGAAC
GATGGCCTCAACGCAGTCGGCACCCAGTTGCGCGGCGAGAGTTTCTGGGTGGCCGAATTTCTCTACACTATTCTCGAACG
CTGGATTCCACTGGCGCAACAACGGTCAGATGAAGCATTTGCCGAACGCTGCACTGCCGTGCGCACCACCTTACGGTGGG
CAATGAATCGCTATGGCTGGGATGGAGAGTGGTTTTTGCAAGCCACCACCGATGCCGGCCTGCCACTAGGCTCACAGCAA
AATGAAGAGGGGCGCATCTTCTTGATGCCCAATATCTGGGCCGTTATCAGTGGCATCACCGATCAGCAACGGGCATGGCA
GGCAATGCAGGCAGTGAGTCGTTACCTGCTGTGCGACTACGGAACCCTGCTCAATTATCCTGCCTTTACGCGCCCCCGTT
CCGACATCGGTTATGTAACCCGTTATGCGCCAGGATTACGTGAAAACGGTGGGGTCTATACCCACGCCGCTACCTGGTCG
GTATGGGCATATGCCCTGCTTGGTGATGTCGACCATGCCTACGAAGCGTACCGTCGCATTTGTCCACCCAATCGCAGTGC
CGACATTGAACGCTACAAAGCCGAACCCTACGTGACGCCGGGCAATATTGATGGGCCGCAATCTCCATACTTTGGTCGTG
GTGGATGGACGTGGTATACCGGTTCAGCGCAGTGGCTCCACCGCGTTGCAACACACTGGATTTTGGGCATCCGGCCCCAG
ATCGAGGGGTTACTGATCGATCCACTCATCCCGGCCACCTGGGAACGCTTTACCGTGCGGCGAACGTTTCGCGGTGCAAT
CTACGAGATCGAGGTACTCAATCCAAACCACGTCAATCGTGGGGTCATTTCACTCGAAGTTGATGGTCAACCACTCGCCG
GAACGGTCATCCCGGCATTCAATGATGGGCAAACGCATTCCGTGCGGGTGGTGTTGGGCTAA

Upstream 100 bases:

>100_bases
TTTTTATTCGTTGTAATGAAACGTTTCATACCATCGTGTGCGCATACCGGATGCGCCTTGTTCGCTCCTTAGCTGTACAT
TCATGCAGTGGTGAAACATC

Downstream 100 bases:

>100_bases
CAATCACGAGAAGGAGGTGAGCAACCTGTAGCACGACATAGCGATGTTGCGTGTTTCAGATTCGCAGAAGATTCTCGCGT
TTCAAGGAGGAGAAGCATGA

Product: glycosyl transferase family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 793; Mature: 793

Protein sequence:

>793_residues
MEHLFASTYGYFRADGREYVITTPFTPRPWGNVISNGDYAMMVSQTGSGYSWRNNAGQNRITRSFQDLIQDNWGKYLYLR
DLDSGQYWAATYKPTCHQYDHYQVRHGLGYSSFEQIVQGIHSILTVFVAPDDPVEIFQLTLTNQSDRQRRLDITSYVEWL
LGFAPDEHREFHKLFIETTPEPEFHALLARKYLWGFADELGRHNNIDWPYVAFMAVSEPLKSFDGDKESFIGLYSSLEHP
QAMQQPTLAGRSGRFGDAIAALQVEITLAPGERRTVVFTLGAALQGSEDPLALIQRYTSVTASDQTLQAVHAFWSRLVDA
EHVETPDPALNLMTNYWLKYQAISGRLWGKSAFYQVSAGYGFRDQLQDCQIFLVCDPALAKRQILLHAAQQFVEGDVLHW
WFSIRGGGPRTNCSDDLLWLPFVVDSYLQETGDVAILDEMVPYLNGPAEPLYLHCKRAIERAFSRFSPRGIPLMGDHDWN
DGLNAVGTQLRGESFWVAEFLYTILERWIPLAQQRSDEAFAERCTAVRTTLRWAMNRYGWDGEWFLQATTDAGLPLGSQQ
NEEGRIFLMPNIWAVISGITDQQRAWQAMQAVSRYLLCDYGTLLNYPAFTRPRSDIGYVTRYAPGLRENGGVYTHAATWS
VWAYALLGDVDHAYEAYRRICPPNRSADIERYKAEPYVTPGNIDGPQSPYFGRGGWTWYTGSAQWLHRVATHWILGIRPQ
IEGLLIDPLIPATWERFTVRRTFRGAIYEIEVLNPNHVNRGVISLEVDGQPLAGTVIPAFNDGQTHSVRVVLG

Sequences:

>Translated_793_residues
MEHLFASTYGYFRADGREYVITTPFTPRPWGNVISNGDYAMMVSQTGSGYSWRNNAGQNRITRSFQDLIQDNWGKYLYLR
DLDSGQYWAATYKPTCHQYDHYQVRHGLGYSSFEQIVQGIHSILTVFVAPDDPVEIFQLTLTNQSDRQRRLDITSYVEWL
LGFAPDEHREFHKLFIETTPEPEFHALLARKYLWGFADELGRHNNIDWPYVAFMAVSEPLKSFDGDKESFIGLYSSLEHP
QAMQQPTLAGRSGRFGDAIAALQVEITLAPGERRTVVFTLGAALQGSEDPLALIQRYTSVTASDQTLQAVHAFWSRLVDA
EHVETPDPALNLMTNYWLKYQAISGRLWGKSAFYQVSAGYGFRDQLQDCQIFLVCDPALAKRQILLHAAQQFVEGDVLHW
WFSIRGGGPRTNCSDDLLWLPFVVDSYLQETGDVAILDEMVPYLNGPAEPLYLHCKRAIERAFSRFSPRGIPLMGDHDWN
DGLNAVGTQLRGESFWVAEFLYTILERWIPLAQQRSDEAFAERCTAVRTTLRWAMNRYGWDGEWFLQATTDAGLPLGSQQ
NEEGRIFLMPNIWAVISGITDQQRAWQAMQAVSRYLLCDYGTLLNYPAFTRPRSDIGYVTRYAPGLRENGGVYTHAATWS
VWAYALLGDVDHAYEAYRRICPPNRSADIERYKAEPYVTPGNIDGPQSPYFGRGGWTWYTGSAQWLHRVATHWILGIRPQ
IEGLLIDPLIPATWERFTVRRTFRGAIYEIEVLNPNHVNRGVISLEVDGQPLAGTVIPAFNDGQTHSVRVVLG
>Mature_793_residues
MEHLFASTYGYFRADGREYVITTPFTPRPWGNVISNGDYAMMVSQTGSGYSWRNNAGQNRITRSFQDLIQDNWGKYLYLR
DLDSGQYWAATYKPTCHQYDHYQVRHGLGYSSFEQIVQGIHSILTVFVAPDDPVEIFQLTLTNQSDRQRRLDITSYVEWL
LGFAPDEHREFHKLFIETTPEPEFHALLARKYLWGFADELGRHNNIDWPYVAFMAVSEPLKSFDGDKESFIGLYSSLEHP
QAMQQPTLAGRSGRFGDAIAALQVEITLAPGERRTVVFTLGAALQGSEDPLALIQRYTSVTASDQTLQAVHAFWSRLVDA
EHVETPDPALNLMTNYWLKYQAISGRLWGKSAFYQVSAGYGFRDQLQDCQIFLVCDPALAKRQILLHAAQQFVEGDVLHW
WFSIRGGGPRTNCSDDLLWLPFVVDSYLQETGDVAILDEMVPYLNGPAEPLYLHCKRAIERAFSRFSPRGIPLMGDHDWN
DGLNAVGTQLRGESFWVAEFLYTILERWIPLAQQRSDEAFAERCTAVRTTLRWAMNRYGWDGEWFLQATTDAGLPLGSQQ
NEEGRIFLMPNIWAVISGITDQQRAWQAMQAVSRYLLCDYGTLLNYPAFTRPRSDIGYVTRYAPGLRENGGVYTHAATWS
VWAYALLGDVDHAYEAYRRICPPNRSADIERYKAEPYVTPGNIDGPQSPYFGRGGWTWYTGSAQWLHRVATHWILGIRPQ
IEGLLIDPLIPATWERFTVRRTFRGAIYEIEVLNPNHVNRGVISLEVDGQPLAGTVIPAFNDGQTHSVRVVLG

Specific function: Involved in the production of beta-(1,2)-glucan. It is involved not only in invasion but also in bacteroid development [H]

COG id: COG3459

COG function: function code G; Cellobiose phosphorylase

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: To A.tumefaciens ChvB [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008928
- InterPro:   IPR009342
- InterPro:   IPR019282
- InterPro:   IPR021478
- InterPro:   IPR011013
- InterPro:   IPR010383
- InterPro:   IPR010403 [H]

Pfam domain/function: PF06204 CBM_X; PF10091 DUF2329; PF11329 DUF3131; PF06165 Glyco_transf_36; PF06205 GT36_AF [H]

EC number: NA

Molecular weight: Translated: 90264; Mature: 90264

Theoretical pI: Translated: 5.77; Mature: 5.77

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MEHLFASTYGYFRADGREYVITTPFTPRPWGNVISNGDYAMMVSQTGSGYSWRNNAGQNR
CCCCHHCCCCEEEECCCEEEEECCCCCCCCCHHHCCCCEEEEEECCCCCCCCCCCCCHHH
ITRSFQDLIQDNWGKYLYLRDLDSGQYWAATYKPTCHQYDHYQVRHGLGYSSFEQIVQGI
HHHHHHHHHHCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCHHHHCCCHHHHHHHHHHH
HSILTVFVAPDDPVEIFQLTLTNQSDRQRRLDITSYVEWLLGFAPDEHREFHKLFIETTP
HHHHEEEECCCCCEEEEEEEECCCCCHHHHCCHHHHHHHHHCCCCHHHHHHHHHEEECCC
EPEFHALLARKYLWGFADELGRHNNIDWPYVAFMAVSEPLKSFDGDKESFIGLYSSLEHP
CHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC
QAMQQPTLAGRSGRFGDAIAALQVEITLAPGERRTVVFTLGAALQGSEDPLALIQRYTSV
HHHCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCCCCHHHHHHHHHHC
TASDQTLQAVHAFWSRLVDAEHVETPDPALNLMTNYWLKYQAISGRLWGKSAFYQVSAGY
CCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHEEEEEEECCEECCCCEEEEEECCC
GFRDQLQDCQIFLVCDPALAKRQILLHAAQQFVEGDVLHWWFSIRGGGPRTNCSDDLLWL
CHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCEEH
PFVVDSYLQETGDVAILDEMVPYLNGPAEPLYLHCKRAIERAFSRFSPRGIPLMGDHDWN
HHHHHHHHHHCCCEEEHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCEECCCCCC
DGLNAVGTQLRGESFWVAEFLYTILERWIPLAQQRSDEAFAERCTAVRTTLRWAMNRYGW
CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCC
DGEWFLQATTDAGLPLGSQQNEEGRIFLMPNIWAVISGITDQQRAWQAMQAVSRYLLCDY
CCEEEEEEECCCCCCCCCCCCCCCEEEECCCHHHHHCCCCHHHHHHHHHHHHHHHHHHCC
GTLLNYPAFTRPRSDIGYVTRYAPGLRENGGVYTHAATWSVWAYALLGDVDHAYEAYRRI
HHHHCCCCCCCCCCCCCHHHHCCCCCCCCCCEEEEEHHHHHHHHHHHHCHHHHHHHHHHC
CPPNRSADIERYKAEPYVTPGNIDGPQSPYFGRGGWTWYTGSAQWLHRVATHWILGIRPQ
CCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHEECCCC
IEGLLIDPLIPATWERFTVRRTFRGAIYEIEVLNPNHVNRGVISLEVDGQPLAGTVIPAF
CCCEEECCCCCCCHHHHHHHHHHCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEECC
NDGQTHSVRVVLG
CCCCCEEEEEEEC
>Mature Secondary Structure
MEHLFASTYGYFRADGREYVITTPFTPRPWGNVISNGDYAMMVSQTGSGYSWRNNAGQNR
CCCCHHCCCCEEEECCCEEEEECCCCCCCCCHHHCCCCEEEEEECCCCCCCCCCCCCHHH
ITRSFQDLIQDNWGKYLYLRDLDSGQYWAATYKPTCHQYDHYQVRHGLGYSSFEQIVQGI
HHHHHHHHHHCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCHHHHCCCHHHHHHHHHHH
HSILTVFVAPDDPVEIFQLTLTNQSDRQRRLDITSYVEWLLGFAPDEHREFHKLFIETTP
HHHHEEEECCCCCEEEEEEEECCCCCHHHHCCHHHHHHHHHCCCCHHHHHHHHHEEECCC
EPEFHALLARKYLWGFADELGRHNNIDWPYVAFMAVSEPLKSFDGDKESFIGLYSSLEHP
CHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC
QAMQQPTLAGRSGRFGDAIAALQVEITLAPGERRTVVFTLGAALQGSEDPLALIQRYTSV
HHHCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCCCCHHHHHHHHHHC
TASDQTLQAVHAFWSRLVDAEHVETPDPALNLMTNYWLKYQAISGRLWGKSAFYQVSAGY
CCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHEEEEEEECCEECCCCEEEEEECCC
GFRDQLQDCQIFLVCDPALAKRQILLHAAQQFVEGDVLHWWFSIRGGGPRTNCSDDLLWL
CHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCEEH
PFVVDSYLQETGDVAILDEMVPYLNGPAEPLYLHCKRAIERAFSRFSPRGIPLMGDHDWN
HHHHHHHHHHCCCEEEHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCEECCCCCC
DGLNAVGTQLRGESFWVAEFLYTILERWIPLAQQRSDEAFAERCTAVRTTLRWAMNRYGW
CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCC
DGEWFLQATTDAGLPLGSQQNEEGRIFLMPNIWAVISGITDQQRAWQAMQAVSRYLLCDY
CCEEEEEEECCCCCCCCCCCCCCCEEEECCCHHHHHCCCCHHHHHHHHHHHHHHHHHHCC
GTLLNYPAFTRPRSDIGYVTRYAPGLRENGGVYTHAATWSVWAYALLGDVDHAYEAYRRI
HHHHCCCCCCCCCCCCCHHHHCCCCCCCCCCEEEEEHHHHHHHHHHHHCHHHHHHHHHHC
CPPNRSADIERYKAEPYVTPGNIDGPQSPYFGRGGWTWYTGSAQWLHRVATHWILGIRPQ
CCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHEECCCC
IEGLLIDPLIPATWERFTVRRTFRGAIYEIEVLNPNHVNRGVISLEVDGQPLAGTVIPAF
CCCEEECCCCCCCHHHHHHHHHHCCEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEECC
NDGQTHSVRVVLG
CCCCCEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 2154461; 11481430 [H]