Definition Ralstonia eutropha JMP134 chromosome 1, complete sequence.
Accession NC_007347
Length 3,806,533

Click here to switch to the map view.

The map label for this gene is fhaB [H]

Identifier: 73542617

GI number: 73542617

Start: 3214946

End: 3217495

Strand: Reverse

Name: fhaB [H]

Synonym: Reut_A2933

Alternate gene names: 73542617

Gene position: 3217495-3214946 (Counterclockwise)

Preceding gene: 73542621

Following gene: 73542616

Centisome position: 84.53

GC content: 64.27

Gene sequence:

>2550_bases
GTGGTGCGGCAACGTAAGGCAATACAACGATGCGATCGGGTGCGGGTGTACGGGCACAGGCTGCAGCCATCCGCGCTTGC
GTGGGGCGTGGCGTTGCTGGCCAGCCGCGCGCTGGCATCAGGCGTAGTGCCGGACGGTGGCATCAACGCGCCGACTGTCA
CAGCGAGCGCGAGCGGTCACGTCACGGTCAATGTGAACACCCCGGTTGGCGGTGTCTCCACCAACACCTATCGAGACTTC
AACGTATCGCGCGCTGGCGTCGATCTTGACAATCGAGCGGCGAGCGCCCGCACTATCCTGAACCAGGTCACCAGCACCAA
TCCCTCACTGCTGGAAGGACCCCTGGTTGTGCTTGGCTCGCGTGCCAACGTCATCATTGCTAATCCCAATGGGATCAGCG
TCAATGGACTGAGCGTGCAGAACACGGGCAACCTGGCGCTCACTACCGGCCAGGTCTCCTTTAACGATTTCACGACGGCC
AGCGGGCAATTGCAGCGCAACCTGCGCATCGATACCAATCAGGGCGCCATCGAGATCGGGCCCGAGGGCCTGACCGGGAC
GCTCCTGAACCTCGAACTGATGGCCAAACGGCTCCGCATCGGCGGCCCCGTCACAAACCTGTACGACGATCCGAATGCCC
GCGCACGTCTGGTAGCCGGCAATAGCCGCGCGGAGATCGACACCAGCGTCTCGCCCACAGACAATCTCACGCCCTGGATT
ACCTATTCGACGCCAGCCGTCAATCCAGGCCAGGGCATCGCCATTGTCATCACCGCCGCGGGCAGTCTCATGGCGGGACG
CATCGAACTCATGGTCACCGATCAGGGTGCCGGTGTGCGCCATGCCGGCGCGGCGTATGCCACCGCGGGCGACTTTGTGG
TGTCGGGCACCGGTGACTTGCAGCTGGCCTCCGGCAAGATCGACGCAAAGCAAGACGTGCTGGTCGGCAGCGGCGGCTTC
ACCGGTAGCGGTGATGTCAGTGCCGGGCGCCACCTGCAGATTGCGTCGGACCGCGTGGACCTGTCGCAAGCAACGCTTGC
CGCCGGCACGCGGGTACCAGGTGATCTTGTGATCGGTGCCGACGGCCAGGTCCACAGCCAGCCCGTCCGGCTGACGGACT
CAACCGTTACCGCCAGTGGCGGCATCGGCGTCTTCGACGCGGGTGCAGGCCTGGTCTTGGCGGGCACGCAGCTTACGGCC
AACGGCAACGTGGTCGTGGCGGTGCCGAGCCTGACGACGCAAGCTACCGGCCAGCGCACGACGCTCACGTCCAGGTCCGG
CACGGTGTCGATAGCAGCCGACGAGGAGACGCTAGCCGCTACCGATATTGACGGCGTGGGCGGCATCGGCATTACCGCGC
GAAACCTGTCGCTACAGGACTCCAAGGTTCAGTCGTCCGGCGGCGCAGTGGCCATCGACACTAGCGGTGCCTACGCCCAG
CAGGACAGTGATGTGTTGGCGGCCACCGACATACGCTTGCACGCCGACTCAGTGGTGCTCGATTCCGCCAACCGGCAATC
CACGCTTGTTGCCAGCAATGGCGGGGTGCTGATCCAGTCCGATACGGATGTGACCAACAGCGGCGCGTTAATCCAGGGCC
AGACGCGCATCGCTAGTGAGCCGCAATCCGCGGGTGCCGTGACCGTGCGTGCCGGCGGCAATGTCACGAATTCGTCCACG
CCCGCTTACCTCGGCATCCTGTTCGGCGCGGACGACGATGTCGACGTGCAGGCGGGCGGCAATGTCTTGAATCAACATGC
GCGGATGCTCTCGAACGGCTTTCTGCGCGTGCACGCTGATGGCGATGTTTCCAACGAGATCAGCAAGCAGAACGGCGCCA
ATGGAGAGCAGCCTGACTTCTACGTGGCGTCCGGACAGCGCTGGCTGGTGCTGACGAAGCGCTCGGCGGGTTTTGACGTG
GACTACGGCACTGCCGACCGGCCCGGTCAGATTGCCTACCTGCTCTCGGACAAGGGCACGACCATCTCCGGACGCAACGT
CACCAACTATGGCGGCGAGATCTACGCCAACAACGGGCCGATCCGTATCCGGGCCTCCGATACATTCAGGACCGAAGGCG
TTGCAACTGGCGCTGCACACTACGAACGATCCTGTCTGATCTTCTGTCGCACCACGGCCTCGTCAACCACGTCCGTGACT
GGCGGCCTTTTGTCGGCCGGCGGCGATATCGATATCCAGGCTGGCAAGGCGGCGATCAATGTTGGCGGCCGGGTGCTGGC
CATGGGGGACCTGACCGTCACGGCGCCGGTCACCTATGCGAGCGGCATCACCGGCTACACCGCCATCTCGCGCGATCGCG
GCTTCAAGGCCTTCTTCGGGGATACCTGGGCACGCTTGTACGCAGCGGATGTCGGCGGCAGCTGGATGGCCTCCGGCAAG
ACCCAGATCAACGGCGACACCGTGACCGAGGGCGGCAGCTTCGATGGCAACGTTGCGGTCAGCGGCGTGACCACAATGAC
TCGTCCACGCCAGCGTGACCCCGTCACCATCGAGAATCACCTCGGCCTCACTTCGTGGTTGTGGCGATGA

Upstream 100 bases:

>100_bases
CACGCCAACTTTATGCGCGGCGGACGTGGAGCGGATAGCGACTAGCTATTCGCCCATCGTCACAGTTGATCCCAGCCGCG
CCATCCAACGAGGGAGCGCA

Downstream 100 bases:

>100_bases
GGAGGCGTGCTGACTCCTTGCGCGCCGCTGCGTCGGCGGCCGCCCTGGCCTGTACAACGACGTGGGCACAGGTGCCGCCC
GCCGCGCTCGTGCCGGAAGA

Product: filamentous haemagglutinin, N-terminal

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 849; Mature: 849

Protein sequence:

>849_residues
MVRQRKAIQRCDRVRVYGHRLQPSALAWGVALLASRALASGVVPDGGINAPTVTASASGHVTVNVNTPVGGVSTNTYRDF
NVSRAGVDLDNRAASARTILNQVTSTNPSLLEGPLVVLGSRANVIIANPNGISVNGLSVQNTGNLALTTGQVSFNDFTTA
SGQLQRNLRIDTNQGAIEIGPEGLTGTLLNLELMAKRLRIGGPVTNLYDDPNARARLVAGNSRAEIDTSVSPTDNLTPWI
TYSTPAVNPGQGIAIVITAAGSLMAGRIELMVTDQGAGVRHAGAAYATAGDFVVSGTGDLQLASGKIDAKQDVLVGSGGF
TGSGDVSAGRHLQIASDRVDLSQATLAAGTRVPGDLVIGADGQVHSQPVRLTDSTVTASGGIGVFDAGAGLVLAGTQLTA
NGNVVVAVPSLTTQATGQRTTLTSRSGTVSIAADEETLAATDIDGVGGIGITARNLSLQDSKVQSSGGAVAIDTSGAYAQ
QDSDVLAATDIRLHADSVVLDSANRQSTLVASNGGVLIQSDTDVTNSGALIQGQTRIASEPQSAGAVTVRAGGNVTNSST
PAYLGILFGADDDVDVQAGGNVLNQHARMLSNGFLRVHADGDVSNEISKQNGANGEQPDFYVASGQRWLVLTKRSAGFDV
DYGTADRPGQIAYLLSDKGTTISGRNVTNYGGEIYANNGPIRIRASDTFRTEGVATGAAHYERSCLIFCRTTASSTTSVT
GGLLSAGGDIDIQAGKAAINVGGRVLAMGDLTVTAPVTYASGITGYTAISRDRGFKAFFGDTWARLYAADVGGSWMASGK
TQINGDTVTEGGSFDGNVAVSGVTTMTRPRQRDPVTIENHLGLTSWLWR

Sequences:

>Translated_849_residues
MVRQRKAIQRCDRVRVYGHRLQPSALAWGVALLASRALASGVVPDGGINAPTVTASASGHVTVNVNTPVGGVSTNTYRDF
NVSRAGVDLDNRAASARTILNQVTSTNPSLLEGPLVVLGSRANVIIANPNGISVNGLSVQNTGNLALTTGQVSFNDFTTA
SGQLQRNLRIDTNQGAIEIGPEGLTGTLLNLELMAKRLRIGGPVTNLYDDPNARARLVAGNSRAEIDTSVSPTDNLTPWI
TYSTPAVNPGQGIAIVITAAGSLMAGRIELMVTDQGAGVRHAGAAYATAGDFVVSGTGDLQLASGKIDAKQDVLVGSGGF
TGSGDVSAGRHLQIASDRVDLSQATLAAGTRVPGDLVIGADGQVHSQPVRLTDSTVTASGGIGVFDAGAGLVLAGTQLTA
NGNVVVAVPSLTTQATGQRTTLTSRSGTVSIAADEETLAATDIDGVGGIGITARNLSLQDSKVQSSGGAVAIDTSGAYAQ
QDSDVLAATDIRLHADSVVLDSANRQSTLVASNGGVLIQSDTDVTNSGALIQGQTRIASEPQSAGAVTVRAGGNVTNSST
PAYLGILFGADDDVDVQAGGNVLNQHARMLSNGFLRVHADGDVSNEISKQNGANGEQPDFYVASGQRWLVLTKRSAGFDV
DYGTADRPGQIAYLLSDKGTTISGRNVTNYGGEIYANNGPIRIRASDTFRTEGVATGAAHYERSCLIFCRTTASSTTSVT
GGLLSAGGDIDIQAGKAAINVGGRVLAMGDLTVTAPVTYASGITGYTAISRDRGFKAFFGDTWARLYAADVGGSWMASGK
TQINGDTVTEGGSFDGNVAVSGVTTMTRPRQRDPVTIENHLGLTSWLWR
>Mature_849_residues
MVRQRKAIQRCDRVRVYGHRLQPSALAWGVALLASRALASGVVPDGGINAPTVTASASGHVTVNVNTPVGGVSTNTYRDF
NVSRAGVDLDNRAASARTILNQVTSTNPSLLEGPLVVLGSRANVIIANPNGISVNGLSVQNTGNLALTTGQVSFNDFTTA
SGQLQRNLRIDTNQGAIEIGPEGLTGTLLNLELMAKRLRIGGPVTNLYDDPNARARLVAGNSRAEIDTSVSPTDNLTPWI
TYSTPAVNPGQGIAIVITAAGSLMAGRIELMVTDQGAGVRHAGAAYATAGDFVVSGTGDLQLASGKIDAKQDVLVGSGGF
TGSGDVSAGRHLQIASDRVDLSQATLAAGTRVPGDLVIGADGQVHSQPVRLTDSTVTASGGIGVFDAGAGLVLAGTQLTA
NGNVVVAVPSLTTQATGQRTTLTSRSGTVSIAADEETLAATDIDGVGGIGITARNLSLQDSKVQSSGGAVAIDTSGAYAQ
QDSDVLAATDIRLHADSVVLDSANRQSTLVASNGGVLIQSDTDVTNSGALIQGQTRIASEPQSAGAVTVRAGGNVTNSST
PAYLGILFGADDDVDVQAGGNVLNQHARMLSNGFLRVHADGDVSNEISKQNGANGEQPDFYVASGQRWLVLTKRSAGFDV
DYGTADRPGQIAYLLSDKGTTISGRNVTNYGGEIYANNGPIRIRASDTFRTEGVATGAAHYERSCLIFCRTTASSTTSVT
GGLLSAGGDIDIQAGKAAINVGGRVLAMGDLTVTAPVTYASGITGYTAISRDRGFKAFFGDTWARLYAADVGGSWMASGK
TQINGDTVTEGGSFDGNVAVSGVTTMTRPRQRDPVTIENHLGLTSWLWR

Specific function: Evidence for a role in host-cell binding and infection [H]

COG id: COG3210

COG function: function code U; Large exoproteins involved in heme utilization or adhesion

Gene ontology:

Cell location: Cell surface [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010069
- InterPro:   IPR008619
- InterPro:   IPR008638
- InterPro:   IPR012334
- InterPro:   IPR011050
- InterPro:   IPR011102 [H]

Pfam domain/function: PF05594 Fil_haemagg; PF05860 Haemagg_act [H]

EC number: NA

Molecular weight: Translated: 87155; Mature: 87155

Theoretical pI: Translated: 5.68; Mature: 5.68

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
0.9 %Met     (Translated Protein)
1.3 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVRQRKAIQRCDRVRVYGHRLQPSALAWGVALLASRALASGVVPDGGINAPTVTASASGH
CCCHHHHHHHHHHEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCE
VTVNVNTPVGGVSTNTYRDFNVSRAGVDLDNRAASARTILNQVTSTNPSLLEGPLVVLGS
EEEEECCCCCCCCCCCEEECCCEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEC
RANVIIANPNGISVNGLSVQNTGNLALTTGQVSFNDFTTASGQLQRNLRIDTNQGAIEIG
CCCEEEECCCCEEEECEEEECCCCEEEEECEEEECCCCCCCCCEEEEEEEECCCCEEEEC
PEGLTGTLLNLELMAKRLRIGGPVTNLYDDPNARARLVAGNSRAEIDTSVSPTDNLTPWI
CCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCEEECCCCCCCCCCEEE
TYSTPAVNPGQGIAIVITAAGSLMAGRIELMVTDQGAGVRHAGAAYATAGDFVVSGTGDL
EECCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCEEEECCCEEEECCCCE
QLASGKIDAKQDVLVGSGGFTGSGDVSAGRHLQIASDRVDLSQATLAAGTRVPGDLVIGA
EEECCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCCCCCHHHHHHCCCCCCCCEEEEC
DGQVHSQPVRLTDSTVTASGGIGVFDAGAGLVLAGTQLTANGNVVVAVPSLTTQATGQRT
CCCCCCCCEEEECCEEEECCCCEEEECCCCEEEEEEEEECCCCEEEEECCCCCCCCCCEE
TLTSRSGTVSIAADEETLAATDIDGVGGIGITARNLSLQDSKVQSSGGAVAIDTSGAYAQ
EEECCCCEEEEEECCCEEEEECCCCCCCCEEEEECCCCCCCCHHCCCCEEEEECCCCCCC
QDSDVLAATDIRLHADSVVLDSANRQSTLVASNGGVLIQSDTDVTNSGALIQGQTRIASE
CCCCEEEEEEEEEEECEEEEECCCCCEEEEECCCCEEEECCCCCCCCCCEEECCCCCCCC
PQSAGAVTVRAGGNVTNSSTPAYLGILFGADDDVDVQAGGNVLNQHARMLSNGFLRVHAD
CCCCCEEEEEECCCCCCCCCCEEEEEEECCCCCEEEECCCHHHHHHHHHHHCCEEEEEEC
GDVSNEISKQNGANGEQPDFYVASGQRWLVLTKRSAGFDVDYGTADRPGQIAYLLSDKGT
CCCCHHHHHCCCCCCCCCCEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCC
TISGRNVTNYGGEIYANNGPIRIRASDTFRTEGVATGAAHYERSCLIFCRTTASSTTSVT
EEECCEEECCCCEEEECCCCEEEEECCCCCCCCEECCCHHCCCEEEEEEEECCCCCCEEC
GGLLSAGGDIDIQAGKAAINVGGRVLAMGDLTVTAPVTYASGITGYTAISRDRGFKAFFG
CCCEECCCCEEEECCCEEEECCCEEEEEECEEEEECEEECCCCCCEEEEECCCCCEEECC
DTWARLYAADVGGSWMASGKTQINGDTVTEGGSFDGNVAVSGVTTMTRPRQRDPVTIENH
CCEEEEEEECCCCCEEECCCEEECCCEEECCCCCCCCEEEEEEEECCCCCCCCCEEEHHC
LGLTSWLWR
CCCHHHHCC
>Mature Secondary Structure
MVRQRKAIQRCDRVRVYGHRLQPSALAWGVALLASRALASGVVPDGGINAPTVTASASGH
CCCHHHHHHHHHHEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCE
VTVNVNTPVGGVSTNTYRDFNVSRAGVDLDNRAASARTILNQVTSTNPSLLEGPLVVLGS
EEEEECCCCCCCCCCCEEECCCEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEC
RANVIIANPNGISVNGLSVQNTGNLALTTGQVSFNDFTTASGQLQRNLRIDTNQGAIEIG
CCCEEEECCCCEEEECEEEECCCCEEEEECEEEECCCCCCCCCEEEEEEEECCCCEEEEC
PEGLTGTLLNLELMAKRLRIGGPVTNLYDDPNARARLVAGNSRAEIDTSVSPTDNLTPWI
CCCCEEEEEEHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCEEECCCCCCCCCCEEE
TYSTPAVNPGQGIAIVITAAGSLMAGRIELMVTDQGAGVRHAGAAYATAGDFVVSGTGDL
EECCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCEEEECCCEEEECCCCE
QLASGKIDAKQDVLVGSGGFTGSGDVSAGRHLQIASDRVDLSQATLAAGTRVPGDLVIGA
EEECCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCCCCCHHHHHHCCCCCCCCEEEEC
DGQVHSQPVRLTDSTVTASGGIGVFDAGAGLVLAGTQLTANGNVVVAVPSLTTQATGQRT
CCCCCCCCEEEECCEEEECCCCEEEECCCCEEEEEEEEECCCCEEEEECCCCCCCCCCEE
TLTSRSGTVSIAADEETLAATDIDGVGGIGITARNLSLQDSKVQSSGGAVAIDTSGAYAQ
EEECCCCEEEEEECCCEEEEECCCCCCCCEEEEECCCCCCCCHHCCCCEEEEECCCCCCC
QDSDVLAATDIRLHADSVVLDSANRQSTLVASNGGVLIQSDTDVTNSGALIQGQTRIASE
CCCCEEEEEEEEEEECEEEEECCCCCEEEEECCCCEEEECCCCCCCCCCEEECCCCCCCC
PQSAGAVTVRAGGNVTNSSTPAYLGILFGADDDVDVQAGGNVLNQHARMLSNGFLRVHAD
CCCCCEEEEEECCCCCCCCCCEEEEEEECCCCCEEEECCCHHHHHHHHHHHCCEEEEEEC
GDVSNEISKQNGANGEQPDFYVASGQRWLVLTKRSAGFDVDYGTADRPGQIAYLLSDKGT
CCCCHHHHHCCCCCCCCCCEEEECCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCC
TISGRNVTNYGGEIYANNGPIRIRASDTFRTEGVATGAAHYERSCLIFCRTTASSTTSVT
EEECCEEECCCCEEEECCCCEEEEECCCCCCCCEECCCHHCCCEEEEEEEECCCCCCEEC
GGLLSAGGDIDIQAGKAAINVGGRVLAMGDLTVTAPVTYASGITGYTAISRDRGFKAFFG
CCCEECCCCEEEECCCEEEECCCEEEEEECEEEEECEEECCCCCCEEEEECCCCCEEECC
DTWARLYAADVGGSWMASGKTQINGDTVTEGGSFDGNVAVSGVTTMTRPRQRDPVTIENH
CCEEEEEEECCCCCEEECCCEEECCCEEECCCCCCCCEEEEEEEECCCCCCCCCEEEHHC
LGLTSWLWR
CCCHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2388559; 12910271; 2539596; 1696934; 1791761 [H]