Definition Novosphingobium aromaticivorans DSM 12444 chromosome, complete genome.
Accession NC_007794
Length 3,561,584

Click here to switch to the map view.

The map label for this gene is yeeB [H]

Identifier: 87198289

GI number: 87198289

Start: 281100

End: 283169

Strand: Direct

Name: yeeB [H]

Synonym: Saro_0264

Alternate gene names: 87198289

Gene position: 281100-283169 (Clockwise)

Preceding gene: 87198288

Following gene: 87198290

Centisome position: 7.89

GC content: 60.19

Gene sequence:

>2070_bases
ATGACCGCCGACAAACCGATCCCTGCCGTTACCTTTCAGACCGCTCACAATGGCGTCTCGGCTAAAGCCAACTCTCTCGG
CATGCGGCCGATGCAGGAGCGCGCGTATAACAAGCGCGGCGAGCAATATCTGCTAATCAAGTCCCCGCCAGCGTCCGGCA
AAAGCCGCGCCCTGATGTTTATTGCACTCGACAAACTCCACAACCAAGGGCTCTCGCAGGCAATTATCGTCGTGCCAGAG
AAATCAATCGGCGGTAGCTTTGCTGATGAGCCACTCACCATGTACGGGTTCTGGGCGGACTGGGTAGTCCAGCCGCAATG
GAACCTTTGCAACGCGCCGGGGCCGGACGAGGTAAAGGTCGATGCCTCGAAGGTGAAGGCGGTCGGCCAGTTCCTGGCGA
GCGATGATAAGGTGCTGGTTTGTACCCACGCCACGTTCCGTTTCGCAGTTGAGGCTCTAGGCATTGAGGCCTTTGACGGT
CGGCTGATCGCAATCGATGAGTTCCACCATGTCTCGGCCAACCCCGACAACAAGCTGGGCAGCCAGCTCACAGCCTTCAT
CGCGCGCGACAAGGTCCACATTATCGCCATGACCGGAAGCTACTTCCGGGGCGATACTGTCGCGGTCCTGACGCCGGAAG
ATGAGGCAAAGTTCGAGACAGTCACCTATACCTACTACGAGCAGCTCAACGGGTACGACCATCTCAAGTCGCTGGCGATC
GGGTATTTCTTCTACACTGGCCGCTACCTCACCGCGATCGAGCACTGCCTCGACCCGGCGCGCAAGACGATCGTCCACAT
CCCCTCGGTCAACTCCAAGGAAAGCACCAAGGACAAGATCAAGGAAGTCGAGGAGATCATGGAATACCTCGGGGACTGGC
AGGGCGCCGATCCTGTGACGGGGTTCCACTTGGTCAAGCTCGCCGATGGCCGGACACTCAAGATCGCGGACCTTGTGGAC
GACAGCGATCCTGCCAAACGTGCCAAGGTGTTGTCCGCGCTCAAGGACCCAGCCCACAAGAACGATCGCGACCACGTCGA
CATCATCATCGCGCTGGGAATGGCCAAGGAAGGCTTCGACTGGATCTGGTGCGAACATGCGCTGACGGTCGGCTATCGGT
CGAGCCTCACCGAGATCATCCAGATCATCGGCCGCGCCACGCGCGATGCGCCCGGAAAGACGGTCGCCACCTTCACCAAC
CTGATCGCCGAGCCGGATGCCTCCGAAGCTGCGGTGGCCGATGCGATCAACGATACGCTCAAAGCCATCGCTGCCAGCTT
GCTGATGGAGCAGGTTCTGACGCCCAAGTTCCAGTTCACGCCCAAGAACACCGGGCCACTGCCTGATTTCAACTATGGCC
CCGGCGGTTACCAAGAAGGCAAGACCAACGTTGGCGTCAACGAGGAGCGGGGCGAATTCCACTTCGAGATCGCCGGACTG
GCTACCCCGAAAAGCCCCGAGGCGGCGCGCATCTGCCAGCAGGATCTAAATGAGGTCATCGCGTCTTACGTCCAGAACAA
AGACGTGATCGGCCCTGGCCTGTTCAACGAGGAAGCTGTCCCGCAGGACCTGACGCAAGTGCAGATGGGCAAGATTGTCC
GTGACCGGTATCCGGACCTCACACCGGAGGATCAGGAGGCCGTGCGCCAACACGCCATCGCTGCGCTGAACCTCACGCAA
AAAGCTAAGGCGATAGCGATGGGCACCGACGATGGCAGCGGCGAGGTCAAGGCCAACACCGCGTTGATCGATGGCGTGAA
GCAGTTTGCGATGGACGTGAAGGAACTCGACATCGACTTGATCGACAGCATCAACCCGTTCCAGGCAGCCTACTCGATCT
TGGCCAAGTCAATGGATGAGGCCACGCTCAAGCAGGTGAAGGCAGCGGTCGCAGCCAAGAAGACCAAGATCACGCCGGAT
GAGGCCAAGGACCTCGCCGGGCGCGCGGTGCGCTTCAAGCGGGAGCGCGGGCGTCTACCGTCGATCATCGCTGCGGACGC
CTGGGAGCAGCGGCTAGCCGAAGGTGCTGCCGCGTTCATGCGGTTCAAGGAGGAAGGCCGCTATGTCTGA

Upstream 100 bases:

>100_bases
CGGCATGGCCGGAACGCACAAAGGCGCTGAGGATGGACAGGCAGCGCGATAACTGTGAATAACGAAAAACACCCGAGCCT
ACCGTTGACGAAGTGACAGA

Downstream 100 bases:

>100_bases
TCTGGACGAGCTAGCAAACGAGCTGGCGGACTTTGCGCCCTCTAAGAAGAAGCAGGCCACCTACGCTCCGCGCGAGGAAC
GCATCATCGCCGGGTTCGAG

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 689; Mature: 688

Protein sequence:

>689_residues
MTADKPIPAVTFQTAHNGVSAKANSLGMRPMQERAYNKRGEQYLLIKSPPASGKSRALMFIALDKLHNQGLSQAIIVVPE
KSIGGSFADEPLTMYGFWADWVVQPQWNLCNAPGPDEVKVDASKVKAVGQFLASDDKVLVCTHATFRFAVEALGIEAFDG
RLIAIDEFHHVSANPDNKLGSQLTAFIARDKVHIIAMTGSYFRGDTVAVLTPEDEAKFETVTYTYYEQLNGYDHLKSLAI
GYFFYTGRYLTAIEHCLDPARKTIVHIPSVNSKESTKDKIKEVEEIMEYLGDWQGADPVTGFHLVKLADGRTLKIADLVD
DSDPAKRAKVLSALKDPAHKNDRDHVDIIIALGMAKEGFDWIWCEHALTVGYRSSLTEIIQIIGRATRDAPGKTVATFTN
LIAEPDASEAAVADAINDTLKAIAASLLMEQVLTPKFQFTPKNTGPLPDFNYGPGGYQEGKTNVGVNEERGEFHFEIAGL
ATPKSPEAARICQQDLNEVIASYVQNKDVIGPGLFNEEAVPQDLTQVQMGKIVRDRYPDLTPEDQEAVRQHAIAALNLTQ
KAKAIAMGTDDGSGEVKANTALIDGVKQFAMDVKELDIDLIDSINPFQAAYSILAKSMDEATLKQVKAAVAAKKTKITPD
EAKDLAGRAVRFKRERGRLPSIIAADAWEQRLAEGAAAFMRFKEEGRYV

Sequences:

>Translated_689_residues
MTADKPIPAVTFQTAHNGVSAKANSLGMRPMQERAYNKRGEQYLLIKSPPASGKSRALMFIALDKLHNQGLSQAIIVVPE
KSIGGSFADEPLTMYGFWADWVVQPQWNLCNAPGPDEVKVDASKVKAVGQFLASDDKVLVCTHATFRFAVEALGIEAFDG
RLIAIDEFHHVSANPDNKLGSQLTAFIARDKVHIIAMTGSYFRGDTVAVLTPEDEAKFETVTYTYYEQLNGYDHLKSLAI
GYFFYTGRYLTAIEHCLDPARKTIVHIPSVNSKESTKDKIKEVEEIMEYLGDWQGADPVTGFHLVKLADGRTLKIADLVD
DSDPAKRAKVLSALKDPAHKNDRDHVDIIIALGMAKEGFDWIWCEHALTVGYRSSLTEIIQIIGRATRDAPGKTVATFTN
LIAEPDASEAAVADAINDTLKAIAASLLMEQVLTPKFQFTPKNTGPLPDFNYGPGGYQEGKTNVGVNEERGEFHFEIAGL
ATPKSPEAARICQQDLNEVIASYVQNKDVIGPGLFNEEAVPQDLTQVQMGKIVRDRYPDLTPEDQEAVRQHAIAALNLTQ
KAKAIAMGTDDGSGEVKANTALIDGVKQFAMDVKELDIDLIDSINPFQAAYSILAKSMDEATLKQVKAAVAAKKTKITPD
EAKDLAGRAVRFKRERGRLPSIIAADAWEQRLAEGAAAFMRFKEEGRYV
>Mature_688_residues
TADKPIPAVTFQTAHNGVSAKANSLGMRPMQERAYNKRGEQYLLIKSPPASGKSRALMFIALDKLHNQGLSQAIIVVPEK
SIGGSFADEPLTMYGFWADWVVQPQWNLCNAPGPDEVKVDASKVKAVGQFLASDDKVLVCTHATFRFAVEALGIEAFDGR
LIAIDEFHHVSANPDNKLGSQLTAFIARDKVHIIAMTGSYFRGDTVAVLTPEDEAKFETVTYTYYEQLNGYDHLKSLAIG
YFFYTGRYLTAIEHCLDPARKTIVHIPSVNSKESTKDKIKEVEEIMEYLGDWQGADPVTGFHLVKLADGRTLKIADLVDD
SDPAKRAKVLSALKDPAHKNDRDHVDIIIALGMAKEGFDWIWCEHALTVGYRSSLTEIIQIIGRATRDAPGKTVATFTNL
IAEPDASEAAVADAINDTLKAIAASLLMEQVLTPKFQFTPKNTGPLPDFNYGPGGYQEGKTNVGVNEERGEFHFEIAGLA
TPKSPEAARICQQDLNEVIASYVQNKDVIGPGLFNEEAVPQDLTQVQMGKIVRDRYPDLTPEDQEAVRQHAIAALNLTQK
AKAIAMGTDDGSGEVKANTALIDGVKQFAMDVKELDIDLIDSINPFQAAYSILAKSMDEATLKQVKAAVAAKKTKITPDE
AKDLAGRAVRFKRERGRLPSIIAADAWEQRLAEGAAAFMRFKEEGRYV

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 helicase C-terminal domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014001 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 75560; Mature: 75428

Theoretical pI: Translated: 5.46; Mature: 5.46

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTADKPIPAVTFQTAHNGVSAKANSLGMRPMQERAYNKRGEQYLLIKSPPASGKSRALMF
CCCCCCCCEEEEECCCCCCCCCHHHCCCCHHHHHHHHCCCCEEEEEECCCCCCCCCEEEE
IALDKLHNQGLSQAIIVVPEKSIGGSFADEPLTMYGFWADWVVQPQWNLCNAPGPDEVKV
EEEHHHHCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCEEE
DASKVKAVGQFLASDDKVLVCTHATFRFAVEALGIEAFDGRLIAIDEFHHVSANPDNKLG
CHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHCCEECCCEEEEEECCCCCCCCCCHHHH
SQLTAFIARDKVHIIAMTGSYFRGDTVAVLTPEDEAKFETVTYTYYEQLNGYDHLKSLAI
HHHHHHHHCCCEEEEEEECCCCCCCEEEEECCCCCCCEEEEEEHHHHHHCCHHHHHHHHH
GYFFYTGRYLTAIEHCLDPARKTIVHIPSVNSKESTKDKIKEVEEIMEYLGDWQGADPVT
HHHHHHHHHHHHHHHHHCHHHHHEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCC
GFHLVKLADGRTLKIADLVDDSDPAKRAKVLSALKDPAHKNDRDHVDIIIALGMAKEGFD
CEEEEEECCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCC
WIWCEHALTVGYRSSLTEIIQIIGRATRDAPGKTVATFTNLIAEPDASEAAVADAINDTL
EEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHH
KAIAASLLMEQVLTPKFQFTPKNTGPLPDFNYGPGGYQEGKTNVGVNEERGEFHFEIAGL
HHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC
ATPKSPEAARICQQDLNEVIASYVQNKDVIGPGLFNEEAVPQDLTQVQMGKIVRDRYPDL
CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC
TPEDQEAVRQHAIAALNLTQKAKAIAMGTDDGSGEVKANTALIDGVKQFAMDVKELDIDL
CCCHHHHHHHHHHHHHHHHHHCCEEEECCCCCCCCEEEHHHHHHHHHHHHHHHHHCCHHH
IDSINPFQAAYSILAKSMDEATLKQVKAAVAAKKTKITPDEAKDLAGRAVRFKRERGRLP
HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCC
SIIAADAWEQRLAEGAAAFMRFKEEGRYV
CEEHHHHHHHHHHHHHHHHHHHHHCCCCC
>Mature Secondary Structure 
TADKPIPAVTFQTAHNGVSAKANSLGMRPMQERAYNKRGEQYLLIKSPPASGKSRALMF
CCCCCCCEEEEECCCCCCCCCHHHCCCCHHHHHHHHCCCCEEEEEECCCCCCCCCEEEE
IALDKLHNQGLSQAIIVVPEKSIGGSFADEPLTMYGFWADWVVQPQWNLCNAPGPDEVKV
EEEHHHHCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCEEE
DASKVKAVGQFLASDDKVLVCTHATFRFAVEALGIEAFDGRLIAIDEFHHVSANPDNKLG
CHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHCCEECCCEEEEEECCCCCCCCCCHHHH
SQLTAFIARDKVHIIAMTGSYFRGDTVAVLTPEDEAKFETVTYTYYEQLNGYDHLKSLAI
HHHHHHHHCCCEEEEEEECCCCCCCEEEEECCCCCCCEEEEEEHHHHHHCCHHHHHHHHH
GYFFYTGRYLTAIEHCLDPARKTIVHIPSVNSKESTKDKIKEVEEIMEYLGDWQGADPVT
HHHHHHHHHHHHHHHHHCHHHHHEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCC
GFHLVKLADGRTLKIADLVDDSDPAKRAKVLSALKDPAHKNDRDHVDIIIALGMAKEGFD
CEEEEEECCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCC
WIWCEHALTVGYRSSLTEIIQIIGRATRDAPGKTVATFTNLIAEPDASEAAVADAINDTL
EEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHH
KAIAASLLMEQVLTPKFQFTPKNTGPLPDFNYGPGGYQEGKTNVGVNEERGEFHFEIAGL
HHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC
ATPKSPEAARICQQDLNEVIASYVQNKDVIGPGLFNEEAVPQDLTQVQMGKIVRDRYPDL
CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC
TPEDQEAVRQHAIAALNLTQKAKAIAMGTDDGSGEVKANTALIDGVKQFAMDVKELDIDL
CCCHHHHHHHHHHHHHHHHHHCCEEEECCCCCCCCEEEHHHHHHHHHHHHHHHHHCCHHH
IDSINPFQAAYSILAKSMDEATLKQVKAAVAAKKTKITPDEAKDLAGRAVRFKRERGRLP
HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCC
SIIAADAWEQRLAEGAAAFMRFKEEGRYV
CEEHHHHHHHHHHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]