Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is aarF [H]
Identifier: 15887669
GI number: 15887669
Start: 312080
End: 313654
Strand: Reverse
Name: aarF [H]
Synonym: Atu0319
Alternate gene names: 15887669
Gene position: 313654-312080 (Counterclockwise)
Preceding gene: 159184255
Following gene: 159184254
Centisome position: 11.04
GC content: 61.14
Gene sequence:
>1575_bases ATGAGTACTTTGGGCGCATATTTCCGCCTCGCTAGGGTTGGCTGGGTTCTCGTGCGCGAAGGCGTCGTGCTGGCTTTGCC GTCCGATGACCTGCCTGCCCCGGCACAATTGTTGAAAGCCGCGCTGAAACCTTTCGCCCGCAGCAAGGCAAAACGGGCGC AGCGCAGCGACCGGCTAGCCGTGGCGGTCGAGCGGCTTGGGCCTTCCTATGTGAAGATGGGCCAGTTTCTGGCCACCCGG CCGGATGTGGTGGGCGCCGATTTCGCCGACGATCTTGCGAGCCTTCAGGACCGCATGGCTTTTTTCCCGGCCGCTGCGGC GAAGGCCAATATCGAAGGTTCGCTCGGACGATCAATTTCGGAGCTTTATAGAGAATTCGGCGACCCGATCGCCGCCGCCT CCATCGCCCAGGTGCACCCGGCCATGGTGGATACACCAAAGGGACCGCGCAAGGTGGCCGTCAAGGTCATTCGACCCGGC GTGCGTCAGCGTTTCCAGAACGATCTCGAGGCGATGTATCTGATTGCCGACCTGCAGCAGCGTTTCGTTCGCTCCGCCCG TCGCCTGCGGCCGGTGGAAGTGACACGCACGCTGGAGCAGACGACCAAGATCGAGATGGATCTGCGGCTCGAGGCCGCCG CACTTTCCGAACTTGCGGAAAATACCAGACAAGATCCCGGCTTCCGTGTTCCCGAAGTGGATTGGGAACGCACCGGCCGC GACGTCGTCACCATGGAATGGATCGACGGCGTCAAGATGTCGGATATCGAGGGACTGAAGGCAGCCGGCCACGACCTGAA CAAGCTGGCCGATACGCTGATCCAGTCCTTCCTGCGGCATACGCTGCGCGACGGTTTTTTCCATGCCGACATGCATCCCG GCAACCTCTTCGTCGATGCGAAGGGCGAGATCGTCGCCGTGGATATGGGCATTGCCGGTCGGCTCGGCAAAAAGGAACGC CGCTTCCTTGCCGAAATCCTCTATGGCTTCATCACGCGTGACTACATGCGGGTGGCGGAAGTGCATTTCGAGGCCGGTTA CGTGCCGGGCCATCACGACAAGGCCAGCTTCGCCCAGGCGATCCGCGCCATTGGCGAACCAATCCACGGCCAGCCGGCCG AGACGATCTCGATGGGCAAGCTCCTGACTCTGCTGTTCGAAGTGACCGAGCTTTTCGACATGGAAACGCGGCCGGAACTG GTGATGCTGCAAAAGACCATGGTGGTGGTGGAAGGCGTGTCGCGCATGCTCAATCCGCGCTTCAATATGTGGAAGGCGGC CGATCCGGTCGTCGGCGGCTGGATCCGTGACAATCTCGGTCCCAAACGCATCGCCACCGATCTGAAGGACGGCGTGAAGG CGGCACTCAAGCTCGCCGAAGCAGTGCCCGAGATCGCCGCCAAGACCGAGAAGCTGCATTCCGAACTGATGTATATGAGC GAAAACGGCCTGCGTTTTGACGCGCAGACGGCGGAAGCCATCGGCAAAGCGGAAGCGCGCCATACCAAATGGGGGCGGAT TGCGCTCTGGGTGATTGCGCTGACGCTTCTCTATATTGCCATCCGAATCAGCTAA
Upstream 100 bases:
>100_bases CCGCGTCACCTATACCAATTATACCGGCGGCATCGCCGCCCTGCATTCCGGCTGGAAGCTTTAAGACCATGCCCGGCAGA CGCAGGAGACAAGCATAAGC
Downstream 100 bases:
>100_bases GAACGTGAAGGTGCCCGGCGTGACCGCCAGGCGCCTCAGTTATCAAAAAAACCGCCTATTAAGCAGACGTAACGACCGCA TGCCATGCTCATGCAATATA
Product: ubiquinone biosynthesis protein
Products: 2-octaprenyl-6-hydroxyphenol [C]
Alternate protein names: NA
Number of amino acids: Translated: 524; Mature: 523
Protein sequence:
>524_residues MSTLGAYFRLARVGWVLVREGVVLALPSDDLPAPAQLLKAALKPFARSKAKRAQRSDRLAVAVERLGPSYVKMGQFLATR PDVVGADFADDLASLQDRMAFFPAAAAKANIEGSLGRSISELYREFGDPIAAASIAQVHPAMVDTPKGPRKVAVKVIRPG VRQRFQNDLEAMYLIADLQQRFVRSARRLRPVEVTRTLEQTTKIEMDLRLEAAALSELAENTRQDPGFRVPEVDWERTGR DVVTMEWIDGVKMSDIEGLKAAGHDLNKLADTLIQSFLRHTLRDGFFHADMHPGNLFVDAKGEIVAVDMGIAGRLGKKER RFLAEILYGFITRDYMRVAEVHFEAGYVPGHHDKASFAQAIRAIGEPIHGQPAETISMGKLLTLLFEVTELFDMETRPEL VMLQKTMVVVEGVSRMLNPRFNMWKAADPVVGGWIRDNLGPKRIATDLKDGVKAALKLAEAVPEIAAKTEKLHSELMYMS ENGLRFDAQTAEAIGKAEARHTKWGRIALWVIALTLLYIAIRIS
Sequences:
>Translated_524_residues MSTLGAYFRLARVGWVLVREGVVLALPSDDLPAPAQLLKAALKPFARSKAKRAQRSDRLAVAVERLGPSYVKMGQFLATR PDVVGADFADDLASLQDRMAFFPAAAAKANIEGSLGRSISELYREFGDPIAAASIAQVHPAMVDTPKGPRKVAVKVIRPG VRQRFQNDLEAMYLIADLQQRFVRSARRLRPVEVTRTLEQTTKIEMDLRLEAAALSELAENTRQDPGFRVPEVDWERTGR DVVTMEWIDGVKMSDIEGLKAAGHDLNKLADTLIQSFLRHTLRDGFFHADMHPGNLFVDAKGEIVAVDMGIAGRLGKKER RFLAEILYGFITRDYMRVAEVHFEAGYVPGHHDKASFAQAIRAIGEPIHGQPAETISMGKLLTLLFEVTELFDMETRPEL VMLQKTMVVVEGVSRMLNPRFNMWKAADPVVGGWIRDNLGPKRIATDLKDGVKAALKLAEAVPEIAAKTEKLHSELMYMS ENGLRFDAQTAEAIGKAEARHTKWGRIALWVIALTLLYIAIRIS >Mature_523_residues STLGAYFRLARVGWVLVREGVVLALPSDDLPAPAQLLKAALKPFARSKAKRAQRSDRLAVAVERLGPSYVKMGQFLATRP DVVGADFADDLASLQDRMAFFPAAAAKANIEGSLGRSISELYREFGDPIAAASIAQVHPAMVDTPKGPRKVAVKVIRPGV RQRFQNDLEAMYLIADLQQRFVRSARRLRPVEVTRTLEQTTKIEMDLRLEAAALSELAENTRQDPGFRVPEVDWERTGRD VVTMEWIDGVKMSDIEGLKAAGHDLNKLADTLIQSFLRHTLRDGFFHADMHPGNLFVDAKGEIVAVDMGIAGRLGKKERR FLAEILYGFITRDYMRVAEVHFEAGYVPGHHDKASFAQAIRAIGEPIHGQPAETISMGKLLTLLFEVTELFDMETRPELV MLQKTMVVVEGVSRMLNPRFNMWKAADPVVGGWIRDNLGPKRIATDLKDGVKAALKLAEAVPEIAAKTEKLHSELMYMSE NGLRFDAQTAEAIGKAEARHTKWGRIALWVIALTLLYIAIRIS
Specific function: Required, probably indirectly, for the hydroxylation of 2-octaprenylphenol to 2-octaprenyl-6-hydroxy-phenol, the fourth step in ubiquinone biosynthesis [H]
COG id: COG0661
COG function: function code R; Predicted unusual protein kinase
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ABC1 family. UbiB subfamily [H]
Homologues:
Organism=Homo sapiens, GI41393593, Length=273, Percent_Identity=30.4029304029304, Blast_Score=107, Evalue=2e-23, Organism=Homo sapiens, GI40254938, Length=282, Percent_Identity=26.5957446808511, Blast_Score=100, Evalue=3e-21, Organism=Homo sapiens, GI217035081, Length=236, Percent_Identity=28.3898305084746, Blast_Score=90, Evalue=4e-18, Organism=Homo sapiens, GI34147522, Length=303, Percent_Identity=24.0924092409241, Blast_Score=76, Evalue=8e-14, Organism=Homo sapiens, GI217416386, Length=248, Percent_Identity=27.0161290322581, Blast_Score=75, Evalue=2e-13, Organism=Homo sapiens, GI27363457, Length=248, Percent_Identity=27.0161290322581, Blast_Score=75, Evalue=2e-13, Organism=Escherichia coli, GI2367309, Length=513, Percent_Identity=35.0877192982456, Blast_Score=292, Evalue=4e-80, Organism=Caenorhabditis elegans, GI17559152, Length=325, Percent_Identity=24.3076923076923, Blast_Score=94, Evalue=1e-19, Organism=Saccharomyces cerevisiae, GI6325148, Length=444, Percent_Identity=23.1981981981982, Blast_Score=107, Evalue=4e-24, Organism=Saccharomyces cerevisiae, GI6321319, Length=324, Percent_Identity=24.0740740740741, Blast_Score=78, Evalue=3e-15, Organism=Saccharomyces cerevisiae, GI6323282, Length=291, Percent_Identity=24.0549828178694, Blast_Score=70, Evalue=6e-13, Organism=Drosophila melanogaster, GI24662575, Length=299, Percent_Identity=29.4314381270903, Blast_Score=125, Evalue=8e-29, Organism=Drosophila melanogaster, GI22024280, Length=256, Percent_Identity=26.5625, Blast_Score=91, Evalue=2e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004147 - InterPro: IPR011009 - InterPro: IPR010232 [H]
Pfam domain/function: PF03109 ABC1 [H]
EC number: NA
Molecular weight: Translated: 58214; Mature: 58083
Theoretical pI: Translated: 9.63; Mature: 9.63
Prosite motif: PS50011 PROTEIN_KINASE_DOM
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSTLGAYFRLARVGWVLVREGVVLALPSDDLPAPAQLLKAALKPFARSKAKRAQRSDRLA CCCHHHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH VAVERLGPSYVKMGQFLATRPDVVGADFADDLASLQDRMAFFPAAAAKANIEGSLGRSIS HHHHHHCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH ELYREFGDPIAAASIAQVHPAMVDTPKGPRKVAVKVIRPGVRQRFQNDLEAMYLIADLQQ HHHHHHCCHHHHHHHHHHCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RFVRSARRLRPVEVTRTLEQTTKIEMDLRLEAAALSELAENTRQDPGFRVPEVDWERTGR HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCC DVVTMEWIDGVKMSDIEGLKAAGHDLNKLADTLIQSFLRHTLRDGFFHADMHPGNLFVDA CEEEEHHHCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCEEEEC KGEIVAVDMGIAGRLGKKERRFLAEILYGFITRDYMRVAEVHFEAGYVPGHHDKASFAQA CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH IRAIGEPIHGQPAETISMGKLLTLLFEVTELFDMETRPELVMLQKTMVVVEGVSRMLNPR HHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCC FNMWKAADPVVGGWIRDNLGPKRIATDLKDGVKAALKLAEAVPEIAAKTEKLHSELMYMS CCCHHCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC ENGLRFDAQTAEAIGKAEARHTKWGRIALWVIALTLLYIAIRIS CCCCEECHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC >Mature Secondary Structure STLGAYFRLARVGWVLVREGVVLALPSDDLPAPAQLLKAALKPFARSKAKRAQRSDRLA CCHHHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH VAVERLGPSYVKMGQFLATRPDVVGADFADDLASLQDRMAFFPAAAAKANIEGSLGRSIS HHHHHHCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH ELYREFGDPIAAASIAQVHPAMVDTPKGPRKVAVKVIRPGVRQRFQNDLEAMYLIADLQQ HHHHHHCCHHHHHHHHHHCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RFVRSARRLRPVEVTRTLEQTTKIEMDLRLEAAALSELAENTRQDPGFRVPEVDWERTGR HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCC DVVTMEWIDGVKMSDIEGLKAAGHDLNKLADTLIQSFLRHTLRDGFFHADMHPGNLFVDA CEEEEHHHCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCEEEEC KGEIVAVDMGIAGRLGKKERRFLAEILYGFITRDYMRVAEVHFEAGYVPGHHDKASFAQA CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH IRAIGEPIHGQPAETISMGKLLTLLFEVTELFDMETRPELVMLQKTMVVVEGVSRMLNPR HHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCC FNMWKAADPVVGGWIRDNLGPKRIATDLKDGVKAALKLAEAVPEIAAKTEKLHSELMYMS CCCHHCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC ENGLRFDAQTAEAIGKAEARHTKWGRIALWVIALTLLYIAIRIS CCCCEECHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: 2 2-octaprenylphenol; O2 [C]
Specific reaction: 2 2-octaprenylphenol + O2 = 2 2-octaprenyl-6-hydroxyphenol [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA