Definition | Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome. |
---|---|
Accession | NC_009972 |
Length | 6,346,587 |
Click here to switch to the map view.
The map label for this gene is ygcU [H]
Identifier: 159896890
GI number: 159896890
Start: 427529
End: 429130
Strand: Direct
Name: ygcU [H]
Synonym: Haur_0358
Alternate gene names: 159896890
Gene position: 427529-429130 (Clockwise)
Preceding gene: 159896889
Following gene: 159896891
Centisome position: 6.74
GC content: 54.49
Gene sequence:
>1602_bases ATGCGCCGTTGGAATGGTTGGGGCGACGAGAGCAAAGAGTATCCGGTTAAAGCCGGCATTTTAGGCTTGCTCAAGCAATT AATTGGTGCTGGCACGGCTCCCAGTGATGTTCGTTTGGCCGAGATTGTTGCCCAAGTTCCAGCATCACGCTTGCCACACC ACGAATTGATCAGCACCGATCCTGAGTTGCGGATTCGCCATGCTCGTGGTCAGAGTTTCGCCGATTTGGTGGCTACCCGC AGCGGCGAGCTAGGCCAAATTCCCGATGGCGTGGCGTTTCCCCAATCCAGCCAAGCAGTTCGCGAATTGATCGATTGGGC TAGCGACAACAATGTTAGCCTGATTCCCTATGGCGGTGGAACCAGCGTTGCGGGGCATATTAATCCCGTTGCTGGCGAGC GACCCATTTTAACTGTCAGCCTTGCCAAACTTAATCGGCTGATGGAAATCAATCCAACTGCGCGGCTAGCTCGCTTTGGC GCTGGAATCAAAGGTCCAGATCTCGAAGCCCAATTACGGGCTTTGGGCTTTACGCTGGGCCATTTTCCCCAATCGTTTGA GCTTTCGACGCTGGGCGGCTGGATTGCAACCCGCTCTAGTGGCCAGCAATCGCTTGGTTTTGGGCGGATTGAACAACTTT GGGCAGGCGGGCGGGTTGAAACACCCCGAGGCTCGTTAGAATTAGCGCCCTTTCCGGCTTCGGCGGCTGGCCCTGACCTG CGTGAAATGCTGCTTGGCTCAGAAGGTCGTTATGGCATTATCACCGAAGTCACTGTGCGAATCCGCCCAATTCCTGAGCT TGATGTAGTCCATGCGATCTTTTTTGCAAATTGGGAGCAAGCCCAAACTGCCGCGCGAACAATCGCCCAAGCCAACCTAC CATTGGGCATGTTGCGGCTTAGCACGCCCACCGAAACCATGACCAACCTTGCCTTGGCTGGCCATGAGCGGGTGATTGGT TTGCTCGAAGGCTTTTTGAAATTGCGTGGGGTTGGCAGCGAAAAATGTCTGCTGTTGGTTGGCTTTATTGGCAGCCAAGC TCAAGTGAAGCTCAGTCGTCAAGCGGTGTTGAGCTTGATGCGTCGGCATGGCGGCGTTCACATCGGCCAAAGCTTTGGCA AAGCTTGGCAGGCTGGGCGCTTTCGTGCGCCGTATCTACGCAATCGGCTGTGGGAGTTGGGCTATGGCGTGGATACGGTC GAAACCGCCACGACATGGGAGAATGTTACGCCATTGTTGAATAGATTGGAACATAGCTTGCGCCATGGATTAGCGGCTGA GCACGAACAGGTGCATGTGTTCACCCATCTTTCACATTTTTACCCAACTGGCTCAAGCATCTACACAACCTATGCTTTTC GGCTAGGCAACGATGCGCAGGCAACTTTAGCCCGTTGGCACGCCTTGAAAACCGCTGCAAGCCAAGCAATTGTGGCCGCA GGCGGCACGATTAGCCATCAACATGGCGTGGGGCTTGATCATCGCCCCTATCTCGAAGCTGAAAAAGGCCGCTTAGGAAT TGGCGCATTACAACAACTTGGCCAGCATTTCGACCCGCAGGGCTTGCTCAATCCCGCCAAATTATACGAGGATGGCCAAT GA
Upstream 100 bases:
>100_bases GCCGATTTAGCTCGCCAAGCCAGCCAATTCGCCATGCAGCAGAGCATCGAAGTTTGGCAATTGTTAGCCGCGAATATCAA TAATACACAAAGGGGATAAT
Downstream 100 bases:
>100_bases ATCCGCAGTTGCAATGGAACGCTGCTTGGCGCGAAACCACTTGGCAAGTGCTCGATCAAGCGTGGGATGTGATTATTATT GGCGGTGGAATTACCGGCGC
Product: FAD linked oxidase domain-containing protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 533; Mature: 533
Protein sequence:
>533_residues MRRWNGWGDESKEYPVKAGILGLLKQLIGAGTAPSDVRLAEIVAQVPASRLPHHELISTDPELRIRHARGQSFADLVATR SGELGQIPDGVAFPQSSQAVRELIDWASDNNVSLIPYGGGTSVAGHINPVAGERPILTVSLAKLNRLMEINPTARLARFG AGIKGPDLEAQLRALGFTLGHFPQSFELSTLGGWIATRSSGQQSLGFGRIEQLWAGGRVETPRGSLELAPFPASAAGPDL REMLLGSEGRYGIITEVTVRIRPIPELDVVHAIFFANWEQAQTAARTIAQANLPLGMLRLSTPTETMTNLALAGHERVIG LLEGFLKLRGVGSEKCLLLVGFIGSQAQVKLSRQAVLSLMRRHGGVHIGQSFGKAWQAGRFRAPYLRNRLWELGYGVDTV ETATTWENVTPLLNRLEHSLRHGLAAEHEQVHVFTHLSHFYPTGSSIYTTYAFRLGNDAQATLARWHALKTAASQAIVAA GGTISHQHGVGLDHRPYLEAEKGRLGIGALQQLGQHFDPQGLLNPAKLYEDGQ
Sequences:
>Translated_533_residues MRRWNGWGDESKEYPVKAGILGLLKQLIGAGTAPSDVRLAEIVAQVPASRLPHHELISTDPELRIRHARGQSFADLVATR SGELGQIPDGVAFPQSSQAVRELIDWASDNNVSLIPYGGGTSVAGHINPVAGERPILTVSLAKLNRLMEINPTARLARFG AGIKGPDLEAQLRALGFTLGHFPQSFELSTLGGWIATRSSGQQSLGFGRIEQLWAGGRVETPRGSLELAPFPASAAGPDL REMLLGSEGRYGIITEVTVRIRPIPELDVVHAIFFANWEQAQTAARTIAQANLPLGMLRLSTPTETMTNLALAGHERVIG LLEGFLKLRGVGSEKCLLLVGFIGSQAQVKLSRQAVLSLMRRHGGVHIGQSFGKAWQAGRFRAPYLRNRLWELGYGVDTV ETATTWENVTPLLNRLEHSLRHGLAAEHEQVHVFTHLSHFYPTGSSIYTTYAFRLGNDAQATLARWHALKTAASQAIVAA GGTISHQHGVGLDHRPYLEAEKGRLGIGALQQLGQHFDPQGLLNPAKLYEDGQ >Mature_533_residues MRRWNGWGDESKEYPVKAGILGLLKQLIGAGTAPSDVRLAEIVAQVPASRLPHHELISTDPELRIRHARGQSFADLVATR SGELGQIPDGVAFPQSSQAVRELIDWASDNNVSLIPYGGGTSVAGHINPVAGERPILTVSLAKLNRLMEINPTARLARFG AGIKGPDLEAQLRALGFTLGHFPQSFELSTLGGWIATRSSGQQSLGFGRIEQLWAGGRVETPRGSLELAPFPASAAGPDL REMLLGSEGRYGIITEVTVRIRPIPELDVVHAIFFANWEQAQTAARTIAQANLPLGMLRLSTPTETMTNLALAGHERVIG LLEGFLKLRGVGSEKCLLLVGFIGSQAQVKLSRQAVLSLMRRHGGVHIGQSFGKAWQAGRFRAPYLRNRLWELGYGVDTV ETATTWENVTPLLNRLEHSLRHGLAAEHEQVHVFTHLSHFYPTGSSIYTTYAFRLGNDAQATLARWHALKTAASQAIVAA GGTISHQHGVGLDHRPYLEAEKGRLGIGALQQLGQHFDPQGLLNPAKLYEDGQ
Specific function: Unknown
COG id: COG0277
COG function: function code C; FAD/FMN-containing dehydrogenases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 FAD-binding PCMH-type domain [H]
Homologues:
Organism=Homo sapiens, GI4501993, Length=564, Percent_Identity=27.1276595744681, Blast_Score=219, Evalue=5e-57, Organism=Homo sapiens, GI37595756, Length=489, Percent_Identity=26.1758691206544, Blast_Score=108, Evalue=2e-23, Organism=Homo sapiens, GI37595754, Length=508, Percent_Identity=25.1968503937008, Blast_Score=100, Evalue=3e-21, Organism=Escherichia coli, GI48994907, Length=452, Percent_Identity=23.4513274336283, Blast_Score=123, Evalue=3e-29, Organism=Escherichia coli, GI1789351, Length=483, Percent_Identity=23.8095238095238, Blast_Score=87, Evalue=3e-18, Organism=Caenorhabditis elegans, GI17556096, Length=573, Percent_Identity=27.0506108202443, Blast_Score=220, Evalue=1e-57, Organism=Caenorhabditis elegans, GI17534361, Length=185, Percent_Identity=26.4864864864865, Blast_Score=67, Evalue=2e-11, Organism=Saccharomyces cerevisiae, GI6320027, Length=219, Percent_Identity=28.310502283105, Blast_Score=89, Evalue=1e-18, Organism=Saccharomyces cerevisiae, GI6320023, Length=194, Percent_Identity=25.7731958762887, Blast_Score=73, Evalue=1e-13, Organism=Saccharomyces cerevisiae, GI6320764, Length=196, Percent_Identity=23.469387755102, Blast_Score=68, Evalue=4e-12, Organism=Drosophila melanogaster, GI24653753, Length=561, Percent_Identity=28.5204991087344, Blast_Score=217, Evalue=2e-56,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016166 - InterPro: IPR016167 - InterPro: IPR016164 - InterPro: IPR016168 - InterPro: IPR004113 - InterPro: IPR006094 - InterPro: IPR016171 [H]
Pfam domain/function: PF02913 FAD-oxidase_C; PF01565 FAD_binding_4 [H]
EC number: NA
Molecular weight: Translated: 57825; Mature: 57825
Theoretical pI: Translated: 9.42; Mature: 9.42
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 1.1 %Met (Translated Protein) 1.3 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 1.1 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRRWNGWGDESKEYPVKAGILGLLKQLIGAGTAPSDVRLAEIVAQVPASRLPHHELISTD CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHCCCHHHHCCC PELRIRHARGQSFADLVATRSGELGQIPDGVAFPQSSQAVRELIDWASDNNVSLIPYGGG CCCEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCC TSVAGHINPVAGERPILTVSLAKLNRLMEINPTARLARFGAGIKGPDLEAQLRALGFTLG CCCCCCCCCCCCCCCEEEEEHHHHHHHEECCCHHHHHHHCCCCCCCCHHHHHHHHHHHHH HFPQSFELSTLGGWIATRSSGQQSLGFGRIEQLWAGGRVETPRGSLELAPFPASAAGPDL CCCCCCEEHHHCCCCEECCCCCCCCCHHHHHHHHCCCCCCCCCCCEEECCCCCCCCCHHH REMLLGSEGRYGIITEVTVRIRPIPELDVVHAIFFANWEQAQTAARTIAQANLPLGMLRL HHHHHCCCCCEEEEEEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCEEEEEE STPTETMTNLALAGHERVIGLLEGFLKLRGVGSEKCLLLVGFIGSQAQVKLSRQAVLSLM CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHH RRHGGVHIGQSFGKAWQAGRFRAPYLRNRLWELGYGVDTVETATTWENVTPLLNRLEHSL HHHCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH RHGLAAEHEQVHVFTHLSHFYPTGSSIYTTYAFRLGNDAQATLARWHALKTAASQAIVAA HHCCCCCCCEEEEEEHHHHHCCCCCCEEEEEEEEECCCHHHHHHHHHHHHHHHHCEEEEC GGTISHQHGVGLDHRPYLEAEKGRLGIGALQQLGQHFDPQGLLNPAKLYEDGQ CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHCCCCC >Mature Secondary Structure MRRWNGWGDESKEYPVKAGILGLLKQLIGAGTAPSDVRLAEIVAQVPASRLPHHELISTD CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHCCCHHHHCCC PELRIRHARGQSFADLVATRSGELGQIPDGVAFPQSSQAVRELIDWASDNNVSLIPYGGG CCCEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCC TSVAGHINPVAGERPILTVSLAKLNRLMEINPTARLARFGAGIKGPDLEAQLRALGFTLG CCCCCCCCCCCCCCCEEEEEHHHHHHHEECCCHHHHHHHCCCCCCCCHHHHHHHHHHHHH HFPQSFELSTLGGWIATRSSGQQSLGFGRIEQLWAGGRVETPRGSLELAPFPASAAGPDL CCCCCCEEHHHCCCCEECCCCCCCCCHHHHHHHHCCCCCCCCCCCEEECCCCCCCCCHHH REMLLGSEGRYGIITEVTVRIRPIPELDVVHAIFFANWEQAQTAARTIAQANLPLGMLRL HHHHHCCCCCEEEEEEEEEEEECCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCEEEEEE STPTETMTNLALAGHERVIGLLEGFLKLRGVGSEKCLLLVGFIGSQAQVKLSRQAVLSLM CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHH RRHGGVHIGQSFGKAWQAGRFRAPYLRNRLWELGYGVDTVETATTWENVTPLLNRLEHSL HHHCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH RHGLAAEHEQVHVFTHLSHFYPTGSSIYTTYAFRLGNDAQATLARWHALKTAASQAIVAA HHCCCCCCCEEEEEEHHHHHCCCCCCEEEEEEEEECCCHHHHHHHHHHHHHHHHCEEEEC GGTISHQHGVGLDHRPYLEAEKGRLGIGALQQLGQHFDPQGLLNPAKLYEDGQ CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]