Definition | Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome. |
---|---|
Accession | NC_009972 |
Length | 6,346,587 |
Click here to switch to the map view.
The map label for this gene is 159896998
Identifier: 159896998
GI number: 159896998
Start: 543677
End: 545101
Strand: Direct
Name: 159896998
Synonym: Haur_0466
Alternate gene names: NA
Gene position: 543677-545101 (Clockwise)
Preceding gene: 159896997
Following gene: 159897002
Centisome position: 8.57
GC content: 55.02
Gene sequence:
>1425_bases ATGTTTGGAGTTCTGCGTGGGTGCTCGCCGCACCTCGATCAACCGACCCATGCCGCGTGGTGGAGCCACATTTGTGGTAT GTGCCTAACCTTGCGCGACCAGCATGGCCAAGTTGCTCGCATTACCACCAATTACGATGCTGCTTTGCTTTCGGCGTTGT ACGAAGCTCAACGCGCTGAGCAAGGCTCCCGCCGCACGAGTGTTTGCGCGTTACGTGGTTTTCGGGCGCTCGATGTGGTC GCTGCTGATAATCATGGTTCACGCTATGCTGCCGCCGTAAGTTTGCTGATGGCCGCAATTCGGCTGCGCGATAATGTGGC CGATCGCGATGGTTGGGCTGGTAAAGTGCCCTTGGTTGCTAACACGGTTGCTAAACGTTGGGATAAAAAAGCTGAGCAAA CCGCCCGTGAGTTGGGCTTCGAGCCAGCTTTATTGCGCCAACAAGCCGTCGCTCAGGCTGAAGTCGAGGCCTTGGTCAAT GCCGATTTTTGCACCTATTCGGCTCCGACTGAAGCTGCGGTGGGTGCGGCGTTTCGGCATACGGCGATTTTGGCCAACCA ACCGAGCAATGCCGAGCCACTTGAGCAAATGGGCCGTATGTATGGCCGCATGATGTTGCTGCTCGATAGCTACAACGATG TGGCTGAAGATCAAGCCCGTGGCCATTTCAATGCCTTGTGTGCCACGCCCCAAGCCGAATTGCGCACGGTTGCCCATAAA ATCTTCAACGATGCTTTTGCCAATTTGCGCCAACAATGGCAACGTTTGAGCTTGCTGCAAGCACATTTGGTTGAAGTGTT GTTGCTGAATATGTTGCCAGCAATTGGCTCGAAAGCCTTTGGCGCATGCAAACGCCATAGTTTGGCTTGTGCCACGGTCT TGCCAGTTGCGGCGGCTTTATTACCAGCCATGGCCAGCGATTACGATGACGATCCTTCACGGCGCGACCCCAACCGCGCT TATACTGGGCCAACCCGCGCCCTCAAGCCCCACGAAAACCCTGAATATCAAGGTCAATATCCACAACAGCCGCCCTATCA AGGCCAATATCCGCAACAACCATATCAAGGCTATCCACCACAATCCAATCCAAATGATCCCTCGATGTTGCCGCCTGGCT CGTTGCCGCCAGCTAGTTCACACCCGGTTCCGCCGCCAGGCCATTATCAAGATCGTAAGCGCGATGTAGCGATGGGCGCA GCCTTATGCTGTTGTCAGAGTCGTAGCCGTCGTCACCACCACTCGCATCGTTCACGTATCGTGTGTTGTGACGATGATTG CTGCGATTGTTGCCAATGCACCTGCTGCTGTTGCGATGCCACCGATGCCTGCGAAGGCGATGGTTGTAGTTGCTGCGAAT GTGGGGGCTGTGGTGAATGTGGCAGTTGCTGCGAATGTGGCGATTGCTGTAGCTGTGATTGTTAA
Upstream 100 bases:
>100_bases TTTTTTCGCAACAATCCAGCTCTAACCCACGTATGTTTTGCAAGATTTTTAATCCCCATTGGCCGTTTCCTGCGATGTTC TTTTTGAGAAAGGGTCTGCT
Downstream 100 bases:
>100_bases CATAAGCAACCCCGCCAAGCAGCCTAGTCGCTTGGCGGGGTTGCATTGTTTATCTGTAGTTTTTAGGCGCGGGCAGTGAT TTGGCGGCCAGTCAAGCGTG
Product: expression regulator
Products: NA
Alternate protein names: Regulatory Protein; Regulator; Transcription Regulator; LOW QUALITY PROTEIN Expression Regulator
Number of amino acids: Translated: 474; Mature: 474
Protein sequence:
>474_residues MFGVLRGCSPHLDQPTHAAWWSHICGMCLTLRDQHGQVARITTNYDAALLSALYEAQRAEQGSRRTSVCALRGFRALDVV AADNHGSRYAAAVSLLMAAIRLRDNVADRDGWAGKVPLVANTVAKRWDKKAEQTARELGFEPALLRQQAVAQAEVEALVN ADFCTYSAPTEAAVGAAFRHTAILANQPSNAEPLEQMGRMYGRMMLLLDSYNDVAEDQARGHFNALCATPQAELRTVAHK IFNDAFANLRQQWQRLSLLQAHLVEVLLLNMLPAIGSKAFGACKRHSLACATVLPVAAALLPAMASDYDDDPSRRDPNRA YTGPTRALKPHENPEYQGQYPQQPPYQGQYPQQPYQGYPPQSNPNDPSMLPPGSLPPASSHPVPPPGHYQDRKRDVAMGA ALCCCQSRSRRHHHSHRSRIVCCDDDCCDCCQCTCCCCDATDACEGDGCSCCECGGCGECGSCCECGDCCSCDC
Sequences:
>Translated_474_residues MFGVLRGCSPHLDQPTHAAWWSHICGMCLTLRDQHGQVARITTNYDAALLSALYEAQRAEQGSRRTSVCALRGFRALDVV AADNHGSRYAAAVSLLMAAIRLRDNVADRDGWAGKVPLVANTVAKRWDKKAEQTARELGFEPALLRQQAVAQAEVEALVN ADFCTYSAPTEAAVGAAFRHTAILANQPSNAEPLEQMGRMYGRMMLLLDSYNDVAEDQARGHFNALCATPQAELRTVAHK IFNDAFANLRQQWQRLSLLQAHLVEVLLLNMLPAIGSKAFGACKRHSLACATVLPVAAALLPAMASDYDDDPSRRDPNRA YTGPTRALKPHENPEYQGQYPQQPPYQGQYPQQPYQGYPPQSNPNDPSMLPPGSLPPASSHPVPPPGHYQDRKRDVAMGA ALCCCQSRSRRHHHSHRSRIVCCDDDCCDCCQCTCCCCDATDACEGDGCSCCECGGCGECGSCCECGDCCSCDC >Mature_474_residues MFGVLRGCSPHLDQPTHAAWWSHICGMCLTLRDQHGQVARITTNYDAALLSALYEAQRAEQGSRRTSVCALRGFRALDVV AADNHGSRYAAAVSLLMAAIRLRDNVADRDGWAGKVPLVANTVAKRWDKKAEQTARELGFEPALLRQQAVAQAEVEALVN ADFCTYSAPTEAAVGAAFRHTAILANQPSNAEPLEQMGRMYGRMMLLLDSYNDVAEDQARGHFNALCATPQAELRTVAHK IFNDAFANLRQQWQRLSLLQAHLVEVLLLNMLPAIGSKAFGACKRHSLACATVLPVAAALLPAMASDYDDDPSRRDPNRA YTGPTRALKPHENPEYQGQYPQQPPYQGQYPQQPYQGYPPQSNPNDPSMLPPGSLPPASSHPVPPPGHYQDRKRDVAMGA ALCCCQSRSRRHHHSHRSRIVCCDDDCCDCCQCTCCCCDATDACEGDGCSCCECGGCGECGSCCECGDCCSCDC
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 51611; Mature: 51611
Theoretical pI: Translated: 6.62; Mature: 6.62
Prosite motif: PS01208 VWFC_1 ; PS00198 4FE4S_FERREDOXIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
7.6 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 9.9 %Cys+Met (Translated Protein) 7.6 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 9.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFGVLRGCSPHLDQPTHAAWWSHICGMCLTLRDQHGQVARITTNYDAALLSALYEAQRAE CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHH QGSRRTSVCALRGFRALDVVAADNHGSRYAAAVSLLMAAIRLRDNVADRDGWAGKVPLVA CCCHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH NTVAKRWDKKAEQTARELGFEPALLRQQAVAQAEVEALVNADFCTYSAPTEAAVGAAFRH HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHCCCCCHHHHHHHHHHH TAILANQPSNAEPLEQMGRMYGRMMLLLDSYNDVAEDQARGHFNALCATPQAELRTVAHK HHHEECCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHCCCHHHHHHHHHH IFNDAFANLRQQWQRLSLLQAHLVEVLLLNMLPAIGSKAFGACKRHSLACATVLPVAAAL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHH LPAMASDYDDDPSRRDPNRAYTGPTRALKPHENPEYQGQYPQQPPYQGQYPQQPYQGYPP HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC QSNPNDPSMLPPGSLPPASSHPVPPPGHYQDRKRDVAMGAALCCCQSRSRRHHHSHRSRI CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE VCCDDDCCDCCQCTCCCCDATDACEGDGCSCCECGGCGECGSCCECGDCCSCDC EEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure MFGVLRGCSPHLDQPTHAAWWSHICGMCLTLRDQHGQVARITTNYDAALLSALYEAQRAE CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHH QGSRRTSVCALRGFRALDVVAADNHGSRYAAAVSLLMAAIRLRDNVADRDGWAGKVPLVA CCCHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH NTVAKRWDKKAEQTARELGFEPALLRQQAVAQAEVEALVNADFCTYSAPTEAAVGAAFRH HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHCCCCCHHHHHHHHHHH TAILANQPSNAEPLEQMGRMYGRMMLLLDSYNDVAEDQARGHFNALCATPQAELRTVAHK HHHEECCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHCCCHHHHHHHHHH IFNDAFANLRQQWQRLSLLQAHLVEVLLLNMLPAIGSKAFGACKRHSLACATVLPVAAAL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHH LPAMASDYDDDPSRRDPNRAYTGPTRALKPHENPEYQGQYPQQPPYQGQYPQQPYQGYPP HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC QSNPNDPSMLPPGSLPPASSHPVPPPGHYQDRKRDVAMGAALCCCQSRSRRHHHSHRSRI CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE VCCDDDCCDCCQCTCCCCDATDACEGDGCSCCECGGCGECGSCCECGDCCSCDC EEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA