Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is 159899699

Identifier: 159899699

GI number: 159899699

Start: 4022365

End: 4024428

Strand: Reverse

Name: 159899699

Synonym: Haur_3181

Alternate gene names: NA

Gene position: 4024428-4022365 (Counterclockwise)

Preceding gene: 159899702

Following gene: 159899696

Centisome position: 63.41

GC content: 45.69

Gene sequence:

>2064_bases
GTGGCAAGAAAACCGTTTCGAACGTTGGTTAGTTTGACCCTGGCCTTAGGCGTGTTGATGGGCGGAACGTTGTTGAGTGC
AGTAGCTGACCAATCAGCCACATTGCCCCCAATTAATGGTATCTGGGATAGTTCGCTTGGCAACAGCTTTCAAAAAGCTG
GTTCGATTCATTCGTTGACGGTTGATCCAAGTAATAATCTCTACGTTGCTGGCGATTTCAATTTTATCAATCAAACCGAA
GTTAATGGTTTAGCACGTTGGAATGGCTCAACATGGCAAGGCTATGGGCTTCAGCCAAACGATGCTGGCAAAATTCACAA
GGTATTGCCATTTGAGAATGAGCTATTTGCGATTGGTGATTTTGAGCGTTTGCAGCAAGCTCAAAATAAAATTGCCCGTT
GGAATGGCACAAGTTTTCAGCCAATTGGCAATGGTATTACCGGATTATTGCATCGATTCTCACCTGATATTACGGTTGCA
ACCTTGCATAGCTATAGCGAAACCCTCTATATCGGCGGTGAGTTTAGCCAATTTGCTGGTGAATTTGCCTATGCAATTGG
CCAATGGAATGGCGCGGTTCAACCGCAAGCCACGATCTTTGATGGTAAAGTTACTAGCTTTGCCTCTGATGCTGATGAAT
TAATTGCTGGTGGCTATTTTACCCAAATTGATGGCGTGGATAGCCGTTTGGCACGCTTGGTCGATAATCAATGGGTTTCA
CTCGATGTTGGGATCGCCAATAATGCTTTTACGGTCTACTCAGCGAATAATACGATTTATCTTGTAGGCTATAATCTCAG
TAGTCAAACCTATCGACTCTATCGCTGGGATGGTTCAAGCGCAACCAGTGTGGGTGGTCCACTGGATGTTGCCATTGATA
ATTTAGTTGTTCATGGCACAGATCTCTATATTCAAAGCAATCAGCAACTCTTGAAATTAGAAAATGATCAATGGCATGCT
GCTAATTTGCCAGTGACGATTACCAAACTGACCGCTTTAGCCAGCAATGGCTCAACCCTTTACCTTGGCGGTGAATTGCT
GGTGAATGGCAATCCAAGCCAAATTGTTGCTTGGAACGGTACGCAAGCCCAAAGTTTGGCGAGCTTGACGGTGATTGATG
ATCAACACTTGGTCAGTGGTGATGCTGGTCGGCCAGTGATTATCGATTATGGAATTGGTAGTGCTCAAGGCTCAATTCAA
CGCTGGAATGGCACGAGCTGGGAAACCTTGGCAACTGATACCAACGATTCGTTTGGAATTTCACAGTTTTATCGCGCCAA
CGATCAACTTTACAGCTTTTTCCTTGAGCCACAAGCCTTGGCTACTGGCCAAGCGGCTAGTAATGTTTGGCGCTTGAATG
GCACGACGTGGACGAGCGCTGATATTAATCTTCAAGATTCAGTCTATTGGTATGAGTCAGGCCAACAGATCTTGGCGTAT
CTTTCACAACCGCAGGCAAGCCAACCAATAACTGGCATTTTGGAGTTTGATGGCACGAGCCTTAATCCACGGCTTCAACC
AGCTTGGTTTAATCGCTCAGATTATCTCTTCTTCTTCGAGAATAATTTCTATGCGATCAATCTGCTGAATGATGTAGACT
CGAACTTTTTAGAAATTCAGCGCTGGGATGGCCAAACCTGGCAAGAAATTCAGTTTATGGAAGTGCCTAAGGCCCGTTAT
AGCCTGAAACTGTGGCGTGGTCAGCTGTTTATGGCCAATACTGCTGGCAATTTCTACGAAATTAATCCTGATGGTAGTTT
GGATGAAATTGCTACGGCTGATGGTGGCATTTATACTTTGGCTGGGCGTGATGATGGTTCGCTCTATCTTGGTGGCGATT
TTAGCACCATTGATGCAGCGGCAACAGGTCTAATCGCTAGCTTTGATGGCACGAATTTCCGGGGCTTGGTCAGTCAACCC
AATGGACGCGTCATCTCACTGAGCGTTGATCATAATTATGTGTATGTTGCTGGCGATTTCACCAAAGTGGGCACAGTAGC
ATCGTTGGGCGTGGCGGTCTTTGCGCCAACCAAGCAGGTCTATTTGCCGATGGCAATTCGTTAA

Upstream 100 bases:

>100_bases
CCCCAACTACCGATCCCCAAGATCGAAATCGTGCGCACAATCCGATTTACCTAGCGTATAGTAAAGCAATCGTGTGTGTG
TAGTAGGAAGGATGATTGCA

Downstream 100 bases:

>100_bases
TGTTAAATTCCCTCACCCCCTGGCCCCCTCATCCCGCACGCGGGAGAGGGGGCCTATGCTCATTTTGCTCCCCTCGCCCG
CCGCAGTGGGAGAAGGCTGG

Product: hypothetical protein

Products: NA

Alternate protein names: None

Number of amino acids: Translated: 687; Mature: 686

Protein sequence:

>687_residues
MARKPFRTLVSLTLALGVLMGGTLLSAVADQSATLPPINGIWDSSLGNSFQKAGSIHSLTVDPSNNLYVAGDFNFINQTE
VNGLARWNGSTWQGYGLQPNDAGKIHKVLPFENELFAIGDFERLQQAQNKIARWNGTSFQPIGNGITGLLHRFSPDITVA
TLHSYSETLYIGGEFSQFAGEFAYAIGQWNGAVQPQATIFDGKVTSFASDADELIAGGYFTQIDGVDSRLARLVDNQWVS
LDVGIANNAFTVYSANNTIYLVGYNLSSQTYRLYRWDGSSATSVGGPLDVAIDNLVVHGTDLYIQSNQQLLKLENDQWHA
ANLPVTITKLTALASNGSTLYLGGELLVNGNPSQIVAWNGTQAQSLASLTVIDDQHLVSGDAGRPVIIDYGIGSAQGSIQ
RWNGTSWETLATDTNDSFGISQFYRANDQLYSFFLEPQALATGQAASNVWRLNGTTWTSADINLQDSVYWYESGQQILAY
LSQPQASQPITGILEFDGTSLNPRLQPAWFNRSDYLFFFENNFYAINLLNDVDSNFLEIQRWDGQTWQEIQFMEVPKARY
SLKLWRGQLFMANTAGNFYEINPDGSLDEIATADGGIYTLAGRDDGSLYLGGDFSTIDAAATGLIASFDGTNFRGLVSQP
NGRVISLSVDHNYVYVAGDFTKVGTVASLGVAVFAPTKQVYLPMAIR

Sequences:

>Translated_687_residues
MARKPFRTLVSLTLALGVLMGGTLLSAVADQSATLPPINGIWDSSLGNSFQKAGSIHSLTVDPSNNLYVAGDFNFINQTE
VNGLARWNGSTWQGYGLQPNDAGKIHKVLPFENELFAIGDFERLQQAQNKIARWNGTSFQPIGNGITGLLHRFSPDITVA
TLHSYSETLYIGGEFSQFAGEFAYAIGQWNGAVQPQATIFDGKVTSFASDADELIAGGYFTQIDGVDSRLARLVDNQWVS
LDVGIANNAFTVYSANNTIYLVGYNLSSQTYRLYRWDGSSATSVGGPLDVAIDNLVVHGTDLYIQSNQQLLKLENDQWHA
ANLPVTITKLTALASNGSTLYLGGELLVNGNPSQIVAWNGTQAQSLASLTVIDDQHLVSGDAGRPVIIDYGIGSAQGSIQ
RWNGTSWETLATDTNDSFGISQFYRANDQLYSFFLEPQALATGQAASNVWRLNGTTWTSADINLQDSVYWYESGQQILAY
LSQPQASQPITGILEFDGTSLNPRLQPAWFNRSDYLFFFENNFYAINLLNDVDSNFLEIQRWDGQTWQEIQFMEVPKARY
SLKLWRGQLFMANTAGNFYEINPDGSLDEIATADGGIYTLAGRDDGSLYLGGDFSTIDAAATGLIASFDGTNFRGLVSQP
NGRVISLSVDHNYVYVAGDFTKVGTVASLGVAVFAPTKQVYLPMAIR
>Mature_686_residues
ARKPFRTLVSLTLALGVLMGGTLLSAVADQSATLPPINGIWDSSLGNSFQKAGSIHSLTVDPSNNLYVAGDFNFINQTEV
NGLARWNGSTWQGYGLQPNDAGKIHKVLPFENELFAIGDFERLQQAQNKIARWNGTSFQPIGNGITGLLHRFSPDITVAT
LHSYSETLYIGGEFSQFAGEFAYAIGQWNGAVQPQATIFDGKVTSFASDADELIAGGYFTQIDGVDSRLARLVDNQWVSL
DVGIANNAFTVYSANNTIYLVGYNLSSQTYRLYRWDGSSATSVGGPLDVAIDNLVVHGTDLYIQSNQQLLKLENDQWHAA
NLPVTITKLTALASNGSTLYLGGELLVNGNPSQIVAWNGTQAQSLASLTVIDDQHLVSGDAGRPVIIDYGIGSAQGSIQR
WNGTSWETLATDTNDSFGISQFYRANDQLYSFFLEPQALATGQAASNVWRLNGTTWTSADINLQDSVYWYESGQQILAYL
SQPQASQPITGILEFDGTSLNPRLQPAWFNRSDYLFFFENNFYAINLLNDVDSNFLEIQRWDGQTWQEIQFMEVPKARYS
LKLWRGQLFMANTAGNFYEINPDGSLDEIATADGGIYTLAGRDDGSLYLGGDFSTIDAAATGLIASFDGTNFRGLVSQPN
GRVISLSVDHNYVYVAGDFTKVGTVASLGVAVFAPTKQVYLPMAIR

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 74965; Mature: 74834

Theoretical pI: Translated: 4.30; Mature: 4.30

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
0.7 %Met     (Translated Protein)
0.7 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
0.6 %Met     (Mature Protein)
0.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MARKPFRTLVSLTLALGVLMGGTLLSAVADQSATLPPINGIWDSSLGNSFQKAGSIHSLT
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCCEEEEE
VDPSNNLYVAGDFNFINQTEVNGLARWNGSTWQGYGLQPNDAGKIHKVLPFENELFAIGD
ECCCCCEEEEECCCCCCCCCCCEEEEECCCCEECCCCCCCCCCCEEEECCCCCCEEEECC
FERLQQAQNKIARWNGTSFQPIGNGITGLLHRFSPDITVATLHSYSETLYIGGEFSQFAG
HHHHHHHHHHHEECCCCCCCCCCCCHHHHHHHCCCCEEEEEEECCCCEEEECCCHHHHHH
EFAYAIGQWNGAVQPQATIFDGKVTSFASDADELIAGGYFTQIDGVDSRLARLVDNQWVS
HHEEEEECCCCCCCCCEEEECCCHHHCCCCHHHHHCCCEEEEECCHHHHHHHHHCCCEEE
LDVGIANNAFTVYSANNTIYLVGYNLSSQTYRLYRWDGSSATSVGGPLDVAIDNLVVHGT
EEEEECCCEEEEEECCCEEEEEEECCCCCEEEEEEECCCCCCCCCCCEEEEECCEEEEEE
DLYIQSNQQLLKLENDQWHAANLPVTITKLTALASNGSTLYLGGELLVNGNPSQIVAWNG
EEEEECCCEEEEECCCCEEECCCCEEEEEEEEEECCCCEEEECCEEEECCCCCEEEEECC
TQAQSLASLTVIDDQHLVSGDAGRPVIIDYGIGSAQGSIQRWNGTSWETLATDTNDSFGI
CCCCCEEEEEEECCCEEECCCCCCEEEEEECCCCCCCCEEECCCCCCEEEEECCCCCCCH
SQFYRANDQLYSFFLEPQALATGQAASNVWRLNGTTWTSADINLQDSVYWYESGQQILAY
HHHHHCCCEEEEEEECCHHHCCCCCCCCEEEECCCEEECEECCCCCCEEEECCCHHHHHH
LSQPQASQPITGILEFDGTSLNPRLQPAWFNRSDYLFFFENNFYAINLLNDVDSNFLEIQ
HCCCCCCCCEEEEEEECCCCCCCCCCEEEECCCCEEEEEECCEEEEEEECCCCCCEEEEE
RWDGQTWQEIQFMEVPKARYSLKLWRGQLFMANTAGNFYEINPDGSLDEIATADGGIYTL
ECCCCCHHEEEEEECCCCCEEEEEEECEEEEEECCCCEEEECCCCCHHHHCCCCCCEEEE
AGRDDGSLYLGGDFSTIDAAATGLIASFDGTNFRGLVSQPNGRVISLSVDHNYVYVAGDF
ECCCCCCEEECCCCCHHHHHHHCEEEECCCCCCEEEEECCCCEEEEEEECCCEEEEEECC
TKVGTVASLGVAVFAPTKQVYLPMAIR
HHHCHHHHCCEEEEECCCEEEEEEEEC
>Mature Secondary Structure 
ARKPFRTLVSLTLALGVLMGGTLLSAVADQSATLPPINGIWDSSLGNSFQKAGSIHSLT
CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCCEEEEE
VDPSNNLYVAGDFNFINQTEVNGLARWNGSTWQGYGLQPNDAGKIHKVLPFENELFAIGD
ECCCCCEEEEECCCCCCCCCCCEEEEECCCCEECCCCCCCCCCCEEEECCCCCCEEEECC
FERLQQAQNKIARWNGTSFQPIGNGITGLLHRFSPDITVATLHSYSETLYIGGEFSQFAG
HHHHHHHHHHHEECCCCCCCCCCCCHHHHHHHCCCCEEEEEEECCCCEEEECCCHHHHHH
EFAYAIGQWNGAVQPQATIFDGKVTSFASDADELIAGGYFTQIDGVDSRLARLVDNQWVS
HHEEEEECCCCCCCCCEEEECCCHHHCCCCHHHHHCCCEEEEECCHHHHHHHHHCCCEEE
LDVGIANNAFTVYSANNTIYLVGYNLSSQTYRLYRWDGSSATSVGGPLDVAIDNLVVHGT
EEEEECCCEEEEEECCCEEEEEEECCCCCEEEEEEECCCCCCCCCCCEEEEECCEEEEEE
DLYIQSNQQLLKLENDQWHAANLPVTITKLTALASNGSTLYLGGELLVNGNPSQIVAWNG
EEEEECCCEEEEECCCCEEECCCCEEEEEEEEEECCCCEEEECCEEEECCCCCEEEEECC
TQAQSLASLTVIDDQHLVSGDAGRPVIIDYGIGSAQGSIQRWNGTSWETLATDTNDSFGI
CCCCCEEEEEEECCCEEECCCCCCEEEEEECCCCCCCCEEECCCCCCEEEEECCCCCCCH
SQFYRANDQLYSFFLEPQALATGQAASNVWRLNGTTWTSADINLQDSVYWYESGQQILAY
HHHHHCCCEEEEEEECCHHHCCCCCCCCEEEECCCEEECEECCCCCCEEEECCCHHHHHH
LSQPQASQPITGILEFDGTSLNPRLQPAWFNRSDYLFFFENNFYAINLLNDVDSNFLEIQ
HCCCCCCCCEEEEEEECCCCCCCCCCEEEECCCCEEEEEECCEEEEEEECCCCCCEEEEE
RWDGQTWQEIQFMEVPKARYSLKLWRGQLFMANTAGNFYEINPDGSLDEIATADGGIYTL
ECCCCCHHEEEEEECCCCCEEEEEEECEEEEEECCCCEEEECCCCCHHHHCCCCCCEEEE
AGRDDGSLYLGGDFSTIDAAATGLIASFDGTNFRGLVSQPNGRVISLSVDHNYVYVAGDF
ECCCCCCEEECCCCCHHHHHHHCEEEECCCCCCEEEEECCCCEEEEEEECCCEEEEEECC
TKVGTVASLGVAVFAPTKQVYLPMAIR
HHHCHHHHCCEEEEECCCEEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA