The gene/protein map for NC_009972 is currently unavailable.
Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is 159898390

Identifier: 159898390

GI number: 159898390

Start: 2201175

End: 2203613

Strand: Reverse

Name: 159898390

Synonym: Haur_1866

Alternate gene names: NA

Gene position: 2203613-2201175 (Counterclockwise)

Preceding gene: 159898391

Following gene: 159898379

Centisome position: 34.72

GC content: 47.81

Gene sequence:

>2439_bases
ATGAGTGAAAGCTTCGGTTACTGGCTTAAACAGCGGCGTAAAGAGCTTAATTTTACCCAAGAATATTTAGCCGAGTTGGT
AAGCTGTTCAACCATTACTATTCGCAAAATCGAGTCGAATGAACGGCGACCTTCGCGCCAGATTGCGGCTCGGATCGCTA
AATTTTGTCAAGTAGAAGCCAATCGGGCCTTTGTTGATGCAGCATGGGCTGGGCAATCGCCTAGCCCATCTGATGGTGGC
TCGCCACCTGAGCCAGCTCCTTCGAACTTACTGCCCCCATTTAGTTCAATCATTGGCCGCGATTCGGCGATTGAATCAAT
TTGTGTCCAATTTCAAGCCCAAAAAGCCCGTTTAGTTACGATTGTCGGCTCACCTGGCGTTGGCAAAACCCGCCTGGCCC
AAGCCATTGGCCAACAGTTACTCACACATTTTAGCGATGGCGTTTTTTGGATTAGCCTCGATCCAATCGTCAATGCTAGC
CTTGTCCCATCGTTAATTACGCGGGTACTCGGCATTCACGAAAACCCCAATCAATCGATCGAAGAAACAATTTTCAACTG
GCTCAAAAATCGCCATTTGCTCCTCATTCTCGATAATTGTGAGCATATTATTGAGTTGCGCCAGTTTGTAAACCAACTGT
TAAGTTATTGTCCAACCCTCTCGATTCTTGCGACCAGCCGCGAAGTGTTGCATTTGCGCTGGGAACAGCGCTTTCCATTG
CGCCCGCTGACGGTTCCAGTACGCGGTATGCAGCTTGATCTCGCGCAACTGGCCCAAATTCCAGCGATTGCGCTATTTTT
AGAGCGCAGTCGGGCGATCAATCCTCAGGCCGAGTTGAATGCATCGAATGCCCGAGCAATTAGCACGATTTGTATGCAGC
TTGAAGGCTTGCCGCTAAGTATTGAGTTAATTGCTGCGCGTAGCGCCATGCTCAGCCCTCAAATGCTGGTGCATCGGCTG
AATAATCAATTGAATGTACTGACCCAAGGCTCGCGCGATTTGCCTCATCGCCAACAAACCTTGCGCAATGCCATCCAATG
GAGTATCGATTTGCTCGATAGTGCCGAGCAATTTCTGCTGGTAGCGCTGGCATTAGCTCCCGAAAGTTGTACCCTCCTGA
GCCTAGAAGCGCTTGCTGATTGCTATAGCCCGTGGCCGTGGTCGATTTTCGATGGCCTAACCAACTTGTTCGATAAAAGC
CTAATTTGGATTCAGCAGCAGCAAACTGATGAGCCACGCTTTGGGATGTTGCGGGTTTTGCGTGAATATGTGCTTGAGCA
GCTTGCTGAGCCAACAACCATTCAGCAATTACGCCAAAGTTTTGCCAGCTACTACCTGAATATTGCCGAAACCATTTATC
AAAAGATGCTCAACTCGCGTACCAATAGCCTTTTTCAAGAGATTGCCGCTGAGTATTATAATTTTCACACCGTTATCACA
TGGTGCCTTGAACCACCATATGATCTGGAAAATGCGATTAAGCTAATTGCGACGTTAATCGATTTTCTACATCTCTATGG
CTATCAACGCGAGGGGATTAGTTGGTTACAACACATTTTGGGCCTGATTGAACAACAAACAGTCACGCTTAGCCCAGCGA
TTCTGGCCGATGCCTATAACGCCTTAGGCTTTTTATACTACCATCAGGGCAATATTAACCAAGCCCAACACTTTTTTGAG
CGCGTATTGGAGCTTATTGGCGGCCACACATCGTTTAAACATGCACGAATTTTGTATAATTTAGGTTTAGTTAAAAAGAA
CAAAGGCGAATTTCTTCAGGCCGAGGCCGATTTACAAGCCAGTTTAGCAAGTTGGCGCACCCTTGGTTTACAGCCAGGCG
AAGCCTATTCGCTCTGGGGGTTGGGCAGTTTAGCCCTCGACCAAGGCCTCTATACTCATGGGTTAACCTACCTGCAACAA
AGCTTGGCAATTTGGCAAACGCTTGAATCAACTCATGGACAAGTGATGGTGTTAAGTGATTTGGCCGAGTTAGCCTTACT
ACAAGCCAATCCGCATGAGGCTGAGCAAATATTAGCCCAGATTAAAACGATTGTTGAGGCCAGCAATTATACAATCACAA
GTTCACGTATAGCCTTGCTCGAAGGTAAATGTGCGATGCAACGCCACGATTTTAGCCATGCCCAAACCTGCTTCGAAGAA
GCCGAGGAGATCGCTGAAGAACAGCAATCAACCGCCTATTTAGCCAAAATCCACCTCGAACAGGCTAAACTGGCTTTGGT
GCAGGCACACTATCATCAGGCCAGTTATCATGGCTATGAAGGGTTGCGCCTAGCGACCATGCTTGAACATCAGACTGGGA
TTGCCAAGGCCCACCACGTGCTGGCCCAAGTCTATCAGCAGTTGGCCAATCCGAGTCAAGCCGAGCAACATTGGCAAGCT
TATGCAGCAATTTATCAACACGTTGGTTTAGTGCCATAA

Upstream 100 bases:

>100_bases
CGTTCACCAGGCTGCGTCACCATGGTAGGTGGAGAATGTTGTGAGATCCAGCTCGATACCAAATAGCTAGCAGATAACCC
ATCACGACTAGGCACGCTAT

Downstream 100 bases:

>100_bases
ACTGCCACACCTTTGTGGGCCTAGCTTCAGCATCTGTGACCACAGATTGATCGTGCTGCCGTAGAATTCCGTTAGCATTA
TCTGCGGCAGCACGCTATCT

Product: XRE family transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 812; Mature: 811

Protein sequence:

>812_residues
MSESFGYWLKQRRKELNFTQEYLAELVSCSTITIRKIESNERRPSRQIAARIAKFCQVEANRAFVDAAWAGQSPSPSDGG
SPPEPAPSNLLPPFSSIIGRDSAIESICVQFQAQKARLVTIVGSPGVGKTRLAQAIGQQLLTHFSDGVFWISLDPIVNAS
LVPSLITRVLGIHENPNQSIEETIFNWLKNRHLLLILDNCEHIIELRQFVNQLLSYCPTLSILATSREVLHLRWEQRFPL
RPLTVPVRGMQLDLAQLAQIPAIALFLERSRAINPQAELNASNARAISTICMQLEGLPLSIELIAARSAMLSPQMLVHRL
NNQLNVLTQGSRDLPHRQQTLRNAIQWSIDLLDSAEQFLLVALALAPESCTLLSLEALADCYSPWPWSIFDGLTNLFDKS
LIWIQQQQTDEPRFGMLRVLREYVLEQLAEPTTIQQLRQSFASYYLNIAETIYQKMLNSRTNSLFQEIAAEYYNFHTVIT
WCLEPPYDLENAIKLIATLIDFLHLYGYQREGISWLQHILGLIEQQTVTLSPAILADAYNALGFLYYHQGNINQAQHFFE
RVLELIGGHTSFKHARILYNLGLVKKNKGEFLQAEADLQASLASWRTLGLQPGEAYSLWGLGSLALDQGLYTHGLTYLQQ
SLAIWQTLESTHGQVMVLSDLAELALLQANPHEAEQILAQIKTIVEASNYTITSSRIALLEGKCAMQRHDFSHAQTCFEE
AEEIAEEQQSTAYLAKIHLEQAKLALVQAHYHQASYHGYEGLRLATMLEHQTGIAKAHHVLAQVYQQLANPSQAEQHWQA
YAAIYQHVGLVP

Sequences:

>Translated_812_residues
MSESFGYWLKQRRKELNFTQEYLAELVSCSTITIRKIESNERRPSRQIAARIAKFCQVEANRAFVDAAWAGQSPSPSDGG
SPPEPAPSNLLPPFSSIIGRDSAIESICVQFQAQKARLVTIVGSPGVGKTRLAQAIGQQLLTHFSDGVFWISLDPIVNAS
LVPSLITRVLGIHENPNQSIEETIFNWLKNRHLLLILDNCEHIIELRQFVNQLLSYCPTLSILATSREVLHLRWEQRFPL
RPLTVPVRGMQLDLAQLAQIPAIALFLERSRAINPQAELNASNARAISTICMQLEGLPLSIELIAARSAMLSPQMLVHRL
NNQLNVLTQGSRDLPHRQQTLRNAIQWSIDLLDSAEQFLLVALALAPESCTLLSLEALADCYSPWPWSIFDGLTNLFDKS
LIWIQQQQTDEPRFGMLRVLREYVLEQLAEPTTIQQLRQSFASYYLNIAETIYQKMLNSRTNSLFQEIAAEYYNFHTVIT
WCLEPPYDLENAIKLIATLIDFLHLYGYQREGISWLQHILGLIEQQTVTLSPAILADAYNALGFLYYHQGNINQAQHFFE
RVLELIGGHTSFKHARILYNLGLVKKNKGEFLQAEADLQASLASWRTLGLQPGEAYSLWGLGSLALDQGLYTHGLTYLQQ
SLAIWQTLESTHGQVMVLSDLAELALLQANPHEAEQILAQIKTIVEASNYTITSSRIALLEGKCAMQRHDFSHAQTCFEE
AEEIAEEQQSTAYLAKIHLEQAKLALVQAHYHQASYHGYEGLRLATMLEHQTGIAKAHHVLAQVYQQLANPSQAEQHWQA
YAAIYQHVGLVP
>Mature_811_residues
SESFGYWLKQRRKELNFTQEYLAELVSCSTITIRKIESNERRPSRQIAARIAKFCQVEANRAFVDAAWAGQSPSPSDGGS
PPEPAPSNLLPPFSSIIGRDSAIESICVQFQAQKARLVTIVGSPGVGKTRLAQAIGQQLLTHFSDGVFWISLDPIVNASL
VPSLITRVLGIHENPNQSIEETIFNWLKNRHLLLILDNCEHIIELRQFVNQLLSYCPTLSILATSREVLHLRWEQRFPLR
PLTVPVRGMQLDLAQLAQIPAIALFLERSRAINPQAELNASNARAISTICMQLEGLPLSIELIAARSAMLSPQMLVHRLN
NQLNVLTQGSRDLPHRQQTLRNAIQWSIDLLDSAEQFLLVALALAPESCTLLSLEALADCYSPWPWSIFDGLTNLFDKSL
IWIQQQQTDEPRFGMLRVLREYVLEQLAEPTTIQQLRQSFASYYLNIAETIYQKMLNSRTNSLFQEIAAEYYNFHTVITW
CLEPPYDLENAIKLIATLIDFLHLYGYQREGISWLQHILGLIEQQTVTLSPAILADAYNALGFLYYHQGNINQAQHFFER
VLELIGGHTSFKHARILYNLGLVKKNKGEFLQAEADLQASLASWRTLGLQPGEAYSLWGLGSLALDQGLYTHGLTYLQQS
LAIWQTLESTHGQVMVLSDLAELALLQANPHEAEQILAQIKTIVEASNYTITSSRIALLEGKCAMQRHDFSHAQTCFEEA
EEIAEEQQSTAYLAKIHLEQAKLALVQAHYHQASYHGYEGLRLATMLEHQTGIAKAHHVLAQVYQQLANPSQAEQHWQAY
AAIYQHVGLVP

Specific function: Unknown

COG id: COG3903

COG function: function code R; Predicted ATPase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH luxR-type DNA-binding domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000767
- InterPro:   IPR016032
- InterPro:   IPR011990
- InterPro:   IPR000792
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00196 GerE [H]

EC number: NA

Molecular weight: Translated: 91491; Mature: 91360

Theoretical pI: Translated: 6.27; Mature: 6.27

Prosite motif: PS50005 TPR ; PS50293 TPR_REGION ; PS50943 HTH_CROC1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
1.2 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSESFGYWLKQRRKELNFTQEYLAELVSCSTITIRKIESNERRPSRQIAARIAKFCQVEA
CCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHC
NRAFVDAAWAGQSPSPSDGGSPPEPAPSNLLPPFSSIIGRDSAIESICVQFQAQKARLVT
CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHCCEEEEE
IVGSPGVGKTRLAQAIGQQLLTHFSDGVFWISLDPIVNASLVPSLITRVLGIHENPNQSI
EECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCHH
EETIFNWLKNRHLLLILDNCEHIIELRQFVNQLLSYCPTLSILATSREVLHLRWEQRFPL
HHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCC
RPLTVPVRGMQLDLAQLAQIPAIALFLERSRAINPQAELNASNARAISTICMQLEGLPLS
CCCEECCCCCEECHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCHHHHHHHHHHHCCCCEE
IELIAARSAMLSPQMLVHRLNNQLNVLTQGSRDLPHRQQTLRNAIQWSIDLLDSAEQFLL
HHHHHHHHHHCCHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
VALALAPESCTLLSLEALADCYSPWPWSIFDGLTNLFDKSLIWIQQQQTDEPRFGMLRVL
HHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHH
REYVLEQLAEPTTIQQLRQSFASYYLNIAETIYQKMLNSRTNSLFQEIAAEYYNFHTVIT
HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
WCLEPPYDLENAIKLIATLIDFLHLYGYQREGISWLQHILGLIEQQTVTLSPAILADAYN
EECCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCEEECHHHHHHHHH
ALGFLYYHQGNINQAQHFFERVLELIGGHTSFKHARILYNLGLVKKNKGEFLQAEADLQA
HHEEEEEECCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCEECCCCCEEECHHHHHH
SLASWRTLGLQPGEAYSLWGLGSLALDQGLYTHGLTYLQQSLAIWQTLESTHGQVMVLSD
HHHHHHHCCCCCCCCEEECCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEHHH
LAELALLQANPHEAEQILAQIKTIVEASNYTITSSRIALLEGKCAMQRHDFSHAQTCFEE
HHHHHHHCCCCHHHHHHHHHHHHHHHHCCCEEECCCEEEECCCHHHHHCCCHHHHHHHHH
AEEIAEEQQSTAYLAKIHLEQAKLALVQAHYHQASYHGYEGLRLATMLEHQTGIAKAHHV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH
LAQVYQQLANPSQAEQHWQAYAAIYQHVGLVP
HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure 
SESFGYWLKQRRKELNFTQEYLAELVSCSTITIRKIESNERRPSRQIAARIAKFCQVEA
CCCHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHC
NRAFVDAAWAGQSPSPSDGGSPPEPAPSNLLPPFSSIIGRDSAIESICVQFQAQKARLVT
CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHCCEEEEE
IVGSPGVGKTRLAQAIGQQLLTHFSDGVFWISLDPIVNASLVPSLITRVLGIHENPNQSI
EECCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCHH
EETIFNWLKNRHLLLILDNCEHIIELRQFVNQLLSYCPTLSILATSREVLHLRWEQRFPL
HHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCC
RPLTVPVRGMQLDLAQLAQIPAIALFLERSRAINPQAELNASNARAISTICMQLEGLPLS
CCCEECCCCCEECHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCHHHHHHHHHHHCCCCEE
IELIAARSAMLSPQMLVHRLNNQLNVLTQGSRDLPHRQQTLRNAIQWSIDLLDSAEQFLL
HHHHHHHHHHCCHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
VALALAPESCTLLSLEALADCYSPWPWSIFDGLTNLFDKSLIWIQQQQTDEPRFGMLRVL
HHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHH
REYVLEQLAEPTTIQQLRQSFASYYLNIAETIYQKMLNSRTNSLFQEIAAEYYNFHTVIT
HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
WCLEPPYDLENAIKLIATLIDFLHLYGYQREGISWLQHILGLIEQQTVTLSPAILADAYN
EECCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCEEECHHHHHHHHH
ALGFLYYHQGNINQAQHFFERVLELIGGHTSFKHARILYNLGLVKKNKGEFLQAEADLQA
HHEEEEEECCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCEECCCCCEEECHHHHHH
SLASWRTLGLQPGEAYSLWGLGSLALDQGLYTHGLTYLQQSLAIWQTLESTHGQVMVLSD
HHHHHHHCCCCCCCCEEECCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEHHH
LAELALLQANPHEAEQILAQIKTIVEASNYTITSSRIALLEGKCAMQRHDFSHAQTCFEE
HHHHHHHCCCCHHHHHHHHHHHHHHHHCCCEEECCCEEEECCCHHHHHCCCHHHHHHHHH
AEEIAEEQQSTAYLAKIHLEQAKLALVQAHYHQASYHGYEGLRLATMLEHQTGIAKAHHV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH
LAQVYQQLANPSQAEQHWQAYAAIYQHVGLVP
HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9634230; 12218036 [H]