The gene/protein map for NC_009567 is currently unavailable.
Definition Haemophilus influenzae PittGG chromosome, complete genome.
Accession NC_009567
Length 1,887,192

Click here to switch to the map view.

The map label for this gene is rep [H]

Identifier: 148827827

GI number: 148827827

Start: 1190377

End: 1192389

Strand: Reverse

Name: rep [H]

Synonym: CGSHiGG_06525

Alternate gene names: 148827827

Gene position: 1192389-1190377 (Counterclockwise)

Preceding gene: 148827828

Following gene: 148827826

Centisome position: 63.18

GC content: 40.64

Gene sequence:

>2013_bases
ATGAAACTCAATCCTCAACAACAACAAGCCGTTGAATATGTAACAGGCCCTTGTCTTGTGCTTGCTGGCGCAGGCTCTGG
CAAAACTCGCGTAATTATTAATAAAATCGCCCATTTAATTGAAAAGTGCGGGTATTCTCCGAAACAAATTGCTGCCGTCA
CTTTTACGAATAAAGCCGCACGCGAGATGAAAGAGCGTGTAGCACATTCCATTGGCAAAGAGCAATCCAAAGGCTTACTT
GTTTCCACTTTTCATACGCTTGGTTTTGACATTATTAAGCGTGAATATAAAGCGTTGGGCTTTAAATCAAATATGACTTT
ATTTGATGAACATGATCAATTTGCGTTGTTAAAAGAGCTAACCGCTGATGTGTTAAAAGAAGATAAGGATTTATTGCGTG
AGTTAATTTCAGTGATTTCTAACTGGAAGAACGATTTGATTTCGCCAAAACAGGCGTTTGCGTTGGCACGTGATGCTAAA
TATCAAACTTTCGCAAAATGTTATGAGCGTTACGCCACACAAATTCGATCTTACAACGCCCTAGATTTTGATGATTTGAT
TATGCTGCCGACGTTGTTGTTCAAGCAAAATGAAGAAGTGCGGTCAAAATGGCAGGCAAAAATTCGTTATTTGTTGGTGG
ATGAATATCAAGATACCAATACCAGTCAATATGAGCTGATTAAACTTTTAGTGGGGGAGCGTGCATGTTTCACTGTGGTG
GGCGATGATGACCAATCTATTTATTCATGGCGTGGCGCACGACCAGAAAATATGGTGCGTTTACGCGATGATTTCCCTCG
TTTGAACGTGATTAAGCTAGAGCAAAATTATCGTTCAACCCAACGCATTCTGCATTGTGCCAATATCTTGATTGATAACA
ATGACCACGTGTTTGATAAGAAACTCTTTTCAACCATTGGCGAAGGGGAAAAATTGCTTATTATCGAAGCAAAAAATGAA
GAACACGAAGCAGAACGGATTGTCGCTGAGTTGATCGCCCATCGTTTTAGCCGTAAAACCAAATATAAAGATTATGCGAT
TTTATATCGAGGCAATCATCAATCCCGATTACTCGAAAAAGTACTGATGCAAAACCGTATTCCTTATAAAATTTCGGGCG
GTACTTCTTTTTTCTCCCGTGCAGAAATTAAAGATATGATGGCGTATTTACGCTTGGTGGTGAATCAAGATGATGACGCC
GCATTCCTACGTATTGTGAATACCCCGAAACGTGAAATTGGCACCGCAACGTTACAAAAACTTGGAGAGTTGGCTCAAGA
AAAACACATCAGTTTGTTTGAGGCTATTTTTGAGTTTGCGCTTATTCAACGCATCACGCCAAAAGCCTATGATTCATTGC
AAAAATTTGGCCGTTGGATTGTAGAACTTAATGATGAAATTCAACGTTCTGAACCAGAACGAGCGGTACGTTCAATGTTA
TCCGCAATTCATTATGAAGAATATTTGTACGAATATGCAACAAGTCCTAAAGCGGCAGAAATGCAAAGTAAGAATGTTGC
CACGCTATTTGACTGGGTTGCGGATATGTTAAAAGGCGATGAAACCAATGAGCCGATGAACCTTAATCAAGTAGTAACCC
GCCTGACATTACGCGATATGTTGGAGCGAGGCGAAGACGATGACGACAGCGATCAAGTTCAACTGATGACATTGCACGCA
TCTAAAGGATTGGAATTTCCTTATGTTTATTTGATTGGTATGGAAGAGGGCATTTTGCCCCACCAAACCAGCATTGATGA
AGACAACGTGGAAGAAGAACGCCGCTTGGCTTATGTGGGTATCACAAGAGCACAAAAAGAACTCACTTTTTCCTTGTGTC
GTGAGCGTCGTCAATATGGCGAATTAGTTCGCCCAGAACCAAGCCGATTTTTAGCTGAATTACCTAATGACGATGTGCTA
TGGGAACGCGATAAACCAAAACTCACCACCGAGCAAAAACAAGAAAAAACACAAAACCAACTTGATAGATTGAGGGCGAT
TTTGAAAAGTTAA

Upstream 100 bases:

>100_bases
TTGAAATGGCCCAACAATGGGGATTGCCAATCTTAGATTTGCCCCTCTCATTTTTATTAGACACCGTGTTACTGCCCTAT
GCGTGGGCTCAATAATTTTT

Downstream 100 bases:

>100_bases
TAAAAATCGGCAGGGAAATTCTCTGCCGATTTTTTATTTTAAAAGTGCGGTCATTTTTCGTTTGATTTCCACCAATTTTT
TACCGAATTTTCAATTCCAG

Product: ATP-dependent DNA helicase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 670; Mature: 670

Protein sequence:

>670_residues
MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAAREMKERVAHSIGKEQSKGLL
VSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKELTADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAK
YQTFAKCYERYATQIRSYNALDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV
GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDKKLFSTIGEGEKLLIIEAKNE
EHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEKVLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDA
AFLRIVNTPKREIGTATLQKLGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML
SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDMLERGEDDDDSDQVQLMTLHA
SKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVGITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVL
WERDKPKLTTEQKQEKTQNQLDRLRAILKS

Sequences:

>Translated_670_residues
MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAAREMKERVAHSIGKEQSKGLL
VSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKELTADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAK
YQTFAKCYERYATQIRSYNALDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV
GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDKKLFSTIGEGEKLLIIEAKNE
EHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEKVLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDA
AFLRIVNTPKREIGTATLQKLGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML
SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDMLERGEDDDDSDQVQLMTLHA
SKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVGITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVL
WERDKPKLTTEQKQEKTQNQLDRLRAILKS
>Mature_670_residues
MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAAREMKERVAHSIGKEQSKGLL
VSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKELTADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAK
YQTFAKCYERYATQIRSYNALDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV
GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDKKLFSTIGEGEKLLIIEAKNE
EHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEKVLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDA
AFLRIVNTPKREIGTATLQKLGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML
SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDMLERGEDDDDSDQVQLMTLHA
SKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVGITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVL
WERDKPKLTTEQKQEKTQNQLDRLRAILKS

Specific function: Rep helicase is a single-stranded DNA-dependent ATPase involved in DNA replication; it can initiate unwinding at a nick in the DNA. It binds to the single-stranded DNA and acts in a progressive fashion along the DNA in the 3' to 5' direction [H]

COG id: COG0210

COG function: function code L; Superfamily I DNA and RNA helicases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 uvrD-like helicase C-terminal domain [H]

Homologues:

Organism=Escherichia coli, GI48994965, Length=670, Percent_Identity=67.0149253731343, Blast_Score=940, Evalue=0.0,
Organism=Escherichia coli, GI2367296, Length=643, Percent_Identity=39.0357698289269, Blast_Score=417, Evalue=1e-117,
Organism=Escherichia coli, GI1787196, Length=348, Percent_Identity=27.8735632183908, Blast_Score=101, Evalue=2e-22,
Organism=Saccharomyces cerevisiae, GI6322369, Length=733, Percent_Identity=27.012278308322, Blast_Score=184, Evalue=5e-47,
Organism=Saccharomyces cerevisiae, GI6324477, Length=679, Percent_Identity=23.5640648011782, Blast_Score=97, Evalue=1e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005752
- InterPro:   IPR013986
- InterPro:   IPR014017
- InterPro:   IPR000212
- InterPro:   IPR014016 [H]

Pfam domain/function: PF00580 UvrD-helicase [H]

EC number: =3.6.4.12 [H]

Molecular weight: Translated: 77690; Mature: 77690

Theoretical pI: Translated: 6.47; Mature: 6.47

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAA
CCCCCHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCHHHEEEEEECCHHH
REMKERVAHSIGKEQSKGLLVSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKEL
HHHHHHHHHHHCCHHCCCEEEEHHHHHHHHHHHHHHHHCCCCCCCCEECCCHHHHHHHHH
TADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAKYQTFAKCYERYATQIRSYNA
HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCC
LDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV
CCHHHHHHHHHHHHCCCHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHHHHCCCEEEEEE
GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDK
CCCCCHHHHCCCCCCHHHEEHHCCCCCCEEEEEHHHHHHHHHHHHHHHHHCCCCCCHHHH
KLFSTIGEGEKLLIIEAKNEEHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEK
HHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHH
VLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDAAFLRIVNTPKREIGTATLQK
HHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHH
LGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML
HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHCCCHHHHHHHHH
SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDM
HHHHHHHHHHHHHCCCCHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH
LERGEDDDDSDQVQLMTLHASKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVG
HHCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHH
ITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVLWERDKPKLTTEQKQEKTQNQ
HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCEECCCCCCCCCHHHHHHHHHH
LDRLRAILKS
HHHHHHHHCC
>Mature Secondary Structure
MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAA
CCCCCHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCHHHEEEEEECCHHH
REMKERVAHSIGKEQSKGLLVSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKEL
HHHHHHHHHHHCCHHCCCEEEEHHHHHHHHHHHHHHHHCCCCCCCCEECCCHHHHHHHHH
TADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAKYQTFAKCYERYATQIRSYNA
HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCC
LDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV
CCHHHHHHHHHHHHCCCHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHHHHCCCEEEEEE
GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDK
CCCCCHHHHCCCCCCHHHEEHHCCCCCCEEEEEHHHHHHHHHHHHHHHHHCCCCCCHHHH
KLFSTIGEGEKLLIIEAKNEEHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEK
HHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHH
VLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDAAFLRIVNTPKREIGTATLQK
HHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHH
LGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML
HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHCCCHHHHHHHHH
SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDM
HHHHHHHHHHHHHCCCCHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH
LERGEDDDDSDQVQLMTLHASKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVG
HHCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHH
ITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVLWERDKPKLTTEQKQEKTQNQ
HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCEECCCCCCCCCHHHHHHHHHH
LDRLRAILKS
HHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7542800 [H]