Definition | Haemophilus influenzae PittGG chromosome, complete genome. |
---|---|
Accession | NC_009567 |
Length | 1,887,192 |
Click here to switch to the map view.
The map label for this gene is rep [H]
Identifier: 148827827
GI number: 148827827
Start: 1190377
End: 1192389
Strand: Reverse
Name: rep [H]
Synonym: CGSHiGG_06525
Alternate gene names: 148827827
Gene position: 1192389-1190377 (Counterclockwise)
Preceding gene: 148827828
Following gene: 148827826
Centisome position: 63.18
GC content: 40.64
Gene sequence:
>2013_bases ATGAAACTCAATCCTCAACAACAACAAGCCGTTGAATATGTAACAGGCCCTTGTCTTGTGCTTGCTGGCGCAGGCTCTGG CAAAACTCGCGTAATTATTAATAAAATCGCCCATTTAATTGAAAAGTGCGGGTATTCTCCGAAACAAATTGCTGCCGTCA CTTTTACGAATAAAGCCGCACGCGAGATGAAAGAGCGTGTAGCACATTCCATTGGCAAAGAGCAATCCAAAGGCTTACTT GTTTCCACTTTTCATACGCTTGGTTTTGACATTATTAAGCGTGAATATAAAGCGTTGGGCTTTAAATCAAATATGACTTT ATTTGATGAACATGATCAATTTGCGTTGTTAAAAGAGCTAACCGCTGATGTGTTAAAAGAAGATAAGGATTTATTGCGTG AGTTAATTTCAGTGATTTCTAACTGGAAGAACGATTTGATTTCGCCAAAACAGGCGTTTGCGTTGGCACGTGATGCTAAA TATCAAACTTTCGCAAAATGTTATGAGCGTTACGCCACACAAATTCGATCTTACAACGCCCTAGATTTTGATGATTTGAT TATGCTGCCGACGTTGTTGTTCAAGCAAAATGAAGAAGTGCGGTCAAAATGGCAGGCAAAAATTCGTTATTTGTTGGTGG ATGAATATCAAGATACCAATACCAGTCAATATGAGCTGATTAAACTTTTAGTGGGGGAGCGTGCATGTTTCACTGTGGTG GGCGATGATGACCAATCTATTTATTCATGGCGTGGCGCACGACCAGAAAATATGGTGCGTTTACGCGATGATTTCCCTCG TTTGAACGTGATTAAGCTAGAGCAAAATTATCGTTCAACCCAACGCATTCTGCATTGTGCCAATATCTTGATTGATAACA ATGACCACGTGTTTGATAAGAAACTCTTTTCAACCATTGGCGAAGGGGAAAAATTGCTTATTATCGAAGCAAAAAATGAA GAACACGAAGCAGAACGGATTGTCGCTGAGTTGATCGCCCATCGTTTTAGCCGTAAAACCAAATATAAAGATTATGCGAT TTTATATCGAGGCAATCATCAATCCCGATTACTCGAAAAAGTACTGATGCAAAACCGTATTCCTTATAAAATTTCGGGCG GTACTTCTTTTTTCTCCCGTGCAGAAATTAAAGATATGATGGCGTATTTACGCTTGGTGGTGAATCAAGATGATGACGCC GCATTCCTACGTATTGTGAATACCCCGAAACGTGAAATTGGCACCGCAACGTTACAAAAACTTGGAGAGTTGGCTCAAGA AAAACACATCAGTTTGTTTGAGGCTATTTTTGAGTTTGCGCTTATTCAACGCATCACGCCAAAAGCCTATGATTCATTGC AAAAATTTGGCCGTTGGATTGTAGAACTTAATGATGAAATTCAACGTTCTGAACCAGAACGAGCGGTACGTTCAATGTTA TCCGCAATTCATTATGAAGAATATTTGTACGAATATGCAACAAGTCCTAAAGCGGCAGAAATGCAAAGTAAGAATGTTGC CACGCTATTTGACTGGGTTGCGGATATGTTAAAAGGCGATGAAACCAATGAGCCGATGAACCTTAATCAAGTAGTAACCC GCCTGACATTACGCGATATGTTGGAGCGAGGCGAAGACGATGACGACAGCGATCAAGTTCAACTGATGACATTGCACGCA TCTAAAGGATTGGAATTTCCTTATGTTTATTTGATTGGTATGGAAGAGGGCATTTTGCCCCACCAAACCAGCATTGATGA AGACAACGTGGAAGAAGAACGCCGCTTGGCTTATGTGGGTATCACAAGAGCACAAAAAGAACTCACTTTTTCCTTGTGTC GTGAGCGTCGTCAATATGGCGAATTAGTTCGCCCAGAACCAAGCCGATTTTTAGCTGAATTACCTAATGACGATGTGCTA TGGGAACGCGATAAACCAAAACTCACCACCGAGCAAAAACAAGAAAAAACACAAAACCAACTTGATAGATTGAGGGCGAT TTTGAAAAGTTAA
Upstream 100 bases:
>100_bases TTGAAATGGCCCAACAATGGGGATTGCCAATCTTAGATTTGCCCCTCTCATTTTTATTAGACACCGTGTTACTGCCCTAT GCGTGGGCTCAATAATTTTT
Downstream 100 bases:
>100_bases TAAAAATCGGCAGGGAAATTCTCTGCCGATTTTTTATTTTAAAAGTGCGGTCATTTTTCGTTTGATTTCCACCAATTTTT TACCGAATTTTCAATTCCAG
Product: ATP-dependent DNA helicase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 670; Mature: 670
Protein sequence:
>670_residues MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAAREMKERVAHSIGKEQSKGLL VSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKELTADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAK YQTFAKCYERYATQIRSYNALDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDKKLFSTIGEGEKLLIIEAKNE EHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEKVLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDA AFLRIVNTPKREIGTATLQKLGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDMLERGEDDDDSDQVQLMTLHA SKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVGITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVL WERDKPKLTTEQKQEKTQNQLDRLRAILKS
Sequences:
>Translated_670_residues MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAAREMKERVAHSIGKEQSKGLL VSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKELTADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAK YQTFAKCYERYATQIRSYNALDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDKKLFSTIGEGEKLLIIEAKNE EHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEKVLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDA AFLRIVNTPKREIGTATLQKLGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDMLERGEDDDDSDQVQLMTLHA SKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVGITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVL WERDKPKLTTEQKQEKTQNQLDRLRAILKS >Mature_670_residues MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAAREMKERVAHSIGKEQSKGLL VSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKELTADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAK YQTFAKCYERYATQIRSYNALDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDKKLFSTIGEGEKLLIIEAKNE EHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEKVLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDA AFLRIVNTPKREIGTATLQKLGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDMLERGEDDDDSDQVQLMTLHA SKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVGITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVL WERDKPKLTTEQKQEKTQNQLDRLRAILKS
Specific function: Rep helicase is a single-stranded DNA-dependent ATPase involved in DNA replication; it can initiate unwinding at a nick in the DNA. It binds to the single-stranded DNA and acts in a progressive fashion along the DNA in the 3' to 5' direction [H]
COG id: COG0210
COG function: function code L; Superfamily I DNA and RNA helicases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 uvrD-like helicase C-terminal domain [H]
Homologues:
Organism=Escherichia coli, GI48994965, Length=670, Percent_Identity=67.0149253731343, Blast_Score=940, Evalue=0.0, Organism=Escherichia coli, GI2367296, Length=643, Percent_Identity=39.0357698289269, Blast_Score=417, Evalue=1e-117, Organism=Escherichia coli, GI1787196, Length=348, Percent_Identity=27.8735632183908, Blast_Score=101, Evalue=2e-22, Organism=Saccharomyces cerevisiae, GI6322369, Length=733, Percent_Identity=27.012278308322, Blast_Score=184, Evalue=5e-47, Organism=Saccharomyces cerevisiae, GI6324477, Length=679, Percent_Identity=23.5640648011782, Blast_Score=97, Evalue=1e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR005752 - InterPro: IPR013986 - InterPro: IPR014017 - InterPro: IPR000212 - InterPro: IPR014016 [H]
Pfam domain/function: PF00580 UvrD-helicase [H]
EC number: =3.6.4.12 [H]
Molecular weight: Translated: 77690; Mature: 77690
Theoretical pI: Translated: 6.47; Mature: 6.47
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAA CCCCCHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCHHHEEEEEECCHHH REMKERVAHSIGKEQSKGLLVSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKEL HHHHHHHHHHHCCHHCCCEEEEHHHHHHHHHHHHHHHHCCCCCCCCEECCCHHHHHHHHH TADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAKYQTFAKCYERYATQIRSYNA HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCC LDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV CCHHHHHHHHHHHHCCCHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHHHHCCCEEEEEE GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDK CCCCCHHHHCCCCCCHHHEEHHCCCCCCEEEEEHHHHHHHHHHHHHHHHHCCCCCCHHHH KLFSTIGEGEKLLIIEAKNEEHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEK HHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHH VLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDAAFLRIVNTPKREIGTATLQK HHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHH LGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHCCCHHHHHHHHH SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDM HHHHHHHHHHHHHCCCCHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH LERGEDDDDSDQVQLMTLHASKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVG HHCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHH ITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVLWERDKPKLTTEQKQEKTQNQ HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCEECCCCCCCCCHHHHHHHHHH LDRLRAILKS HHHHHHHHCC >Mature Secondary Structure MKLNPQQQQAVEYVTGPCLVLAGAGSGKTRVIINKIAHLIEKCGYSPKQIAAVTFTNKAA CCCCCHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCHHHEEEEEECCHHH REMKERVAHSIGKEQSKGLLVSTFHTLGFDIIKREYKALGFKSNMTLFDEHDQFALLKEL HHHHHHHHHHHCCHHCCCEEEEHHHHHHHHHHHHHHHHCCCCCCCCEECCCHHHHHHHHH TADVLKEDKDLLRELISVISNWKNDLISPKQAFALARDAKYQTFAKCYERYATQIRSYNA HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCC LDFDDLIMLPTLLFKQNEEVRSKWQAKIRYLLVDEYQDTNTSQYELIKLLVGERACFTVV CCHHHHHHHHHHHHCCCHHHHHHHHHHHHEEEEECCCCCCCHHHHHHHHHHCCCEEEEEE GDDDQSIYSWRGARPENMVRLRDDFPRLNVIKLEQNYRSTQRILHCANILIDNNDHVFDK CCCCCHHHHCCCCCCHHHEEHHCCCCCCEEEEEHHHHHHHHHHHHHHHHHCCCCCCHHHH KLFSTIGEGEKLLIIEAKNEEHEAERIVAELIAHRFSRKTKYKDYAILYRGNHQSRLLEK HHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHH VLMQNRIPYKISGGTSFFSRAEIKDMMAYLRLVVNQDDDAAFLRIVNTPKREIGTATLQK HHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHH LGELAQEKHISLFEAIFEFALIQRITPKAYDSLQKFGRWIVELNDEIQRSEPERAVRSML HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHCCCHHHHHHHHH SAIHYEEYLYEYATSPKAAEMQSKNVATLFDWVADMLKGDETNEPMNLNQVVTRLTLRDM HHHHHHHHHHHHHCCCCHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH LERGEDDDDSDQVQLMTLHASKGLEFPYVYLIGMEEGILPHQTSIDEDNVEEERRLAYVG HHCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHH ITRAQKELTFSLCRERRQYGELVRPEPSRFLAELPNDDVLWERDKPKLTTEQKQEKTQNQ HHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCEECCCCCCCCCHHHHHHHHHH LDRLRAILKS HHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7542800 [H]