| Definition | Haemophilus influenzae Rd KW20 chromosome, complete genome. |
|---|---|
| Accession | NC_000907 |
| Length | 1,830,138 |
Click here to switch to the map view.
The map label for this gene is yfgE [C]
Identifier: 16273145
GI number: 16273145
Start: 1294919
End: 1295614
Strand: Reverse
Name: yfgE [C]
Synonym: HI1225.1
Alternate gene names: 16273145
Gene position: 1295614-1294919 (Counterclockwise)
Preceding gene: 16273146
Following gene: 16273137
Centisome position: 70.79
GC content: 33.48
Gene sequence:
>696_bases TTGAATAAGCAACTCCCCTTACCTATTCATCAAATTGATGATGCTACATTAGAGAACTTTTACGGCGATAATAATCTTTT ATTGCTCGATTCTTTACGCAAAAATTCATCTGATTTAAAACAACCGTTTTTCTACATTTGGGGAGATAAAGGCTCTGGTA AAACCCATTTACTTAGAGCATTCAGTAACGAATATTTAATCAATCAACGTACTGCTATTTATGTACCCCTTAGCAAATCT CAATATTTTTCTACCGCAGTTCTTGAAAATTTAGAGCAACAAGAATTAGTATGCTTAGATGATTTACAAAGTGTAATCGG AAATGATGAATGGGAATTAGCTATTTTTGATCTGTTTAATCGAATTAAAGCAAGCGGGAAAACGCTTTTACTAATTAGTG CAGATAAATCGCCTTCCGCACTTTCTGTCAAATTGCCTGATTTAAACTCTCGCTTAACTTGGGGGGAAATTTATCAATTA AATTCATTAACAGACGAACAAAAAATCAAAGTACTACAACTCGCAGCTTACCAACGAGGATTCCAATTATCTGACGAAAC TGCAAATTTTTTAATTACACGACTAGCACGAGATATGCATACTCTATTTGAAGCACTTGATCTACTAGATAAAGCATCAT TACAAGCTCAACGAAATCTTACGATTCCTTTCGTCAAAAAAATTTTGAATCTCTAA
Upstream 100 bases:
>100_bases TTGTTTTACCGAAAGCAAAAAATGAAGTAGAATAAGCCCCATTTCAAAGGGTTAAATTATCATTTAGCCCTTTCTTACTT AATCTGTACTACATAAACAT
Downstream 100 bases:
>100_bases CAAAAAAGGCACTAATGCGCCTTTTTAATCATCCACCTGATAATTTAACTTTAAACCCTTTTTGCTCTAATAATTGCTTG AGCAAATCTCGTTTTTCCCC
Product: DNA replication initiation factor
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 231; Mature: 231
Protein sequence:
>231_residues MNKQLPLPIHQIDDATLENFYGDNNLLLLDSLRKNSSDLKQPFFYIWGDKGSGKTHLLRAFSNEYLINQRTAIYVPLSKS QYFSTAVLENLEQQELVCLDDLQSVIGNDEWELAIFDLFNRIKASGKTLLLISADKSPSALSVKLPDLNSRLTWGEIYQL NSLTDEQKIKVLQLAAYQRGFQLSDETANFLITRLARDMHTLFEALDLLDKASLQAQRNLTIPFVKKILNL
Sequences:
>Translated_231_residues MNKQLPLPIHQIDDATLENFYGDNNLLLLDSLRKNSSDLKQPFFYIWGDKGSGKTHLLRAFSNEYLINQRTAIYVPLSKS QYFSTAVLENLEQQELVCLDDLQSVIGNDEWELAIFDLFNRIKASGKTLLLISADKSPSALSVKLPDLNSRLTWGEIYQL NSLTDEQKIKVLQLAAYQRGFQLSDETANFLITRLARDMHTLFEALDLLDKASLQAQRNLTIPFVKKILNL >Mature_231_residues MNKQLPLPIHQIDDATLENFYGDNNLLLLDSLRKNSSDLKQPFFYIWGDKGSGKTHLLRAFSNEYLINQRTAIYVPLSKS QYFSTAVLENLEQQELVCLDDLQSVIGNDEWELAIFDLFNRIKASGKTLLLISADKSPSALSVKLPDLNSRLTWGEIYQL NSLTDEQKIKVLQLAAYQRGFQLSDETANFLITRLARDMHTLFEALDLLDKASLQAQRNLTIPFVKKILNL
Specific function: Unknown
COG id: COG0593
COG function: function code L; ATPase involved in DNA replication initiation
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the dnaA family. HdA subfamily
Homologues:
Organism=Escherichia coli, GI226510964, Length=228, Percent_Identity=46.9298245614035, Blast_Score=214, Evalue=3e-57, Organism=Escherichia coli, GI2367267, Length=229, Percent_Identity=25.3275109170306, Blast_Score=62, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y122B_HAEIN (O86235)
Other databases:
- EMBL: L42023 - RefSeq: NP_439382.1 - ProteinModelPortal: O86235 - SMR: O86235 - GeneID: 950792 - GenomeReviews: L42023_GR - KEGG: hin:HI1225.1 - NMPDR: fig|71421.1.peg.1172 - TIGR: HI_1225.1 - HOGENOM: HBG508398 - OMA: TRGTRSM - ProtClustDB: PRK06893 - BioCyc: HINF71421:HI_1225.1-MONOMER - InterPro: IPR003593 - InterPro: IPR020591 - InterPro: IPR017788 - InterPro: IPR013317 - PRINTS: PR00051 - SMART: SM00382 - TIGRFAMs: TIGR03420
Pfam domain/function: PF00308 Bac_DnaA
EC number: NA
Molecular weight: Translated: 26367; Mature: 26367
Theoretical pI: Translated: 5.34; Mature: 5.34
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.3 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 0.9 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNKQLPLPIHQIDDATLENFYGDNNLLLLDSLRKNSSDLKQPFFYIWGDKGSGKTHLLRA CCCCCCCCHHHCCCHHHHHHCCCCCEEEEECCCCCCCHHCCCEEEEECCCCCCHHHHHHH FSNEYLINQRTAIYVPLSKSQYFSTAVLENLEQQELVCLDDLQSVIGNDEWELAIFDLFN HCCCEEEECCEEEEEECCCCHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCEEHHHHHHH RIKASGKTLLLISADKSPSALSVKLPDLNSRLTWGEIYQLNSLTDEQKIKVLQLAAYQRG HHHCCCCEEEEEECCCCCCEEEEECCCCCCCCCHHHEEEECCCCCHHHHHHHHHHHHHCC FQLSDETANFLITRLARDMHTLFEALDLLDKASLQAQRNLTIPFVKKILNL CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCC >Mature Secondary Structure MNKQLPLPIHQIDDATLENFYGDNNLLLLDSLRKNSSDLKQPFFYIWGDKGSGKTHLLRA CCCCCCCCHHHCCCHHHHHHCCCCCEEEEECCCCCCCHHCCCEEEEECCCCCCHHHHHHH FSNEYLINQRTAIYVPLSKSQYFSTAVLENLEQQELVCLDDLQSVIGNDEWELAIFDLFN HCCCEEEECCEEEEEECCCCHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCEEHHHHHHH RIKASGKTLLLISADKSPSALSVKLPDLNSRLTWGEIYQLNSLTDEQKIKVLQLAAYQRG HHHCCCCEEEEEECCCCCCEEEEECCCCCCCCCHHHEEEECCCCCHHHHHHHHHHHHHCC FQLSDETANFLITRLARDMHTLFEALDLLDKASLQAQRNLTIPFVKKILNL CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7542800