Definition Haemophilus influenzae PittGG chromosome, complete genome.
Accession NC_009567
Length 1,887,192

Click here to switch to the map view.

The map label for this gene is yheS [C]

Identifier: 148827837

GI number: 148827837

Start: 1198544

End: 1200460

Strand: Reverse

Name: yheS [C]

Synonym: CGSHiGG_06575

Alternate gene names: 148827837

Gene position: 1200460-1198544 (Counterclockwise)

Preceding gene: 148827838

Following gene: 148827836

Centisome position: 63.61

GC content: 40.64

Gene sequence:

>1917_bases
ATGATTATATTTAGTAACTTATCCTTAAAACGAGGGCAAACGGAACTTCTCGAAAATGCTTCAGCTACGATTAACCCCAA
GCAAAAAGTCGGCTTAGTGGGGAAAAATGGTTGTGGAAAATCTTCACTTTTTGCCTTATTAAAAAAAGAATTAATGCCAG
AGGGCGGAGAGGTGAATTATCCAGCAAATTGGCGAGTCTCTTGGGTAAATCAAGAAACGCCTGCATTGGATATTTCTGCG
ATTGATTATGTAATTCAAGGGGATCGTGAATATTGCCGTTTGCAACAAGAGCTTGAACGTGCAAATGAACGCAATGACGG
TAACGCTATTGCACGTATTCATGGGCAATTAGAAACCTTGGATGCGTGGACAATTCAATCGCGTGCCGCTTCTTTATTGC
ATGGTTTAGGATTTAGCCAAGAAGAAACAATACAGCCAGTGAAAGCCTTTTCGGGCGGTTGGCGGATGCGTTTGAATTTG
GCGCAAGCTCTGCTTTGTCCGTCAGATTTATTATTACTTGATGAACCGACCAACCATTTGGATTTGGATGCGGTTATTTG
GTTGGAGCGTTGGTTAGTGCAATATCAAGGCACCTTGGTATTAATTTCTCACGACCGTGATTTTCTAGACCCGATTGTGA
CAAAAATCCTCCATATCGAAAATCAGAAGCTCAACGAATACACGGGCGATTATTCTTCCTTTGAAGTGCAACGAGCCACT
AAATTGGCTCAACAAACAGCCATGTATCGTCAGCAACAACAAAAGATTTCCCATTTACAAAAATATATTGATCGCTTTAA
AGCCAAAGCCACCAAAGCCAAACAGGCACAAAGCCGTATGAAAGCATTGGAAAGAATGGAGCTGATTGCGCCTGCTTATG
TGGATAATCCTTTTACTTTTAAATTTCGTCCGCCGCAATCCTTGCCGAATCCTTTGGTGATGATTGAACAGGCAAGTGCA
GGTTATGGCAGCGGAGAAAGTGCGGTAGAAATTTTAAGTAAAATTAAACTGAATTTAGTGCCTGGTTCGCGCATTGGTTT
GCTCGGGAAAAATGGTGCAGGAAAATCAACCTTGATTAAACTTTTAGCTGGAGAACTGACCGCACTTTCAGGCACAGTAC
AATTAGCTAAAGGTGTGCAACTTGGCTATTTTGCTCAGCATCAATTAGATACTTTACGCGCAGACGAATCTGCTCTGTGG
CATATGCAAAAACTCGCACCTGAGCAAACGGAGCAACAAGTTCGAGATTATTTAGGCAGTTTTGCGTTTCACGGCGATAA
AGTAAATCAAGCAGTGAAATCTTTTTCTGGAGGAGAAAAAGCTCGTTTGGTACTTGCGCTAATTGTTTGGCAACGTCCAA
ACCTATTACTGCTCGATGAACCAACTAACCATTTGGATTTGGATATGCGTCAAGCATTAACGGAAGCATTGGTGGATTAC
GAAGGTTCTTTGGTGGTGGTGTCGCACGATCGTCATCTATTACGCAATACCGTGGAAGAATTTTATTTAGTTCACGATAA
AAAAGTGGAAGAATTCAAAGGCGATTTAGAGGATTATCAAAAATGGCTGAGTGAACAAAATAGCACATCTGAAAATAAAG
TTTCGGAAAAAGTGGGCGATAATGAAAATTCGGTTCAAAATCGTAAGGAACAAAAACGCCGTGAAGCAGAGTTACGCCAA
CAAACTGCACCATTACGTAAAAAAATCACGCAATTAGAAGAAAAAATGAATAAATTTTCTTCTGAGCTTGCGAACATAGA
AAATCAACTGGCTGATGCTGAATTATATAATGCCGAAAATAAAGAAAAATTGACCGCACTTTTGGCTCAACAAGTGGATG
TGAAGAAAGCATTAGATGATGTAGAAACGGAATGGATGACTGCACAGGAAGAATTGGAAGAAATGCTTCAAGCATAA

Upstream 100 bases:

>100_bases
GTGGCTTTTTTGACTAGAAAATCGATAAAATCGGAATAGACTGTTCCGAGAAAAGAAAAAATATTGCAAAATAGCCTCAA
ATTAAGACCTATGTAGAAAA

Downstream 100 bases:

>100_bases
GGATAGATTATGAATCAAAGCCTTTTTCATCATAGCAAACAACAGGAATATTGCCCTCAATGTGGTGCACCTTTACAAAT
TAAACAAGGCAAAAAAGGGC

Product: ABC transporter ATP-binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 638; Mature: 638

Protein sequence:

>638_residues
MIIFSNLSLKRGQTELLENASATINPKQKVGLVGKNGCGKSSLFALLKKELMPEGGEVNYPANWRVSWVNQETPALDISA
IDYVIQGDREYCRLQQELERANERNDGNAIARIHGQLETLDAWTIQSRAASLLHGLGFSQEETIQPVKAFSGGWRMRLNL
AQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRAT
KLAQQTAMYRQQQQKISHLQKYIDRFKAKATKAKQAQSRMKALERMELIAPAYVDNPFTFKFRPPQSLPNPLVMIEQASA
GYGSGESAVEILSKIKLNLVPGSRIGLLGKNGAGKSTLIKLLAGELTALSGTVQLAKGVQLGYFAQHQLDTLRADESALW
HMQKLAPEQTEQQVRDYLGSFAFHGDKVNQAVKSFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALVDY
EGSLVVVSHDRHLLRNTVEEFYLVHDKKVEEFKGDLEDYQKWLSEQNSTSENKVSEKVGDNENSVQNRKEQKRREAELRQ
QTAPLRKKITQLEEKMNKFSSELANIENQLADAELYNAENKEKLTALLAQQVDVKKALDDVETEWMTAQEELEEMLQA

Sequences:

>Translated_638_residues
MIIFSNLSLKRGQTELLENASATINPKQKVGLVGKNGCGKSSLFALLKKELMPEGGEVNYPANWRVSWVNQETPALDISA
IDYVIQGDREYCRLQQELERANERNDGNAIARIHGQLETLDAWTIQSRAASLLHGLGFSQEETIQPVKAFSGGWRMRLNL
AQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRAT
KLAQQTAMYRQQQQKISHLQKYIDRFKAKATKAKQAQSRMKALERMELIAPAYVDNPFTFKFRPPQSLPNPLVMIEQASA
GYGSGESAVEILSKIKLNLVPGSRIGLLGKNGAGKSTLIKLLAGELTALSGTVQLAKGVQLGYFAQHQLDTLRADESALW
HMQKLAPEQTEQQVRDYLGSFAFHGDKVNQAVKSFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALVDY
EGSLVVVSHDRHLLRNTVEEFYLVHDKKVEEFKGDLEDYQKWLSEQNSTSENKVSEKVGDNENSVQNRKEQKRREAELRQ
QTAPLRKKITQLEEKMNKFSSELANIENQLADAELYNAENKEKLTALLAQQVDVKKALDDVETEWMTAQEELEEMLQA
>Mature_638_residues
MIIFSNLSLKRGQTELLENASATINPKQKVGLVGKNGCGKSSLFALLKKELMPEGGEVNYPANWRVSWVNQETPALDISA
IDYVIQGDREYCRLQQELERANERNDGNAIARIHGQLETLDAWTIQSRAASLLHGLGFSQEETIQPVKAFSGGWRMRLNL
AQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRAT
KLAQQTAMYRQQQQKISHLQKYIDRFKAKATKAKQAQSRMKALERMELIAPAYVDNPFTFKFRPPQSLPNPLVMIEQASA
GYGSGESAVEILSKIKLNLVPGSRIGLLGKNGAGKSTLIKLLAGELTALSGTVQLAKGVQLGYFAQHQLDTLRADESALW
HMQKLAPEQTEQQVRDYLGSFAFHGDKVNQAVKSFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALVDY
EGSLVVVSHDRHLLRNTVEEFYLVHDKKVEEFKGDLEDYQKWLSEQNSTSENKVSEKVGDNENSVQNRKEQKRREAELRQ
QTAPLRKKITQLEEKMNKFSSELANIENQLADAELYNAENKEKLTALLAQQVDVKKALDDVETEWMTAQEELEEMLQA

Specific function: Unknown

COG id: COG0488

COG function: function code R; ATPase components of ABC transporters with duplicated ATPase domains

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 ABC transporter domains [H]

Homologues:

Organism=Homo sapiens, GI27881506, Length=539, Percent_Identity=35.8070500927644, Blast_Score=332, Evalue=7e-91,
Organism=Homo sapiens, GI148612853, Length=530, Percent_Identity=34.7169811320755, Blast_Score=332, Evalue=8e-91,
Organism=Homo sapiens, GI10947137, Length=539, Percent_Identity=35.8070500927644, Blast_Score=331, Evalue=1e-90,
Organism=Homo sapiens, GI10947135, Length=553, Percent_Identity=32.7305605786618, Blast_Score=250, Evalue=3e-66,
Organism=Homo sapiens, GI69354671, Length=553, Percent_Identity=32.7305605786618, Blast_Score=249, Evalue=5e-66,
Organism=Escherichia coli, GI1789751, Length=636, Percent_Identity=68.5534591194968, Blast_Score=915, Evalue=0.0,
Organism=Escherichia coli, GI1787041, Length=528, Percent_Identity=33.1439393939394, Blast_Score=315, Evalue=6e-87,
Organism=Escherichia coli, GI1787182, Length=625, Percent_Identity=28.16, Blast_Score=243, Evalue=3e-65,
Organism=Escherichia coli, GI2367384, Length=531, Percent_Identity=29.9435028248588, Blast_Score=209, Evalue=4e-55,
Organism=Escherichia coli, GI1788165, Length=196, Percent_Identity=32.6530612244898, Blast_Score=86, Evalue=6e-18,
Organism=Escherichia coli, GI48994997, Length=227, Percent_Identity=24.2290748898678, Blast_Score=66, Evalue=7e-12,
Organism=Escherichia coli, GI1787758, Length=206, Percent_Identity=24.2718446601942, Blast_Score=64, Evalue=2e-11,
Organism=Escherichia coli, GI1788002, Length=186, Percent_Identity=32.258064516129, Blast_Score=64, Evalue=4e-11,
Organism=Caenorhabditis elegans, GI17553372, Length=537, Percent_Identity=37.243947858473, Blast_Score=345, Evalue=3e-95,
Organism=Caenorhabditis elegans, GI17555318, Length=531, Percent_Identity=33.5216572504708, Blast_Score=312, Evalue=3e-85,
Organism=Caenorhabditis elegans, GI17559834, Length=534, Percent_Identity=32.3970037453184, Blast_Score=282, Evalue=3e-76,
Organism=Caenorhabditis elegans, GI17541710, Length=198, Percent_Identity=29.7979797979798, Blast_Score=69, Evalue=1e-11,
Organism=Caenorhabditis elegans, GI17556528, Length=236, Percent_Identity=25.4237288135593, Blast_Score=67, Evalue=3e-11,
Organism=Saccharomyces cerevisiae, GI6321121, Length=545, Percent_Identity=35.045871559633, Blast_Score=337, Evalue=4e-93,
Organism=Saccharomyces cerevisiae, GI6320874, Length=532, Percent_Identity=34.5864661654135, Blast_Score=320, Evalue=4e-88,
Organism=Saccharomyces cerevisiae, GI6325030, Length=395, Percent_Identity=29.3670886075949, Blast_Score=154, Evalue=3e-38,
Organism=Saccharomyces cerevisiae, GI6324314, Length=388, Percent_Identity=29.3814432989691, Blast_Score=131, Evalue=3e-31,
Organism=Saccharomyces cerevisiae, GI6323278, Length=388, Percent_Identity=27.8350515463918, Blast_Score=128, Evalue=3e-30,
Organism=Drosophila melanogaster, GI24666836, Length=528, Percent_Identity=36.3636363636364, Blast_Score=372, Evalue=1e-103,
Organism=Drosophila melanogaster, GI24642252, Length=546, Percent_Identity=36.996336996337, Blast_Score=352, Evalue=3e-97,
Organism=Drosophila melanogaster, GI18859989, Length=546, Percent_Identity=36.996336996337, Blast_Score=352, Evalue=3e-97,
Organism=Drosophila melanogaster, GI24641342, Length=540, Percent_Identity=32.7777777777778, Blast_Score=298, Evalue=7e-81,
Organism=Drosophila melanogaster, GI116007184, Length=159, Percent_Identity=36.4779874213836, Blast_Score=69, Evalue=1e-11,
Organism=Drosophila melanogaster, GI221500365, Length=159, Percent_Identity=36.4779874213836, Blast_Score=69, Evalue=1e-11,
Organism=Drosophila melanogaster, GI45550390, Length=233, Percent_Identity=28.755364806867, Blast_Score=66, Evalue=7e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003439
- InterPro:   IPR017871
- InterPro:   IPR003593 [H]

Pfam domain/function: PF00005 ABC_tran [H]

EC number: NA

Molecular weight: Translated: 72176; Mature: 72176

Theoretical pI: Translated: 5.84; Mature: 5.84

Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIIFSNLSLKRGQTELLENASATINPKQKVGLVGKNGCGKSSLFALLKKELMPEGGEVNY
CEEEECCCCCCCHHHHHHCCCCCCCCHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCC
PANWRVSWVNQETPALDISAIDYVIQGDREYCRLQQELERANERNDGNAIARIHGQLETL
CCCCEEEEECCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHH
DAWTIQSRAASLLHGLGFSQEETIQPVKAFSGGWRMRLNLAQALLCPSDLLLLDEPTNHL
HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCEEEHHHHHHHHCCCCEEEEECCCCCC
DLDAVIWLERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRAT
CHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHHHH
KLAQQTAMYRQQQQKISHLQKYIDRFKAKATKAKQAQSRMKALERMELIAPAYVDNPFTF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE
KFRPPQSLPNPLVMIEQASAGYGSGESAVEILSKIKLNLVPGSRIGLLGKNGAGKSTLIK
EECCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHEECCCCCEEEEECCCCCHHHHHH
LLAGELTALSGTVQLAKGVQLGYFAQHQLDTLRADESALWHMQKLAPEQTEQQVRDYLGS
HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHCHHHHHHHHHHHHHH
FAFHGDKVNQAVKSFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALVDY
HHHCCHHHHHHHHHCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHCC
EGSLVVVSHDRHLLRNTVEEFYLVHDKKVEEFKGDLEDYQKWLSEQNSTSENKVSEKVGD
CCCEEEEECCHHHHHHHHHHHHHHHCCHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHCCC
NENSVQNRKEQKRREAELRQQTAPLRKKITQLEEKMNKFSSELANIENQLADAELYNAEN
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
KEKLTALLAQQVDVKKALDDVETEWMTAQEELEEMLQA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MIIFSNLSLKRGQTELLENASATINPKQKVGLVGKNGCGKSSLFALLKKELMPEGGEVNY
CEEEECCCCCCCHHHHHHCCCCCCCCHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCC
PANWRVSWVNQETPALDISAIDYVIQGDREYCRLQQELERANERNDGNAIARIHGQLETL
CCCCEEEEECCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHH
DAWTIQSRAASLLHGLGFSQEETIQPVKAFSGGWRMRLNLAQALLCPSDLLLLDEPTNHL
HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCEEEHHHHHHHHCCCCEEEEECCCCCC
DLDAVIWLERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRAT
CHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCHHHHHCCCCCHHHHHHHH
KLAQQTAMYRQQQQKISHLQKYIDRFKAKATKAKQAQSRMKALERMELIAPAYVDNPFTF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE
KFRPPQSLPNPLVMIEQASAGYGSGESAVEILSKIKLNLVPGSRIGLLGKNGAGKSTLIK
EECCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHEECCCCCEEEEECCCCCHHHHHH
LLAGELTALSGTVQLAKGVQLGYFAQHQLDTLRADESALWHMQKLAPEQTEQQVRDYLGS
HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHCHHHHHHHHHHHHHH
FAFHGDKVNQAVKSFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALVDY
HHHCCHHHHHHHHHCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHCC
EGSLVVVSHDRHLLRNTVEEFYLVHDKKVEEFKGDLEDYQKWLSEQNSTSENKVSEKVGD
CCCEEEEECCHHHHHHHHHHHHHHHCCHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHCCC
NENSVQNRKEQKRREAELRQQTAPLRKKITQLEEKMNKFSSELANIENQLADAELYNAEN
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
KEKLTALLAQQVDVKKALDDVETEWMTAQEELEEMLQA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7542800; 10675023 [H]