Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is modF [H]
Identifier: 157160235
GI number: 157160235
Start: 818919
End: 820391
Strand: Reverse
Name: modF [H]
Synonym: EcHS_A0814
Alternate gene names: 157160235
Gene position: 820391-818919 (Counterclockwise)
Preceding gene: 157160236
Following gene: 157160234
Centisome position: 17.67
GC content: 52.68
Gene sequence:
>1473_bases ATGTCATCGTTGCAAATTTTGCAAGGCACGTTTCGTCTTAGCGACACAAAAACGCTGCAATTGCCTCAGCTAACGTTAAA CGCGGGTGATAGTTGGGCGTTTGTCGGTTCGAATGGAAGCGGGAAATCGGCCCTGGCCCGCGCGCTGGCGGGGGAACTTC CGCTTTTGAAAGGTGAACGGCAAAGCCAGTTTTCCCACATCACTCGTCTCTCCTTCGAGCAATTGCAAAAGCTCGTCAGC GACGAATGGCAACGGAATAACACCGATATGCTCGGCCCTGGCGAAGATGACACCGGACGCACTACGGCTGAGATTATTCA GGATGAAGTAAAGGATGCACCGCGTTGCATGCAACTGGCGCAGCAGTTCGGTATTACCGCCCTCCTCGACCGACGCTTTA AATACCTTTCCACTGGCGAGACGCGAAAAACCCTGCTGTGTCAGGCGCTGATGTCGGAGCCTGACTTGTTGATTCTTGAT GAGCCGTTCGATGGCCTGGATGTTGCTTCACGTCAGCAGCTGGCTGAGCGACTCGCCTCGTTACATCAGTCCGGTATTAC TCTGGTACTGGTGCTCAATCGCTTCGATGAGATCCCGGAATTTGTCCAGTTTGCTGGCGTGCTGGCGGATTGTACGTTAG CGGAAACGGGCGCTAAAGAGGAACTGCTCCAGCAAGCACTCGTCGCGCAACTGGCGCATAGCGAACAGCTTGAAGGTGTG CAACTGCCGGAACCGGATGAACCTTCAGCACGTCACGCCTTACCCGCCAACGAACCGCGCATTGTGCTGAACAATGGCGT GGTTTCTTATAACGATCGCCCCATTCTTAATAACCTTAGCTGGCAGGTGAATCCAGGCGAACACTGGCAAATTGTCGGGC CAAATGGTGCGGGAAAATCGACGTTATTAAGCCTGATTACTGGCGATCATCCGCAAGGTTACAGCAACGATTTGACGCTT TTCGGCCGACGTCGTGGCAGCGGCGAAACCATCTGGGATATCAAAAAGCATATCGGTTACGTCAGCAGTAGTTTGCATCT GGATTACCGGGTCAGCACTACCGTGCGTAATGTGATTCTTTCTGGCTATTTCGATTCGATTGGCATTTATCAAGCCGTTT CGGATCGCCAGCAAAAACTGGTGCAGCAGTGGCTGGATATTCTCGGCATTGATAAACGCACGGCTGACGCTCCGTTCCAT AGTCTTTCCTGGGGACAGCAGCGTCTGGCGCTGATCGTCCGCGCACTGGTGAAACATCCGACGTTGCTTATTCTCGATGA ACCACTACAGGGGCTTGATCCGCTCAATCGCCAGCTTATCCGCCGTTTTGTTGATGTGCTGATTAGCGAAGGTGAAACGC AATTGTTGTTTGTTTCGCACCACGCTGAAGATGCGCCTGCCTGTATTACCCATCGCCTTGAGTTCGTGCCGGACGGTGGA CTCTATCGCTATGTGCTGACAAAAATATATTGA
Upstream 100 bases:
>100_bases GCCGACAGCGTGATTATCGCCACGCTGTGCTAAGCGTGTTGACAATTTGTTATGAAACACGTATCCCTGTCAGTAATCGC TGCACAAAATGGGATATAAA
Downstream 100 bases:
>100_bases GTCGGTAGTGCTGACCTTGCCGGAGGCGGCCTAAGCGCCCTCTCCGGCCAACGGTTCGACGCATGCAGGCATTAAACCGC GTCTTTTTTTCAGATAAAAA
Product: putative molybdenum transport ATP-binding protein ModF
Products: NA
Alternate protein names: Photorepair protein PhrA [H]
Number of amino acids: Translated: 490; Mature: 489
Protein sequence:
>490_residues MSSLQILQGTFRLSDTKTLQLPQLTLNAGDSWAFVGSNGSGKSALARALAGELPLLKGERQSQFSHITRLSFEQLQKLVS DEWQRNNTDMLGPGEDDTGRTTAEIIQDEVKDAPRCMQLAQQFGITALLDRRFKYLSTGETRKTLLCQALMSEPDLLILD EPFDGLDVASRQQLAERLASLHQSGITLVLVLNRFDEIPEFVQFAGVLADCTLAETGAKEELLQQALVAQLAHSEQLEGV QLPEPDEPSARHALPANEPRIVLNNGVVSYNDRPILNNLSWQVNPGEHWQIVGPNGAGKSTLLSLITGDHPQGYSNDLTL FGRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVILSGYFDSIGIYQAVSDRQQKLVQQWLDILGIDKRTADAPFH SLSWGQQRLALIVRALVKHPTLLILDEPLQGLDPLNRQLIRRFVDVLISEGETQLLFVSHHAEDAPACITHRLEFVPDGG LYRYVLTKIY
Sequences:
>Translated_490_residues MSSLQILQGTFRLSDTKTLQLPQLTLNAGDSWAFVGSNGSGKSALARALAGELPLLKGERQSQFSHITRLSFEQLQKLVS DEWQRNNTDMLGPGEDDTGRTTAEIIQDEVKDAPRCMQLAQQFGITALLDRRFKYLSTGETRKTLLCQALMSEPDLLILD EPFDGLDVASRQQLAERLASLHQSGITLVLVLNRFDEIPEFVQFAGVLADCTLAETGAKEELLQQALVAQLAHSEQLEGV QLPEPDEPSARHALPANEPRIVLNNGVVSYNDRPILNNLSWQVNPGEHWQIVGPNGAGKSTLLSLITGDHPQGYSNDLTL FGRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVILSGYFDSIGIYQAVSDRQQKLVQQWLDILGIDKRTADAPFH SLSWGQQRLALIVRALVKHPTLLILDEPLQGLDPLNRQLIRRFVDVLISEGETQLLFVSHHAEDAPACITHRLEFVPDGG LYRYVLTKIY >Mature_489_residues SSLQILQGTFRLSDTKTLQLPQLTLNAGDSWAFVGSNGSGKSALARALAGELPLLKGERQSQFSHITRLSFEQLQKLVSD EWQRNNTDMLGPGEDDTGRTTAEIIQDEVKDAPRCMQLAQQFGITALLDRRFKYLSTGETRKTLLCQALMSEPDLLILDE PFDGLDVASRQQLAERLASLHQSGITLVLVLNRFDEIPEFVQFAGVLADCTLAETGAKEELLQQALVAQLAHSEQLEGVQ LPEPDEPSARHALPANEPRIVLNNGVVSYNDRPILNNLSWQVNPGEHWQIVGPNGAGKSTLLSLITGDHPQGYSNDLTLF GRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVILSGYFDSIGIYQAVSDRQQKLVQQWLDILGIDKRTADAPFHS LSWGQQRLALIVRALVKHPTLLILDEPLQGLDPLNRQLIRRFVDVLISEGETQLLFVSHHAEDAPACITHRLEFVPDGGL YRYVLTKIY
Specific function: Involved in the transport of molybdenum into the cell. Involved in photorepair. Could act on UV-induced DNA damage other than pyrimidine dimers [H]
COG id: COG1119
COG function: function code P; ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA
Gene ontology:
Cell location: Cell inner membrane; Peripheral membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 ABC transporter domains [H]
Homologues:
Organism=Homo sapiens, GI27262624, Length=167, Percent_Identity=28.1437125748503, Blast_Score=66, Evalue=8e-11, Organism=Homo sapiens, GI27262626, Length=167, Percent_Identity=28.1437125748503, Blast_Score=66, Evalue=8e-11, Organism=Escherichia coli, GI1786975, Length=490, Percent_Identity=99.7959183673469, Blast_Score=998, Evalue=0.0, Organism=Escherichia coli, GI48994943, Length=379, Percent_Identity=27.7044854881267, Blast_Score=103, Evalue=2e-23, Organism=Escherichia coli, GI1788897, Length=502, Percent_Identity=23.3067729083665, Blast_Score=100, Evalue=4e-22, Organism=Escherichia coli, GI1787041, Length=526, Percent_Identity=23.1939163498099, Blast_Score=93, Evalue=5e-20, Organism=Escherichia coli, GI1789991, Length=475, Percent_Identity=24.8421052631579, Blast_Score=91, Evalue=1e-19, Organism=Escherichia coli, GI48995001, Length=463, Percent_Identity=23.5421166306695, Blast_Score=90, Evalue=3e-19, Organism=Escherichia coli, GI1790525, Length=474, Percent_Identity=21.9409282700422, Blast_Score=80, Evalue=3e-16, Organism=Escherichia coli, GI1790190, Length=480, Percent_Identity=23.5416666666667, Blast_Score=80, Evalue=3e-16, Organism=Escherichia coli, GI1787792, Length=489, Percent_Identity=24.1308793456033, Blast_Score=79, Evalue=5e-16, Organism=Escherichia coli, GI2367384, Length=483, Percent_Identity=24.4306418219462, Blast_Score=77, Evalue=2e-15, Organism=Escherichia coli, GI1787029, Length=206, Percent_Identity=30.0970873786408, Blast_Score=73, Evalue=5e-14, Organism=Escherichia coli, GI145693107, Length=496, Percent_Identity=23.3870967741935, Blast_Score=72, Evalue=1e-13, Organism=Escherichia coli, GI1789593, Length=223, Percent_Identity=28.2511210762332, Blast_Score=72, Evalue=1e-13, Organism=Escherichia coli, GI1786319, Length=193, Percent_Identity=31.0880829015544, Blast_Score=70, Evalue=3e-13, Organism=Escherichia coli, GI1788225, Length=180, Percent_Identity=28.3333333333333, Blast_Score=70, Evalue=4e-13, Organism=Escherichia coli, GI87081782, Length=203, Percent_Identity=27.5862068965517, Blast_Score=69, Evalue=8e-13, Organism=Escherichia coli, GI1786563, Length=197, Percent_Identity=30.4568527918782, Blast_Score=67, Evalue=3e-12, Organism=Escherichia coli, GI1790544, Length=184, Percent_Identity=30.4347826086957, Blast_Score=66, Evalue=4e-12, Organism=Escherichia coli, GI1786253, Length=201, Percent_Identity=27.8606965174129, Blast_Score=64, Evalue=2e-11, Organism=Escherichia coli, GI1788472, Length=480, Percent_Identity=22.0833333333333, Blast_Score=64, Evalue=2e-11, Organism=Escherichia coli, GI87082268, Length=198, Percent_Identity=28.2828282828283, Blast_Score=64, Evalue=2e-11, Organism=Escherichia coli, GI1788761, Length=219, Percent_Identity=26.9406392694064, Blast_Score=63, Evalue=4e-11, Organism=Escherichia coli, GI1787112, Length=230, Percent_Identity=27.3913043478261, Blast_Score=63, Evalue=4e-11, Organism=Caenorhabditis elegans, GI193208177, Length=236, Percent_Identity=29.2372881355932, Blast_Score=71, Evalue=1e-12, Organism=Caenorhabditis elegans, GI115533608, Length=229, Percent_Identity=27.9475982532751, Blast_Score=69, Evalue=4e-12, Organism=Saccharomyces cerevisiae, GI6320266, Length=393, Percent_Identity=29.5165394402036, Blast_Score=161, Evalue=2e-40, Organism=Drosophila melanogaster, GI24650853, Length=237, Percent_Identity=32.0675105485232, Blast_Score=77, Evalue=2e-14, Organism=Drosophila melanogaster, GI24650855, Length=237, Percent_Identity=32.0675105485232, Blast_Score=77, Evalue=3e-14, Organism=Drosophila melanogaster, GI281362751, Length=237, Percent_Identity=32.0675105485232, Blast_Score=76, Evalue=7e-14, Organism=Drosophila melanogaster, GI28574744, Length=196, Percent_Identity=28.0612244897959, Blast_Score=71, Evalue=1e-12, Organism=Drosophila melanogaster, GI221512771, Length=202, Percent_Identity=26.2376237623762, Blast_Score=71, Evalue=2e-12, Organism=Drosophila melanogaster, GI19920532, Length=183, Percent_Identity=29.5081967213115, Blast_Score=69, Evalue=5e-12, Organism=Drosophila melanogaster, GI24580930, Length=183, Percent_Identity=29.5081967213115, Blast_Score=69, Evalue=5e-12, Organism=Drosophila melanogaster, GI85725262, Length=214, Percent_Identity=29.9065420560748, Blast_Score=66, Evalue=5e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003439 - InterPro: IPR003593 [H]
Pfam domain/function: PF00005 ABC_tran [H]
EC number: NA
Molecular weight: Translated: 54551; Mature: 54419
Theoretical pI: Translated: 5.45; Mature: 5.45
Prosite motif: PS50893 ABC_TRANSPORTER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 0.8 %Met (Translated Protein) 1.6 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 0.6 %Met (Mature Protein) 1.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSSLQILQGTFRLSDTKTLQLPQLTLNAGDSWAFVGSNGSGKSALARALAGELPLLKGER CCCCHHHHHHEEECCCCEEECCEEEECCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCCH QSQFSHITRLSFEQLQKLVSDEWQRNNTDMLGPGEDDTGRTTAEIIQDEVKDAPRCMQLA HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHH QQFGITALLDRRFKYLSTGETRKTLLCQALMSEPDLLILDEPFDGLDVASRQQLAERLAS HHHCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHH LHQSGITLVLVLNRFDEIPEFVQFAGVLADCTLAETGAKEELLQQALVAQLAHSEQLEGV HHHCCCEEEEEHHHHHHHHHHHHHHHHHHHCHHHHCCCHHHHHHHHHHHHHHHHHHCCCC QLPEPDEPSARHALPANEPRIVLNNGVVSYNDRPILNNLSWQVNPGEHWQIVGPNGAGKS CCCCCCCCCCCCCCCCCCCEEEEECCEEEECCCCCCCCCCEEECCCCCEEEECCCCCCHH TLLSLITGDHPQGYSNDLTLFGRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVIL HHHHHHHCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCEEEEEEHHHHHHHHHH SGYFDSIGIYQAVSDRQQKLVQQWLDILGIDKRTADAPFHSLSWGQQRLALIVRALVKHP HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC TLLILDEPLQGLDPLNRQLIRRFVDVLISEGETQLLFVSHHAEDAPACITHRLEFVPDGG EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHCCCCCC LYRYVLTKIY HHHHHHHHCC >Mature Secondary Structure SSLQILQGTFRLSDTKTLQLPQLTLNAGDSWAFVGSNGSGKSALARALAGELPLLKGER CCCHHHHHHEEECCCCEEECCEEEECCCCCEEEECCCCCCHHHHHHHHHCCCCCCCCCH QSQFSHITRLSFEQLQKLVSDEWQRNNTDMLGPGEDDTGRTTAEIIQDEVKDAPRCMQLA HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHH QQFGITALLDRRFKYLSTGETRKTLLCQALMSEPDLLILDEPFDGLDVASRQQLAERLAS HHHCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHH LHQSGITLVLVLNRFDEIPEFVQFAGVLADCTLAETGAKEELLQQALVAQLAHSEQLEGV HHHCCCEEEEEHHHHHHHHHHHHHHHHHHHCHHHHCCCHHHHHHHHHHHHHHHHHHCCCC QLPEPDEPSARHALPANEPRIVLNNGVVSYNDRPILNNLSWQVNPGEHWQIVGPNGAGKS CCCCCCCCCCCCCCCCCCCEEEEECCEEEECCCCCCCCCCEEECCCCCEEEECCCCCCHH TLLSLITGDHPQGYSNDLTLFGRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVIL HHHHHHHCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCEEEEEEHHHHHHHHHH SGYFDSIGIYQAVSDRQQKLVQQWLDILGIDKRTADAPFHSLSWGQQRLALIVRALVKHP HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC TLLILDEPLQGLDPLNRQLIRRFVDVLISEGETQLLFVSHHAEDAPACITHRLEFVPDGG EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHCCCCCC LYRYVLTKIY HHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8564363; 8550508; 8905232; 9278503; 8310005 [H]