Definition | Mycobacterium tuberculosis F11, complete genome. |
---|---|
Accession | NC_009565 |
Length | 4,424,435 |
Click here to switch to the map view.
The map label for this gene is 148823772
Identifier: 148823772
GI number: 148823772
Start: 2914235
End: 2915824
Strand: Direct
Name: 148823772
Synonym: TBFG_12597
Alternate gene names: NA
Gene position: 2914235-2915824 (Clockwise)
Preceding gene: 148823770
Following gene: 148823774
Centisome position: 65.87
GC content: 64.47
Gene sequence:
>1590_bases GTGGGCGCCGATCTGAAGCAGCCGCAGGATGCCGATTCACCCCCGAAAGGGGTTAGCCGCCGTAGGTTCCTGACGACGGG CGCGGCAGCGGTTGTTGGGACAGGTGTCGGCGCGGGCGGGACCGCGCTGCTGTCGTCACACCCCCGGGGTCCTGCCGTCT GGTATCAACGTGGTCGGAGCGGCGCGCCTCCGGTGGGTGGTCTGCACCTGCAGTTCGGCCGGAATGCCAGCACCGAAATG GTGGTGTCCTGGCATACCACGGACACCGTCGGCAATCCGCGAGTCATGCTGGGCACGCCAACCTCTGGCTTCGGCAGCGT CGTGGTGGCCGAGACCCGGTCGTACCGGGATGCGAAGTCCAATACCGAGGTGCGCGTCAACCACGCTCACCTGACCAACC TGACACCCGATACCGACTACGTCTACGCCGCGGTGCACGACGGTACAACTCCGGAGCTCGGGACCGCACGGACCGCACCG TCGGGTCGAAAACCGCTACGCTTCACCAGCTTCGGTGATCAGTCCACTCCCGCGTTGGGCAGACTGGCCGACGGGAGGTA CGTCAGCGACAACATCGGATCCCCCTTCGCCGGTGACATCACGATTGCGATCGAGCGTATTGCCCCGTTGTTCAACCTGA TCAACGGTGACCTGTGTTACGCCAACCTGGCACAAGACCGAATTCGCACCTGGTCGGACTGGTTTGACAACAACACCCGC TCGGCGCGCTACCGGCCGTGGATGCCGGCAGCGGGCAATCACGAGAACGAAGTCGGTAACGGGCCAATCGGTTATGACGC CTATCAGACCTACTTTGCGGTACCCGACTCGGGATCCAGCCCGCAACTGCGCGGGCTATGGTACTCGTTCACCGCCGGCT CGGTGCGGGTGATCAGCCTGCACAACGATGATGTGTGCTACCAGGACGGTGGCAACTCCTACGTACGCGGCTATTCGGGC GGCGAACAACGGCGCTGGCTGCAAGCCGAACTCGCCAACGCTCGGCGCGACTCGGAAATCGACTGGGTGGTCGTCTGCAT GCATCAGACCGCGATCTCCACCGCCGACGACAACAACGGTGCCGACCTCGGAATCCGGCAGGAATGGCTACCGCTGTTCG ACCAGTACCAGGTCGACCTGGTGGTGTGCGGCCACGAACACCACTACGAGCGGTCACATCCGCTGCGCGGGGCCCTGGGC ACCGATACCCGAACACCGATACCCGTCGACACCCGCAGCGACCTCATCGACTCAACCCGGGGAACCGTGCACCTGGTAAT CGGTGGGGGCGGCACGTCGAAGCCGACCAACGCGCTGCTCTTCCCGCAGCCTCGGTGCCAGGTGATAACCGGCGTCGGGG ATTTTGATCCCGCGATCCGGCGTAAGCCGTCCATATTCGTGCTCGAGGATGCGCCGTGGTCGGCGTTCCGCGACCGCGAT AATCCTTACGGCTTCGTGGCCTTCGACGTCGACCCGGGTCAACCCGGCGGCACTACCTCGATCAAGGCGACGTATTACGC GGTGACTGGGCCGTTCGGGGGACTCACCGTCATCGACCAATTCACCTTGACCAAGCCGCGCGGCGGATAG
Upstream 100 bases:
>100_bases TACGCAGTTCAGAAAGCCTTTCCGAGCAACGCGCCGAGGTAACTTCAGATTTCGGCAGCCGGTTTACCCGCAGGTAAACC AGGGCGGGTATGAAACGTGA
Downstream 100 bases:
>100_bases CTCAGAACAGGGTCGCCTGAACGGGTACCAGTGCCGCTTCGGTCTCCGGCGGCGCCGGGCGATGATCACCCGCCAACCGA TACTTTGCGATCAGCGGTGC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 529; Mature: 528
Protein sequence:
>529_residues MGADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALLSSHPRGPAVWYQRGRSGAPPVGGLHLQFGRNASTEM VVSWHTTDTVGNPRVMLGTPTSGFGSVVVAETRSYRDAKSNTEVRVNHAHLTNLTPDTDYVYAAVHDGTTPELGTARTAP SGRKPLRFTSFGDQSTPALGRLADGRYVSDNIGSPFAGDITIAIERIAPLFNLINGDLCYANLAQDRIRTWSDWFDNNTR SARYRPWMPAAGNHENEVGNGPIGYDAYQTYFAVPDSGSSPQLRGLWYSFTAGSVRVISLHNDDVCYQDGGNSYVRGYSG GEQRRWLQAELANARRDSEIDWVVVCMHQTAISTADDNNGADLGIRQEWLPLFDQYQVDLVVCGHEHHYERSHPLRGALG TDTRTPIPVDTRSDLIDSTRGTVHLVIGGGGTSKPTNALLFPQPRCQVITGVGDFDPAIRRKPSIFVLEDAPWSAFRDRD NPYGFVAFDVDPGQPGGTTSIKATYYAVTGPFGGLTVIDQFTLTKPRGG
Sequences:
>Translated_529_residues MGADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALLSSHPRGPAVWYQRGRSGAPPVGGLHLQFGRNASTEM VVSWHTTDTVGNPRVMLGTPTSGFGSVVVAETRSYRDAKSNTEVRVNHAHLTNLTPDTDYVYAAVHDGTTPELGTARTAP SGRKPLRFTSFGDQSTPALGRLADGRYVSDNIGSPFAGDITIAIERIAPLFNLINGDLCYANLAQDRIRTWSDWFDNNTR SARYRPWMPAAGNHENEVGNGPIGYDAYQTYFAVPDSGSSPQLRGLWYSFTAGSVRVISLHNDDVCYQDGGNSYVRGYSG GEQRRWLQAELANARRDSEIDWVVVCMHQTAISTADDNNGADLGIRQEWLPLFDQYQVDLVVCGHEHHYERSHPLRGALG TDTRTPIPVDTRSDLIDSTRGTVHLVIGGGGTSKPTNALLFPQPRCQVITGVGDFDPAIRRKPSIFVLEDAPWSAFRDRD NPYGFVAFDVDPGQPGGTTSIKATYYAVTGPFGGLTVIDQFTLTKPRGG >Mature_528_residues GADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALLSSHPRGPAVWYQRGRSGAPPVGGLHLQFGRNASTEMV VSWHTTDTVGNPRVMLGTPTSGFGSVVVAETRSYRDAKSNTEVRVNHAHLTNLTPDTDYVYAAVHDGTTPELGTARTAPS GRKPLRFTSFGDQSTPALGRLADGRYVSDNIGSPFAGDITIAIERIAPLFNLINGDLCYANLAQDRIRTWSDWFDNNTRS ARYRPWMPAAGNHENEVGNGPIGYDAYQTYFAVPDSGSSPQLRGLWYSFTAGSVRVISLHNDDVCYQDGGNSYVRGYSGG EQRRWLQAELANARRDSEIDWVVVCMHQTAISTADDNNGADLGIRQEWLPLFDQYQVDLVVCGHEHHYERSHPLRGALGT DTRTPIPVDTRSDLIDSTRGTVHLVIGGGGTSKPTNALLFPQPRCQVITGVGDFDPAIRRKPSIFVLEDAPWSAFRDRDN PYGFVAFDVDPGQPGGTTSIKATYYAVTGPFGGLTVIDQFTLTKPRGG
Specific function: Unknown
COG id: COG1409
COG function: function code R; Predicted phosphohydrolases
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI228008321, Length=241, Percent_Identity=26.1410788381743, Blast_Score=68, Evalue=2e-11, Organism=Caenorhabditis elegans, GI32566472, Length=350, Percent_Identity=25.7142857142857, Blast_Score=77, Evalue=3e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y2577_MYCTU (Q50644)
Other databases:
- EMBL: BX842580 - EMBL: AE000516 - PIR: H70724 - RefSeq: NP_217093.1 - RefSeq: NP_337153.1 - ProteinModelPortal: Q50644 - EnsemblBacteria: EBMYCT00000000938 - EnsemblBacteria: EBMYCT00000072039 - GeneID: 888207 - GeneID: 925659 - GenomeReviews: AE000516_GR - GenomeReviews: AL123456_GR - KEGG: mtc:MT2654 - KEGG: mtu:Rv2577 - TIGR: MT2654 - TubercuList: Rv2577 - GeneTree: EBGT00050000017187 - HOGENOM: HBG367301 - OMA: TIHLILG - ProtClustDB: CLSK872076 - InterPro: IPR004843 - InterPro: IPR008963 - InterPro: IPR015914 - InterPro: IPR006311 - Gene3D: G3DSA:2.60.40.380 - TIGRFAMs: TIGR01409
Pfam domain/function: PF00149 Metallophos; SSF49363 Purple_Pase_N
EC number: NA
Molecular weight: Translated: 57272; Mature: 57141
Theoretical pI: Translated: 6.75; Mature: 6.75
Prosite motif: PS51318 TAT
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.9 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 0.8 %Met (Mature Protein) 1.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALLSSHPRGPAVWYQRGRS CCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEECCCCCCCHHHCCCCCCCCCHHHHCCCC GAPPVGGLHLQFGRNASTEMVVSWHTTDTVGNPRVMLGTPTSGFGSVVVAETRSYRDAKS CCCCCCCEEEEECCCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCEEEEECCCCCCCCC NTEVRVNHAHLTNLTPDTDYVYAAVHDGTTPELGTARTAPSGRKPLRFTSFGDQSTPALG CCEEEEEEEEECCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCHH RLADGRYVSDNIGSPFAGDITIAIERIAPLFNLINGDLCYANLAQDRIRTWSDWFDNNTR HCCCCCEEECCCCCCCCCCEEEEHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHCCCCC SARYRPWMPAAGNHENEVGNGPIGYDAYQTYFAVPDSGSSPQLRGLWYSFTAGSVRVISL CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEEEEEECCCCEEEEEE HNDDVCYQDGGNSYVRGYSGGEQRRWLQAELANARRDSEIDWVVVCMHQTAISTADDNNG CCCCEEEECCCCCEEECCCCCHHHHHHHHHHHHCCCCCCCCEEEEEEEHHEECCCCCCCC ADLGIRQEWLPLFDQYQVDLVVCGHEHHYERSHPLRGALGTDTRTPIPVDTRSDLIDSTR CCCCCCHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCC GTVHLVIGGGGTSKPTNALLFPQPRCQVITGVGDFDPAIRRKPSIFVLEDAPWSAFRDRD CEEEEEEECCCCCCCCCEEECCCCCEEEEECCCCCCHHHHCCCCEEEEECCCCHHHCCCC NPYGFVAFDVDPGQPGGTTSIKATYYAVTGPFGGLTVIDQFTLTKPRGG CCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCCEEEEEEEEECCCCCC >Mature Secondary Structure GADLKQPQDADSPPKGVSRRRFLTTGAAAVVGTGVGAGGTALLSSHPRGPAVWYQRGRS CCCCCCCCCCCCCCCCCCCCEEEECCCCEEEECCCCCCCHHHCCCCCCCCCHHHHCCCC GAPPVGGLHLQFGRNASTEMVVSWHTTDTVGNPRVMLGTPTSGFGSVVVAETRSYRDAKS CCCCCCCEEEEECCCCCCEEEEEEECCCCCCCCEEEEECCCCCCCCEEEEECCCCCCCCC NTEVRVNHAHLTNLTPDTDYVYAAVHDGTTPELGTARTAPSGRKPLRFTSFGDQSTPALG CCEEEEEEEEECCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCHH RLADGRYVSDNIGSPFAGDITIAIERIAPLFNLINGDLCYANLAQDRIRTWSDWFDNNTR HCCCCCEEECCCCCCCCCCEEEEHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHCCCCC SARYRPWMPAAGNHENEVGNGPIGYDAYQTYFAVPDSGSSPQLRGLWYSFTAGSVRVISL CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEEEEEECCCCEEEEEE HNDDVCYQDGGNSYVRGYSGGEQRRWLQAELANARRDSEIDWVVVCMHQTAISTADDNNG CCCCEEEECCCCCEEECCCCCHHHHHHHHHHHHCCCCCCCCEEEEEEEHHEECCCCCCCC ADLGIRQEWLPLFDQYQVDLVVCGHEHHYERSHPLRGALGTDTRTPIPVDTRSDLIDSTR CCCCCCHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCC GTVHLVIGGGGTSKPTNALLFPQPRCQVITGVGDFDPAIRRKPSIFVLEDAPWSAFRDRD CEEEEEEECCCCCCCCCEEECCCCCEEEEECCCCCCHHHHCCCCEEEEECCCCHHHCCCC NPYGFVAFDVDPGQPGGTTSIKATYYAVTGPFGGLTVIDQFTLTKPRGG CCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCCEEEEEEEEECCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036