| Definition | Mycobacterium tuberculosis H37Ra, complete genome. |
|---|---|
| Accession | NC_009525 |
| Length | 4,419,977 |
Click here to switch to the map view.
The map label for this gene is 148659865
Identifier: 148659865
GI number: 148659865
Start: 119072
End: 121057
Strand: Direct
Name: 148659865
Synonym: MRA_0107
Alternate gene names: NA
Gene position: 119072-121057 (Clockwise)
Preceding gene: 148659864
Following gene: 148659867
Centisome position: 2.69
GC content: 65.06
Gene sequence:
>1986_bases ATGGGGACGCACGGGGCTACCAAGAGTGCGACGTCGGCTGTGCCAACGCCCCGGTCGAACTCCATGGCGATGGTACGGCT GGCAATTGGCCTGCTGGGTGTGTGCGCGGTGGTCGCGGCCTTCGGGCTGGTGTCGGGAGCGCGCCGCTACGCTGAGGCCG GCAATCCCTATCCGGGCGCCTTCGTCAGCGTCGCCGAGCCGGTCGGGTTCTTCGCCGCGTCGCTGGCCGGTGCGCTGTGT CTGGGCGCGCTGATCCACGTGGTCATGACGGCCAAACCCGAGCCGGATGGCTTAATCGACGCCGCGGCGTTCCGGATTCA CCTGCTGGCAGAACGTGTTTCAGGTCTCTGGTTGGGGCTAGCCGCGACCATGGTGGTCATTCAGGCCGCCCACGATACTG GAGTGGGGCCCGCGAGACTGCTGGCTAGTGGGGCACTATCGGACTCCGTCGCCGCCTCCGAGATGGCACGCGGGTGGATT GTTGCGGCGATCTGCGCGCTGGTGGTTGCGACGGCGCTGCGGCTGTACACTCGCTGGCTCGGGCACGTTGTGCTGCTTGT CCCCACTGTGCTTGCCGTCGTCGCCACCGCGGTGACCGGTAACCCGGGACAGGGACCCGACCATGACTACGCGACCAGCG CCGCGATCGTGTTCGCGGTCGCGTTCGCCACCTTGACCGGGCTCAAGATCGCTGCGGCGTTGGCGGGAACGACGCCAAGC CGCGCTGTGCTGGTAACGCAGGTCACCTGTGGAGCGCTCGCGTTGGCATACGGAGCGATGCTGCTTTATCTCTTCATCCC GGGCTGGGCGGTCGATTCGGATTTTGCCCGCCTTGGTCTGCTTGCGGGGGTAATCCTGACGTCGGTGTGGTTGTTTGACT GCTGGCGGCTGTTGGTCAGGCCGCCACATGCGGGCCGTCGCCGCGGTGGTGGCTCCGGTGCCGCACTGGCCATGATGGCC GCCATGGCTTCGATAGCTGCCATGGCCGTTATGACCGCGCCGCGATTTCTCACCCACGCGTTCACGGCTTGGGATGTCTT CCTCGGCTATGAACTGCCGCAACCGCCGACCATAGCCCGGGTGCTCACCGTGTGGCGCTTCGATAGCCTGATCGGAGCCG CTGGTGTGGTTCTCGCGATCGGGTATGCGGCGGGCTTCGCCGCGCTGCGGCGCCGAGGTAACTCTTGGCCGGTGGGCAGA TTGATCGCCTGGCTGACTGGTTGCGCCGCACTGGTATTCACCAGCGGCTCCGGTGTACGGGCCTATGGTTCGGCGATGTT CAGCGTCCACATGGCCGAACACATGACACTGAACATGTTCATCCCGGTCCTGTTGGTGCTCGGTGGCCCGGTCACGCTGG CGCTGCGGGTGCTGCCGGTAACGGGTGATGGACGGCCGCCGGGGGCTCGCGAATGGCTGACCTGGCTGCTGCACTCCCGG GTGACAACTTTCCTGTCGCACCCGATCACCGCATTCGTCCTCTTTGTGGCCTCGCCCTATATCGTCTATTTCACACCGCT GTTCGATACCTTCGTCCGCTATCACTGGGGCCACGAGTTCATGGCGATCCATTTCCTGGTGGTCGGGTACTTGTTCTACT GGGCGATCATCGGCATCGACCCAGGGCCGCGCCGACTGCCCTACCCGGGCCGGATCGGGCTGTTGTTCGCGGTGATGCCG TTCCACGCCTTCTTCGGGATCGCGCTGATGACGATGTCGTCTACGGTGGGCGCTACGTTCTATCGTTCCGTCAATCTGCC GTGGTTGTCGAGCATCATCGCCGACCAGCATCTCGGCGGTGGAATTGCTTGGAGCCTAACGGAATTGCCGGTCATCATGG TCATCGTGGCGCTGGTTACCCAATGGGCGCGCCAAGACCGCCGAGTCGCGTCCCGCGAAGACCGGCATGCCGACAGCGAC TACGCCGACGACGAGCTGGAAGCCTACAACGCGATGCTTCGCGAGTTGTCGCGAATGCGGCGCTGA
Upstream 100 bases:
>100_bases GAGCAGAATATGTGGTTGATGGCCACTAGGCCGGTACCGGGGAACTGGCGGTTCCCGGCCGATGAGCATCGGCCCTGACG CGCGGCCGTAAGCTCCAGGA
Downstream 100 bases:
>100_bases ATGTGCAGATGATTTTGGAAGCGGTTGGCGTATCTGCCCGTGCTCGGCTACACCAGGACCGCGGGGCGCTGGCACGCGAA CGATCCGGCGAGGAGGTGGG
Product: putative integral membrane protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 661; Mature: 660
Protein sequence:
>661_residues MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALC LGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMA AMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGR LIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMP FHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSD YADDELEAYNAMLRELSRMRR
Sequences:
>Translated_661_residues MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALC LGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMA AMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGR LIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMP FHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSD YADDELEAYNAMLRELSRMRR >Mature_660_residues GTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALCL GALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWIV AAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPSR AVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMAA MASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGRL IAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSRV TTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMPF HAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSDY ADDELEAYNAMLRELSRMRR
Specific function: Unknown
COG id: COG3336
COG function: function code S; Predicted membrane protein
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential)
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: To M.leprae ML1998
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y102_MYCTU (P64689)
Other databases:
- EMBL: BX842572 - EMBL: AE000516 - PIR: F70751 - RefSeq: NP_214616.1 - RefSeq: NP_334519.1 - ProteinModelPortal: P64689 - EnsemblBacteria: EBMYCT00000003504 - EnsemblBacteria: EBMYCT00000069269 - GeneID: 886926 - GeneID: 922960 - GenomeReviews: AE000516_GR - GenomeReviews: AL123456_GR - KEGG: mtc:MT0111 - KEGG: mtu:Rv0102 - TIGR: MT0111 - TubercuList: Rv0102 - GeneTree: EBGT00050000017144 - HOGENOM: HBG579710 - OMA: ATVGNPY - ProtClustDB: CLSK790265 - GO: GO:0040007 - InterPro: IPR008457 - InterPro: IPR019108
Pfam domain/function: PF09678 Caa3_CtaG; PF05425 CopD
EC number: NA
Molecular weight: Translated: 70357; Mature: 70226
Theoretical pI: Translated: 9.60; Mature: 9.60
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x1b3dd8f4)-; HASH(0x1bda0168)-; HASH(0x1b493790)-; HASH(0x1bda021c)-; HASH(0x1b5fac3c)-; HASH(0x1b56f614)-; HASH(0x1bc841c8)-; HASH(0x1bc7136c)-; HASH(0x1b5fae7c)-; HASH(0x1bda018c)-; HASH(0x1b5fae10)-; HASH(0x1b5fac6c)-; HASH(0x1bc83ed4)-; HASH(0x1bcd1f68)-; HASH(0x1bcd210c)-; HASH(0x1b493718)-;
Cys/Met content:
0.9 %Cys (Translated Protein) 3.5 %Met (Translated Protein) 4.4 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGA CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC FVSVAEPVGFFAASLAGALCLGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH AATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWIVAAICALVVATALRLYTRWL HHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH GHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVR CEEEEEHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC PPHAGRRRGGGSGAALAMMAAMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIAR CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH VLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGRLIAWLTGCAALVFTSGSGVR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCH AYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHH VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGID HHHHHHHHHHHHHHHHHCCHHEEEHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC PGPRRLPYPGRIGLLFAVMPFHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGG CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC GIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSDYADDELEAYNAMLRELSRMR CCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHC R H >Mature Secondary Structure GTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGA CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC FVSVAEPVGFFAASLAGALCLGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH AATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWIVAAICALVVATALRLYTRWL HHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH GHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVR CEEEEEHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC PPHAGRRRGGGSGAALAMMAAMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIAR CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH VLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGRLIAWLTGCAALVFTSGSGVR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCH AYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHH VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGID HHHHHHHHHHHHHHHHHCCHHEEEHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC PGPRRLPYPGRIGLLFAVMPFHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGG CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC GIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSDYADDELEAYNAMLRELSRMR CCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHC R H
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036