Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is 148659865

Identifier: 148659865

GI number: 148659865

Start: 119072

End: 121057

Strand: Direct

Name: 148659865

Synonym: MRA_0107

Alternate gene names: NA

Gene position: 119072-121057 (Clockwise)

Preceding gene: 148659864

Following gene: 148659867

Centisome position: 2.69

GC content: 65.06

Gene sequence:

>1986_bases
ATGGGGACGCACGGGGCTACCAAGAGTGCGACGTCGGCTGTGCCAACGCCCCGGTCGAACTCCATGGCGATGGTACGGCT
GGCAATTGGCCTGCTGGGTGTGTGCGCGGTGGTCGCGGCCTTCGGGCTGGTGTCGGGAGCGCGCCGCTACGCTGAGGCCG
GCAATCCCTATCCGGGCGCCTTCGTCAGCGTCGCCGAGCCGGTCGGGTTCTTCGCCGCGTCGCTGGCCGGTGCGCTGTGT
CTGGGCGCGCTGATCCACGTGGTCATGACGGCCAAACCCGAGCCGGATGGCTTAATCGACGCCGCGGCGTTCCGGATTCA
CCTGCTGGCAGAACGTGTTTCAGGTCTCTGGTTGGGGCTAGCCGCGACCATGGTGGTCATTCAGGCCGCCCACGATACTG
GAGTGGGGCCCGCGAGACTGCTGGCTAGTGGGGCACTATCGGACTCCGTCGCCGCCTCCGAGATGGCACGCGGGTGGATT
GTTGCGGCGATCTGCGCGCTGGTGGTTGCGACGGCGCTGCGGCTGTACACTCGCTGGCTCGGGCACGTTGTGCTGCTTGT
CCCCACTGTGCTTGCCGTCGTCGCCACCGCGGTGACCGGTAACCCGGGACAGGGACCCGACCATGACTACGCGACCAGCG
CCGCGATCGTGTTCGCGGTCGCGTTCGCCACCTTGACCGGGCTCAAGATCGCTGCGGCGTTGGCGGGAACGACGCCAAGC
CGCGCTGTGCTGGTAACGCAGGTCACCTGTGGAGCGCTCGCGTTGGCATACGGAGCGATGCTGCTTTATCTCTTCATCCC
GGGCTGGGCGGTCGATTCGGATTTTGCCCGCCTTGGTCTGCTTGCGGGGGTAATCCTGACGTCGGTGTGGTTGTTTGACT
GCTGGCGGCTGTTGGTCAGGCCGCCACATGCGGGCCGTCGCCGCGGTGGTGGCTCCGGTGCCGCACTGGCCATGATGGCC
GCCATGGCTTCGATAGCTGCCATGGCCGTTATGACCGCGCCGCGATTTCTCACCCACGCGTTCACGGCTTGGGATGTCTT
CCTCGGCTATGAACTGCCGCAACCGCCGACCATAGCCCGGGTGCTCACCGTGTGGCGCTTCGATAGCCTGATCGGAGCCG
CTGGTGTGGTTCTCGCGATCGGGTATGCGGCGGGCTTCGCCGCGCTGCGGCGCCGAGGTAACTCTTGGCCGGTGGGCAGA
TTGATCGCCTGGCTGACTGGTTGCGCCGCACTGGTATTCACCAGCGGCTCCGGTGTACGGGCCTATGGTTCGGCGATGTT
CAGCGTCCACATGGCCGAACACATGACACTGAACATGTTCATCCCGGTCCTGTTGGTGCTCGGTGGCCCGGTCACGCTGG
CGCTGCGGGTGCTGCCGGTAACGGGTGATGGACGGCCGCCGGGGGCTCGCGAATGGCTGACCTGGCTGCTGCACTCCCGG
GTGACAACTTTCCTGTCGCACCCGATCACCGCATTCGTCCTCTTTGTGGCCTCGCCCTATATCGTCTATTTCACACCGCT
GTTCGATACCTTCGTCCGCTATCACTGGGGCCACGAGTTCATGGCGATCCATTTCCTGGTGGTCGGGTACTTGTTCTACT
GGGCGATCATCGGCATCGACCCAGGGCCGCGCCGACTGCCCTACCCGGGCCGGATCGGGCTGTTGTTCGCGGTGATGCCG
TTCCACGCCTTCTTCGGGATCGCGCTGATGACGATGTCGTCTACGGTGGGCGCTACGTTCTATCGTTCCGTCAATCTGCC
GTGGTTGTCGAGCATCATCGCCGACCAGCATCTCGGCGGTGGAATTGCTTGGAGCCTAACGGAATTGCCGGTCATCATGG
TCATCGTGGCGCTGGTTACCCAATGGGCGCGCCAAGACCGCCGAGTCGCGTCCCGCGAAGACCGGCATGCCGACAGCGAC
TACGCCGACGACGAGCTGGAAGCCTACAACGCGATGCTTCGCGAGTTGTCGCGAATGCGGCGCTGA

Upstream 100 bases:

>100_bases
GAGCAGAATATGTGGTTGATGGCCACTAGGCCGGTACCGGGGAACTGGCGGTTCCCGGCCGATGAGCATCGGCCCTGACG
CGCGGCCGTAAGCTCCAGGA

Downstream 100 bases:

>100_bases
ATGTGCAGATGATTTTGGAAGCGGTTGGCGTATCTGCCCGTGCTCGGCTACACCAGGACCGCGGGGCGCTGGCACGCGAA
CGATCCGGCGAGGAGGTGGG

Product: putative integral membrane protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 661; Mature: 660

Protein sequence:

>661_residues
MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALC
LGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI
VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS
RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMA
AMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGR
LIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR
VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMP
FHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSD
YADDELEAYNAMLRELSRMRR

Sequences:

>Translated_661_residues
MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALC
LGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI
VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS
RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMA
AMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGR
LIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR
VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMP
FHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSD
YADDELEAYNAMLRELSRMRR
>Mature_660_residues
GTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALCL
GALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWIV
AAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPSR
AVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMAA
MASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGRL
IAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSRV
TTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMPF
HAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSDY
ADDELEAYNAMLRELSRMRR

Specific function: Unknown

COG id: COG3336

COG function: function code S; Predicted membrane protein

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential)

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: To M.leprae ML1998

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y102_MYCTU (P64689)

Other databases:

- EMBL:   BX842572
- EMBL:   AE000516
- PIR:   F70751
- RefSeq:   NP_214616.1
- RefSeq:   NP_334519.1
- ProteinModelPortal:   P64689
- EnsemblBacteria:   EBMYCT00000003504
- EnsemblBacteria:   EBMYCT00000069269
- GeneID:   886926
- GeneID:   922960
- GenomeReviews:   AE000516_GR
- GenomeReviews:   AL123456_GR
- KEGG:   mtc:MT0111
- KEGG:   mtu:Rv0102
- TIGR:   MT0111
- TubercuList:   Rv0102
- GeneTree:   EBGT00050000017144
- HOGENOM:   HBG579710
- OMA:   ATVGNPY
- ProtClustDB:   CLSK790265
- GO:   GO:0040007
- InterPro:   IPR008457
- InterPro:   IPR019108

Pfam domain/function: PF09678 Caa3_CtaG; PF05425 CopD

EC number: NA

Molecular weight: Translated: 70357; Mature: 70226

Theoretical pI: Translated: 9.60; Mature: 9.60

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x1b3dd8f4)-; HASH(0x1bda0168)-; HASH(0x1b493790)-; HASH(0x1bda021c)-; HASH(0x1b5fac3c)-; HASH(0x1b56f614)-; HASH(0x1bc841c8)-; HASH(0x1bc7136c)-; HASH(0x1b5fae7c)-; HASH(0x1bda018c)-; HASH(0x1b5fae10)-; HASH(0x1b5fac6c)-; HASH(0x1bc83ed4)-; HASH(0x1bcd1f68)-; HASH(0x1bcd210c)-; HASH(0x1b493718)-;

Cys/Met content:

0.9 %Cys     (Translated Protein)
3.5 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
4.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGA
CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
FVSVAEPVGFFAASLAGALCLGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
AATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWIVAAICALVVATALRLYTRWL
HHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS
HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVR
CEEEEEHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC
PPHAGRRRGGGSGAALAMMAAMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIAR
CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH
VLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGRLIAWLTGCAALVFTSGSGVR
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCH
AYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHH
VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGID
HHHHHHHHHHHHHHHHHCCHHEEEHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC
PGPRRLPYPGRIGLLFAVMPFHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGG
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC
GIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSDYADDELEAYNAMLRELSRMR
CCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHC
R
H
>Mature Secondary Structure 
GTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGA
CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
FVSVAEPVGFFAASLAGALCLGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
AATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWIVAAICALVVATALRLYTRWL
HHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS
HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVR
CEEEEEHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC
PPHAGRRRGGGSGAALAMMAAMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIAR
CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH
VLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGRLIAWLTGCAALVFTSGSGVR
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCH
AYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHH
VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGID
HHHHHHHHHHHHHHHHHCCHHEEEHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC
PGPRRLPYPGRIGLLFAVMPFHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGG
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC
GIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSDYADDELEAYNAMLRELSRMR
CCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHC
R
H

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9634230; 12218036