Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is 148659834

Identifier: 148659834

GI number: 148659834

Start: 84106

End: 85341

Strand: Direct

Name: 148659834

Synonym: MRA_0076

Alternate gene names: NA

Gene position: 84106-85341 (Clockwise)

Preceding gene: 148659833

Following gene: 148659835

Centisome position: 1.9

GC content: 66.02

Gene sequence:

>1236_bases
ATGGGCGATCTGAGCATTAGCCAGGTGTCGGCGCGTCCGGGACGGATCGGGATTCGCGCTAGGCAAATGTTCGACGGATA
CCGGTTTCAGCGTGGTCCCGTGCTGGTCGTGGTCGAGGATGGTCGGATCAGCGCGGTCGATTTTGCTGGCTCCGCCTGCC
CCGATATGAACCTGGTTGATCTGGGTGAATCGACTTTGTTGCCGGGTCTGGTGGATGCGCATGCGCATTTGTGCTGGGAC
CCCGACGGTAGGCCAGAGGATTTGGCCGGCGACCCCCATGCGGTGCTGGTGGGACGGGCGCGACGGCACGCCGCGGCCGC
GTTGCGCTCCGGGATCACCACGATTCGCGATCTCGGCGACCGTGACTATGCGGCCTTGGCGCTGCGGGAGGAGTATCGGC
AGAAAACGACGGTGGGGCCGGAACTGGTGGTTTCTGGGCCACCATTGACTCGCAGCGGCGGGCATTGCTGGTTCCTCGGC
GGCGTGGCCGATAGCGTCGAGGAGCTGGTTGATGCGGTGCAGGAGCGGGCCGCGCGGGGAGCGGATTGGATCAAGGTGAT
GGCCACGGGCGGATTCGTTACCACAGCATCCGATCCGTGGCAGCCGCAGTACGGCAGCGGCCAACTGGCCGCGGTGGTGG
CGGCCGCCGAGCAGGTAGGTCTACCGGTGACCGCACATGCACATGCCACCGCAGGGATCGCCGCGGCGGTCGCCGCGGGT
GTTGACGGCATCGAGCACTGCACGTTCTTGAGCGAAGGCAGCGCCGCCGCCAGCCCGGATGTTGTTGAAGCGATTGTTGC
CCAAGGTGTGTGGTGCGGTATGACGATTCCCCGGGTGTATCCGGAGATGCCGGAGAACCTTGTCGCGGTTGTGCAGGATG
GATGGCGAAACATCCGCCGGCTCATCGACGCCGGTGCGCGTGTCGCCCTGTCCACCGACGCTGGAGTCGCCCCGGGCAGA
CGCCATGACGTGCTCCCCGACGATTTGGTGTATCTGTCTCGACACGGGTTCACCAGCACAGAGGTGCTGACCGGCGCCAC
CGCAGCGGCCGCTGCCAGCTGTGGGCTCGGCCACCGCAAGGGTCGCATCGCGCCGGGCTACGACGCTGATCTGCTGGCTG
TTGCGGCAGGTGTGGACCATGACCCCGCCGGACTCTGCGACGTCAAAGCCGTCTGGCGCAGCGGAACCCAGGTACCGCTA
CAAGCATCCGCTGTGGGCTACAACACCCCGTCATAA

Upstream 100 bases:

>100_bases
CGTGCGCTTGCCAACGACTAACCCGGCTTGGCCGGAACTAGCCACTGCCGGGGCAGCGGTGGCGGTTCACACCGCGTGCG
CGTTTGGAGGTCCCTGAGCG

Downstream 100 bases:

>100_bases
CCCCGTCATAAAATGCAGGACAGCATCTTCAATCTGTTGACCGAGGAACAGCTTCGGGGTCGCAACACGCTCAAGTGGAA
CTATTTCGGGCCCGATGTAG

Product: hypothetical protein

Products: NA

Alternate protein names: Amidohydrolase Family Protein; Xaa-Pro Dipeptidase; Secreted Hydrolase; Aryldialkylphosphatase Related Protein; Amidohydrolase Family; Imidazolonepropionase; Aryldialkylphosphatase; Metal-Dependent Amidohydrolase; Amidohydrolase Domain Protein; Imidazolonepropionase Related Amidohydrolase; Metal-Dependent Hydrolase; Amidohydrolase Imidazolonepropionase; Xaa-Arg/Lys Peptidase; Hydrolase; Parathion Hydrolase; Pro-Hyp Dipeptidase; Xaa-Pro Dipeptidase Family; Prolidase

Number of amino acids: Translated: 411; Mature: 410

Protein sequence:

>411_residues
MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAGSACPDMNLVDLGESTLLPGLVDAHAHLCWD
PDGRPEDLAGDPHAVLVGRARRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLG
GVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAG
VDGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRRLIDAGARVALSTDAGVAPGR
RHDVLPDDLVYLSRHGFTSTEVLTGATAAAAASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPL
QASAVGYNTPS

Sequences:

>Translated_411_residues
MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAGSACPDMNLVDLGESTLLPGLVDAHAHLCWD
PDGRPEDLAGDPHAVLVGRARRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLG
GVADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAG
VDGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRRLIDAGARVALSTDAGVAPGR
RHDVLPDDLVYLSRHGFTSTEVLTGATAAAAASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPL
QASAVGYNTPS
>Mature_410_residues
GDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAGSACPDMNLVDLGESTLLPGLVDAHAHLCWDP
DGRPEDLAGDPHAVLVGRARRHAAAALRSGITTIRDLGDRDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLGG
VADSVEELVDAVQERAARGADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAGV
DGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRRLIDAGARVALSTDAGVAPGRR
HDVLPDDLVYLSRHGFTSTEVLTGATAAAAASCGLGHRKGRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPLQ
ASAVGYNTPS

Specific function: Unknown

COG id: COG1228

COG function: function code Q; Imidazolonepropionase and related amidohydrolases

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Saccharomyces cerevisiae, GI6322248, Length=317, Percent_Identity=30.2839116719243, Blast_Score=105, Evalue=2e-23,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 42792; Mature: 42660

Theoretical pI: Translated: 5.37; Mature: 5.37

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
1.5 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
1.2 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAGSACPDMNLVD
CCCCCCHHHCCCCCCCCHHHHHHHCCCEECCCCEEEEECCCCEEEEECCCCCCCCCCEEE
LGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRARRHAAAALRSGITTIRDLGD
CCCHHCCCCHHHCCCEEEECCCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHHCCC
RDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLGGVADSVEELVDAVQERAARG
CCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHCC
ADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAG
CCEEEEEEECCEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCEEECCHHHHHHHHHHHHC
VDGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRR
CCCHHHHEEECCCCCCCCHHHHHHHHHCCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHH
LIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSRHGFTSTEVLTGATAAAAASCGLGHRK
HHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHCCCCCHHHHHHCCHHHHHHHCCCCCCC
GRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPLQASAVGYNTPS
CCCCCCCCCCHHHEEECCCCCCCCCHHHHHHHHCCCCCCEEEEECCCCCCC
>Mature Secondary Structure 
GDLSISQVSARPGRIGIRARQMFDGYRFQRGPVLVVVEDGRISAVDFAGSACPDMNLVD
CCCCCHHHCCCCCCCCHHHHHHHCCCEECCCCEEEEECCCCEEEEECCCCCCCCCCEEE
LGESTLLPGLVDAHAHLCWDPDGRPEDLAGDPHAVLVGRARRHAAAALRSGITTIRDLGD
CCCHHCCCCHHHCCCEEEECCCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHHCCC
RDYAALALREEYRQKTTVGPELVVSGPPLTRSGGHCWFLGGVADSVEELVDAVQERAARG
CCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHCC
ADWIKVMATGGFVTTASDPWQPQYGSGQLAAVVAAAEQVGLPVTAHAHATAGIAAAVAAG
CCEEEEEEECCEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCEEECCHHHHHHHHHHHHC
VDGIEHCTFLSEGSAAASPDVVEAIVAQGVWCGMTIPRVYPEMPENLVAVVQDGWRNIRR
CCCHHHHEEECCCCCCCCHHHHHHHHHCCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHH
LIDAGARVALSTDAGVAPGRRHDVLPDDLVYLSRHGFTSTEVLTGATAAAAASCGLGHRK
HHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHCCCCCHHHHHHCCHHHHHHHCCCCCCC
GRIAPGYDADLLAVAAGVDHDPAGLCDVKAVWRSGTQVPLQASAVGYNTPS
CCCCCCCCCCHHHEEECCCCCCCCCHHHHHHHHCCCCCCEEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA