Definition Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome.
Accession NC_008769
Length 4,374,522

Click here to switch to the map view.

The map label for this gene is ureC

Identifier: 121637753

GI number: 121637753

Start: 2098920

End: 2100653

Strand: Direct

Name: ureC

Synonym: BCG_1886

Alternate gene names: 121637753

Gene position: 2098920-2100653 (Clockwise)

Preceding gene: 121637752

Following gene: 121637754

Centisome position: 47.98

GC content: 65.8

Gene sequence:

>1734_bases
ATGGCGCGACTGTCAAGGGAGCGCTACGCACAGCTGTACGGACCTACCACCGGCGACCGGATACGGCTGGCCGACACCAA
CCTGCTGGTTGAGGTCACCGAAGACCGGTGTGGGGGACCGGGACTGGCCGGTGACGAGGCGGTGTTCGGCGGCGGCAAGG
TGCTGCGCGAGTCCATGGGCCAGGGCCGTGCGAGCCGGGCCGACGGTGCCCCCGACACCGTGATCACCGGTGCGGTGATC
ATCGACTACTGGGGAATCATCAAGGCCGACATCGGGATTCGCGATGGCCGCATCGTCGGGATCGGAAAGGCCGGCAATCC
CGACATCATGACAGGTGTGCATCGGGATCTCGTCGTCGGGCCGTCCACCGAAATCATCAGCGGCAACCGTCGAATCGTCA
CCGCAGGCACCGTCGACTGTCACGTGCACTTGATCTGTCCGCAGATCATCGTCGAAGCCTTGGCCGCGGGCACCACCACG
ATCATCGGCGGTGGCACCGGACCCGCCGAGGGCACCAAGGCCACCACAGTCACTCCCGGCGAGTGGCACCTGGCCCGGAT
GCTGGAGTCACTGGACGGTTGGCCGGTGAACTTCGCGCTGCTCGGCAAGGGAAACACCGTGAATCCCGACGCACTGTGGG
AACAGTTGCGCGGTGGCGCATCGGGTTTCAAACTCCACGAAGACTGGGGATCGACCCCGGCGGCCATCGACACCTGCTTG
GCGGTCGCCGACGTGGCCGGGGTGCAGGTTGCGCTGCACTCCGACACTCTCAATGAGACCGGATTCGTCGAGGACACCAT
CGGCGCGATCGCCGGACGTTCGATTCACGCCTACCACACCGAGGGCGCCGGCGGCGGGCACGCACCGGACATCATTACCG
TCGCGGCGCAACCGAATGTACTGCCCAGCTCGACCAATCCGACCCGCCCGCATACGGTGAACACCCTTGACGAGCATCTC
GACATGCTGATGGTGTGCCACCACCTCAACCCCCGGATCCCGGAGGACCTCGCGTTTGCCGAAAGCCGGATCCGACCGTC
CACCATTGCGGCAGAAGATGTGTTGCACGATATGGGGGCAATCTCGATGATTGGCAGCGATTCCCAGGCGATGGGCCGTG
TCGGCGAGGTGGTGCTGCGCACCTGGCAGACCGCGCACGTGATGAAAGCCCGCCGCGGGGCACTGGAAGGTGACCCGTCT
GGTAGCCAAGCCGCCGACAACAACCGGGTCCGCCGCTACATCGCCAAATACACCATCTGCCCGGCCATCGCACACGGCAT
GGATCACCTGATCGGTTCGGTGGAGGTGGGAAAGTTGGCCGACCTGGTGTTGTGGGAGCCGGCGTTTTTCGGGGTTCGCC
CGCACGTCGTGCTCAAAGGTGGGGCGATCGCCTGGGCAGCGATGGGCGATGCGAACGCGTCAATCCCGACCCCGCAACCG
GTGCTCCCGCGACCGATGTTCGGCGCGGCCGCGGCAACCGCGGCGGCGACCTCGGTGCACTTCGTCGCGCCGCAATCCAT
CGACGCGCGCCTGGCGGACCGGCTCGCGGTCAATCGGGGACTAGCGCCGGTGGCCGACGTGCGCGCAGTGGGCAAGACCG
ACCTGCCGCTCAATGATGCCCTACCGAGCATCGAGGTCGATCCCGACACCTTCACCGTGCGAATCGACGGCCAGGTGTGG
CAACCGCAGCCGGCCGCCGAACTACCTATGACACAACGGTATTTCCTGTTCTAA

Upstream 100 bases:

>100_bases
TCGAGCCGGGCATTCCCCAAATCGTCGGGTTGGTTCCGTTGGGCGGACGGCGCGAGGTACCCGGTCTGACGCTAAATCCG
CCCGGACGGTTGGACCGCTG

Downstream 100 bases:

>100_bases
TGACCTCGCTGGCCGTGCTGCTCACCCTCGCCGACTCGCGGCTGCCCACGGGTGCGCACGTGCACTCGGGCGGCATCGAA
GAAGCCATCGCCGCCGGCTT

Product: urease subunit alpha

Products: NA

Alternate protein names: Urea amidohydrolase subunit alpha

Number of amino acids: Translated: 577; Mature: 576

Protein sequence:

>577_residues
MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFGGGKVLRESMGQGRASRADGAPDTVITGAVI
IDYWGIIKADIGIRDGRIVGIGKAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTT
IIGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCL
AVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNVLPSSTNPTRPHTVNTLDEHL
DMLMVCHHLNPRIPEDLAFAESRIRPSTIAAEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPS
GSQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFFGVRPHVVLKGGAIAWAAMGDANASIPTPQP
VLPRPMFGAAAATAAATSVHFVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDTFTVRIDGQVW
QPQPAAELPMTQRYFLF

Sequences:

>Translated_577_residues
MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFGGGKVLRESMGQGRASRADGAPDTVITGAVI
IDYWGIIKADIGIRDGRIVGIGKAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTT
IIGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCL
AVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNVLPSSTNPTRPHTVNTLDEHL
DMLMVCHHLNPRIPEDLAFAESRIRPSTIAAEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPS
GSQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFFGVRPHVVLKGGAIAWAAMGDANASIPTPQP
VLPRPMFGAAAATAAATSVHFVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDTFTVRIDGQVW
QPQPAAELPMTQRYFLF
>Mature_576_residues
ARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFGGGKVLRESMGQGRASRADGAPDTVITGAVII
DYWGIIKADIGIRDGRIVGIGKAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTTI
IGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCLA
VADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNVLPSSTNPTRPHTVNTLDEHLD
MLMVCHHLNPRIPEDLAFAESRIRPSTIAAEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPSG
SQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFFGVRPHVVLKGGAIAWAAMGDANASIPTPQPV
LPRPMFGAAAATAAATSVHFVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDTFTVRIDGQVWQ
PQPAAELPMTQRYFLF

Specific function: Unknown

COG id: COG0804

COG function: function code E; Urea amidohydrolase (urease) alpha subunit

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 urease domain

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): URE1_MYCBO (P0A661)

Other databases:

- EMBL:   BX248340
- RefSeq:   NP_855533.1
- ProteinModelPortal:   P0A661
- SMR:   P0A661
- MEROPS:   M38.982
- EnsemblBacteria:   EBMYCT00000015447
- GeneID:   1093004
- GenomeReviews:   BX248333_GR
- KEGG:   mbo:Mb1881
- GeneTree:   EBGT00050000017361
- HOGENOM:   HBG357507
- OMA:   TIHAFHT
- ProtClustDB:   PRK13206
- BioCyc:   MBOV233413:MB1881-MONOMER
- BRENDA:   3.5.1.5
- GO:   GO:0005737
- HAMAP:   MF_01953
- InterPro:   IPR006680
- InterPro:   IPR011059
- InterPro:   IPR011612
- InterPro:   IPR005848
- InterPro:   IPR017951
- InterPro:   IPR017950
- PRINTS:   PR01752
- TIGRFAMs:   TIGR01792

Pfam domain/function: PF01979 Amidohydro_1; PF00449 Urease_alpha; SSF51338 Metalo_hydrolase

EC number: =3.5.1.5

Molecular weight: Translated: 60826; Mature: 60695

Theoretical pI: Translated: 5.78; Mature: 5.78

Prosite motif: PS01120 UREASE_1; PS00145 UREASE_2; PS51368 UREASE_3; PS00403 UTEROGLOBIN_1

Important sites: ACT_SITE 327-327 BINDING 226-226

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFGGGKVLRESMG
CCCCCHHHHHHHCCCCCCCEEEEECCEEEEEECCCCCCCCCCCCCCEEECCHHHHHHHHC
QGRASRADGAPDTVITGAVIIDYWGIIKADIGIRDGRIVGIGKAGNPDIMTGVHRDLVVG
CCCCCCCCCCCCCEEEHHHHEEHHHHEEECCCCCCCEEEEECCCCCCHHHHCCCCCEEEC
PSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTTIIGGGTGPAEGTKATTVTPG
CCCCEECCCCEEEEECCEEEEEEEECHHHHHHHHHCCCEEEEECCCCCCCCCEEEEECCC
EWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCL
CHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHCCCCCCEEECCCCCCHHHHHHHH
AVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNV
HHHHHCCEEEEEECCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCCCCCEEEEECCCCC
LPSSTNPTRPHTVNTLDEHLDMLMVCHHLNPRIPEDLAFAESRIRPSTIAAEDVLHDMGA
CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCC
ISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPSGSQAADNNRVRRYIAKYTIC
EEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH
PAIAHGMDHLIGSVEVGKLADLVLWEPAFFGVRPHVVLKGGAIAWAAMGDANASIPTPQP
HHHHHHHHHHHCCCCHHHHHHHEEECCHHCCCCCEEEEECCEEEEEEECCCCCCCCCCCC
VLPRPMFGAAAATAAATSVHFVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDA
CCCCCCHHHHHHHHHHCEEEEECCCCHHHHHHHHHHHHCCCCCHHHHHHCCCCCCCCCCC
LPSIEVDPDTFTVRIDGQVWQPQPAAELPMTQRYFLF
CCCEEECCCEEEEEECCEEECCCCCCCCCCCCCCCCC
>Mature Secondary Structure 
ARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFGGGKVLRESMG
CCCCHHHHHHHCCCCCCCEEEEECCEEEEEECCCCCCCCCCCCCCEEECCHHHHHHHHC
QGRASRADGAPDTVITGAVIIDYWGIIKADIGIRDGRIVGIGKAGNPDIMTGVHRDLVVG
CCCCCCCCCCCCCEEEHHHHEEHHHHEEECCCCCCCEEEEECCCCCCHHHHCCCCCEEEC
PSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTTIIGGGTGPAEGTKATTVTPG
CCCCEECCCCEEEEECCEEEEEEEECHHHHHHHHHCCCEEEEECCCCCCCCCEEEEECCC
EWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCL
CHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHCCCCCCEEECCCCCCHHHHHHHH
AVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNV
HHHHHCCEEEEEECCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCCCCCEEEEECCCCC
LPSSTNPTRPHTVNTLDEHLDMLMVCHHLNPRIPEDLAFAESRIRPSTIAAEDVLHDMGA
CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCC
ISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPSGSQAADNNRVRRYIAKYTIC
EEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH
PAIAHGMDHLIGSVEVGKLADLVLWEPAFFGVRPHVVLKGGAIAWAAMGDANASIPTPQP
HHHHHHHHHHHCCCCHHHHHHHEEECCHHCCCCCEEEEECCEEEEEEECCCCCCCCCCCC
VLPRPMFGAAAATAAATSVHFVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDA
CCCCCCHHHHHHHHHHCEEEEECCCCHHHHHHHHHHHHCCCCCHHHHHHCCCCCCCCCCC
LPSIEVDPDTFTVRIDGQVWQPQPAAELPMTQRYFLF
CCCEEECCCEEEEEECCEEECCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12788972