Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is ureC

Identifier: 15889672

GI number: 15889672

Start: 2373013

End: 2374722

Strand: Reverse

Name: ureC

Synonym: Atu2401

Alternate gene names: 15889672

Gene position: 2374722-2373013 (Counterclockwise)

Preceding gene: 159185198

Following gene: 159185197

Centisome position: 83.57

GC content: 61.52

Gene sequence:

>1710_bases
ATGCCTTACAAGATTTCCCGCGCCGCTTATGCCGGCATGTTCGGCCCGACCGTTGGCGACAAGGTGCGGCTTGCCGATAC
CGAACTCTTCATCGAGATCGAGAAGGACCACACGACCTATGGCGAGGAAGTGAAATTCGGCGGCGGCAAGGTCATTCGCG
ACGGCATGGGCCAGAGCCAGGCAACGCGGGCCGAAGGGGCTGTCGATACCGTCATCACCAATGCCGTCATCGTCGACCAC
AGCGGCATCTACAAGGCGGATGTGGGCCTGAAGAACGGCCGCATCCACGCCATCGGCAAGGCTGGCAACCCGGATACGCA
GCCGGGTGTGACGATCATCGTCGGCCCCTCGACGGAGGCAATTGCCGGTGAAGGCAAGATACTAACCGCCGGCGGCATGG
ACGCGCATATCCATTACATCTGCCCGCAGCAGATCGAGGAAGCGCTGATGAGCGGCGTGACGTGCATGCTGGGCGGCGGT
TCGGGCCCCGCGCATGGCACGCTCGCCACCACCTGCACCGGCGCGTGGCACATCGAGCGCATGATCGAAAGCTTCGACGC
TTTCCCGATGAACCTCGCACTCGCGGGTAAAGGCAATGCCTCGCTGCCCGCACCGCTGGAAGAAATGATCCTTGCCGGCG
CTTCCTCGCTGAAGCTGCATGAGGACTGGGGCACGACACCCGCCGCCATCGACAATTGCCTGACGGTGGCCGATGAATAT
GATGTGCAGGTGATGATCCACACCGACACGCTGAATGAAAGCGGTTTCGTCGAAGACACCGTCGCCGCCATCCGGGGCCG
CACCATCCATGCCTTCCACACCGAAGGTGCGGGTGGCGGGCACGCGCCTGATATCATCAAGGTCTGCGGCAACCCGAACG
TCATTCCGTCCTCCACCAACCCGACGCGGCCCTATACCGTCAATACACTTGCCGAACATCTGGACATGCTGATGGTGTGT
CATCACCTGTCGCCGTCCATTCCTGAGGATATTGCCTTCGCCGAAAGCCGCATCCGCAAGGAAACCATTGCGGCGGAAGA
TATTCTCCACGATATCGGCGCGTTTTCGATCATCTCGTCGGACAGCCAGGCCATGGGCCGCGTGGGCGAAGTGGCGATCC
GCACCTGGCAGACCGCCGACAAGATGAAGCGCCAGCGCGGCCGTCTGAAGGAGGAGACGGGCGAAAACGACAATTTCCGG
GTGCGCCGTTACATCGCCAAATATACCATCAACCCGGCCATTGCCCAGGGTGTCAGCCACGAGATCGGCTCGGTCGAAGT
CGGCAAGCGCGCCGATCTTGTCTTGTGGAACCCGGCCTTTTTCGGCGTGAAGCCGGAAATGGTGCTGCTTGGCGGTTCGA
TTGCGGCAGCCCCGATGGGTGATCCGAATGCCTCCATTCCCACACCGCAGCCGATGCACTACCGGCCGATGTTTGCCGCC
TACGGCAAGCTGCGCACCAATTCCTCGGTCACTTTCGTGTCGCAGGCGTCGCTGGATGGTGGCCTTGCCCAGCGCCTCGG
CGTTGCCAAGAAGCTGCTGGCGGTGAAGAATGTGCGTGGCGGCATTTCCAAGGCGTCGATGATCCACAATTCGCTCACCC
CGCATATCGAGGTCGATCCCGAGACTTATGAGGTGCGCGCGGATGGCGAGTTGCTGACCTGCGAACCGGCGACTGTGCTG
CCGATGGCGCAGCGTTATTTCCTGTTTTAA

Upstream 100 bases:

>100_bases
GCTGCCGAACAACAACTGGTCGATACCCTCTCAAACGAGGGCCGGCTGGCACGAATTTCTCCCACATCGATTTTCTCCGC
GTCCCAAGGAGCCTGAACCC

Downstream 100 bases:

>100_bases
CGGACGTCGCGTTGAGAACTGTCCTCAGATGGATCGGCAGCGGCTTGCTGCTGGTGGTCCTGCTTCTCGTGCTCGGCACC
GTGGTGCCGCGCCCTTTTTT

Product: urease subunit alpha

Products: NA

Alternate protein names: Urea amidohydrolase subunit alpha

Number of amino acids: Translated: 569; Mature: 568

Protein sequence:

>569_residues
MPYKISRAAYAGMFGPTVGDKVRLADTELFIEIEKDHTTYGEEVKFGGGKVIRDGMGQSQATRAEGAVDTVITNAVIVDH
SGIYKADVGLKNGRIHAIGKAGNPDTQPGVTIIVGPSTEAIAGEGKILTAGGMDAHIHYICPQQIEEALMSGVTCMLGGG
SGPAHGTLATTCTGAWHIERMIESFDAFPMNLALAGKGNASLPAPLEEMILAGASSLKLHEDWGTTPAAIDNCLTVADEY
DVQVMIHTDTLNESGFVEDTVAAIRGRTIHAFHTEGAGGGHAPDIIKVCGNPNVIPSSTNPTRPYTVNTLAEHLDMLMVC
HHLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGENDNFR
VRRYIAKYTINPAIAQGVSHEIGSVEVGKRADLVLWNPAFFGVKPEMVLLGGSIAAAPMGDPNASIPTPQPMHYRPMFAA
YGKLRTNSSVTFVSQASLDGGLAQRLGVAKKLLAVKNVRGGISKASMIHNSLTPHIEVDPETYEVRADGELLTCEPATVL
PMAQRYFLF

Sequences:

>Translated_569_residues
MPYKISRAAYAGMFGPTVGDKVRLADTELFIEIEKDHTTYGEEVKFGGGKVIRDGMGQSQATRAEGAVDTVITNAVIVDH
SGIYKADVGLKNGRIHAIGKAGNPDTQPGVTIIVGPSTEAIAGEGKILTAGGMDAHIHYICPQQIEEALMSGVTCMLGGG
SGPAHGTLATTCTGAWHIERMIESFDAFPMNLALAGKGNASLPAPLEEMILAGASSLKLHEDWGTTPAAIDNCLTVADEY
DVQVMIHTDTLNESGFVEDTVAAIRGRTIHAFHTEGAGGGHAPDIIKVCGNPNVIPSSTNPTRPYTVNTLAEHLDMLMVC
HHLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGENDNFR
VRRYIAKYTINPAIAQGVSHEIGSVEVGKRADLVLWNPAFFGVKPEMVLLGGSIAAAPMGDPNASIPTPQPMHYRPMFAA
YGKLRTNSSVTFVSQASLDGGLAQRLGVAKKLLAVKNVRGGISKASMIHNSLTPHIEVDPETYEVRADGELLTCEPATVL
PMAQRYFLF
>Mature_568_residues
PYKISRAAYAGMFGPTVGDKVRLADTELFIEIEKDHTTYGEEVKFGGGKVIRDGMGQSQATRAEGAVDTVITNAVIVDHS
GIYKADVGLKNGRIHAIGKAGNPDTQPGVTIIVGPSTEAIAGEGKILTAGGMDAHIHYICPQQIEEALMSGVTCMLGGGS
GPAHGTLATTCTGAWHIERMIESFDAFPMNLALAGKGNASLPAPLEEMILAGASSLKLHEDWGTTPAAIDNCLTVADEYD
VQVMIHTDTLNESGFVEDTVAAIRGRTIHAFHTEGAGGGHAPDIIKVCGNPNVIPSSTNPTRPYTVNTLAEHLDMLMVCH
HLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGENDNFRV
RRYIAKYTINPAIAQGVSHEIGSVEVGKRADLVLWNPAFFGVKPEMVLLGGSIAAAPMGDPNASIPTPQPMHYRPMFAAY
GKLRTNSSVTFVSQASLDGGLAQRLGVAKKLLAVKNVRGGISKASMIHNSLTPHIEVDPETYEVRADGELLTCEPATVLP
MAQRYFLF

Specific function: Unknown

COG id: COG0804

COG function: function code E; Urea amidohydrolase (urease) alpha subunit

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 urease domain

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): URE1_AGRT5 (Q8UCT2)

Other databases:

- EMBL:   AE007869
- PIR:   A97648
- PIR:   AG2871
- RefSeq:   NP_355353.1
- ProteinModelPortal:   Q8UCT2
- SMR:   Q8UCT2
- STRING:   Q8UCT2
- MEROPS:   M38.982
- GeneID:   1134439
- GenomeReviews:   AE007869_GR
- KEGG:   atu:Atu2401
- eggNOG:   COG0804
- HOGENOM:   HBG357507
- OMA:   TIHAFHT
- ProtClustDB:   PRK13207
- BioCyc:   ATUM176299-1:ATU2401-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_01953
- InterPro:   IPR006680
- InterPro:   IPR011059
- InterPro:   IPR011612
- InterPro:   IPR005848
- InterPro:   IPR017951
- InterPro:   IPR017950
- PRINTS:   PR01752
- TIGRFAMs:   TIGR01792

Pfam domain/function: PF01979 Amidohydro_1; PF00449 Urease_alpha; SSF51338 Metalo_hydrolase

EC number: =3.5.1.5

Molecular weight: Translated: 60816; Mature: 60685

Theoretical pI: Translated: 6.08; Mature: 6.08

Prosite motif: PS01120 UREASE_1; PS00145 UREASE_2; PS51368 UREASE_3

Important sites: ACT_SITE 321-321 BINDING 220-220

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
3.5 %Met     (Translated Protein)
4.7 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
4.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPYKISRAAYAGMFGPTVGDKVRLADTELFIEIEKDHTTYGEEVKFGGGKVIRDGMGQSQ
CCEECCHHHHCCCCCCCCCCEEEEECEEEEEEEECCCCCCCCEEEECCCEEEECCCCCCH
ATRAEGAVDTVITNAVIVDHSGIYKADVGLKNGRIHAIGKAGNPDTQPGVTIIVGPSTEA
HHHHCCCHHHHEEEEEEEECCCCEEECCCCCCCEEEEEECCCCCCCCCCEEEEECCCCCE
IAGEGKILTAGGMDAHIHYICPQQIEEALMSGVTCMLGGGSGPAHGTLATTCTGAWHIER
ECCCCEEEEECCCCEEEEEECHHHHHHHHHCCCEEEEECCCCCCCCCEEEEECCHHHHHH
MIESFDAFPMNLALAGKGNASLPAPLEEMILAGASSLKLHEDWGTTPAAIDNCLTVADEY
HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHCCCCCEEEECCCCCCHHHHHHHHEECCCC
DVQVMIHTDTLNESGFVEDTVAAIRGRTIHAFHTEGAGGGHAPDIIKVCGNPNVIPSSTN
CEEEEEEECCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCC
PTRPYTVNTLAEHLDMLMVCHHLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS
CCCCEEHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC
DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGENDNFRVRRYIAKYTINPAIAQGVSH
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCHHHEEEEHHHCCCHHHHHCCCC
EIGSVEVGKRADLVLWNPAFFGVKPEMVLLGGSIAAAPMGDPNASIPTPQPMHYRPMFAA
CCCCEECCCCCCEEEECCCCCCCCCCEEEECCCEEECCCCCCCCCCCCCCCCCCCCHHHH
YGKLRTNSSVTFVSQASLDGGLAQRLGVAKKLLAVKNVRGGISKASMIHNSLTPHIEVDP
HHHEECCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEECC
ETYEVRADGELLTCEPATVLPMAQRYFLF
CEEEEEECCCEEEECCCHHHHHHHHHHCC
>Mature Secondary Structure 
PYKISRAAYAGMFGPTVGDKVRLADTELFIEIEKDHTTYGEEVKFGGGKVIRDGMGQSQ
CEECCHHHHCCCCCCCCCCEEEEECEEEEEEEECCCCCCCCEEEECCCEEEECCCCCCH
ATRAEGAVDTVITNAVIVDHSGIYKADVGLKNGRIHAIGKAGNPDTQPGVTIIVGPSTEA
HHHHCCCHHHHEEEEEEEECCCCEEECCCCCCCEEEEEECCCCCCCCCCEEEEECCCCCE
IAGEGKILTAGGMDAHIHYICPQQIEEALMSGVTCMLGGGSGPAHGTLATTCTGAWHIER
ECCCCEEEEECCCCEEEEEECHHHHHHHHHCCCEEEEECCCCCCCCCEEEEECCHHHHHH
MIESFDAFPMNLALAGKGNASLPAPLEEMILAGASSLKLHEDWGTTPAAIDNCLTVADEY
HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHCCCCCEEEECCCCCCHHHHHHHHEECCCC
DVQVMIHTDTLNESGFVEDTVAAIRGRTIHAFHTEGAGGGHAPDIIKVCGNPNVIPSSTN
CEEEEEEECCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCC
PTRPYTVNTLAEHLDMLMVCHHLSPSIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS
CCCCEEHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC
DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGENDNFRVRRYIAKYTINPAIAQGVSH
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCHHHEEEEHHHCCCHHHHHCCCC
EIGSVEVGKRADLVLWNPAFFGVKPEMVLLGGSIAAAPMGDPNASIPTPQPMHYRPMFAA
CCCCEECCCCCCEEEECCCCCCCCCCEEEECCCEEECCCCCCCCCCCCCCCCCCCCHHHH
YGKLRTNSSVTFVSQASLDGGLAQRLGVAKKLLAVKNVRGGISKASMIHNSLTPHIEVDP
HHHEECCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEECC
ETYEVRADGELLTCEPATVLPMAQRYFLF
CEEEEEECCCEEEECCCHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11743193; 11743194