Definition Agrobacterium vitis S4 plasmid pAtS4e, complete sequence.
Accession NC_011981
Length 631,775

Click here to switch to the map view.

The map label for this gene is hyuA

Identifier: 222102422

GI number: 222102422

Start: 14987

End: 17008

Strand: Direct

Name: hyuA

Synonym: Avi_7016

Alternate gene names: NA

Gene position: 14987-17008 (Clockwise)

Preceding gene: 222102421

Following gene: 222102423

Centisome position: 2.37

GC content: 59.69

Gene sequence:

>2022_bases
ATGGCTACCCGAATTGGTGTCGATATCGGCGGCACCTTCACAGACCTTGTTTACTTCGATGAAGAAAATGGCAGAACGGT
TGAAGGAAAAGTTCCCACCGTTCCATCAGCGCCGGAGGAAGGTGTTGCACATGCGATCCGTGCTCATGTGCCCCAGGACA
TTATCGACAGCGCTGCGTATTTCCTGCATGGAACAACAGTGGGCCTCAATGCTCTGCTTGAGCGCCGTGGCGCCAAAGTC
GGCCTGATCACGACACAGGGCTTCCGCGACGTCCTTGAAATCCGGCGCGGCGACCGCGCGGAAATGTACAATCTGTTCTG
GAAGCAAACGGTGCCGCTCGTTCCGCGCCGCCTTCGTCTGGAGGTCGATGGACGCATTCGGGGCGATGGCGCGGTTCTAT
CCCCGGTGACGCGAGAAAGTGTTCTGGCGGCCTATGAACTGCTGGCGGCAGAAAAGGTCGACAGTATTGCCGTTTGCCTG
ATCAGCGCCTTTGCCAATCCGCAGCACGAGCTCGAGATCGAAGCCATTCTTCGTGATGCGGGGTATGAAGGCGGCATTTC
GCTCTCCCACAAGATCTCCGGAGAATACCGGGAATATGAGCGCACGTCGACAACGGTCATCGACGCGTTCGTTCGCGGTC
GCATGTCGAATTATCTCAGGCGGCTGGAAAACACGCTGCGCAATCTCGGCTTTAAGGGGCAGTGCCTGATTACGCGTTCC
GGCAGTGGCTCCATGACATTTTCCGAGGCGGAAGACCGACCGTTCGAAACGATTATGTCCGGGCCTGTCGGCGGCGCTCA
GGGGGCAGGTGAGCTTGCCAAGATGCTTGGGCGGAAGGCCATGATCACCGCCGATGTCGGCGGAACGAGTTTCGACACGG
CCGTCATCATCAATGGCGAACCCCAGGTTCTGTTTGAGGGCATGATCGACAATATGCCGGTGCAGACCCCATGGGTCGAC
GTGCGCTCGATCGGTTCGGGCGGCGGGTCGCTTGCCTATGTTGATGTGGGCAACCTCATGCGTGTAGGGCCACAAAGCGC
CGGCGCCGTTCCGGGGCCTGCCTGCTACGGCAAAGGCGGCACCCAGCCCGCGCTCACTGACGCTGCGGCCTACCTTGGCA
TGCTCGGTCCGGGTAATCTCGCCTCCGGCATCCATCTCGATATCGGCAAGGCGGAGGCTGCACTTGCCCCTGTCGCATCG
GCAATCGGACAAGATATCGAAATGACGGCGGCGGGCGTTCTGCGCATTGCCAGCTCGGCCATGGCCAATGCCATGCGGGA
GATTTCTCTGGACCAGGGTTTCGACCCTCGCGAGATGACGTTGCTACCCTTTGGTGGTGCGGGACCGCTGATGGCGACCC
TGCTTGCTGACGAGTTGAAGATGAACGAGATCGTCGTGCCACCACTCGCTGGCAATTTCTCTGCCTGGGGACTTCTCGGG
GCGGACATGGTGCAGTCGGCAGCACGTACCCGCATCCTGGATCTGACGGATGACAGCCTGAAGATCGCCAATGAGGTGCT
GGAAGGGCTGTTTGTCGCGATCCGGTCGCGAAGCGAGCGTTCGTTCGCCGAAGCGGTCAATGCCGTTCGTCTCGACCTGC
GCTACAAGGGGCAGGAGCACCGGCTGTCTATCGTTGCCGATAACGAGAATGGCAGGATCTCTGAGGGCGCGGAGTCCATC
AAGACGAAGTTCCGCGCCGAATACAGCAGGACATTCGGCAGCACGATGAACGACGAAATCGAGGTGGTTTCCATTCGTGC
CACGGTGCGCGTGCCCTTGCCGCGGCGGGAAATCCGCTTCACGCACGAGCCGGAAGCCGTTGCAGAGTTCCAGATGCTTG
AGGCTTTTTCGTTCGAGAAAGGAAAGCGCATGTCCTTTGCGATCGTACAGCGCCAGGCGATCGTCGGTAAGCTCTCAGGC
CCCGCCATCATCACCGAAGGAACCTGTACGACCTATCTGGACGCCGATTGGACGGCGCGGATCGGGACTGTCGGCGAAAT
CATTCTGGAACGGAAGAATTAA

Upstream 100 bases:

>100_bases
TGTTGAACGCCGATATTTCCCTCTCGGCGAGCGACAGAAACAACGTCGTGTCGGGCGCACAGAGACGAAGCCTTCTGCGG
CACATCAGGAGAAAATACGC

Downstream 100 bases:

>100_bases
TCATGAAGCGGAAAGATCCAGTCCTTCACTTGGCAGGGGTTCCCAGCAAGGGGCTTGCCGTCAACTCCGATCCAATCACG
ACGGAAGTCGTGCGGCATGC

Product: hydantoin utilization protein

Products: ADP; phosphate; N-carbamoylsarcosine

Alternate protein names: NA

Number of amino acids: Translated: 673; Mature: 672

Protein sequence:

>673_residues
MATRIGVDIGGTFTDLVYFDEENGRTVEGKVPTVPSAPEEGVAHAIRAHVPQDIIDSAAYFLHGTTVGLNALLERRGAKV
GLITTQGFRDVLEIRRGDRAEMYNLFWKQTVPLVPRRLRLEVDGRIRGDGAVLSPVTRESVLAAYELLAAEKVDSIAVCL
ISAFANPQHELEIEAILRDAGYEGGISLSHKISGEYREYERTSTTVIDAFVRGRMSNYLRRLENTLRNLGFKGQCLITRS
GSGSMTFSEAEDRPFETIMSGPVGGAQGAGELAKMLGRKAMITADVGGTSFDTAVIINGEPQVLFEGMIDNMPVQTPWVD
VRSIGSGGGSLAYVDVGNLMRVGPQSAGAVPGPACYGKGGTQPALTDAAAYLGMLGPGNLASGIHLDIGKAEAALAPVAS
AIGQDIEMTAAGVLRIASSAMANAMREISLDQGFDPREMTLLPFGGAGPLMATLLADELKMNEIVVPPLAGNFSAWGLLG
ADMVQSAARTRILDLTDDSLKIANEVLEGLFVAIRSRSERSFAEAVNAVRLDLRYKGQEHRLSIVADNENGRISEGAESI
KTKFRAEYSRTFGSTMNDEIEVVSIRATVRVPLPRREIRFTHEPEAVAEFQMLEAFSFEKGKRMSFAIVQRQAIVGKLSG
PAIITEGTCTTYLDADWTARIGTVGEIILERKN

Sequences:

>Translated_673_residues
MATRIGVDIGGTFTDLVYFDEENGRTVEGKVPTVPSAPEEGVAHAIRAHVPQDIIDSAAYFLHGTTVGLNALLERRGAKV
GLITTQGFRDVLEIRRGDRAEMYNLFWKQTVPLVPRRLRLEVDGRIRGDGAVLSPVTRESVLAAYELLAAEKVDSIAVCL
ISAFANPQHELEIEAILRDAGYEGGISLSHKISGEYREYERTSTTVIDAFVRGRMSNYLRRLENTLRNLGFKGQCLITRS
GSGSMTFSEAEDRPFETIMSGPVGGAQGAGELAKMLGRKAMITADVGGTSFDTAVIINGEPQVLFEGMIDNMPVQTPWVD
VRSIGSGGGSLAYVDVGNLMRVGPQSAGAVPGPACYGKGGTQPALTDAAAYLGMLGPGNLASGIHLDIGKAEAALAPVAS
AIGQDIEMTAAGVLRIASSAMANAMREISLDQGFDPREMTLLPFGGAGPLMATLLADELKMNEIVVPPLAGNFSAWGLLG
ADMVQSAARTRILDLTDDSLKIANEVLEGLFVAIRSRSERSFAEAVNAVRLDLRYKGQEHRLSIVADNENGRISEGAESI
KTKFRAEYSRTFGSTMNDEIEVVSIRATVRVPLPRREIRFTHEPEAVAEFQMLEAFSFEKGKRMSFAIVQRQAIVGKLSG
PAIITEGTCTTYLDADWTARIGTVGEIILERKN
>Mature_672_residues
ATRIGVDIGGTFTDLVYFDEENGRTVEGKVPTVPSAPEEGVAHAIRAHVPQDIIDSAAYFLHGTTVGLNALLERRGAKVG
LITTQGFRDVLEIRRGDRAEMYNLFWKQTVPLVPRRLRLEVDGRIRGDGAVLSPVTRESVLAAYELLAAEKVDSIAVCLI
SAFANPQHELEIEAILRDAGYEGGISLSHKISGEYREYERTSTTVIDAFVRGRMSNYLRRLENTLRNLGFKGQCLITRSG
SGSMTFSEAEDRPFETIMSGPVGGAQGAGELAKMLGRKAMITADVGGTSFDTAVIINGEPQVLFEGMIDNMPVQTPWVDV
RSIGSGGGSLAYVDVGNLMRVGPQSAGAVPGPACYGKGGTQPALTDAAAYLGMLGPGNLASGIHLDIGKAEAALAPVASA
IGQDIEMTAAGVLRIASSAMANAMREISLDQGFDPREMTLLPFGGAGPLMATLLADELKMNEIVVPPLAGNFSAWGLLGA
DMVQSAARTRILDLTDDSLKIANEVLEGLFVAIRSRSERSFAEAVNAVRLDLRYKGQEHRLSIVADNENGRISEGAESIK
TKFRAEYSRTFGSTMNDEIEVVSIRATVRVPLPRREIRFTHEPEAVAEFQMLEAFSFEKGKRMSFAIVQRQAIVGKLSGP
AIITEGTCTTYLDADWTARIGTVGEIILERKN

Specific function: Unknown

COG id: COG0145

COG function: function code EQ; N-methylhydantoinase A/acetone carboxylase, beta subunit

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the oxoprolinase family [H]

Homologues:

Organism=Homo sapiens, GI48314820, Length=741, Percent_Identity=28.3400809716599, Blast_Score=172, Evalue=9e-43,
Organism=Caenorhabditis elegans, GI133901900, Length=716, Percent_Identity=24.1620111731844, Blast_Score=157, Evalue=2e-38,
Organism=Saccharomyces cerevisiae, GI6322634, Length=604, Percent_Identity=28.476821192053, Blast_Score=181, Evalue=3e-46,
Organism=Drosophila melanogaster, GI45550492, Length=728, Percent_Identity=26.2362637362637, Blast_Score=186, Evalue=4e-47,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008040
- InterPro:   IPR002821 [H]

Pfam domain/function: PF05378 Hydant_A_N; PF01968 Hydantoinase_A [H]

EC number: 3.5.2.14

Molecular weight: Translated: 72432; Mature: 72300

Theoretical pI: Translated: 5.06; Mature: 5.06

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MATRIGVDIGGTFTDLVYFDEENGRTVEGKVPTVPSAPEEGVAHAIRAHVPQDIIDSAAY
CCEEEEEECCCCEEEEEEEECCCCCEEECCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHH
FLHGTTVGLNALLERRGAKVGLITTQGFRDVLEIRRGDRAEMYNLFWKQTVPLVPRRLRL
HHCCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCHHHHHHHHHHHCCCCCCCEEEE
EVDGRIRGDGAVLSPVTRESVLAAYELLAAEKVDSIAVCLISAFANPQHELEIEAILRDA
EECCEEECCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC
GYEGGISLSHKISGEYREYERTSTTVIDAFVRGRMSNYLRRLENTLRNLGFKGQCLITRS
CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEC
GSGSMTFSEAEDRPFETIMSGPVGGAQGAGELAKMLGRKAMITADVGGTSFDTAVIINGE
CCCCEEECCCCCCCHHHHHCCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCEEEEECCC
PQVLFEGMIDNMPVQTPWVDVRSIGSGGGSLAYVDVGNLMRVGPQSAGAVPGPACYGKGG
CCEEEHHHHCCCCCCCCCHHHHCCCCCCCCEEEEECCCHHCCCCCCCCCCCCCCCCCCCC
TQPALTDAAAYLGMLGPGNLASGIHLDIGKAEAALAPVASAIGQDIEMTAAGVLRIASSA
CCCHHHHHHHHHHCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH
MANAMREISLDQGFDPREMTLLPFGGAGPLMATLLADELKMNEIVVPPLAGNFSAWGLLG
HHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHH
ADMVQSAARTRILDLTDDSLKIANEVLEGLFVAIRSRSERSFAEAVNAVRLDLRYKGQEH
HHHHHHHHHCEEEECCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHEEEEEEECCCCC
RLSIVADNENGRISEGAESIKTKFRAEYSRTFGSTMNDEIEVVSIRATVRVPLPRREIRF
EEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCCHHHCEE
THEPEAVAEFQMLEAFSFEKGKRMSFAIVQRQAIVGKLSGPAIITEGTCTTYLDADWTAR
CCCCHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHHHCCCCCEEEECCCEEEEECCCCCCC
IGTVGEIILERKN
CCCCEEEEEECCC
>Mature Secondary Structure 
ATRIGVDIGGTFTDLVYFDEENGRTVEGKVPTVPSAPEEGVAHAIRAHVPQDIIDSAAY
CEEEEEECCCCEEEEEEEECCCCCEEECCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHH
FLHGTTVGLNALLERRGAKVGLITTQGFRDVLEIRRGDRAEMYNLFWKQTVPLVPRRLRL
HHCCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHCCCHHHHHHHHHHHCCCCCCCEEEE
EVDGRIRGDGAVLSPVTRESVLAAYELLAAEKVDSIAVCLISAFANPQHELEIEAILRDA
EECCEEECCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC
GYEGGISLSHKISGEYREYERTSTTVIDAFVRGRMSNYLRRLENTLRNLGFKGQCLITRS
CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEC
GSGSMTFSEAEDRPFETIMSGPVGGAQGAGELAKMLGRKAMITADVGGTSFDTAVIINGE
CCCCEEECCCCCCCHHHHHCCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCEEEEECCC
PQVLFEGMIDNMPVQTPWVDVRSIGSGGGSLAYVDVGNLMRVGPQSAGAVPGPACYGKGG
CCEEEHHHHCCCCCCCCCHHHHCCCCCCCCEEEEECCCHHCCCCCCCCCCCCCCCCCCCC
TQPALTDAAAYLGMLGPGNLASGIHLDIGKAEAALAPVASAIGQDIEMTAAGVLRIASSA
CCCHHHHHHHHHHCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH
MANAMREISLDQGFDPREMTLLPFGGAGPLMATLLADELKMNEIVVPPLAGNFSAWGLLG
HHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHH
ADMVQSAARTRILDLTDDSLKIANEVLEGLFVAIRSRSERSFAEAVNAVRLDLRYKGQEH
HHHHHHHHHCEEEECCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHEEEEEEECCCCC
RLSIVADNENGRISEGAESIKTKFRAEYSRTFGSTMNDEIEVVSIRATVRVPLPRREIRF
EEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCCHHHCEE
THEPEAVAEFQMLEAFSFEKGKRMSFAIVQRQAIVGKLSGPAIITEGTCTTYLDADWTAR
CCCCHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHHHCCCCCEEEECCCEEEEECCCCCCC
IGTVGEIILERKN
CCCCEEEEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: ATP; N-methylimidazolidine-2,4-dione; H2O

Specific reaction: ATP + N-methylimidazolidine-2,4-dione + 2 H2O = ADP + phosphate + N-carbamoylsarcosine

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8688087 [H]