Definition Azorhizobium caulinodans ORS 571, complete genome.
Accession NC_009937
Length 5,369,772

Click here to switch to the map view.

The map label for this gene is 158423666

Identifier: 158423666

GI number: 158423666

Start: 2354593

End: 2356002

Strand: Direct

Name: 158423666

Synonym: AZC_2042

Alternate gene names: NA

Gene position: 2354593-2356002 (Clockwise)

Preceding gene: 158423663

Following gene: 158423668

Centisome position: 43.85

GC content: 71.06

Gene sequence:

>1410_bases
ATGGCCGGGCTTTCTTCGTTTCTCGCCAAGGATGAAGGACCGCAGGTGGCGGCCCTCACGCAGGCTTTTGCGCGCCGCAC
ACTGTCCCCCGTGGAAGTGACGGAGGCCGCCCTCGCGCGCGCCGAAGAGATCAACCCGGCGCTCAATGCATTCCTGTCCA
TCGATCATGAGCGCGCGCTTGCCGCTGCGCGGGCGGCGGAAGCGCGCTGGGCCAACGGGACGGCGCTCTCGCCCATCGAT
GGCATTCCGACGACCCTCAAGGACATCGTGTGGGTGAAGGACTGGTCGGTCCGCTACGGCAGCGGCACGACGCCCGCCCA
GCCCTATGCCGAGGATGCGCCCTCCGTTCGGCGCCTGCGCAGCGCCGGCGCGGTCTTCATCGGCCTTACCTCCTCGCCTG
AGTTCGGCTGGAAGGCGGTGACGGACAGCCCCGCCTGCGGCATCACGCGCAATCCCCATGATCCCAGCCGCACGCCGGGC
GGGTCCTCCGGCGGTGCGGCGGTCGCGGCCGCTGCGGGAGCGGGCGTGCTCCATCTCGGCACCGACGGAGGCGGCTCGAT
CCGCGTCCCCTCGGCCTTTTCAGGTATCGCCGGCCTGAAGCCGACGTTCGGCCGCGTTCCCGCTTTCCCCGCGAGCGCCT
TCGGCACCGTGGCCCATATCGGTCCCATGGCTCGCCACGCCGCAGACCTCGCCCCCATGCTGGCGGCCATGTCGGGCCGT
GATCTTGCAGACTGGGCGCAGGGAGCGGGCGCGCTGGCGCCGCTGGGCACGCGCCTTCATCCGGATTTTCCGAAGGGAGC
GCGCATCGGCGTCTGGTCCACGCCCCCTTGTGGCGCCGTGGATGCCCCGGTGGCCGTGGCCTTCCAGGCCGCGCTGAAGG
TGCTGGAGGCGCAGGGAGCGATTCTGGAGCCCATCGACCTGCCGCGGGCCGATCGGGTGACCGGCGTGTTCGAGGCCCAT
TGGTACGGCGGGGCCGCGGCCCGCCTTGCAGCCATTCCAGAGGACGATCGTGGCGGCATCGACCCCGGGTTCCTGGAGAT
CGCGGCGGAGGGCGCGCGTCAGTCGGCCATCGATCTGATGCGGGCACAGGCGGAGCGCGGGGCCTTCGGTGCCGCCATGG
ATGCCCTGCTCGAGTCCTACGACTTCATCGTCTCGCCGGGCGTCGCGGTGCTTCCGTTCAGCGCCGGGGCGCTGGTGCCA
GAGGGGAGCGGATTGAAGCGCTGGCATCTGTGGGCCGGCTTCAGTTTCCCCATCAATCTCAGCCAGCAGCCGGCCGCGGT
CGTGCCGCTGGCGTCCACGGCGGAGGGACTTCCGCGCTCCTTCCAGATTGTTGGCGCCCGCGGCGCGGACGGAGCGGTGC
TGGCTGCAGCCGAGGCGCTGGAGCCGCTGCTCAAGGATGCCGGGCGATGA

Upstream 100 bases:

>100_bases
GTTCTCCGAATAATACATTTTGTGGGAGGCCTCGGGAAGGTCATTCTGCCTTTCAAAGGTGCAGCGGAAATTTTGCTGCA
TTTGCGAGCAGGGGTTGCGT

Downstream 100 bases:

>100_bases
GCGGCCTCAGCCTTTCGATTCTTCGCTCTCGCGGGCGGCAAGGAGCTTGACCACATGCGCGCGCGCTACACCGATATGAT
GGTCGATGGCCGCGCACCAG

Product: amidase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 469; Mature: 468

Protein sequence:

>469_residues
MAGLSSFLAKDEGPQVAALTQAFARRTLSPVEVTEAALARAEEINPALNAFLSIDHERALAAARAAEARWANGTALSPID
GIPTTLKDIVWVKDWSVRYGSGTTPAQPYAEDAPSVRRLRSAGAVFIGLTSSPEFGWKAVTDSPACGITRNPHDPSRTPG
GSSGGAAVAAAAGAGVLHLGTDGGGSIRVPSAFSGIAGLKPTFGRVPAFPASAFGTVAHIGPMARHAADLAPMLAAMSGR
DLADWAQGAGALAPLGTRLHPDFPKGARIGVWSTPPCGAVDAPVAVAFQAALKVLEAQGAILEPIDLPRADRVTGVFEAH
WYGGAAARLAAIPEDDRGGIDPGFLEIAAEGARQSAIDLMRAQAERGAFGAAMDALLESYDFIVSPGVAVLPFSAGALVP
EGSGLKRWHLWAGFSFPINLSQQPAAVVPLASTAEGLPRSFQIVGARGADGAVLAAAEALEPLLKDAGR

Sequences:

>Translated_469_residues
MAGLSSFLAKDEGPQVAALTQAFARRTLSPVEVTEAALARAEEINPALNAFLSIDHERALAAARAAEARWANGTALSPID
GIPTTLKDIVWVKDWSVRYGSGTTPAQPYAEDAPSVRRLRSAGAVFIGLTSSPEFGWKAVTDSPACGITRNPHDPSRTPG
GSSGGAAVAAAAGAGVLHLGTDGGGSIRVPSAFSGIAGLKPTFGRVPAFPASAFGTVAHIGPMARHAADLAPMLAAMSGR
DLADWAQGAGALAPLGTRLHPDFPKGARIGVWSTPPCGAVDAPVAVAFQAALKVLEAQGAILEPIDLPRADRVTGVFEAH
WYGGAAARLAAIPEDDRGGIDPGFLEIAAEGARQSAIDLMRAQAERGAFGAAMDALLESYDFIVSPGVAVLPFSAGALVP
EGSGLKRWHLWAGFSFPINLSQQPAAVVPLASTAEGLPRSFQIVGARGADGAVLAAAEALEPLLKDAGR
>Mature_468_residues
AGLSSFLAKDEGPQVAALTQAFARRTLSPVEVTEAALARAEEINPALNAFLSIDHERALAAARAAEARWANGTALSPIDG
IPTTLKDIVWVKDWSVRYGSGTTPAQPYAEDAPSVRRLRSAGAVFIGLTSSPEFGWKAVTDSPACGITRNPHDPSRTPGG
SSGGAAVAAAAGAGVLHLGTDGGGSIRVPSAFSGIAGLKPTFGRVPAFPASAFGTVAHIGPMARHAADLAPMLAAMSGRD
LADWAQGAGALAPLGTRLHPDFPKGARIGVWSTPPCGAVDAPVAVAFQAALKVLEAQGAILEPIDLPRADRVTGVFEAHW
YGGAAARLAAIPEDDRGGIDPGFLEIAAEGARQSAIDLMRAQAERGAFGAAMDALLESYDFIVSPGVAVLPFSAGALVPE
GSGLKRWHLWAGFSFPINLSQQPAAVVPLASTAEGLPRSFQIVGARGADGAVLAAAEALEPLLKDAGR

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the amidase family [H]

Homologues:

Organism=Homo sapiens, GI195972892, Length=239, Percent_Identity=32.6359832635983, Blast_Score=114, Evalue=2e-25,
Organism=Homo sapiens, GI222831590, Length=505, Percent_Identity=23.5643564356436, Blast_Score=88, Evalue=1e-17,
Organism=Caenorhabditis elegans, GI17537465, Length=480, Percent_Identity=25.8333333333333, Blast_Score=125, Evalue=6e-29,
Organism=Caenorhabditis elegans, GI17556264, Length=245, Percent_Identity=30.6122448979592, Blast_Score=101, Evalue=1e-21,
Organism=Caenorhabditis elegans, GI17556278, Length=191, Percent_Identity=34.0314136125654, Blast_Score=90, Evalue=3e-18,
Organism=Caenorhabditis elegans, GI17556276, Length=191, Percent_Identity=34.0314136125654, Blast_Score=90, Evalue=3e-18,
Organism=Caenorhabditis elegans, GI17538252, Length=190, Percent_Identity=31.5789473684211, Blast_Score=89, Evalue=5e-18,
Organism=Caenorhabditis elegans, GI71990152, Length=330, Percent_Identity=27.2727272727273, Blast_Score=86, Evalue=4e-17,
Organism=Caenorhabditis elegans, GI17543272, Length=130, Percent_Identity=38.4615384615385, Blast_Score=67, Evalue=3e-11,
Organism=Caenorhabditis elegans, GI17538254, Length=258, Percent_Identity=27.906976744186, Blast_Score=66, Evalue=3e-11,
Organism=Saccharomyces cerevisiae, GI6320448, Length=280, Percent_Identity=33.9285714285714, Blast_Score=98, Evalue=3e-21,
Organism=Saccharomyces cerevisiae, GI6319685, Length=433, Percent_Identity=25.4041570438799, Blast_Score=75, Evalue=2e-14,
Organism=Drosophila melanogaster, GI45550774, Length=233, Percent_Identity=36.0515021459227, Blast_Score=131, Evalue=9e-31,
Organism=Drosophila melanogaster, GI24648435, Length=233, Percent_Identity=36.0515021459227, Blast_Score=131, Evalue=1e-30,
Organism=Drosophila melanogaster, GI24648437, Length=233, Percent_Identity=36.0515021459227, Blast_Score=131, Evalue=1e-30,
Organism=Drosophila melanogaster, GI24644968, Length=483, Percent_Identity=26.5010351966874, Blast_Score=123, Evalue=2e-28,
Organism=Drosophila melanogaster, GI24652985, Length=240, Percent_Identity=34.5833333333333, Blast_Score=115, Evalue=6e-26,
Organism=Drosophila melanogaster, GI19922090, Length=240, Percent_Identity=34.5833333333333, Blast_Score=115, Evalue=6e-26,
Organism=Drosophila melanogaster, GI24652981, Length=240, Percent_Identity=34.5833333333333, Blast_Score=115, Evalue=6e-26,
Organism=Drosophila melanogaster, GI24652983, Length=240, Percent_Identity=34.5833333333333, Blast_Score=115, Evalue=6e-26,
Organism=Drosophila melanogaster, GI24648441, Length=169, Percent_Identity=39.0532544378698, Blast_Score=107, Evalue=1e-23,
Organism=Drosophila melanogaster, GI24648439, Length=169, Percent_Identity=39.0532544378698, Blast_Score=107, Evalue=1e-23,
Organism=Drosophila melanogaster, GI21356731, Length=475, Percent_Identity=26.7368421052632, Blast_Score=107, Evalue=1e-23,
Organism=Drosophila melanogaster, GI161078093, Length=241, Percent_Identity=32.3651452282158, Blast_Score=107, Evalue=2e-23,
Organism=Drosophila melanogaster, GI24648113, Length=314, Percent_Identity=32.1656050955414, Blast_Score=94, Evalue=2e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000120
- InterPro:   IPR020556 [H]

Pfam domain/function: PF01425 Amidase [H]

EC number: =3.5.1.4 [H]

Molecular weight: Translated: 47923; Mature: 47791

Theoretical pI: Translated: 5.89; Mature: 5.89

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAGLSSFLAKDEGPQVAALTQAFARRTLSPVEVTEAALARAEEINPALNAFLSIDHERAL
CCCHHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHH
AAARAAEARWANGTALSPIDGIPTTLKDIVWVKDWSVRYGSGTTPAQPYAEDAPSVRRLR
HHHHHHHHHCCCCCCCCCCCCCCHHHHHEEEEEECCEEECCCCCCCCCCCCCCHHHHHHH
SAGAVFIGLTSSPEFGWKAVTDSPACGITRNPHDPSRTPGGSSGGAAVAAAAGAGVLHLG
HCCEEEEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEHHCCCCEEEEE
TDGGGSIRVPSAFSGIAGLKPTFGRVPAFPASAFGTVAHIGPMARHAADLAPMLAAMSGR
CCCCCCEECCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
DLADWAQGAGALAPLGTRLHPDFPKGARIGVWSTPPCGAVDAPVAVAFQAALKVLEAQGA
CHHHHHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCC
ILEPIDLPRADRVTGVFEAHWYGGAAARLAAIPEDDRGGIDPGFLEIAAEGARQSAIDLM
EECCCCCCCCCCCCEEEEECCCCCHHHHEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH
RAQAERGAFGAAMDALLESYDFIVSPGVAVLPFSAGALVPEGSGLKRWHLWAGFSFPINL
HHHHHCCCHHHHHHHHHHHCCHHCCCCCEEEECCCCCCCCCCCCCCEEEEEECEECCCCC
SQQPAAVVPLASTAEGLPRSFQIVGARGADGAVLAAAEALEPLLKDAGR
CCCCCEEEECHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
AGLSSFLAKDEGPQVAALTQAFARRTLSPVEVTEAALARAEEINPALNAFLSIDHERAL
CCHHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHH
AAARAAEARWANGTALSPIDGIPTTLKDIVWVKDWSVRYGSGTTPAQPYAEDAPSVRRLR
HHHHHHHHHCCCCCCCCCCCCCCHHHHHEEEEEECCEEECCCCCCCCCCCCCCHHHHHHH
SAGAVFIGLTSSPEFGWKAVTDSPACGITRNPHDPSRTPGGSSGGAAVAAAAGAGVLHLG
HCCEEEEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEHHCCCCEEEEE
TDGGGSIRVPSAFSGIAGLKPTFGRVPAFPASAFGTVAHIGPMARHAADLAPMLAAMSGR
CCCCCCEECCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
DLADWAQGAGALAPLGTRLHPDFPKGARIGVWSTPPCGAVDAPVAVAFQAALKVLEAQGA
CHHHHHHCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCC
ILEPIDLPRADRVTGVFEAHWYGGAAARLAAIPEDDRGGIDPGFLEIAAEGARQSAIDLM
EECCCCCCCCCCCCEEEEECCCCCHHHHEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH
RAQAERGAFGAAMDALLESYDFIVSPGVAVLPFSAGALVPEGSGLKRWHLWAGFSFPINL
HHHHHCCCHHHHHHHHHHHCCHHCCCCCEEEECCCCCCCCCCCCCCEEEEEECEECCCCC
SQQPAAVVPLASTAEGLPRSFQIVGARGADGAVLAAAEALEPLLKDAGR
CCCCCEEEECHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9389475 [H]