The gene/protein map for NC_007946 is currently unavailable.
Definition Escherichia coli UTI89 chromosome, complete genome.
Accession NC_007946
Length 5,065,741

Click here to switch to the map view.

The map label for this gene is mycA [H]

Identifier: 91212389

GI number: 91212389

Start: 3334167

End: 3335900

Strand: Direct

Name: mycA [H]

Synonym: UTI89_C3397

Alternate gene names: 91212389

Gene position: 3334167-3335900 (Clockwise)

Preceding gene: 91212387

Following gene: 91212390

Centisome position: 65.82

GC content: 52.48

Gene sequence:

>1734_bases
GTGGTGTATATGTCTAATAAAATCTTTACGCATTCCCTACCTATGCGCTATGCCGATTTTCCAACGCTGGTTGATGCTTT
GGACTACGCCGCTCTGAGTAGCGCCGGAATGAATTTTTATGACAGACGTTGCCAACTTGAAGATCAACTGGAATATCAGA
CGTTAAAAGCGCGTGCCGAAGCTGGTGCGAAAAGGTTGTTATCGCTGAACCTGAAAAAAGGCGATCGCGTGGCGCTGATT
GCCGAAACGAGTAGCGGGTTTGTAGAGGCTTTTTTTGCCTGCCAGTATGCCGGCTTAGTCGCCGTCCCGTTGGCGATTCC
AATGGGCGTAGGTCAGCGGGATTCCTGGAGCGCCAAACTGCAGGGTTTACTGGCAAGTTGCCAGCCCGCAGCCATTATCA
CTGGTGATGAGTGGTTGCCACTGGTCAATGCCGCGACGCATAACAACAACCCCGAATTACATGTTTTAAGCCACGCCTGG
TTTAAGGCATTACCGGAAGCCGATGTTGTGCTCCAGCGTCCAGTTCCAAACGATATCGCCTACCTCCAGTACACCTCCGG
CAGCACCCGTTTTCCCCGTGGCGTCATTATCACCCATCACGAAGTGATGGCTAATCTACGTGCTATAAGCCACGATGGGA
TTAAATTACGCCCTGGCGACCGCTGCGTCTCCTGGCTGCCTTTCTACCATGATATGGGACTGGTCGGCTTTCTCCTGACC
CCCGTCGCCACGCAGCTTTCAGTAGATTATTTGCGCACTCAGGATTTTGCCATGCGTCCTCTGCAATGGCTTAAATTGAT
CAGTAAAAATCGTGGCACCGTTTCCGTTGCACCGCCGTTTGGCTATGAATTGTGCCAGCGCCGCGTGAATGAAAAAGATC
TCGCTGAACTGGATCTTTCCTGCTGGCGCGTCGCTGGTATTGGCGCAGAACCCATATCCGCAGAACAACTCCATCAATTC
GCTGAATGTTTCCGTCAGGTTAACTTTGACGATAAAACTTTCATGCCGTGCTACGGACTGGCAGAAAATGCGCTGGCTGT
CAGCTTCTCTGATGAAGCCTGCGGGGTTGTGGTTAACGAAGTGGATCGCGACATCCTCGAATACCAGGGTAAAGCCGTCG
CGCCGGGTGCTGAAACACGCGCCGTATCGACTTTCGTCAACTGCGGTAAAGCGTTGCCGGAACATGGCATTGAAATCCGC
AATGAAGCAGATATACCTGTCGCGGAACGTGTGGTAGGCCATATTTGCATCTCCGGCCCCAGCCTGATGAGCGGGTACTT
TGGCGACCAGGTTTCGCAAGACGAGATTGCCGCGACGGGCTGGTTAGACACCGGCGACCTTGGTTATCTGCTGGACGGTT
ATCTGTATGTCACCGGACGCATTAAAGATCTGATTATTATTCGTGGCCGTAATATCTGGCCGCAGGATATTGAATACATT
GCGGAACAGGAGCCGGAAATTCATTCTGGCGATGCGATTGCTTTTGTTACCGCCCAGGAAAAAATCATTTTGCAGATCCA
GTGTCGGATCAGCGACGAAGAACGTCGCGGGCAGCTTATCCACGCGCTGGCAGCTCGGATCCAAAGCGAATTTGGCGTTA
CCGCGGATATCGATCTGTTGCCGCCCCACAGTATTCCCCGAACATCCTCCGGCAAGCCTGCCCGTGCGGAAGCGAAAAAA
CGTTATCAGAAGGCTTATGCTGCCAGTCTTCATGTGCAGGAATCCCTGGCATGA

Upstream 100 bases:

>100_bases
TGTTCATTTTCACACCGATGAAAATACTATCTCATTTATCTTTTATGCCTATATAATCCACCGCATTATTGTCATTTTAT
TTTTGATATCCAATTTTTTC

Downstream 100 bases:

>100_bases
ATCAAACTGTCGCGGTGACGGGCGCTACCGGGTTTATCGGTAAATATATTATTGATAACCTGCTCGCCCGCGGCTTTCAT
GTTCGCGCATTGACGCGTAC

Product: acyl-CoA synthetase

Products: AMP; diphosphate +an acyl-CoA [C]

Alternate protein names: Glutamate-1-semialdehyde aminotransferase; GSA-AT; ATP-dependent asparagine adenylase 1; AsnA 1; Asparagine activase 1 [H]

Number of amino acids: Translated: 577; Mature: 577

Protein sequence:

>577_residues
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAEAGAKRLLSLNLKKGDRVALI
AETSSGFVEAFFACQYAGLVAVPLAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHNNNPELHVLSHAW
FKALPEADVVLQRPVPNDIAYLQYTSGSTRFPRGVIITHHEVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLT
PVATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRVAGIGAEPISAEQLHQF
AECFRQVNFDDKTFMPCYGLAENALAVSFSDEACGVVVNEVDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIR
NEADIPVAERVVGHICISGPSLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYI
AEQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTADIDLLPPHSIPRTSSGKPARAEAKK
RYQKAYAASLHVQESLA

Sequences:

>Translated_577_residues
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAEAGAKRLLSLNLKKGDRVALI
AETSSGFVEAFFACQYAGLVAVPLAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHNNNPELHVLSHAW
FKALPEADVVLQRPVPNDIAYLQYTSGSTRFPRGVIITHHEVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLT
PVATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRVAGIGAEPISAEQLHQF
AECFRQVNFDDKTFMPCYGLAENALAVSFSDEACGVVVNEVDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIR
NEADIPVAERVVGHICISGPSLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYI
AEQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTADIDLLPPHSIPRTSSGKPARAEAKK
RYQKAYAASLHVQESLA
>Mature_577_residues
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAEAGAKRLLSLNLKKGDRVALI
AETSSGFVEAFFACQYAGLVAVPLAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHNNNPELHVLSHAW
FKALPEADVVLQRPVPNDIAYLQYTSGSTRFPRGVIITHHEVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLT
PVATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRVAGIGAEPISAEQLHQF
AECFRQVNFDDKTFMPCYGLAENALAVSFSDEACGVVVNEVDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIR
NEADIPVAERVVGHICISGPSLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYI
AEQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTADIDLLPPHSIPRTSSGKPARAEAKK
RYQKAYAASLHVQESLA

Specific function: This protein is a multifunctional enzyme, able to activate a long chain fatty acid and link it with the amino acid Asn as part of the synthesis of mycosubtilin. The activation sites consist of individual domains [H]

COG id: COG0318

COG function: function code IQ; Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II

Gene ontology:

Cell location: Partially Membrane-Associated [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 4 acyl carrier domains [H]

Homologues:

Organism=Homo sapiens, GI44888818, Length=563, Percent_Identity=22.202486678508, Blast_Score=90, Evalue=5e-18,
Organism=Homo sapiens, GI156151445, Length=564, Percent_Identity=22.8723404255319, Blast_Score=86, Evalue=1e-16,
Organism=Homo sapiens, GI225735629, Length=541, Percent_Identity=23.4750462107209, Blast_Score=82, Evalue=1e-15,
Organism=Homo sapiens, GI45827692, Length=541, Percent_Identity=23.4750462107209, Blast_Score=82, Evalue=1e-15,
Organism=Homo sapiens, GI225735625, Length=541, Percent_Identity=23.4750462107209, Blast_Score=81, Evalue=2e-15,
Organism=Homo sapiens, GI225735627, Length=436, Percent_Identity=23.6238532110092, Blast_Score=75, Evalue=1e-13,
Organism=Homo sapiens, GI45827694, Length=436, Percent_Identity=23.6238532110092, Blast_Score=75, Evalue=1e-13,
Organism=Homo sapiens, GI45827696, Length=436, Percent_Identity=23.6238532110092, Blast_Score=75, Evalue=1e-13,
Organism=Homo sapiens, GI45827698, Length=393, Percent_Identity=24.4274809160305, Blast_Score=72, Evalue=2e-12,
Organism=Homo sapiens, GI55749758, Length=457, Percent_Identity=22.5382932166302, Blast_Score=66, Evalue=8e-11,
Organism=Escherichia coli, GI1788107, Length=587, Percent_Identity=23.8500851788756, Blast_Score=117, Evalue=3e-27,
Organism=Caenorhabditis elegans, GI17559526, Length=551, Percent_Identity=22.6860254083485, Blast_Score=100, Evalue=4e-21,
Organism=Caenorhabditis elegans, GI17531443, Length=555, Percent_Identity=22.8828828828829, Blast_Score=93, Evalue=5e-19,
Organism=Drosophila melanogaster, GI18859661, Length=399, Percent_Identity=24.0601503759398, Blast_Score=105, Evalue=7e-23,
Organism=Drosophila melanogaster, GI24654656, Length=605, Percent_Identity=23.1404958677686, Blast_Score=98, Evalue=2e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010071
- InterPro:   IPR009081
- InterPro:   IPR005814
- InterPro:   IPR020845
- InterPro:   IPR000873
- InterPro:   IPR023213
- InterPro:   IPR001242
- InterPro:   IPR018201
- InterPro:   IPR014031
- InterPro:   IPR014030
- InterPro:   IPR006163
- InterPro:   IPR020806
- InterPro:   IPR006162
- InterPro:   IPR015424
- InterPro:   IPR015421
- InterPro:   IPR015422
- InterPro:   IPR016039
- InterPro:   IPR016038 [H]

Pfam domain/function: PF00202 Aminotran_3; PF00501 AMP-binding; PF00668 Condensation; PF00109 ketoacyl-synt; PF02801 Ketoacyl-synt_C; PF00550 PP-binding [H]

EC number: 6.2.1.3 [C]

Molecular weight: Translated: 63781; Mature: 63781

Theoretical pI: Translated: 5.61; Mature: 5.61

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.1 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
2.1 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAE
CEEECCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHH
AGAKRLLSLNLKKGDRVALIAETSSGFVEAFFACQYAGLVAVPLAIPMGVGQRDSWSAKL
HHHHEEEEEECCCCCEEEEEEECCCHHHHHHHHHHHCCCEEEHHHHCCCCCCCCCHHHHH
QGLLASCQPAAIITGDEWLPLVNAATHNNNPELHVLSHAWFKALPEADVVLQRPVPNDIA
HHHHHCCCCCEEEECCCCCEEEEEECCCCCCCEEEEEHHHHHHCCCCCEEEECCCCCCEE
YLQYTSGSTRFPRGVIITHHEVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLT
EEEECCCCCCCCCCEEEEHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHH
PVATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLS
HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHCCHHHHHHHCHH
CWRVAGIGAEPISAEQLHQFAECFRQVNFDDKTFMPCYGLAENALAVSFSDEACGVVVNE
HHEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHH
VDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRNEADIPVAERVVGHICISGP
HHHHHHHHCCCEECCCCHHHHHHHHHHHHHHCCCCCCEECCCCCCCHHHHHHHHHEECCC
SLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYI
HHHHCCCCCCCCCCHHEEECCCCCCCCHHEECCEEEEEECEEEEEEECCCCCCHHHHHHH
AEQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTADIDLL
HCCCCCCCCCCEEEEEEECCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHCCCEEEEECC
PPHSIPRTSSGKPARAEAKKRYQKAYAASLHVQESLA
CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAE
CEEECCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHH
AGAKRLLSLNLKKGDRVALIAETSSGFVEAFFACQYAGLVAVPLAIPMGVGQRDSWSAKL
HHHHEEEEEECCCCCEEEEEEECCCHHHHHHHHHHHCCCEEEHHHHCCCCCCCCCHHHHH
QGLLASCQPAAIITGDEWLPLVNAATHNNNPELHVLSHAWFKALPEADVVLQRPVPNDIA
HHHHHCCCCCEEEECCCCCEEEEEECCCCCCCEEEEEHHHHHHCCCCCEEEECCCCCCEE
YLQYTSGSTRFPRGVIITHHEVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLT
EEEECCCCCCCCCCEEEEHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHH
PVATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLS
HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHCCHHHHHHHCHH
CWRVAGIGAEPISAEQLHQFAECFRQVNFDDKTFMPCYGLAENALAVSFSDEACGVVVNE
HHEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHH
VDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRNEADIPVAERVVGHICISGP
HHHHHHHHCCCEECCCCHHHHHHHHHHHHHHCCCCCCEECCCCCCCHHHHHHHHHEECCC
SLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYI
HHHHCCCCCCCCCCHHEEECCCCCCCCHHEECCEEEEEECEEEEEEECCCCCCHHHHHHH
AEQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTADIDLL
HCCCCCCCCCCEEEEEEECCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHCCCEEEEECC
PPHSIPRTSSGKPARAEAKKRYQKAYAASLHVQESLA
CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: Salts [C]

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): 0.0024 {myristate}} 0.0021 {myristate}} 0.0012 {myristate}} 0.0002 {myristate}} 0.0408 {octanoate}} 0.0125 {octanoate}} 0.0059 {octanoate}} 0.00072 {octanoate}} 0.0112 {hexanoate}} 0.0083 {decanoate}} 0.0063 {decanoate}} 0.0043 {dec

Substrates: ATP; a long-chain carboxylic acid; CoA [C]

Specific reaction: ATP + a long-chain carboxylic acid + CoA = AMP + diphosphate +an acyl-CoA [C]

General reaction: Acid-thiol ligation; Phosphorylation [C]

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 10557314 [H]