Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is mycA [H]

Identifier: 157162455

GI number: 157162455

Start: 3168187

End: 3169917

Strand: Direct

Name: mycA [H]

Synonym: EcHS_A3155

Alternate gene names: 157162455

Gene position: 3168187-3169917 (Clockwise)

Preceding gene: 157162453

Following gene: 157162456

Centisome position: 68.23

GC content: 52.92

Gene sequence:

>1731_bases
GTGGTGTATATGTCTAATAAAATCTTTACGCATTCCCTACCTATGCGCTATGCCGATTTTCCAACGCTGGTTGATGCTTT
GGACTACGCCGCTCTGAGTAGCGCCGGAATGAATTTTTATGACAGACGTTGCCAACTTGAAGATCAACTGGAATATCAGA
CGTTAAAAGCACGTGCCGAAGCTGTTGCGAAGCGGTTGTTATCGCTGAACCTGAAAAAAGGCGATCGCGTGGCACTGATT
GCCGAAACAAGTAGCGGGTTCGTAGAGGCTTTTTTTGTCTGCCAGTATGCCGGCTTAGTCGCCGTCCCGTTGGCGATTCC
AATGGGCGTTGGTCAGCGGGATTCCTGGAGCGCCAAATTGCAGGGTTTACTGGCAAGTTGCCAGCCCGCAGCCATTATCA
CTGGTGATGAGTGGTTGCCACTGGTCAATGCCGCGACGCATGACAACCCCGAATTACATGTTTTAAGCCACGCCTGGTTT
AAGGCATTACCGGAAGCCGATGTTGCGCTCCAGCGTCCAGTTCCGAACGATATCGCCTACCTCCAGTACACCTCCGGCAG
CACCCGTTTTCCCCGTGGCGTCATTATCACCCATCGCGAAGTAATGGCTAATCTACGTGCTATAAGCCACGACGGCATTA
AATTACGCCCTGGCGACCGCTGCGTCTCCTGGCTGCCTTTCTACCATGATATGGGACTGGTCGGCTTTCTCCTGACCCCC
GTCGCCACGCAGCTTTCAGTAGATTATTTGCGCACTCAGGATTTTGCCATGCGTCCTCTGCAATGGCTTAAATTGATCAG
TAAAAATCGCGGCACCGTTTCCGTTGCGCCGCCGTTTGGCTATGAATTGTGCCAGCGCCGCGTGAATGAAAAAGATCTCG
CTGAACTGGATCTTTCCTGCTGGCGCGTCGCTGGTATTGGTGCTGAGCCGATCTCCGCAGAACAACTCCATCAATTCGCT
GAATGTTTCCGTCAGGTTAACTTTGACGATAAAACGTTCATGCCGTGCTACGGACTGGCAGAAAATGCGCTGGCTGTCAG
CTTCTCTGATGAAGCCTCCGGGGTTGTGGTTAACGAAGTGGATCGCGACATCCTCGAATATCAGGGCAAAGCCGTCGCGC
CGGGTGCAGAGACACGCGCCGTATCGACTTTCGTCAACTGCGGCAAAGCGTTGCCGGAACATGGTATTGAAATCCGCAAT
GAAGCAGGTATGCCGGTCGCGGAACGTGTGGTAGGCCATATTTGCATCTCCGGTCCCAGTCTGATGAGCGGTTACTTTGG
CGACCAGGTTTCGCAAGACGAGATTGCCGCGACGGGCTGGTTAGACACCGGCGACCTCGGTTATCTGCTGGACGGTTATC
TGTATGTCACCGGACGCATTAAAGATCTGATTATTATTCGTGGCCGTAATATCTGGCCGCAGGATATTGAATATATTGCG
GAACAAGAACCGGAAATTCATTCTGGCGATGCGATTGCTTTTGTTACCGCCCAGGAAAAAATCATTTTGCAGATCCAGTG
TCGGATCAGCGACGAAGAACGTCGCGGGCAGCTTATCCACGCGCTGGCGGCACGGATCCAAAGCGAATTTGGCGTGACCG
CGGCTATCGATCTGTTGCCGCCCCACAGTATTCCCCGAACGTCCTCCGGCAAGCCTGCCCGTGCGGAAGCGAAAAAACGT
TATCAGAAGGCTTATGCTGCCAGTCTTAATGTGCAGGAATCCCTGGCATGA

Upstream 100 bases:

>100_bases
TGTTCATTTTCACACTGATGAAAATACTATCTCATTTATCTTTTATGCCTATATAATCCACCGCATTATTGTCATTTTAT
TTTGGATATCCAATTTTTTC

Downstream 100 bases:

>100_bases
ATCAAACTGTCGCGGTGACGGGCGCTACCGGGTTTATCGGTAAATATATTATCGATAACCTGCTCGCCCGCGGCTTTCAC
GTTCGCGCATTGACGCGTAC

Product: acyl-CoA synthetase

Products: AMP; diphosphate +an acyl-CoA [C]

Alternate protein names: Glutamate-1-semialdehyde aminotransferase; GSA-AT; ATP-dependent asparagine adenylase 1; AsnA 1; Asparagine activase 1 [H]

Number of amino acids: Translated: 576; Mature: 576

Protein sequence:

>576_residues
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAEAVAKRLLSLNLKKGDRVALI
AETSSGFVEAFFVCQYAGLVAVPLAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHDNPELHVLSHAWF
KALPEADVALQRPVPNDIAYLQYTSGSTRFPRGVIITHREVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLTP
VATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRVAGIGAEPISAEQLHQFA
ECFRQVNFDDKTFMPCYGLAENALAVSFSDEASGVVVNEVDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRN
EAGMPVAERVVGHICISGPSLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYIA
EQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTAAIDLLPPHSIPRTSSGKPARAEAKKR
YQKAYAASLNVQESLA

Sequences:

>Translated_576_residues
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAEAVAKRLLSLNLKKGDRVALI
AETSSGFVEAFFVCQYAGLVAVPLAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHDNPELHVLSHAWF
KALPEADVALQRPVPNDIAYLQYTSGSTRFPRGVIITHREVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLTP
VATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRVAGIGAEPISAEQLHQFA
ECFRQVNFDDKTFMPCYGLAENALAVSFSDEASGVVVNEVDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRN
EAGMPVAERVVGHICISGPSLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYIA
EQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTAAIDLLPPHSIPRTSSGKPARAEAKKR
YQKAYAASLNVQESLA
>Mature_576_residues
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAEAVAKRLLSLNLKKGDRVALI
AETSSGFVEAFFVCQYAGLVAVPLAIPMGVGQRDSWSAKLQGLLASCQPAAIITGDEWLPLVNAATHDNPELHVLSHAWF
KALPEADVALQRPVPNDIAYLQYTSGSTRFPRGVIITHREVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLTP
VATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSCWRVAGIGAEPISAEQLHQFA
ECFRQVNFDDKTFMPCYGLAENALAVSFSDEASGVVVNEVDRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRN
EAGMPVAERVVGHICISGPSLMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYIA
EQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTAAIDLLPPHSIPRTSSGKPARAEAKKR
YQKAYAASLNVQESLA

Specific function: This protein is a multifunctional enzyme, able to activate a long chain fatty acid and link it with the amino acid Asn as part of the synthesis of mycosubtilin. The activation sites consist of individual domains [H]

COG id: COG0318

COG function: function code IQ; Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II

Gene ontology:

Cell location: Partially Membrane-Associated [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 4 acyl carrier domains [H]

Homologues:

Organism=Homo sapiens, GI156151445, Length=565, Percent_Identity=22.8318584070796, Blast_Score=86, Evalue=8e-17,
Organism=Homo sapiens, GI45827692, Length=570, Percent_Identity=24.0350877192982, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI225735629, Length=570, Percent_Identity=24.0350877192982, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI55749758, Length=572, Percent_Identity=21.8531468531469, Blast_Score=75, Evalue=2e-13,
Organism=Escherichia coli, GI1788107, Length=587, Percent_Identity=24.5315161839864, Blast_Score=118, Evalue=1e-27,
Organism=Escherichia coli, GI145693145, Length=478, Percent_Identity=25.3138075313808, Blast_Score=74, Evalue=2e-14,
Organism=Caenorhabditis elegans, GI17559526, Length=551, Percent_Identity=22.6860254083485, Blast_Score=103, Evalue=4e-22,
Organism=Caenorhabditis elegans, GI17531443, Length=558, Percent_Identity=23.2974910394265, Blast_Score=95, Evalue=1e-19,
Organism=Caenorhabditis elegans, GI32563687, Length=509, Percent_Identity=22.5933202357564, Blast_Score=87, Evalue=2e-17,
Organism=Drosophila melanogaster, GI18859661, Length=399, Percent_Identity=23.8095238095238, Blast_Score=106, Evalue=4e-23,
Organism=Drosophila melanogaster, GI24654656, Length=602, Percent_Identity=23.0897009966777, Blast_Score=99, Evalue=1e-20,
Organism=Drosophila melanogaster, GI24581924, Length=388, Percent_Identity=24.2268041237113, Blast_Score=76, Evalue=8e-14,
Organism=Drosophila melanogaster, GI20130357, Length=539, Percent_Identity=23.191094619666, Blast_Score=70, Evalue=4e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010071
- InterPro:   IPR009081
- InterPro:   IPR005814
- InterPro:   IPR020845
- InterPro:   IPR000873
- InterPro:   IPR023213
- InterPro:   IPR001242
- InterPro:   IPR018201
- InterPro:   IPR014031
- InterPro:   IPR014030
- InterPro:   IPR006163
- InterPro:   IPR020806
- InterPro:   IPR006162
- InterPro:   IPR015424
- InterPro:   IPR015421
- InterPro:   IPR015422
- InterPro:   IPR016039
- InterPro:   IPR016038 [H]

Pfam domain/function: PF00202 Aminotran_3; PF00501 AMP-binding; PF00668 Condensation; PF00109 ketoacyl-synt; PF02801 Ketoacyl-synt_C; PF00550 PP-binding [H]

EC number: 6.2.1.3 [C]

Molecular weight: Translated: 63606; Mature: 63606

Theoretical pI: Translated: 5.64; Mature: 5.64

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAE
CEEECCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHH
AVAKRLLSLNLKKGDRVALIAETSSGFVEAFFVCQYAGLVAVPLAIPMGVGQRDSWSAKL
HHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH
QGLLASCQPAAIITGDEWLPLVNAATHDNPELHVLSHAWFKALPEADVALQRPVPNDIAY
HHHHHCCCCCEEEECCCCCCEEECCCCCCCCEEEEHHHHHHHCCCCCCEECCCCCCCEEE
LQYTSGSTRFPRGVIITHREVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLTP
EEECCCCCCCCCCEEEECHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHHH
VATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSC
HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHCCHHHHHHHCHHH
WRVAGIGAEPISAEQLHQFAECFRQVNFDDKTFMPCYGLAENALAVSFSDEASGVVVNEV
HEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEHHH
DRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRNEAGMPVAERVVGHICISGPS
HHHHHHHCCCEECCCCHHHHHHHHHHHHHHCCCCCCEECCCCCCCHHHHHHHHHEECCCH
LMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYIA
HHHCCCCCCCCCCHHEEECCCCCCCCHHEECCEEEEEECEEEEEEECCCCCCHHHHHHHH
EQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTAAIDLLP
CCCCCCCCCCEEEEEEECCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHCCCEEEEECCC
PHSIPRTSSGKPARAEAKKRYQKAYAASLNVQESLA
CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHCC
>Mature Secondary Structure
MVYMSNKIFTHSLPMRYADFPTLVDALDYAALSSAGMNFYDRRCQLEDQLEYQTLKARAE
CEEECCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHH
AVAKRLLSLNLKKGDRVALIAETSSGFVEAFFVCQYAGLVAVPLAIPMGVGQRDSWSAKL
HHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH
QGLLASCQPAAIITGDEWLPLVNAATHDNPELHVLSHAWFKALPEADVALQRPVPNDIAY
HHHHHCCCCCEEEECCCCCCEEECCCCCCCCEEEEHHHHHHHCCCCCCEECCCCCCCEEE
LQYTSGSTRFPRGVIITHREVMANLRAISHDGIKLRPGDRCVSWLPFYHDMGLVGFLLTP
EEECCCCCCCCCCEEEECHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHHHHH
VATQLSVDYLRTQDFAMRPLQWLKLISKNRGTVSVAPPFGYELCQRRVNEKDLAELDLSC
HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHCCHHHHHHHCHHH
WRVAGIGAEPISAEQLHQFAECFRQVNFDDKTFMPCYGLAENALAVSFSDEASGVVVNEV
HEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEHHH
DRDILEYQGKAVAPGAETRAVSTFVNCGKALPEHGIEIRNEAGMPVAERVVGHICISGPS
HHHHHHHCCCEECCCCHHHHHHHHHHHHHHCCCCCCEECCCCCCCHHHHHHHHHEECCCH
LMSGYFGDQVSQDEIAATGWLDTGDLGYLLDGYLYVTGRIKDLIIIRGRNIWPQDIEYIA
HHHCCCCCCCCCCHHEEECCCCCCCCHHEECCEEEEEECEEEEEEECCCCCCHHHHHHHH
EQEPEIHSGDAIAFVTAQEKIILQIQCRISDEERRGQLIHALAARIQSEFGVTAAIDLLP
CCCCCCCCCCEEEEEEECCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHCCCEEEEECCC
PHSIPRTSSGKPARAEAKKRYQKAYAASLNVQESLA
CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: Salts [C]

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): 0.0024 {myristate}} 0.0021 {myristate}} 0.0012 {myristate}} 0.0002 {myristate}} 0.0408 {octanoate}} 0.0125 {octanoate}} 0.0059 {octanoate}} 0.00072 {octanoate}} 0.0112 {hexanoate}} 0.0083 {decanoate}} 0.0063 {decanoate}} 0.0043 {dec

Substrates: ATP; a long-chain carboxylic acid; CoA [C]

Specific reaction: ATP + a long-chain carboxylic acid + CoA = AMP + diphosphate +an acyl-CoA [C]

General reaction: Acid-thiol ligation; Phosphorylation [C]

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 10557314 [H]