The gene/protein map for NC_003062 is currently unavailable.
Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is nifS [H]

Identifier: 159184937

GI number: 159184937

Start: 1804777

End: 1805946

Strand: Reverse

Name: nifS [H]

Synonym: Atu1825

Alternate gene names: 159184937

Gene position: 1805946-1804777 (Counterclockwise)

Preceding gene: 159184938

Following gene: 159184936

Centisome position: 63.55

GC content: 63.93

Gene sequence:

>1170_bases
ATGACGGGTTTAACGCGTACATATATGGACTGGAACGCCACGGCGCCGCTTCTGCCTGCCGTGCGGGACATCCTTGTGTC
CGCGCTTGATCTTGCCGGCAATCCGTCTTCCGTCCACCGGGAAGGGCGCGCTGCCCGCGCCGCCGTGGAGGCTGCCCGCC
GTGACGTCGCCGCCCTTGCCGGCGCGCAGGCATCCCATGTCACCTTCACCAGCGGTGCGACCGAAGCGGCCAATCTGGTT
CTGACGACGGACTTCAAAATGGGGCGCGCGCCTGTGCGTTATGGCCGGCTTTACGTGTCCGCCATCGAGCACCCCGCGTT
TCGCGAAGGCGGCCGCTTTGAGAAGGATGACGTTACGGAAGTTTCCGTTACGTCAGCGGGCGTCATCGATCTTACCGCGC
TTGAGGCGCTGCTGTCGTCGCATGACAAATCCGCTGGCTTGCCGATGGTCGCCTGCATGCTGGTCAATAACGAGACCGGC
ATTCTCCAGCCGGTGGCGGAAGCCGCGCGGCTCGTGCACGCCGCTGGCGGCTTGATGGTTGTCGATGCCGTGCAGGCGGC
GGGCCGCATTCCGCTCGATATCAATGATCTCGATGCGGATTTCCTCGTCCTCTCGTCCCACAAGCTTGGTGGCCCCAAGG
GTGCCGGCGCTCTCATCTCGCGCGGCGAGGTGATGATGCCGAAGCCGCTGATCCATGGCGGCGGGCAGGAAAAGGGACAT
CGTTCCGGCACGGAAAATACTCTCTCGGTCATCGGTTTTGGTGCCGCCGCTGCCGTGGCGGCCGAGTACCTGGCGGGTGA
AGCGGCGCGTCTTGGCGCGCTGCGGGCGAAGCTGGAGGACGGCATGCGGGTTAATGCGCCTGATGTTATCATCCATGGTG
CGGATGTTGCCCGTGTGGGCAATACGACGTTCTTCACCCTGCCGGGCCTGAAAGCGGAAACGGGACAGATCGCCTTCGAC
ATAGAAGGCATTGCACTCTCGGCCGGTTCGGCCTGCTCCTCGGGCAAGGTGGGCGAAAGCCATGTGCTGACGGCGATGGG
GCACGATCCCAAGCTGGGCGCGCTCAGGATTTCCCTCGGTCACGCGACTGACGAGGCGGATATAGACAGGACGCTTGTGG
CGTTCACGAAAATTGCCGGGCGGCGTAAGCTGTCCGGTCAGGCCGCGTAA

Upstream 100 bases:

>100_bases
GTGTGCTCAAGCACTCGGGTCGTGTCATAAGGCAAGCGCAACTGTAATTTCAAGAAAAATGCGTTTTGGCGCATATAAGA
TTGAGCGAAGCTTTTCCGCT

Downstream 100 bases:

>100_bases
AATGCGGCAAAGAATTGCGGAATAGCAATTTTCCGCTTGCCGGGATGTGTGAAAAGCGCCATCCCGCACTTACATGGGTG
TAAGCTCCGCCCCGTTTATA

Product: cysteine desulfurase

Products: NA

Alternate protein names: Nitrogenase metalloclusters biosynthesis protein NifS [H]

Number of amino acids: Translated: 389; Mature: 388

Protein sequence:

>389_residues
MTGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALAGAQASHVTFTSGATEAANLV
LTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTEVSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETG
ILQPVAEAARLVHAAGGLMVVDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH
RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVGNTTFFTLPGLKAETGQIAFD
IEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLGHATDEADIDRTLVAFTKIAGRRKLSGQAA

Sequences:

>Translated_389_residues
MTGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALAGAQASHVTFTSGATEAANLV
LTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTEVSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETG
ILQPVAEAARLVHAAGGLMVVDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH
RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVGNTTFFTLPGLKAETGQIAFD
IEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLGHATDEADIDRTLVAFTKIAGRRKLSGQAA
>Mature_388_residues
TGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALAGAQASHVTFTSGATEAANLVL
TTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTEVSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETGI
LQPVAEAARLVHAAGGLMVVDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGHR
SGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVGNTTFFTLPGLKAETGQIAFDI
EGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLGHATDEADIDRTLVAFTKIAGRRKLSGQAA

Specific function: Catalyzes the removal of elemental sulfur atoms from cysteine to produce alanine. Seems to participate in the biosynthesis of the nitrogenase metalloclusters by providing the inorganic sulfur required for the Fe-S core formation [H]

COG id: COG1104

COG function: function code E; Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the class-V pyridoxal-phosphate-dependent aminotransferase family. NifS/IscS subfamily [H]

Homologues:

Organism=Homo sapiens, GI32307132, Length=373, Percent_Identity=36.9973190348526, Blast_Score=208, Evalue=6e-54,
Organism=Homo sapiens, GI156713448, Length=403, Percent_Identity=31.5136476426799, Blast_Score=158, Evalue=1e-38,
Organism=Escherichia coli, GI48994898, Length=396, Percent_Identity=34.8484848484849, Blast_Score=192, Evalue=4e-50,
Organism=Escherichia coli, GI1787970, Length=371, Percent_Identity=25.3369272237197, Blast_Score=82, Evalue=7e-17,
Organism=Escherichia coli, GI1789175, Length=390, Percent_Identity=27.4358974358974, Blast_Score=78, Evalue=9e-16,
Organism=Caenorhabditis elegans, GI25143064, Length=375, Percent_Identity=33.6, Blast_Score=180, Evalue=9e-46,
Organism=Caenorhabditis elegans, GI17533177, Length=311, Percent_Identity=31.1897106109325, Blast_Score=122, Evalue=3e-28,
Organism=Saccharomyces cerevisiae, GI6319831, Length=387, Percent_Identity=33.5917312661499, Blast_Score=179, Evalue=6e-46,
Organism=Drosophila melanogaster, GI20129463, Length=372, Percent_Identity=33.8709677419355, Blast_Score=181, Evalue=5e-46,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000192
- InterPro:   IPR020578
- InterPro:   IPR017772
- InterPro:   IPR016454
- InterPro:   IPR015424
- InterPro:   IPR015421
- InterPro:   IPR015422 [H]

Pfam domain/function: PF00266 Aminotran_5 [H]

EC number: =2.8.1.7 [H]

Molecular weight: Translated: 40102; Mature: 39971

Theoretical pI: Translated: 6.65; Mature: 6.65

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALA
CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
GAQASHVTFTSGATEAANLVLTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTE
CCCCCEEEEECCCCCCCEEEEEECCCCCCCCHHHHHEEEEHHCCCCCCCCCCCCCCCCCE
VSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETGILQPVAEAARLVHAAGGLMV
EEECCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCEEE
VDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH
EEHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHCCCCEECCCHHCCCCCCCCCC
RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVG
CCCCCCEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHCCCEECCCEEEEECCCEEECC
NTTFFTLPGLKAETGQIAFDIEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLG
CEEEEEECCCCCCCCEEEEEECCEEEECCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEC
HATDEADIDRTLVAFTKIAGRRKLSGQAA
CCCCCHHHHHHHHHHHHHHCCCCCCCCCC
>Mature Secondary Structure 
TGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALA
CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
GAQASHVTFTSGATEAANLVLTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTE
CCCCCEEEEECCCCCCCEEEEEECCCCCCCCHHHHHEEEEHHCCCCCCCCCCCCCCCCCE
VSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETGILQPVAEAARLVHAAGGLMV
EEECCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCEEE
VDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH
EEHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHCCCCEECCCHHCCCCCCCCCC
RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVG
CCCCCCEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHCCCEECCCEEEEECCCEEECC
NTTFFTLPGLKAETGQIAFDIEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLG
CEEEEEECCCCCCCCEEEEEECCEEEECCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEC
HATDEADIDRTLVAFTKIAGRRKLSGQAA
CCCCCHHHHHHHHHHHHHHCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9163424 [H]