Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is nifS [H]
Identifier: 159184937
GI number: 159184937
Start: 1804777
End: 1805946
Strand: Reverse
Name: nifS [H]
Synonym: Atu1825
Alternate gene names: 159184937
Gene position: 1805946-1804777 (Counterclockwise)
Preceding gene: 159184938
Following gene: 159184936
Centisome position: 63.55
GC content: 63.93
Gene sequence:
>1170_bases ATGACGGGTTTAACGCGTACATATATGGACTGGAACGCCACGGCGCCGCTTCTGCCTGCCGTGCGGGACATCCTTGTGTC CGCGCTTGATCTTGCCGGCAATCCGTCTTCCGTCCACCGGGAAGGGCGCGCTGCCCGCGCCGCCGTGGAGGCTGCCCGCC GTGACGTCGCCGCCCTTGCCGGCGCGCAGGCATCCCATGTCACCTTCACCAGCGGTGCGACCGAAGCGGCCAATCTGGTT CTGACGACGGACTTCAAAATGGGGCGCGCGCCTGTGCGTTATGGCCGGCTTTACGTGTCCGCCATCGAGCACCCCGCGTT TCGCGAAGGCGGCCGCTTTGAGAAGGATGACGTTACGGAAGTTTCCGTTACGTCAGCGGGCGTCATCGATCTTACCGCGC TTGAGGCGCTGCTGTCGTCGCATGACAAATCCGCTGGCTTGCCGATGGTCGCCTGCATGCTGGTCAATAACGAGACCGGC ATTCTCCAGCCGGTGGCGGAAGCCGCGCGGCTCGTGCACGCCGCTGGCGGCTTGATGGTTGTCGATGCCGTGCAGGCGGC GGGCCGCATTCCGCTCGATATCAATGATCTCGATGCGGATTTCCTCGTCCTCTCGTCCCACAAGCTTGGTGGCCCCAAGG GTGCCGGCGCTCTCATCTCGCGCGGCGAGGTGATGATGCCGAAGCCGCTGATCCATGGCGGCGGGCAGGAAAAGGGACAT CGTTCCGGCACGGAAAATACTCTCTCGGTCATCGGTTTTGGTGCCGCCGCTGCCGTGGCGGCCGAGTACCTGGCGGGTGA AGCGGCGCGTCTTGGCGCGCTGCGGGCGAAGCTGGAGGACGGCATGCGGGTTAATGCGCCTGATGTTATCATCCATGGTG CGGATGTTGCCCGTGTGGGCAATACGACGTTCTTCACCCTGCCGGGCCTGAAAGCGGAAACGGGACAGATCGCCTTCGAC ATAGAAGGCATTGCACTCTCGGCCGGTTCGGCCTGCTCCTCGGGCAAGGTGGGCGAAAGCCATGTGCTGACGGCGATGGG GCACGATCCCAAGCTGGGCGCGCTCAGGATTTCCCTCGGTCACGCGACTGACGAGGCGGATATAGACAGGACGCTTGTGG CGTTCACGAAAATTGCCGGGCGGCGTAAGCTGTCCGGTCAGGCCGCGTAA
Upstream 100 bases:
>100_bases GTGTGCTCAAGCACTCGGGTCGTGTCATAAGGCAAGCGCAACTGTAATTTCAAGAAAAATGCGTTTTGGCGCATATAAGA TTGAGCGAAGCTTTTCCGCT
Downstream 100 bases:
>100_bases AATGCGGCAAAGAATTGCGGAATAGCAATTTTCCGCTTGCCGGGATGTGTGAAAAGCGCCATCCCGCACTTACATGGGTG TAAGCTCCGCCCCGTTTATA
Product: cysteine desulfurase
Products: NA
Alternate protein names: Nitrogenase metalloclusters biosynthesis protein NifS [H]
Number of amino acids: Translated: 389; Mature: 388
Protein sequence:
>389_residues MTGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALAGAQASHVTFTSGATEAANLV LTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTEVSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETG ILQPVAEAARLVHAAGGLMVVDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVGNTTFFTLPGLKAETGQIAFD IEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLGHATDEADIDRTLVAFTKIAGRRKLSGQAA
Sequences:
>Translated_389_residues MTGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALAGAQASHVTFTSGATEAANLV LTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTEVSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETG ILQPVAEAARLVHAAGGLMVVDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVGNTTFFTLPGLKAETGQIAFD IEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLGHATDEADIDRTLVAFTKIAGRRKLSGQAA >Mature_388_residues TGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALAGAQASHVTFTSGATEAANLVL TTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTEVSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETGI LQPVAEAARLVHAAGGLMVVDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGHR SGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVGNTTFFTLPGLKAETGQIAFDI EGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLGHATDEADIDRTLVAFTKIAGRRKLSGQAA
Specific function: Catalyzes the removal of elemental sulfur atoms from cysteine to produce alanine. Seems to participate in the biosynthesis of the nitrogenase metalloclusters by providing the inorganic sulfur required for the Fe-S core formation [H]
COG id: COG1104
COG function: function code E; Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the class-V pyridoxal-phosphate-dependent aminotransferase family. NifS/IscS subfamily [H]
Homologues:
Organism=Homo sapiens, GI32307132, Length=373, Percent_Identity=36.9973190348526, Blast_Score=208, Evalue=6e-54, Organism=Homo sapiens, GI156713448, Length=403, Percent_Identity=31.5136476426799, Blast_Score=158, Evalue=1e-38, Organism=Escherichia coli, GI48994898, Length=396, Percent_Identity=34.8484848484849, Blast_Score=192, Evalue=4e-50, Organism=Escherichia coli, GI1787970, Length=371, Percent_Identity=25.3369272237197, Blast_Score=82, Evalue=7e-17, Organism=Escherichia coli, GI1789175, Length=390, Percent_Identity=27.4358974358974, Blast_Score=78, Evalue=9e-16, Organism=Caenorhabditis elegans, GI25143064, Length=375, Percent_Identity=33.6, Blast_Score=180, Evalue=9e-46, Organism=Caenorhabditis elegans, GI17533177, Length=311, Percent_Identity=31.1897106109325, Blast_Score=122, Evalue=3e-28, Organism=Saccharomyces cerevisiae, GI6319831, Length=387, Percent_Identity=33.5917312661499, Blast_Score=179, Evalue=6e-46, Organism=Drosophila melanogaster, GI20129463, Length=372, Percent_Identity=33.8709677419355, Blast_Score=181, Evalue=5e-46,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000192 - InterPro: IPR020578 - InterPro: IPR017772 - InterPro: IPR016454 - InterPro: IPR015424 - InterPro: IPR015421 - InterPro: IPR015422 [H]
Pfam domain/function: PF00266 Aminotran_5 [H]
EC number: =2.8.1.7 [H]
Molecular weight: Translated: 40102; Mature: 39971
Theoretical pI: Translated: 6.65; Mature: 6.65
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALA CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH GAQASHVTFTSGATEAANLVLTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTE CCCCCEEEEECCCCCCCEEEEEECCCCCCCCHHHHHEEEEHHCCCCCCCCCCCCCCCCCE VSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETGILQPVAEAARLVHAAGGLMV EEECCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCEEE VDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH EEHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHCCCCEECCCHHCCCCCCCCCC RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVG CCCCCCEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHCCCEECCCEEEEECCCEEECC NTTFFTLPGLKAETGQIAFDIEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLG CEEEEEECCCCCCCCEEEEEECCEEEECCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEC HATDEADIDRTLVAFTKIAGRRKLSGQAA CCCCCHHHHHHHHHHHHHHCCCCCCCCCC >Mature Secondary Structure TGLTRTYMDWNATAPLLPAVRDILVSALDLAGNPSSVHREGRAARAAVEAARRDVAALA CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH GAQASHVTFTSGATEAANLVLTTDFKMGRAPVRYGRLYVSAIEHPAFREGGRFEKDDVTE CCCCCEEEEECCCCCCCEEEEEECCCCCCCCHHHHHEEEEHHCCCCCCCCCCCCCCCCCE VSVTSAGVIDLTALEALLSSHDKSAGLPMVACMLVNNETGILQPVAEAARLVHAAGGLMV EEECCCCCHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCEEE VDAVQAAGRIPLDINDLDADFLVLSSHKLGGPKGAGALISRGEVMMPKPLIHGGGQEKGH EEHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHCCCCEECCCHHCCCCCCCCCC RSGTENTLSVIGFGAAAAVAAEYLAGEAARLGALRAKLEDGMRVNAPDVIIHGADVARVG CCCCCCEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHCCCEECCCEEEEECCCEEECC NTTFFTLPGLKAETGQIAFDIEGIALSAGSACSSGKVGESHVLTAMGHDPKLGALRISLG CEEEEEECCCCCCCCEEEEEECCEEEECCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEC HATDEADIDRTLVAFTKIAGRRKLSGQAA CCCCCHHHHHHHHHHHHHHCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9163424 [H]