Definition | Nocardioides sp. JS614 chromosome, complete genome. |
---|---|
Accession | NC_008699 |
Length | 4,985,871 |
Click here to switch to the map view.
The map label for this gene is smf [C]
Identifier: 119717470
GI number: 119717470
Start: 3447925
End: 3449061
Strand: Reverse
Name: smf [C]
Synonym: Noca_3246
Alternate gene names: 119717470
Gene position: 3449061-3447925 (Counterclockwise)
Preceding gene: 119717471
Following gene: 119717469
Centisome position: 69.18
GC content: 76.69
Gene sequence:
>1137_bases ATGAGCGCGTCCGAGGAGGACCGCCTGCTTCGGGTCGCGCTGTCGGCGCTCGGTGAGCCCGGCGACCCCCGGCTCCTGGA CTGGGTCACCCGGCTCGGGCCAGCCGCGGTCCACGAGCTGCTGCTGACCGGGCACACGCTCGAGGAGCTCCAGACCGATC CCGACGCGCGCCGCGCGGCCGAGGACCCGCAGCGGACCCTGGCCCGGGCAGCCGACCGAGGCCTGCGCTTCGTCGTCCCG GGAGACGAGGAGTGGCCGACCCAGCTCGACGACCTGGCCACGGCCGCTCCACTGCAGAACCGCGGCGGCGCCCCACTCGG CCTGTGGGTGCGCGGCCCGGCCCGGCTCGACGACCTCGCCGGGTCCGTCGCCGTCGTCGGGTCCCGGTCCGCCACGACCT ACGGCGCCGAGGTCGCCACCGACCTCGCGGCTGCGATCGCGGCCGTCGACCGCGCCGTCGTCTCCGGTGCCGCCTACGGG ATCGACCAGGCCGCGCACCGCGGGGCGCTCGTCGGCGGCGGGTCGACGGTGGCGGTGCTCGCGTGCGGCGCCGACCGGTG CTATCCCGCCGCACACCGAGAGCTGATCGAATACATCGCCGGCCACGGTGCGGTCGTGTCCGAGGCGCTGCCGGGCTGGT CGCCGACCCAGCTCCGGTTCCTCGCGCGCAACCGGCTGATCGCCGCGCTGAGCGTCGGCACCGTGGTCGTGGAGGCCGCG GCGCGCAGCGGCGCGCTCAACACGGCGAACTGGGCCACCCGCCTCAACCGGCACGTGCTCGGCGTGCCCGGGCCGGTGAC CAGCGCGCCGTCGGAGGGCGTGCACCACCTGATCCGCTCCCAGGCCGCCGTCCTGGTGACCTCGGGCGCGGACGTGCTCG AGGTGGTCAGTGCCACGGGCCAGCACCTCGCCGCCGAGCCGCGCGGCCCGGTCCGGTCCCGCGACCGACTCACCACCCGC CAGCGGCAGGTGCTCGACGCGGTGCCGGTGGCCCAAGCGGCCCCCGAGACATCGATCGCCCGCACCGCCGGCCTCGGTCT CACCGAAGCCCGCAAGGTGCTCGTCGCGCTCGCCGAGCGCGGCCTCGCCGAGCAGGTCCCGGGCGGTTGGCGGCTCGCCG CACTGGCTCACCAGTAG
Upstream 100 bases:
>100_bases ACCTGCGACACCTCGACCAGCCCGGTCTCGACGAGGTCGACACCGCGCTGCGCCTCCGCACCGGCGACCTCCTGCTGCTG TCGATGCTGGAGCGGGTCGG
Downstream 100 bases:
>100_bases CGCCCGGCCTCCTTCCTAGACTCGGGGCGTGAGCGAGGAGGAGCAGTCGGCGCGGCTTCCCGAGGCGATGGCCCGGGCGC TCGGCGACTACGAGCGTCAC
Product: DNA protecting protein DprA
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 378; Mature: 377
Protein sequence:
>378_residues MSASEEDRLLRVALSALGEPGDPRLLDWVTRLGPAAVHELLLTGHTLEELQTDPDARRAAEDPQRTLARAADRGLRFVVP GDEEWPTQLDDLATAAPLQNRGGAPLGLWVRGPARLDDLAGSVAVVGSRSATTYGAEVATDLAAAIAAVDRAVVSGAAYG IDQAAHRGALVGGGSTVAVLACGADRCYPAAHRELIEYIAGHGAVVSEALPGWSPTQLRFLARNRLIAALSVGTVVVEAA ARSGALNTANWATRLNRHVLGVPGPVTSAPSEGVHHLIRSQAAVLVTSGADVLEVVSATGQHLAAEPRGPVRSRDRLTTR QRQVLDAVPVAQAAPETSIARTAGLGLTEARKVLVALAERGLAEQVPGGWRLAALAHQ
Sequences:
>Translated_378_residues MSASEEDRLLRVALSALGEPGDPRLLDWVTRLGPAAVHELLLTGHTLEELQTDPDARRAAEDPQRTLARAADRGLRFVVP GDEEWPTQLDDLATAAPLQNRGGAPLGLWVRGPARLDDLAGSVAVVGSRSATTYGAEVATDLAAAIAAVDRAVVSGAAYG IDQAAHRGALVGGGSTVAVLACGADRCYPAAHRELIEYIAGHGAVVSEALPGWSPTQLRFLARNRLIAALSVGTVVVEAA ARSGALNTANWATRLNRHVLGVPGPVTSAPSEGVHHLIRSQAAVLVTSGADVLEVVSATGQHLAAEPRGPVRSRDRLTTR QRQVLDAVPVAQAAPETSIARTAGLGLTEARKVLVALAERGLAEQVPGGWRLAALAHQ >Mature_377_residues SASEEDRLLRVALSALGEPGDPRLLDWVTRLGPAAVHELLLTGHTLEELQTDPDARRAAEDPQRTLARAADRGLRFVVPG DEEWPTQLDDLATAAPLQNRGGAPLGLWVRGPARLDDLAGSVAVVGSRSATTYGAEVATDLAAAIAAVDRAVVSGAAYGI DQAAHRGALVGGGSTVAVLACGADRCYPAAHRELIEYIAGHGAVVSEALPGWSPTQLRFLARNRLIAALSVGTVVVEAAA RSGALNTANWATRLNRHVLGVPGPVTSAPSEGVHHLIRSQAAVLVTSGADVLEVVSATGQHLAAEPRGPVRSRDRLTTRQ RQVLDAVPVAQAAPETSIARTAGLGLTEARKVLVALAERGLAEQVPGGWRLAALAHQ
Specific function: Unknown
COG id: COG0758
COG function: function code LU; Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Escherichia coli, GI48994932, Length=359, Percent_Identity=32.3119777158774, Blast_Score=112, Evalue=3e-26,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003488 - InterPro: IPR011991 [H]
Pfam domain/function: PF02481 DNA_processg_A [H]
EC number: NA
Molecular weight: Translated: 39263; Mature: 39132
Theoretical pI: Translated: 6.68; Mature: 6.68
Prosite motif: PS00398 RECOMBINASES_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 0.3 %Met (Translated Protein) 0.8 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 0.0 %Met (Mature Protein) 0.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSASEEDRLLRVALSALGEPGDPRLLDWVTRLGPAAVHELLLTGHTLEELQTDPDARRAA CCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCHHHHHHHHHCCCCHHHHCCCCHHHHHH EDPQRTLARAADRGLRFVVPGDEEWPTQLDDLATAAPLQNRGGAPLGLWVRGPARLDDLA CCHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCHHHHHHC GSVAVVGSRSATTYGAEVATDLAAAIAAVDRAVVSGAAYGIDQAAHRGALVGGGSTVAVL CCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCEEEEE ACGADRCYPAAHRELIEYIAGHGAVVSEALPGWSPTQLRFLARNRLIAALSVGTVVVEAA EECCCCCCHHHHHHHHHHHHCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH ARSGALNTANWATRLNRHVLGVPGPVTSAPSEGVHHLIRSQAAVLVTSGADVLEVVSATG HHCCCCCCHHHHHHHCCCEECCCCCCCCCCHHHHHHHHHCCCEEEEECCHHHHHHHHHCC QHLAAEPRGPVRSRDRLTTRQRQVLDAVPVAQAAPETSIARTAGLGLTEARKVLVALAER CCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHCCCCHHHHHHHCCCHHHHHHHHHHHHHC GLAEQVPGGWRLAALAHQ CHHHHCCCCEEEEEECCC >Mature Secondary Structure SASEEDRLLRVALSALGEPGDPRLLDWVTRLGPAAVHELLLTGHTLEELQTDPDARRAA CCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCHHHHHHHHHCCCCHHHHCCCCHHHHHH EDPQRTLARAADRGLRFVVPGDEEWPTQLDDLATAAPLQNRGGAPLGLWVRGPARLDDLA CCHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHCCCCCCCCCCEEEEECCCHHHHHHC GSVAVVGSRSATTYGAEVATDLAAAIAAVDRAVVSGAAYGIDQAAHRGALVGGGSTVAVL CCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCEEEEE ACGADRCYPAAHRELIEYIAGHGAVVSEALPGWSPTQLRFLARNRLIAALSVGTVVVEAA EECCCCCCHHHHHHHHHHHHCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH ARSGALNTANWATRLNRHVLGVPGPVTSAPSEGVHHLIRSQAAVLVTSGADVLEVVSATG HHCCCCCCHHHHHHHCCCEECCCCCCCCCCHHHHHHHHHCCCEEEEECCHHHHHHHHHCC QHLAAEPRGPVRSRDRLTTRQRQVLDAVPVAQAAPETSIARTAGLGLTEARKVLVALAER CCCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHCCCCHHHHHHHCCCHHHHHHHHHHHHHC GLAEQVPGGWRLAALAHQ CHHHHCCCCEEEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036 [H]