Definition Bradyrhizobium sp. ORS278 chromosome, complete genome.
Accession NC_009445
Length 7,456,587

Click here to switch to the map view.

The map label for this gene is 146342751

Identifier: 146342751

GI number: 146342751

Start: 6143133

End: 6144824

Strand: Reverse

Name: 146342751

Synonym: BRADO5923

Alternate gene names: NA

Gene position: 6144824-6143133 (Counterclockwise)

Preceding gene: 146342755

Following gene: 146342741

Centisome position: 82.41

GC content: 68.2

Gene sequence:

>1692_bases
ATGTCCCGCCTTGCCTCGCTGCTGCTCCTGCTTGTTTTCGCCGCGCCGGTTGCCGCCGGCCCGCTCCTGTCCGACGACCT
CGTCAACTGCCGCAACCGATCGGCGGACGGCAATAGCCCCGCCCCGCCGCGGCTGGATGCCTGCGAGCGGCTGCTCGCCA
GCGGCAACCTGACCGGCAAGGACCTTGCCGCCGCACTCGAAGTGCGCGGCAACGCCGCGATCAGCAGGCGCGACTACGAC
AAGGCGATCAGCGCCCTGACCGGTGCGATCGCCGCCGATCCGGACAATGCCGGTCTCTACGACCGGCGCGGCGTCGCCTA
TTGGCGCAAGGGCCAGGATGACCTCGCCATCGCCGATTATGCGGTGGCGTTGCAGAAGCGGTCGAACTATGCCGCTCCCT
ACAACAACCGCGGCGTGATCTTCTTGCGACGCGGCGCGCTGCAGAGCGCGCTCGATGATTTCAACGCCGCGGTCCGGATA
TCGCCCACGATGTATGTCGCCAGGGCCAACCGCGGCCGGGTGCGGGCGATGATGAAGGATTTCGAGGGCGCGTTGGCCGA
CTTCGCCGAGGCCGACAAGATCGATCCCGAATCGACCCAGGCGGCGACCTATCGCTGCGACGCGTTCAACGCGATGGGCA
AGTTCGACGACGCCATCGCCAACTGCAACGCCGTGCTCGCCAGCCTGTCCAAATCGCAATACGCGCTGAACAGCCGCGCC
GAGGCCTTCATCGCCAAGGGCGATCTTGGCGCCGCGCTGAAGGACCTGAACACCGTGCTGGCGCTCAACCCCAACAATGT
CCGCGCCCATGCCGATCGCGGCCAGCTGTTCGAGCGCCGGCGTGATCTCGCCCAGGCGCGCGCCGACTACCGTTCGGCGG
CGTTCGCGCTGACGCCGTTCGACGAGATCGACGTGGTGAACGCGCGCAAGCTGGCACAGGACCGGCTCGCGGCGCTGAGC
CCGCAGGGCCCGGGCGCCGCCGCAACGGTGCGTCGGGTTGCCCTGGTGGTCGGCAATGGCGACTACAAATCCGTGCCCGC
GCTGCCGAACCCGCCGCGCGACGCCAAGCTGATCGCCGACACCTTGCGCGGCCTCGGCTTCCAGTCGGTCACGCTCGCCA
ACGACCTGTCGCGCGACAAGTTCTTCGAGGCGCTGCGCAGCTTCGAGGCCGAGGCCGAGAAGGCCGACTGGGCGGTGGTC
TACTACGCCGGCCACGGCTTCGAGATCGGCGGCGTCAACTATCTGGTTCCGGTCGATGCAAGGCTTGCGGCCGACAAGGA
CGCGGAGACGGAGGCGGTCGCGCTGGAGCAGGTGCTGGCCGCGGTCGGCGCCGCGCGCAAGCTGCGTCTCGTCATTCTCG
ATGCCTGCCGGGACAATCCGTTCGCGCCGACGATGAAGCAGACGCTGGCGCTGAAGCTCGTCGACAAGGGCTTCTCCAAC
ATCGAGCCGGGCGCCGGCATGATGGTGGTCTACGCCGCCAAGCACGGCGCCACCGCGCTCGACGGCAACGGCACCAACAG
CCCGTTCGCGACCGCGCTGGCCAAGGACATCAAGGAGCCCAGGGTCGAGATCCGCAAACTGTTCGACATCGTCCGCGACG
ACGTCTGGGCGGCCACACAGCATCAGCAGCAGCCCTTCACCTACGGCTCGCCGCCGGGCCGCGAGGATTTCTTCTTCGTG
GCGGCGAAGTGA

Upstream 100 bases:

>100_bases
CGGCGGGCACCAGTGCCCGGCCCGCTGGCCCTGCTCGAACCGGAGCGCTATTCGTTCGACCAATCGACGTCCACAGCTCC
GATCGAGATCCGCCGCCCCG

Downstream 100 bases:

>100_bases
GCGATGTGCGGTTGCTGCATATTGCATCCGGCGCAATTGAGACGGGTGATACCTGCGCCATCCTCCACTGTCGTCCCGGG
CTTGACCCGGGACCCATACT

Product: hypothetical protein

Products: NA

Alternate protein names: TPR Repeat-Containing Protein; TPR Domain-Containing Protein; Tetratricopeptide Repeat Domain Protein; Tetratricopeptide Repeat-Containing Protein; Tetratricopeptide TPR_2 Repeat Protein; TPR Repeat-Containing Serine/Threonin Protein Kinase; O-Linked GlcNAc Transferase; Tetratricopeptide Repeat Family; Caspase Domain Protein; Tetratricopeptide Repeat Family Protein; Caspase-Like Domain-Containing Protein; Peptidase Protein; Tetratricopeptide Tpr_1 Repeat-Containing Protein; Caspase Domain-Containing Protein; TPR Domain Protein; Tetratricopeptide TPR_1 Repeat-Containing Protein; Peptidase S1 And S6 Chymotrypsin/Hap; Peptidase C; GUN4-Like Family; TPR Repeats Containing Protein; Caspase; TPR Repeat- And Protein Kinase Domain-Containing Protein; Pentapeptide Repeat-Containing Serine/Threonine Kinase; TPR Repeat-Containing Protein Kinase; Peptidylprolyl Isomerase; Caspase-1 P; ICE-Like Protease

Number of amino acids: Translated: 563; Mature: 562

Protein sequence:

>563_residues
MSRLASLLLLLVFAAPVAAGPLLSDDLVNCRNRSADGNSPAPPRLDACERLLASGNLTGKDLAAALEVRGNAAISRRDYD
KAISALTGAIAADPDNAGLYDRRGVAYWRKGQDDLAIADYAVALQKRSNYAAPYNNRGVIFLRRGALQSALDDFNAAVRI
SPTMYVARANRGRVRAMMKDFEGALADFAEADKIDPESTQAATYRCDAFNAMGKFDDAIANCNAVLASLSKSQYALNSRA
EAFIAKGDLGAALKDLNTVLALNPNNVRAHADRGQLFERRRDLAQARADYRSAAFALTPFDEIDVVNARKLAQDRLAALS
PQGPGAAATVRRVALVVGNGDYKSVPALPNPPRDAKLIADTLRGLGFQSVTLANDLSRDKFFEALRSFEAEAEKADWAVV
YYAGHGFEIGGVNYLVPVDARLAADKDAETEAVALEQVLAAVGAARKLRLVILDACRDNPFAPTMKQTLALKLVDKGFSN
IEPGAGMMVVYAAKHGATALDGNGTNSPFATALAKDIKEPRVEIRKLFDIVRDDVWAATQHQQQPFTYGSPPGREDFFFV
AAK

Sequences:

>Translated_563_residues
MSRLASLLLLLVFAAPVAAGPLLSDDLVNCRNRSADGNSPAPPRLDACERLLASGNLTGKDLAAALEVRGNAAISRRDYD
KAISALTGAIAADPDNAGLYDRRGVAYWRKGQDDLAIADYAVALQKRSNYAAPYNNRGVIFLRRGALQSALDDFNAAVRI
SPTMYVARANRGRVRAMMKDFEGALADFAEADKIDPESTQAATYRCDAFNAMGKFDDAIANCNAVLASLSKSQYALNSRA
EAFIAKGDLGAALKDLNTVLALNPNNVRAHADRGQLFERRRDLAQARADYRSAAFALTPFDEIDVVNARKLAQDRLAALS
PQGPGAAATVRRVALVVGNGDYKSVPALPNPPRDAKLIADTLRGLGFQSVTLANDLSRDKFFEALRSFEAEAEKADWAVV
YYAGHGFEIGGVNYLVPVDARLAADKDAETEAVALEQVLAAVGAARKLRLVILDACRDNPFAPTMKQTLALKLVDKGFSN
IEPGAGMMVVYAAKHGATALDGNGTNSPFATALAKDIKEPRVEIRKLFDIVRDDVWAATQHQQQPFTYGSPPGREDFFFV
AAK
>Mature_562_residues
SRLASLLLLLVFAAPVAAGPLLSDDLVNCRNRSADGNSPAPPRLDACERLLASGNLTGKDLAAALEVRGNAAISRRDYDK
AISALTGAIAADPDNAGLYDRRGVAYWRKGQDDLAIADYAVALQKRSNYAAPYNNRGVIFLRRGALQSALDDFNAAVRIS
PTMYVARANRGRVRAMMKDFEGALADFAEADKIDPESTQAATYRCDAFNAMGKFDDAIANCNAVLASLSKSQYALNSRAE
AFIAKGDLGAALKDLNTVLALNPNNVRAHADRGQLFERRRDLAQARADYRSAAFALTPFDEIDVVNARKLAQDRLAALSP
QGPGAAATVRRVALVVGNGDYKSVPALPNPPRDAKLIADTLRGLGFQSVTLANDLSRDKFFEALRSFEAEAEKADWAVVY
YAGHGFEIGGVNYLVPVDARLAADKDAETEAVALEQVLAAVGAARKLRLVILDACRDNPFAPTMKQTLALKLVDKGFSNI
EPGAGMMVVYAAKHGATALDGNGTNSPFATALAKDIKEPRVEIRKLFDIVRDDVWAATQHQQQPFTYGSPPGREDFFFVA
AK

Specific function: Unknown

COG id: COG4249

COG function: function code R; Uncharacterized protein containing caspase domain

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI170784867, Length=251, Percent_Identity=24.3027888446215, Blast_Score=76, Evalue=8e-14,
Organism=Homo sapiens, GI310123097, Length=215, Percent_Identity=28.3720930232558, Blast_Score=75, Evalue=2e-13,
Organism=Homo sapiens, GI310131789, Length=215, Percent_Identity=28.3720930232558, Blast_Score=75, Evalue=2e-13,
Organism=Homo sapiens, GI310110582, Length=215, Percent_Identity=28.3720930232558, Blast_Score=75, Evalue=2e-13,
Organism=Homo sapiens, GI32307148, Length=338, Percent_Identity=22.4852071005917, Blast_Score=72, Evalue=1e-12,
Organism=Homo sapiens, GI32307150, Length=338, Percent_Identity=22.4852071005917, Blast_Score=72, Evalue=1e-12,
Organism=Caenorhabditis elegans, GI25149817, Length=149, Percent_Identity=34.2281879194631, Blast_Score=82, Evalue=5e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 60541; Mature: 60410

Theoretical pI: Translated: 8.16; Mature: 8.16

Prosite motif: PS50005 TPR ; PS50293 TPR_REGION ; PS50208 CASPASE_P20

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
1.2 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSRLASLLLLLVFAAPVAAGPLLSDDLVNCRNRSADGNSPAPPRLDACERLLASGNLTGK
CHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHCCCCCHH
DLAAALEVRGNAAISRRDYDKAISALTGAIAADPDNAGLYDRRGVAYWRKGQDDLAIADY
HHHHHHHHCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCCCHHHHH
AVALQKRSNYAAPYNNRGVIFLRRGALQSALDDFNAAVRISPTMYVARANRGRVRAMMKD
HHHHHHCCCCCCCCCCCCEEEEECCHHHHHHHHCCCEEEECCEEEEEECCCCHHHHHHHH
FEGALADFAEADKIDPESTQAATYRCDAFNAMGKFDDAIANCNAVLASLSKSQYALNSRA
HHHHHHHHHHHCCCCCCCCCHHEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
EAFIAKGDLGAALKDLNTVLALNPNNVRAHADRGQLFERRRDLAQARADYRSAAFALTPF
CEEEECCCHHHHHHHCCEEEEECCCCCEEECCHHHHHHHHHHHHHHHHHHHHCEEEECCC
DEIDVVNARKLAQDRLAALSPQGPGAAATVRRVALVVGNGDYKSVPALPNPPRDAKLIAD
CCCCCHHHHHHHHHHHHHCCCCCCCHHHHEEEEEEEEECCCCCCCCCCCCCCCHHHHHHH
TLRGLGFQSVTLANDLSRDKFFEALRSFEAEAEKADWAVVYYAGHGFEIGGVNYLVPVDA
HHHCCCCCEEHHHHHCCHHHHHHHHHHHHHHCCCCCEEEEEEECCCEEECCEEEEEECCC
RLAADKDAETEAVALEQVLAAVGAARKLRLVILDACRDNPFAPTMKQTLALKLVDKGFSN
HHCCCCCCCHHHHHHHHHHHHHHHHHHEEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCC
IEPGAGMMVVYAAKHGATALDGNGTNSPFATALAKDIKEPRVEIRKLFDIVRDDVWAATQ
CCCCCCEEEEEEECCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
HQQQPFTYGSPPGREDFFFVAAK
CCCCCCCCCCCCCCCCEEEEEEC
>Mature Secondary Structure 
SRLASLLLLLVFAAPVAAGPLLSDDLVNCRNRSADGNSPAPPRLDACERLLASGNLTGK
HHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHCCCCCHH
DLAAALEVRGNAAISRRDYDKAISALTGAIAADPDNAGLYDRRGVAYWRKGQDDLAIADY
HHHHHHHHCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCCCHHHHH
AVALQKRSNYAAPYNNRGVIFLRRGALQSALDDFNAAVRISPTMYVARANRGRVRAMMKD
HHHHHHCCCCCCCCCCCCEEEEECCHHHHHHHHCCCEEEECCEEEEEECCCCHHHHHHHH
FEGALADFAEADKIDPESTQAATYRCDAFNAMGKFDDAIANCNAVLASLSKSQYALNSRA
HHHHHHHHHHHCCCCCCCCCHHEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
EAFIAKGDLGAALKDLNTVLALNPNNVRAHADRGQLFERRRDLAQARADYRSAAFALTPF
CEEEECCCHHHHHHHCCEEEEECCCCCEEECCHHHHHHHHHHHHHHHHHHHHCEEEECCC
DEIDVVNARKLAQDRLAALSPQGPGAAATVRRVALVVGNGDYKSVPALPNPPRDAKLIAD
CCCCCHHHHHHHHHHHHHCCCCCCCHHHHEEEEEEEEECCCCCCCCCCCCCCCHHHHHHH
TLRGLGFQSVTLANDLSRDKFFEALRSFEAEAEKADWAVVYYAGHGFEIGGVNYLVPVDA
HHHCCCCCEEHHHHHCCHHHHHHHHHHHHHHCCCCCEEEEEEECCCEEECCEEEEEECCC
RLAADKDAETEAVALEQVLAAVGAARKLRLVILDACRDNPFAPTMKQTLALKLVDKGFSN
HHCCCCCCCHHHHHHHHHHHHHHHHHHEEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCC
IEPGAGMMVVYAAKHGATALDGNGTNSPFATALAKDIKEPRVEIRKLFDIVRDDVWAATQ
CCCCCCEEEEEEECCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
HQQQPFTYGSPPGREDFFFVAAK
CCCCCCCCCCCCCCCCEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA