Definition Orientia tsutsugamushi Boryong, complete genome.
Accession NC_009488
Length 2,127,051

Click here to switch to the map view.

The map label for this gene is sca4 [H]

Identifier: 148284453

GI number: 148284453

Start: 767525

End: 769804

Strand: Direct

Name: sca4 [H]

Synonym: OTBS_0790

Alternate gene names: 148284453

Gene position: 767525-769804 (Clockwise)

Preceding gene: 148284452

Following gene: 148284454

Centisome position: 36.08

GC content: 35.22

Gene sequence:

>2280_bases
ATGGGGTTAGTATCTAAATTAGGCGAGGAAATGAAGCTAGTAGGCAAAACAGGTGGACAGATGTCTGAGATAAAAGATAC
TTTAGATAAGCTTCATGCAATAGTTAGTAAGTCAGCACAGCATCAACTGCAAAATACTCTTGGTGAGCTGCTTCAGGCAA
TGATTAATGAACATAAACAGAACCAGCTACAAGATGCTCTAGACGATCTGGAAAATATAATCAATGATCATAAACAGAAT
CAAAAAGAACAGAAAGCTACTATTCCACCTGAAAAACATACACATGATGTAACATCTGTTAAAAACCAAGCTCAACAAAA
TACTGGTATTAGTCAACCAGATGCACCTAAATCTGCTATCAAATCAGCGGCAGATATATTACAATCTCCTCAGCATTCTT
CTCCAGCAACACCAACTGCTACAGTACAACAACAAGAACAAAAAAAAACACCTCCTCCTGTTCCGCCTAAACCTAGTAAA
GATACAATCGAAGCTATAAAAGCTAAAATAGCTCAAGCTCAACAAAATGCTGGTATTAATAAACCAGATGCACCTAAATC
TGCTATCAAATCAGCAGCAGATATATCACAATCTCCTCAGTATTCTTCTCCAGCAACACCAACTGCTACAGTGCAACAAC
ATGAGCCACAAAAAACGCCTCCTCCTGTTCCACCTAAACCTAGTAAAAATATAATCGAAGAATTAAAAGCTAAGATATCA
CAAACTCAACAACAGGTTAATCAGCAATCATATATTAATCCAAGTAGTAGTCCCCAACCTCTTAGTTCAACTATAGAGCA
TGCTAAAGACAGAGTATTAACATTAGATCCACAATATAGACACGCTCAAGCTGCTCAAAATATGAGTGGACCAGAGGCTG
AAACAAATCAGATGCCAGTAGATCAAATTCTACAAGCATTTAAAGATTTAAAAGCACTTATAAATAGCGTGATTGCTGAA
GATGATAAGTTTAAAGTATGGCAACAGCAAAATCCAAGCAAAAGCCTAGATGATTTTAAACGTGATACAACACAAATAGA
TAGCTTATCTCAAGAGACTAAAGAGTTACTATCTGGCATTGGATCTGCAGGATATGCTAACATTATGGGCTCAACAGCAA
ATATTGAACAAGCTCAGCAAATGTCATTTGCTGCATCTTTTAGTACATTAGATTGGGCTACTCATGCTAATTCTGTTGGC
AATACTACTCAAAAAACTATTACCAATGATGCTGGTGAAAAAGTTACAGACCTTATAAGCCATAGCCATAAAACTCAGCT
CAGTGCTAGTGTTAATGGTGTTACAAAAATTGTTACTAAACACCGTACCATAGATATTCCAAGGGCAGTTGAAGAAAACA
AAGGACCTTTAGATCTTGCCTTAGTAGCACAAGATACAACTGGAAAAAATATGCCTGAGTCAAAAGCAGTATATCTAACT
GCTCATTATAACCAAGAAGGAAAATTAGTAGAAATGACTCATCCGGAACCTCTGAGATTCTTTAGTGATGAACCTGGCTC
TCCAGCTTATACAGTTATAAATAATGAAGTTTATACTTTGCCAATTACTAGAGAAAAATATGACCAGCTAACCAAAGAAA
TATCACAAAATATACAGGAACAAGATAAAGATAAAGAGAGAGAACAGGAAGCTGTGGATAAGTTTACAGTAGGTTCTCGT
CAAACTGATATTCATAAAGAAAAAAGTATTCAACAGGCTGATGAAGTAAGTAACGATGCTCCTAAATCCTTAAAGTCTAT
GAATGAGTTTACAAGAGACTCTCGTCAAACTGATGATATCTATAACGAGAAAAGTACTCGAAATCCTGAAGAAATAAGTA
GCAATGCTCCTAAATCTTTAAAATCTATTGCTCATGATGAAAACAAAAATAAAATTCAAAGCACTGATCATAAATCGAAA
AACAGCAATGAATATGTTAAACAGATGATTAAATTGCTTAATCAAAACTATAACAAAATTGATTCTAACGAACAAAATCG
TACTGAACAGGTTAAACTTAAGCCTGTAGTCAAATCAATACCAGAATCTCCAAAAAACTCTACTCAGATTGATCCTAATG
AGGAAGGTAGTATTGGATACGTAAAACGTGTAGTTGAATCAATGGAACAAACATCACCAAATCCTAGTGAAATAGCACAA
AGATTACAGGTAAATCTAGCTAATAGTTCTCAGCGTAGTTCTAGTATGTCTATTAATACTCCTACTAATACTCCTCGCAA
TAATAACCAAAGCAAATCTCAGACAAAAGGTATAGTGTAG

Upstream 100 bases:

>100_bases
GTCAACAACCTCAAAAAGGTAAATATAATGATGCCTTTCGTAAGAAAAGAGCTAAGGCTAAAAAAAAAAAGAACACCTTG
AAAAGCAAAAGCAAGAATTC

Downstream 100 bases:

>100_bases
ATAAATTAAGAAACAATTAAGAAATAAGCTCTTGAAAATTAGGTTGTATAAAGAAAAATTTTAATACAACCTATATTTAG
TCTGATAACAAAGAAATGTT

Product: hypothetical protein

Products: NA

Alternate protein names: 120 kDa antigen; Protein PS 120; PS120 [H]

Number of amino acids: Translated: 759; Mature: 758

Protein sequence:

>759_residues
MGLVSKLGEEMKLVGKTGGQMSEIKDTLDKLHAIVSKSAQHQLQNTLGELLQAMINEHKQNQLQDALDDLENIINDHKQN
QKEQKATIPPEKHTHDVTSVKNQAQQNTGISQPDAPKSAIKSAADILQSPQHSSPATPTATVQQQEQKKTPPPVPPKPSK
DTIEAIKAKIAQAQQNAGINKPDAPKSAIKSAADISQSPQYSSPATPTATVQQHEPQKTPPPVPPKPSKNIIEELKAKIS
QTQQQVNQQSYINPSSSPQPLSSTIEHAKDRVLTLDPQYRHAQAAQNMSGPEAETNQMPVDQILQAFKDLKALINSVIAE
DDKFKVWQQQNPSKSLDDFKRDTTQIDSLSQETKELLSGIGSAGYANIMGSTANIEQAQQMSFAASFSTLDWATHANSVG
NTTQKTITNDAGEKVTDLISHSHKTQLSASVNGVTKIVTKHRTIDIPRAVEENKGPLDLALVAQDTTGKNMPESKAVYLT
AHYNQEGKLVEMTHPEPLRFFSDEPGSPAYTVINNEVYTLPITREKYDQLTKEISQNIQEQDKDKEREQEAVDKFTVGSR
QTDIHKEKSIQQADEVSNDAPKSLKSMNEFTRDSRQTDDIYNEKSTRNPEEISSNAPKSLKSIAHDENKNKIQSTDHKSK
NSNEYVKQMIKLLNQNYNKIDSNEQNRTEQVKLKPVVKSIPESPKNSTQIDPNEEGSIGYVKRVVESMEQTSPNPSEIAQ
RLQVNLANSSQRSSSMSINTPTNTPRNNNQSKSQTKGIV

Sequences:

>Translated_759_residues
MGLVSKLGEEMKLVGKTGGQMSEIKDTLDKLHAIVSKSAQHQLQNTLGELLQAMINEHKQNQLQDALDDLENIINDHKQN
QKEQKATIPPEKHTHDVTSVKNQAQQNTGISQPDAPKSAIKSAADILQSPQHSSPATPTATVQQQEQKKTPPPVPPKPSK
DTIEAIKAKIAQAQQNAGINKPDAPKSAIKSAADISQSPQYSSPATPTATVQQHEPQKTPPPVPPKPSKNIIEELKAKIS
QTQQQVNQQSYINPSSSPQPLSSTIEHAKDRVLTLDPQYRHAQAAQNMSGPEAETNQMPVDQILQAFKDLKALINSVIAE
DDKFKVWQQQNPSKSLDDFKRDTTQIDSLSQETKELLSGIGSAGYANIMGSTANIEQAQQMSFAASFSTLDWATHANSVG
NTTQKTITNDAGEKVTDLISHSHKTQLSASVNGVTKIVTKHRTIDIPRAVEENKGPLDLALVAQDTTGKNMPESKAVYLT
AHYNQEGKLVEMTHPEPLRFFSDEPGSPAYTVINNEVYTLPITREKYDQLTKEISQNIQEQDKDKEREQEAVDKFTVGSR
QTDIHKEKSIQQADEVSNDAPKSLKSMNEFTRDSRQTDDIYNEKSTRNPEEISSNAPKSLKSIAHDENKNKIQSTDHKSK
NSNEYVKQMIKLLNQNYNKIDSNEQNRTEQVKLKPVVKSIPESPKNSTQIDPNEEGSIGYVKRVVESMEQTSPNPSEIAQ
RLQVNLANSSQRSSSMSINTPTNTPRNNNQSKSQTKGIV
>Mature_758_residues
GLVSKLGEEMKLVGKTGGQMSEIKDTLDKLHAIVSKSAQHQLQNTLGELLQAMINEHKQNQLQDALDDLENIINDHKQNQ
KEQKATIPPEKHTHDVTSVKNQAQQNTGISQPDAPKSAIKSAADILQSPQHSSPATPTATVQQQEQKKTPPPVPPKPSKD
TIEAIKAKIAQAQQNAGINKPDAPKSAIKSAADISQSPQYSSPATPTATVQQHEPQKTPPPVPPKPSKNIIEELKAKISQ
TQQQVNQQSYINPSSSPQPLSSTIEHAKDRVLTLDPQYRHAQAAQNMSGPEAETNQMPVDQILQAFKDLKALINSVIAED
DKFKVWQQQNPSKSLDDFKRDTTQIDSLSQETKELLSGIGSAGYANIMGSTANIEQAQQMSFAASFSTLDWATHANSVGN
TTQKTITNDAGEKVTDLISHSHKTQLSASVNGVTKIVTKHRTIDIPRAVEENKGPLDLALVAQDTTGKNMPESKAVYLTA
HYNQEGKLVEMTHPEPLRFFSDEPGSPAYTVINNEVYTLPITREKYDQLTKEISQNIQEQDKDKEREQEAVDKFTVGSRQ
TDIHKEKSIQQADEVSNDAPKSLKSMNEFTRDSRQTDDIYNEKSTRNPEEISSNAPKSLKSIAHDENKNKIQSTDHKSKN
SNEYVKQMIKLLNQNYNKIDSNEQNRTEQVKLKPVVKSIPESPKNSTQIDPNEEGSIGYVKRVVESMEQTSPNPSEIAQR
LQVNLANSSQRSSSMSINTPTNTPRNNNQSKSQTKGIV

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm (Probable) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR020954 [H]

Pfam domain/function: PF12574 120_Rick_ant [H]

EC number: NA

Molecular weight: Translated: 83903; Mature: 83772

Theoretical pI: Translated: 6.85; Mature: 6.85

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
1.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGLVSKLGEEMKLVGKTGGQMSEIKDTLDKLHAIVSKSAQHQLQNTLGELLQAMINEHKQ
CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NQLQDALDDLENIINDHKQNQKEQKATIPPEKHTHDVTSVKNQAQQNTGISQPDAPKSAI
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHH
KSAADILQSPQHSSPATPTATVQQQEQKKTPPPVPPKPSKDTIEAIKAKIAQAQQNAGIN
HHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC
KPDAPKSAIKSAADISQSPQYSSPATPTATVQQHEPQKTPPPVPPKPSKNIIEELKAKIS
CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCHHHHHHHHHHHH
QTQQQVNQQSYINPSSSPQPLSSTIEHAKDRVLTLDPQYRHAQAAQNMSGPEAETNQMPV
HHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCEEEECCCHHHHHHHHCCCCCCCCCCCCCH
DQILQAFKDLKALINSVIAEDDKFKVWQQQNPSKSLDDFKRDTTQIDSLSQETKELLSGI
HHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC
GSAGYANIMGSTANIEQAQQMSFAASFSTLDWATHANSVGNTTQKTITNDAGEKVTDLIS
CCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCHHHHHHHHHH
HSHKTQLSASVNGVTKIVTKHRTIDIPRAVEENKGPLDLALVAQDTTGKNMPESKAVYLT
HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCEEEEEEEECCCCCCCCCCCEEEEE
AHYNQEGKLVEMTHPEPLRFFSDEPGSPAYTVINNEVYTLPITREKYDQLTKEISQNIQE
EEECCCCCEEECCCCCCHHHCCCCCCCCEEEEECCEEEEECCCHHHHHHHHHHHHHHHHH
QDKDKEREQEAVDKFTVGSRQTDIHKEKSIQQADEVSNDAPKSLKSMNEFTRDSRQTDDI
HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHH
YNEKSTRNPEEISSNAPKSLKSIAHDENKNKIQSTDHKSKNSNEYVKQMIKLLNQNYNKI
HCCCCCCCHHHHHCCCHHHHHHHHHCCCCCHHHCCCCCCCCCHHHHHHHHHHHHCCCCCC
DSNEQNRTEQVKLKPVVKSIPESPKNSTQIDPNEEGSIGYVKRVVESMEQTSPNPSEIAQ
CCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHH
RLQVNLANSSQRSSSMSINTPTNTPRNNNQSKSQTKGIV
HHHHHHCCCCCCCCCCEECCCCCCCCCCCCCHHHHCCCC
>Mature Secondary Structure 
GLVSKLGEEMKLVGKTGGQMSEIKDTLDKLHAIVSKSAQHQLQNTLGELLQAMINEHKQ
CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NQLQDALDDLENIINDHKQNQKEQKATIPPEKHTHDVTSVKNQAQQNTGISQPDAPKSAI
HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHH
KSAADILQSPQHSSPATPTATVQQQEQKKTPPPVPPKPSKDTIEAIKAKIAQAQQNAGIN
HHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC
KPDAPKSAIKSAADISQSPQYSSPATPTATVQQHEPQKTPPPVPPKPSKNIIEELKAKIS
CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCHHHHHHHHHHHH
QTQQQVNQQSYINPSSSPQPLSSTIEHAKDRVLTLDPQYRHAQAAQNMSGPEAETNQMPV
HHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCEEEECCCHHHHHHHHCCCCCCCCCCCCCH
DQILQAFKDLKALINSVIAEDDKFKVWQQQNPSKSLDDFKRDTTQIDSLSQETKELLSGI
HHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC
GSAGYANIMGSTANIEQAQQMSFAASFSTLDWATHANSVGNTTQKTITNDAGEKVTDLIS
CCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCHHHHHHHHHH
HSHKTQLSASVNGVTKIVTKHRTIDIPRAVEENKGPLDLALVAQDTTGKNMPESKAVYLT
HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCEEEEEEEECCCCCCCCCCCEEEEE
AHYNQEGKLVEMTHPEPLRFFSDEPGSPAYTVINNEVYTLPITREKYDQLTKEISQNIQE
EEECCCCCEEECCCCCCHHHCCCCCCCCEEEEECCEEEEECCCHHHHHHHHHHHHHHHHH
QDKDKEREQEAVDKFTVGSRQTDIHKEKSIQQADEVSNDAPKSLKSMNEFTRDSRQTDDI
HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHH
YNEKSTRNPEEISSNAPKSLKSIAHDENKNKIQSTDHKSKNSNEYVKQMIKLLNQNYNKI
HCCCCCCCHHHHHCCCHHHHHHHHHCCCCCHHHCCCCCCCCCHHHHHHHHHHHHCCCCCC
DSNEQNRTEQVKLKPVVKSIPESPKNSTQIDPNEEGSIGYVKRVVESMEQTSPNPSEIAQ
CCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHH
RLQVNLANSSQRSSSMSINTPTNTPRNNNQSKSQTKGIV
HHHHHHCCCCCCCCCCEECCCCCCCCCCCCCHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11491333 [H]