Definition Chlamydophila abortus S26/3, complete genome.
Accession NC_004552
Length 1,144,377

Click here to switch to the map view.

The map label for this gene is 62185379

Identifier: 62185379

GI number: 62185379

Start: 894981

End: 897803

Strand: Direct

Name: 62185379

Synonym: CAB774

Alternate gene names: NA

Gene position: 894981-897803 (Clockwise)

Preceding gene: 62185378

Following gene: 62185380

Centisome position: 78.21

GC content: 42.72

Gene sequence:

>2823_bases
ATGTCACTAGACAACAATAATTTCCGGGCAGCTTTTGCATACCCACAACCTGCTTCAGCACTGCACGGAACCTCTCTAAT
AAAAACCGTTAATCAAAAGATTTCTTTCCTATCCATATTTAATGCGTTAGGAAATAAAATCGGTTCTTGTCTTTGTTTGC
ATCCAGAGCCTGATTCTAAAGCCGGATGGGTCTTTACCTTTGTTTTATCTGCTATTATTACAGTTCTGCTCTGTATTATT
CTTCTCCCTGTGAAGTTAATCCTTCTAGGATTAAGTTGCTGCCCCTGCTTATCTAAACCTACCACAGGGGTGGAGGCACC
TGAAGTGCCATCTTCTTCAAGACCTCCAATTCCCCCAGCAGGAGAGGCGGGTGCTTTTTCTCAACCTCCTGTAGGATTAG
ATCCATCTAGATTTTCGCCGGATTCGTTTATTCCTGCGCCTCCACTCAGTCCAACCTCAATGCCATCCGCAGGAGGCGTA
GTGTCTCCAGGAATGACCCTTAGAGAGTTCTTGCAAACAAACTACCCTACAGTCGACTTAAACACCGTTACCCTAGACAG
TTTAGGAATTCCCCTTTTATTAACATTAGACGATCTCCCTGAAGGAACTACTCTTCTTGATCTTCCCATGTCTCTACTTT
TCGAAGAAGGTAATCGCGACCTATCTCAACTCCCCCTATTCCAAAGCCATACGGCTGACTCGTCTCCTATATCTTTAACT
GGTTCTTTATCTTCACTTCTAGCACCCTTAGAAGAAGATCTAGAAGATTCTCAAGATCAGGGTGGCAGAACTACTGCACC
AACTTCTCTAATTGTGGATACACCTGCAGCCATTCCTGCTGTAGAAGTTAACCAACAACTCTCTAGTAGAGAATTGTTAA
ATAGTTTGTATCCCAATATGGATCACACAAGATTCATGAACAGTGCGCGTGTAAACCTAAGACTTCAAGGCATTCCTGGA
CCTCTCAGTGACGATGATGTTCTTAATCTTCCTGCAATCATTGCCTTCCCCGATCTAGTTGCTGGACAGCCCGCGCGTCC
TACCTCCTTAACTCTCACTGACACACCAGCATCCCTGGCTTCTGTACAAGAAGAACCTACTGCGCCCCCGCCTAGTGAAG
AATTAATTTCTCCTAGTGACCCCCGATATACTTTCCTACAGAACCACTTTCCTGAACTAGAACCTGAATACTACAGCAGA
CACATTAGTTTACTAGCTTCACTTTCTGGTGTGGACGAAGGAAGCTTCAATCTTCTTGAATTACCTTTGGAAGCATTTAT
TTATACGCAACCTATTCTAGATTACGAGCCGATTCCTTCAGAGCATTTGCAAGAAAGATTAGGAGAGGTCTCTCCCGAAG
AAGATGTACGAAGAAATAACGAGTTTATTGATAATCTCCTAGAAAATACACCCTATCGCTGGACTTTTCTAAATCGACTA
AGAAGCAATATTACTAACTCTACTCAAAGTGCAGACTTGCGTAGACAGTGGTTCTCAATAATAGACATGATCGTTAATAA
GAGCAGTCCGGAACTTGAAATCGAAGATATCAGCAATACTGCTCGTGCATACCTGTTCAGAATTCATAATATTTTAAAAA
ATCCTGAGATTCCTACTGAAAGAAAATCAGAGATGTTAAAATACATAGCCTCTCATTATGATCCGAATTCTGTGGCAATG
TGTTTAGCAGCCATGCAACAAGAAATCGCTTTACAAAATGAGATAACCCCTGAGTTAGCTAGTGTCGAGGCAGAGATGGG
AGCCAATGGCGTCAGCTCATCTATTAGTCAAATTCTTCCTCCTCTAGCCTCTCAAGCCACTCCTCAAGAAGTAGACGGAT
ACATCCAACTGTTAAAAAGCCTTCTATCAGGCCCTATGCTCACAAATGAGGATAACATCCACTTGGCTCCAGCTAATGAT
ATCTATCTAGAGTCATTGATGAGAGATGTACCCAACAGTTGGGGACCCATCCATCGACCACTACAAAATCGCATCAGACG
ACTTCTAGAGGCTGAGGACAACCGTATTCTTCAACAAGTACAAAACCGCGCAACTCAAACTGCACGATTGGCACAGAACC
AACGGATACGCGATAATTGGAATAGCATATTGCTTGCTCTATCCGATGGTAGAGAGGGATCAGTTGCTTCCGACGAGGCC
CAAGCTCTTTCCCGCTCTACAATGTATCAAGTGCTTCAACTTATCGATAATCCTAACATACCGCACGACAAAAAATTCTC
AGTTATCAGCAACGTAGCCTCATACAGTGATCGGTGTCCTCCCACTTGGGTCCGAGTTGCCGGCCAGGAGTTACAAGCTA
TCTTTAATACTAACGATGAGACAGCAAATATTGTGCTTGTTTGGGCGCAAATATTTAAGGAAGGGCTTTTATCAGAAATT
TTCAGAAACCAACGAGAATGGCATATGATGACAGCCTTTAAGATCATTCGCGGTTCTGAATTGGGATTAGACAATGTGGG
TATTATTCTAGACCCGTATACAACCGCGCTAACTGGTCGTCACTACACTAATCAGCATAACCAATATTTCGCACAATTCC
TAAATGTTTACCGAAATAGTGGTAACAACTTGATTAATTCTGCTCTAGAACAGTCTCTTGGAGGTTCTGAAGATCAGATA
CAAGCTCTAACCAATACGATCTTAGCAGACTTAACAGCTGCAGGTATTCCTGAAGCACATCGCGCTCAAATTATGGAGGA
AATCTTCTTCCCGGAAGAAAATGACTACAAACCTTCGAGAGAAGCTATCTGTTATTTACTACTTAAAGAAGGTGTGATTA
TGACTCAAGACCACAACCAGTAA

Upstream 100 bases:

>100_bases
CACAGGTTCTTCTAAAATGTTTTAGCTTTTCATAAAAAAGCACGAAAATAGACCCTTTTAGGGAGAATTTTTTGGCTTAT
TTTCATTCAGGTAAATTTTT

Downstream 100 bases:

>100_bases
ATATGAAAATCGTTCAATACATATCTTCTTGAAGCTAAGGCAACATAGCCCCACAGTTGTGTTGCCTCTATCTTTTAACC
GAAACAAAAAATAACACTAT

Product: hypothetical protein

Products: NA

Alternate protein names: None

Number of amino acids: Translated: 940; Mature: 939

Protein sequence:

>940_residues
MSLDNNNFRAAFAYPQPASALHGTSLIKTVNQKISFLSIFNALGNKIGSCLCLHPEPDSKAGWVFTFVLSAIITVLLCII
LLPVKLILLGLSCCPCLSKPTTGVEAPEVPSSSRPPIPPAGEAGAFSQPPVGLDPSRFSPDSFIPAPPLSPTSMPSAGGV
VSPGMTLREFLQTNYPTVDLNTVTLDSLGIPLLLTLDDLPEGTTLLDLPMSLLFEEGNRDLSQLPLFQSHTADSSPISLT
GSLSSLLAPLEEDLEDSQDQGGRTTAPTSLIVDTPAAIPAVEVNQQLSSRELLNSLYPNMDHTRFMNSARVNLRLQGIPG
PLSDDDVLNLPAIIAFPDLVAGQPARPTSLTLTDTPASLASVQEEPTAPPPSEELISPSDPRYTFLQNHFPELEPEYYSR
HISLLASLSGVDEGSFNLLELPLEAFIYTQPILDYEPIPSEHLQERLGEVSPEEDVRRNNEFIDNLLENTPYRWTFLNRL
RSNITNSTQSADLRRQWFSIIDMIVNKSSPELEIEDISNTARAYLFRIHNILKNPEIPTERKSEMLKYIASHYDPNSVAM
CLAAMQQEIALQNEITPELASVEAEMGANGVSSSISQILPPLASQATPQEVDGYIQLLKSLLSGPMLTNEDNIHLAPAND
IYLESLMRDVPNSWGPIHRPLQNRIRRLLEAEDNRILQQVQNRATQTARLAQNQRIRDNWNSILLALSDGREGSVASDEA
QALSRSTMYQVLQLIDNPNIPHDKKFSVISNVASYSDRCPPTWVRVAGQELQAIFNTNDETANIVLVWAQIFKEGLLSEI
FRNQREWHMMTAFKIIRGSELGLDNVGIILDPYTTALTGRHYTNQHNQYFAQFLNVYRNSGNNLINSALEQSLGGSEDQI
QALTNTILADLTAAGIPEAHRAQIMEEIFFPEENDYKPSREAICYLLLKEGVIMTQDHNQ

Sequences:

>Translated_940_residues
MSLDNNNFRAAFAYPQPASALHGTSLIKTVNQKISFLSIFNALGNKIGSCLCLHPEPDSKAGWVFTFVLSAIITVLLCII
LLPVKLILLGLSCCPCLSKPTTGVEAPEVPSSSRPPIPPAGEAGAFSQPPVGLDPSRFSPDSFIPAPPLSPTSMPSAGGV
VSPGMTLREFLQTNYPTVDLNTVTLDSLGIPLLLTLDDLPEGTTLLDLPMSLLFEEGNRDLSQLPLFQSHTADSSPISLT
GSLSSLLAPLEEDLEDSQDQGGRTTAPTSLIVDTPAAIPAVEVNQQLSSRELLNSLYPNMDHTRFMNSARVNLRLQGIPG
PLSDDDVLNLPAIIAFPDLVAGQPARPTSLTLTDTPASLASVQEEPTAPPPSEELISPSDPRYTFLQNHFPELEPEYYSR
HISLLASLSGVDEGSFNLLELPLEAFIYTQPILDYEPIPSEHLQERLGEVSPEEDVRRNNEFIDNLLENTPYRWTFLNRL
RSNITNSTQSADLRRQWFSIIDMIVNKSSPELEIEDISNTARAYLFRIHNILKNPEIPTERKSEMLKYIASHYDPNSVAM
CLAAMQQEIALQNEITPELASVEAEMGANGVSSSISQILPPLASQATPQEVDGYIQLLKSLLSGPMLTNEDNIHLAPAND
IYLESLMRDVPNSWGPIHRPLQNRIRRLLEAEDNRILQQVQNRATQTARLAQNQRIRDNWNSILLALSDGREGSVASDEA
QALSRSTMYQVLQLIDNPNIPHDKKFSVISNVASYSDRCPPTWVRVAGQELQAIFNTNDETANIVLVWAQIFKEGLLSEI
FRNQREWHMMTAFKIIRGSELGLDNVGIILDPYTTALTGRHYTNQHNQYFAQFLNVYRNSGNNLINSALEQSLGGSEDQI
QALTNTILADLTAAGIPEAHRAQIMEEIFFPEENDYKPSREAICYLLLKEGVIMTQDHNQ
>Mature_939_residues
SLDNNNFRAAFAYPQPASALHGTSLIKTVNQKISFLSIFNALGNKIGSCLCLHPEPDSKAGWVFTFVLSAIITVLLCIIL
LPVKLILLGLSCCPCLSKPTTGVEAPEVPSSSRPPIPPAGEAGAFSQPPVGLDPSRFSPDSFIPAPPLSPTSMPSAGGVV
SPGMTLREFLQTNYPTVDLNTVTLDSLGIPLLLTLDDLPEGTTLLDLPMSLLFEEGNRDLSQLPLFQSHTADSSPISLTG
SLSSLLAPLEEDLEDSQDQGGRTTAPTSLIVDTPAAIPAVEVNQQLSSRELLNSLYPNMDHTRFMNSARVNLRLQGIPGP
LSDDDVLNLPAIIAFPDLVAGQPARPTSLTLTDTPASLASVQEEPTAPPPSEELISPSDPRYTFLQNHFPELEPEYYSRH
ISLLASLSGVDEGSFNLLELPLEAFIYTQPILDYEPIPSEHLQERLGEVSPEEDVRRNNEFIDNLLENTPYRWTFLNRLR
SNITNSTQSADLRRQWFSIIDMIVNKSSPELEIEDISNTARAYLFRIHNILKNPEIPTERKSEMLKYIASHYDPNSVAMC
LAAMQQEIALQNEITPELASVEAEMGANGVSSSISQILPPLASQATPQEVDGYIQLLKSLLSGPMLTNEDNIHLAPANDI
YLESLMRDVPNSWGPIHRPLQNRIRRLLEAEDNRILQQVQNRATQTARLAQNQRIRDNWNSILLALSDGREGSVASDEAQ
ALSRSTMYQVLQLIDNPNIPHDKKFSVISNVASYSDRCPPTWVRVAGQELQAIFNTNDETANIVLVWAQIFKEGLLSEIF
RNQREWHMMTAFKIIRGSELGLDNVGIILDPYTTALTGRHYTNQHNQYFAQFLNVYRNSGNNLINSALEQSLGGSEDQIQ
ALTNTILADLTAAGIPEAHRAQIMEEIFFPEENDYKPSREAICYLLLKEGVIMTQDHNQ

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 103876; Mature: 103744

Theoretical pI: Translated: 4.40; Mature: 4.40

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSLDNNNFRAAFAYPQPASALHGTSLIKTVNQKISFLSIFNALGNKIGSCLCLHPEPDSK
CCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCCC
AGWVFTFVLSAIITVLLCIILLPVKLILLGLSCCPCLSKPTTGVEAPEVPSSSRPPIPPA
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC
GEAGAFSQPPVGLDPSRFSPDSFIPAPPLSPTSMPSAGGVVSPGMTLREFLQTNYPTVDL
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEEE
NTVTLDSLGIPLLLTLDDLPEGTTLLDLPMSLLFEEGNRDLSQLPLFQSHTADSSPISLT
EEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHCCCCCCCCEEEC
GSLSSLLAPLEEDLEDSQDQGGRTTAPTSLIVDTPAAIPAVEVNQQLSSRELLNSLYPNM
CCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCC
DHTRFMNSARVNLRLQGIPGPLSDDDVLNLPAIIAFPDLVAGQPARPTSLTLTDTPASLA
CHHHHHCCCEEEEEEECCCCCCCCCCCCCCCHHHCCCHHHCCCCCCCCEEEEECCCHHHH
SVQEEPTAPPPSEELISPSDPRYTFLQNHFPELEPEYYSRHISLLASLSGVDEGSFNLLE
HHHHCCCCCCCHHHCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHH
LPLEAFIYTQPILDYEPIPSEHLQERLGEVSPEEDVRRNNEFIDNLLENTPYRWTFLNRL
HHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHH
RSNITNSTQSADLRRQWFSIIDMIVNKSSPELEIEDISNTARAYLFRIHNILKNPEIPTE
HHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCH
RKSEMLKYIASHYDPNSVAMCLAAMQQEIALQNEITPELASVEAEMGANGVSSSISQILP
HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHC
PLASQATPQEVDGYIQLLKSLLSGPMLTNEDNIHLAPANDIYLESLMRDVPNSWGPIHRP
HHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCHHHHHHHHHHHCCCCCCCCHHH
LQNRIRRLLEAEDNRILQQVQNRATQTARLAQNQRIRDNWNSILLALSDGREGSVASDEA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCEEEEEECCCCCCCCCCHHH
QALSRSTMYQVLQLIDNPNIPHDKKFSVISNVASYSDRCPPTWVRVAGQELQAIFNTNDE
HHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCC
TANIVLVWAQIFKEGLLSEIFRNQREWHMMTAFKIIRGSELGLDNVGIILDPYTTALTGR
CCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCEEECCHHHHHCCC
HYTNQHNQYFAQFLNVYRNSGNNLINSALEQSLGGSEDQIQALTNTILADLTAAGIPEAH
CCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHH
RAQIMEEIFFPEENDYKPSREAICYLLLKEGVIMTQDHNQ
HHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEECCCCC
>Mature Secondary Structure 
SLDNNNFRAAFAYPQPASALHGTSLIKTVNQKISFLSIFNALGNKIGSCLCLHPEPDSK
CCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCCC
AGWVFTFVLSAIITVLLCIILLPVKLILLGLSCCPCLSKPTTGVEAPEVPSSSRPPIPPA
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC
GEAGAFSQPPVGLDPSRFSPDSFIPAPPLSPTSMPSAGGVVSPGMTLREFLQTNYPTVDL
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEEE
NTVTLDSLGIPLLLTLDDLPEGTTLLDLPMSLLFEEGNRDLSQLPLFQSHTADSSPISLT
EEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHCCCCCCCCEEEC
GSLSSLLAPLEEDLEDSQDQGGRTTAPTSLIVDTPAAIPAVEVNQQLSSRELLNSLYPNM
CCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCC
DHTRFMNSARVNLRLQGIPGPLSDDDVLNLPAIIAFPDLVAGQPARPTSLTLTDTPASLA
CHHHHHCCCEEEEEEECCCCCCCCCCCCCCCHHHCCCHHHCCCCCCCCEEEEECCCHHHH
SVQEEPTAPPPSEELISPSDPRYTFLQNHFPELEPEYYSRHISLLASLSGVDEGSFNLLE
HHHHCCCCCCCHHHCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHH
LPLEAFIYTQPILDYEPIPSEHLQERLGEVSPEEDVRRNNEFIDNLLENTPYRWTFLNRL
HHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHH
RSNITNSTQSADLRRQWFSIIDMIVNKSSPELEIEDISNTARAYLFRIHNILKNPEIPTE
HHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCH
RKSEMLKYIASHYDPNSVAMCLAAMQQEIALQNEITPELASVEAEMGANGVSSSISQILP
HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHC
PLASQATPQEVDGYIQLLKSLLSGPMLTNEDNIHLAPANDIYLESLMRDVPNSWGPIHRP
HHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCHHHHHHHHHHHCCCCCCCCHHH
LQNRIRRLLEAEDNRILQQVQNRATQTARLAQNQRIRDNWNSILLALSDGREGSVASDEA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCEEEEEECCCCCCCCCCHHH
QALSRSTMYQVLQLIDNPNIPHDKKFSVISNVASYSDRCPPTWVRVAGQELQAIFNTNDE
HHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCC
TANIVLVWAQIFKEGLLSEIFRNQREWHMMTAFKIIRGSELGLDNVGIILDPYTTALTGR
CCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCEEECCHHHHHCCC
HYTNQHNQYFAQFLNVYRNSGNNLINSALEQSLGGSEDQIQALTNTILADLTAAGIPEAH
CCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHH
RAQIMEEIFFPEENDYKPSREAICYLLLKEGVIMTQDHNQ
HHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA