Definition Mycobacterium avium subsp. paratuberculosis K-10, complete genome.
Accession NC_002944
Length 4,829,781

Click here to switch to the map view.

The map label for this gene is 41410096

Identifier: 41410096

GI number: 41410096

Start: 4458338

End: 4461004

Strand: Reverse

Name: 41410096

Synonym: MAP3998c

Alternate gene names: NA

Gene position: 4461004-4458338 (Counterclockwise)

Preceding gene: 41410097

Following gene: 41410095

Centisome position: 92.36

GC content: 68.58

Gene sequence:

>2667_bases
ATGGCGCCGCTGGCGTGTGATCCCACCGCCCTAGACCACGCCGGCGCCACCGTGGTGGCCGCCGGTGAGTCGTTGGGTTC
GGTGATCTCGACCTTGACGGCGGCGCTGGCCGGCACCTCCGGGATGGCCGGTGACGATCCGGTGGGCGCCGCGCTTGGCC
GCCGCTACGACGGCGCGGCGGCCAAGTTGATCCAGGCGATGGCCGACACCCGGAATGGGTTGTGCAGCATCGGCGATGGG
GTGCGGATGTCGGCGCACAACTACGCGGTGGCCGAGGCGATGTCGGACCTGGCGGGCCGGGCCTCCGCCCTGCCGGCACC
CCAGGTGACCGGACCGTTGACGGTCGGGGCGCCGCCGTCGGCGGTGGGTCACGGCAGCGGCGCCCCGGCCGGCTGGGGCT
GGGTGGCCCCGTATATCGGGATGATCTGGCCCACCGGCGATTCGGCGAAGTTGCGGGCCGCCGCGGCGGCCTGGGCCACC
GCCGGTGCCAACTTCATGGCCGCCGAGACCGCGGCCGGGGGCGGAACGATGGCAGCCATTGGCGCACAACAGATTCCGGA
GGGCGCCGCGATCAACAAGGCGCTGGCCGACGCCTCCAGCGCCACGGCCGACGTGGCGCGGCAATGCCAGACGATCGCCG
CGCAGCTCAACAGCTATGCGGCCAAGGTCGACCAGGTGCACGCGGCGATCCTGGATCTGTTGTCGCGCATCTGCGATCCG
CTGACCGGGATCAAAGAGGTCTGGGATCTGCTGACCGACGAAGACGAAGACGAGATCAAGAAGATCGCCGACGACATTCG
CACGGTGGTCGACAACTTCGGCCGGGAAGCCGACACGCTGGGCGGCCAGATCGAGGCCACGGTGTCCGCGGTTGCCGCGG
CCACAGAGAACATGAGCCATTGGGCCGGTAAGGAGTGGGACCACTTCTTGCACGGGACGCCGGTCGGCAGGGCGCTCAAC
CAGGTGGGCCAGGCCTTCAAAGGCGTCGGTGAGGAAGGCTGGGGATTTCTCAAAGGGCTGTATGAGGTCAGCCCCAACCG
GATGCTGCTGGATCCCGTGGGTTACGGCAAAACCATGGCCGGCATGGTTGAAGGGGCCGGCACGCTCGTCGGTCTGGGGC
CCGACGGCGTGCCGGGAGCGTTCGACGCGTGGAAGGCGCTGGGCAAGGACGTCACGCACTGGGACGAGTGGGGCTCGAAT
CCGGCCGAGGCCCTCGGGAAGTCCACTTTCGACGTGGCGACGTTGGCCTTGCCCGGCGGGCCGCTGTCCAAGCTGGGCAA
ATTCGGGCACACCGCCGCCGATGCGCTGAAAGGGCTGAAGAAACCGCCCGGGGTTCCCAAGCCTCCGGAGGTCAAGCCCC
CGGCCGCGCCCAAAGCACCGGACTCGGGCCAGCCGGCGCCTTCGGGAAAGCCGGGGCCGGTCGCGCCCTCGGGGAAGCCG
GCGCCCGGGCCGGCCGACGGTCCGCTGCCGCACAGCCCGACCGAGTCCAAGCCGCCGGCTGGCGGGACACCCCCGGCCGC
TGAGCCGCCCAAGCCGACCGCCGCGCCGCACAGCGGCGAACCGAAACCCATTGCGACGCCGCCGGAATCGGTCGGCAAGC
CAGTGACGCCCGCGCCGGCCGAGGGCGCGCCAGCTCAACCGCACGAGCCCGTTCCCGCCCACGCGCCGGCGTCCCCCGGC
GAACCTCTGGCCACGCCGGCTCCCGCTGCGGCGGTACCCGCGGCTGCGGCGGCGCCTGCGTCCGCTCCAGCACCCGCGGC
CGCGGCGGCGGCGCCCGTTCCATCGGCGTCCTCGGTCCCGATGGGCGGCGGCGCGCCTGCCGAAACACCTTCCGGTCTAG
GTGATGTGCCTCATGGCGGGGAGCCGGGCGCTCATCCCGTCGAGCCGCCGCATGATGGCGCTCCCCACGTGCCCGGCGGC
GGAGATGGGCCATACCATCCTGGTGATGGCGGTCCCCATGGACCCGGCGACGGACATCCCCTGGCCGATGGCAGCGGGCC
GCATCAACCTGGCGGTCATCACCCGCCTGGCGACGGCGATCGACCGCCCGGATCGCACCCGCCACACGACGGCGCGCCTC
CCGATGAACCCGCCGACGGACATCCGCCAGAGATCCCCACCCCATCGGATTTACCGCCTTGGCACCAGGCGCAACTTGCG
CTCGCGGAATCACCTGAGAAACTCGTCAAGGATCTCATAGAGCACGGCTGCCCTCGAGAACTTGCCGAATCAGCCGGGGC
GAATAGCCCTTATGCGGGAATGACCGCGCAAGAAATTTTGAATAAATGGTGGGACCCTGCGACTGGGACGTGGGACTGGC
CAAAAGTGGAAGGGTTCGCCGACGGCATATACAAAACTGCTCGCAGCATTCCCAAAGATGCGTGGCTGGACAGGATTGGA
GAGGTCAGCGACGCTAAAGGTGATTTCATGGGCGCTGTCGGTGATAGCTATCCGCACCGCGGCCTGGCACCTGGTTCATC
TGGCGATTACAATCGGTTCCACGGCACAGGTAAAGAATTGCCCGAAGGTTGGGAAGTCAGATACGGCGAAGTTGGCGACG
CATTTGGCCAGCCAGGGGGCGGCACACAATGGGTAGTAATTGACAAAAACAAGAAGACTGTGCTGATAAAGTGGTTAATC
GAGAACGGCTACCTGGATTGGGGATAG

Upstream 100 bases:

>100_bases
ATGCGCGAGGCGCTGGCGCAGCTCGGCAAGGCGGCCTCGACCGCGCACGGCAACTACACCGGGGCGATGTCGAAGAATCT
CGGCATGTGGTCGTGACCCG

Downstream 100 bases:

>100_bases
TGGGCGAGCACAGAGAGGTCCTGCAGAGTTGGCTATGAATCTCGATGCAGAAGTTGGATTCGCTGATGCAATCAAGCATT
ACAGCTTCTGGCGCAAAATT

Product: hypothetical protein

Products: NA

Alternate protein names: None

Number of amino acids: Translated: 888; Mature: 887

Protein sequence:

>888_residues
MAPLACDPTALDHAGATVVAAGESLGSVISTLTAALAGTSGMAGDDPVGAALGRRYDGAAAKLIQAMADTRNGLCSIGDG
VRMSAHNYAVAEAMSDLAGRASALPAPQVTGPLTVGAPPSAVGHGSGAPAGWGWVAPYIGMIWPTGDSAKLRAAAAAWAT
AGANFMAAETAAGGGTMAAIGAQQIPEGAAINKALADASSATADVARQCQTIAAQLNSYAAKVDQVHAAILDLLSRICDP
LTGIKEVWDLLTDEDEDEIKKIADDIRTVVDNFGREADTLGGQIEATVSAVAAATENMSHWAGKEWDHFLHGTPVGRALN
QVGQAFKGVGEEGWGFLKGLYEVSPNRMLLDPVGYGKTMAGMVEGAGTLVGLGPDGVPGAFDAWKALGKDVTHWDEWGSN
PAEALGKSTFDVATLALPGGPLSKLGKFGHTAADALKGLKKPPGVPKPPEVKPPAAPKAPDSGQPAPSGKPGPVAPSGKP
APGPADGPLPHSPTESKPPAGGTPPAAEPPKPTAAPHSGEPKPIATPPESVGKPVTPAPAEGAPAQPHEPVPAHAPASPG
EPLATPAPAAAVPAAAAAPASAPAPAAAAAAPVPSASSVPMGGGAPAETPSGLGDVPHGGEPGAHPVEPPHDGAPHVPGG
GDGPYHPGDGGPHGPGDGHPLADGSGPHQPGGHHPPGDGDRPPGSHPPHDGAPPDEPADGHPPEIPTPSDLPPWHQAQLA
LAESPEKLVKDLIEHGCPRELAESAGANSPYAGMTAQEILNKWWDPATGTWDWPKVEGFADGIYKTARSIPKDAWLDRIG
EVSDAKGDFMGAVGDSYPHRGLAPGSSGDYNRFHGTGKELPEGWEVRYGEVGDAFGQPGGGTQWVVIDKNKKTVLIKWLI
ENGYLDWG

Sequences:

>Translated_888_residues
MAPLACDPTALDHAGATVVAAGESLGSVISTLTAALAGTSGMAGDDPVGAALGRRYDGAAAKLIQAMADTRNGLCSIGDG
VRMSAHNYAVAEAMSDLAGRASALPAPQVTGPLTVGAPPSAVGHGSGAPAGWGWVAPYIGMIWPTGDSAKLRAAAAAWAT
AGANFMAAETAAGGGTMAAIGAQQIPEGAAINKALADASSATADVARQCQTIAAQLNSYAAKVDQVHAAILDLLSRICDP
LTGIKEVWDLLTDEDEDEIKKIADDIRTVVDNFGREADTLGGQIEATVSAVAAATENMSHWAGKEWDHFLHGTPVGRALN
QVGQAFKGVGEEGWGFLKGLYEVSPNRMLLDPVGYGKTMAGMVEGAGTLVGLGPDGVPGAFDAWKALGKDVTHWDEWGSN
PAEALGKSTFDVATLALPGGPLSKLGKFGHTAADALKGLKKPPGVPKPPEVKPPAAPKAPDSGQPAPSGKPGPVAPSGKP
APGPADGPLPHSPTESKPPAGGTPPAAEPPKPTAAPHSGEPKPIATPPESVGKPVTPAPAEGAPAQPHEPVPAHAPASPG
EPLATPAPAAAVPAAAAAPASAPAPAAAAAAPVPSASSVPMGGGAPAETPSGLGDVPHGGEPGAHPVEPPHDGAPHVPGG
GDGPYHPGDGGPHGPGDGHPLADGSGPHQPGGHHPPGDGDRPPGSHPPHDGAPPDEPADGHPPEIPTPSDLPPWHQAQLA
LAESPEKLVKDLIEHGCPRELAESAGANSPYAGMTAQEILNKWWDPATGTWDWPKVEGFADGIYKTARSIPKDAWLDRIG
EVSDAKGDFMGAVGDSYPHRGLAPGSSGDYNRFHGTGKELPEGWEVRYGEVGDAFGQPGGGTQWVVIDKNKKTVLIKWLI
ENGYLDWG
>Mature_887_residues
APLACDPTALDHAGATVVAAGESLGSVISTLTAALAGTSGMAGDDPVGAALGRRYDGAAAKLIQAMADTRNGLCSIGDGV
RMSAHNYAVAEAMSDLAGRASALPAPQVTGPLTVGAPPSAVGHGSGAPAGWGWVAPYIGMIWPTGDSAKLRAAAAAWATA
GANFMAAETAAGGGTMAAIGAQQIPEGAAINKALADASSATADVARQCQTIAAQLNSYAAKVDQVHAAILDLLSRICDPL
TGIKEVWDLLTDEDEDEIKKIADDIRTVVDNFGREADTLGGQIEATVSAVAAATENMSHWAGKEWDHFLHGTPVGRALNQ
VGQAFKGVGEEGWGFLKGLYEVSPNRMLLDPVGYGKTMAGMVEGAGTLVGLGPDGVPGAFDAWKALGKDVTHWDEWGSNP
AEALGKSTFDVATLALPGGPLSKLGKFGHTAADALKGLKKPPGVPKPPEVKPPAAPKAPDSGQPAPSGKPGPVAPSGKPA
PGPADGPLPHSPTESKPPAGGTPPAAEPPKPTAAPHSGEPKPIATPPESVGKPVTPAPAEGAPAQPHEPVPAHAPASPGE
PLATPAPAAAVPAAAAAPASAPAPAAAAAAPVPSASSVPMGGGAPAETPSGLGDVPHGGEPGAHPVEPPHDGAPHVPGGG
DGPYHPGDGGPHGPGDGHPLADGSGPHQPGGHHPPGDGDRPPGSHPPHDGAPPDEPADGHPPEIPTPSDLPPWHQAQLAL
AESPEKLVKDLIEHGCPRELAESAGANSPYAGMTAQEILNKWWDPATGTWDWPKVEGFADGIYKTARSIPKDAWLDRIGE
VSDAKGDFMGAVGDSYPHRGLAPGSSGDYNRFHGTGKELPEGWEVRYGEVGDAFGQPGGGTQWVVIDKNKKTVLIKWLIE
NGYLDWG

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 89340; Mature: 89209

Theoretical pI: Translated: 4.84; Mature: 4.84

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAPLACDPTALDHAGATVVAAGESLGSVISTLTAALAGTSGMAGDDPVGAALGRRYDGAA
CCCCCCCCCHHHHCCCEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCHHH
AKLIQAMADTRNGLCSIGDGVRMSAHNYAVAEAMSDLAGRASALPAPQVTGPLTVGAPPS
HHHHHHHHHHCCCCEECCCCCEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCEECCCCCC
AVGHGSGAPAGWGWVAPYIGMIWPTGDSAKLRAAAAAWATAGANFMAAETAAGGGTMAAI
CCCCCCCCCCCCHHHHHHHHEECCCCCCHHHHHHHHHHHHCCCCCEEHHCCCCCCCEEHH
GAQQIPEGAAINKALADASSATADVARQCQTIAAQLNSYAAKVDQVHAAILDLLSRICDP
HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LTGIKEVWDLLTDEDEDEIKKIADDIRTVVDNFGREADTLGGQIEATVSAVAAATENMSH
HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHHHHH
WAGKEWDHFLHGTPVGRALNQVGQAFKGVGEEGWGFLKGLYEVSPNRMLLDPVGYGKTMA
HCCCCHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCEEECCCCCCHHHH
GMVEGAGTLVGLGPDGVPGAFDAWKALGKDVTHWDEWGSNPAEALGKSTFDVATLALPGG
HHHHCCCEEEECCCCCCCCHHHHHHHHCCCCHHHHHHCCCHHHHHCCCHHCEEEEECCCC
PLSKLGKFGHTAADALKGLKKPPGVPKPPEVKPPAAPKAPDSGQPAPSGKPGPVAPSGKP
CHHHHHHHCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
APGPADGPLPHSPTESKPPAGGTPPAAEPPKPTAAPHSGEPKPIATPPESVGKPVTPAPA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCC
EGAPAQPHEPVPAHAPASPGEPLATPAPAAAVPAAAAAPASAPAPAAAAAAPVPSASSVP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCHHHCCCCCCCCCHHHHHCCCCCCCCCC
MGGGAPAETPSGLGDVPHGGEPGAHPVEPPHDGAPHVPGGGDGPYHPGDGGPHGPGDGHP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
LADGSGPHQPGGHHPPGDGDRPPGSHPPHDGAPPDEPADGHPPEIPTPSDLPPWHQAQLA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
LAESPEKLVKDLIEHGCPRELAESAGANSPYAGMTAQEILNKWWDPATGTWDWPKVEGFA
HHCCHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHH
DGIYKTARSIPKDAWLDRIGEVSDAKGDFMGAVGDSYPHRGLAPGSSGDYNRFHGTGKEL
HHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHCCCCCCC
PEGWEVRYGEVGDAFGQPGGGTQWVVIDKNKKTVLIKWLIENGYLDWG
CCCCCEEECCCCHHCCCCCCCCEEEEEECCCCEEEEEEEECCCCCCCC
>Mature Secondary Structure 
APLACDPTALDHAGATVVAAGESLGSVISTLTAALAGTSGMAGDDPVGAALGRRYDGAA
CCCCCCCCHHHHCCCEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCHHH
AKLIQAMADTRNGLCSIGDGVRMSAHNYAVAEAMSDLAGRASALPAPQVTGPLTVGAPPS
HHHHHHHHHHCCCCEECCCCCEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCEECCCCCC
AVGHGSGAPAGWGWVAPYIGMIWPTGDSAKLRAAAAAWATAGANFMAAETAAGGGTMAAI
CCCCCCCCCCCCHHHHHHHHEECCCCCCHHHHHHHHHHHHCCCCCEEHHCCCCCCCEEHH
GAQQIPEGAAINKALADASSATADVARQCQTIAAQLNSYAAKVDQVHAAILDLLSRICDP
HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LTGIKEVWDLLTDEDEDEIKKIADDIRTVVDNFGREADTLGGQIEATVSAVAAATENMSH
HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHHHHH
WAGKEWDHFLHGTPVGRALNQVGQAFKGVGEEGWGFLKGLYEVSPNRMLLDPVGYGKTMA
HCCCCHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCEEECCCCCCHHHH
GMVEGAGTLVGLGPDGVPGAFDAWKALGKDVTHWDEWGSNPAEALGKSTFDVATLALPGG
HHHHCCCEEEECCCCCCCCHHHHHHHHCCCCHHHHHHCCCHHHHHCCCHHCEEEEECCCC
PLSKLGKFGHTAADALKGLKKPPGVPKPPEVKPPAAPKAPDSGQPAPSGKPGPVAPSGKP
CHHHHHHHCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
APGPADGPLPHSPTESKPPAGGTPPAAEPPKPTAAPHSGEPKPIATPPESVGKPVTPAPA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCC
EGAPAQPHEPVPAHAPASPGEPLATPAPAAAVPAAAAAPASAPAPAAAAAAPVPSASSVP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCHHHCCCCCCCCCHHHHHCCCCCCCCCC
MGGGAPAETPSGLGDVPHGGEPGAHPVEPPHDGAPHVPGGGDGPYHPGDGGPHGPGDGHP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
LADGSGPHQPGGHHPPGDGDRPPGSHPPHDGAPPDEPADGHPPEIPTPSDLPPWHQAQLA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
LAESPEKLVKDLIEHGCPRELAESAGANSPYAGMTAQEILNKWWDPATGTWDWPKVEGFA
HHCCHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHH
DGIYKTARSIPKDAWLDRIGEVSDAKGDFMGAVGDSYPHRGLAPGSSGDYNRFHGTGKEL
HHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHCCCCCCC
PEGWEVRYGEVGDAFGQPGGGTQWVVIDKNKKTVLIKWLIENGYLDWG
CCCCCEEECCCCHHCCCCCCCCEEEEEECCCCEEEEEEEECCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA