Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is 222526088

Identifier: 222526088

GI number: 222526088

Start: 3510690

End: 3511757

Strand: Direct

Name: 222526088

Synonym: Chy400_2845

Alternate gene names: NA

Gene position: 3510690-3511757 (Clockwise)

Preceding gene: 222526087

Following gene: 222526089

Centisome position: 66.63

GC content: 57.12

Gene sequence:

>1068_bases
ATGACAACTCCGGTCTGGCCACCTGTATTCGACGGACATAATGACGTTATTCTCGACCTTTACCGCCCTAAACCCGGTGA
AGAACGCGACTTCTTTCACGCCAGCCCCTACGGTCATCTCGATCTGCCTCGTGCGCGGGCTGGTGGGTTTGGTGGCGGCT
TTTTTGCCATCTACGTCCCACCACCACCTGCGCCCAAACCACCACCAGAACACATTCCCTCGCCACCGTACTATATGCCA
TTGCCGCCACCACTCGAACATAGCTACGCCTTACACACGACGATGGCAATGGCCGCACGCCTCTTCCGCATCGAAGCCGA
GTCCAACGGCCAGTTTCGCATCTGTCGTACTGCTGATGACATTCATGCCTGCCTGACGAATGGTATTGTGGCAGTCGTTT
TCCACATTGAAGGTGCTGAAGCGATTGGCCCCGATCTCGACGAACTAGAGGTACTCTATCAGGCGGGCTTGCGCTCGCTA
GGGCCGGTCTGGAGCCGTCCCAATATCTTTGGGCACGGCGTGCCGTTTGCGTTTCCCGCTTCACCCGATACCGGACCCGG
TCTGACCGACGCCGGTAAAGCGCTGGTCAAGGCGTGCAATCAACTACGCATCCTGATCGATCTCTCGCATTTGAACGAAG
CCGGTTTTTGGGACGTTGCCCGGCTAAGCACGGCACCGTTGGTGGCTACGCATTCCAATGCGCATGCTATCTGCCCCAGT
AGTCGGAATCTTACTGATCGTCAGCTCGATGCCATTCGCGACTCGGATGGCATGGTCGGACTGAATTTTGCCGTCACGTT
TTTGCGCCCTGACGGGCACCGTGATGCTGATACGCCGGTTTCGATTATGGTGCGGCATGTGCAGTATCTGGTTGAACGCC
TGGGTATTGACCGGGTAGGTTTTGGTTCAGATTTCGACGGTGCCTTGATCCCGCAGGCCATTCGCGACGTAAGCGGTTTA
CCGGTACTCTTGCAGGCGCTGGCCGATGCCGGCTTTACACCGGCAGAACTACGCAAGCTGGCCTACGAAAACTGGTTGCG
CGTATTGCGGTTGACCTGGGGCGCATAA

Upstream 100 bases:

>100_bases
GGTGATATGCTGTGCGGTCAGGGGCGTGTCCATCTCGCTATCGAGAGCGGTTGGCGCGCTGCGACGCGCATCCGCGAAGT
TGTATCGTGAGGATACCTGT

Downstream 100 bases:

>100_bases
CGTGATCACATCGTGAGTAGACATCAATACCAGACAATCACAGCTAAGGAGAGTACATTCCCGCTATGCAACATCTGCAT
CCGATTGAGCAACGAGTGCT

Product: Membrane dipeptidase

Products: NA

Alternate protein names: ORF X [H]

Number of amino acids: Translated: 355; Mature: 354

Protein sequence:

>355_residues
MTTPVWPPVFDGHNDVILDLYRPKPGEERDFFHASPYGHLDLPRARAGGFGGGFFAIYVPPPPAPKPPPEHIPSPPYYMP
LPPPLEHSYALHTTMAMAARLFRIEAESNGQFRICRTADDIHACLTNGIVAVVFHIEGAEAIGPDLDELEVLYQAGLRSL
GPVWSRPNIFGHGVPFAFPASPDTGPGLTDAGKALVKACNQLRILIDLSHLNEAGFWDVARLSTAPLVATHSNAHAICPS
SRNLTDRQLDAIRDSDGMVGLNFAVTFLRPDGHRDADTPVSIMVRHVQYLVERLGIDRVGFGSDFDGALIPQAIRDVSGL
PVLLQALADAGFTPAELRKLAYENWLRVLRLTWGA

Sequences:

>Translated_355_residues
MTTPVWPPVFDGHNDVILDLYRPKPGEERDFFHASPYGHLDLPRARAGGFGGGFFAIYVPPPPAPKPPPEHIPSPPYYMP
LPPPLEHSYALHTTMAMAARLFRIEAESNGQFRICRTADDIHACLTNGIVAVVFHIEGAEAIGPDLDELEVLYQAGLRSL
GPVWSRPNIFGHGVPFAFPASPDTGPGLTDAGKALVKACNQLRILIDLSHLNEAGFWDVARLSTAPLVATHSNAHAICPS
SRNLTDRQLDAIRDSDGMVGLNFAVTFLRPDGHRDADTPVSIMVRHVQYLVERLGIDRVGFGSDFDGALIPQAIRDVSGL
PVLLQALADAGFTPAELRKLAYENWLRVLRLTWGA
>Mature_354_residues
TTPVWPPVFDGHNDVILDLYRPKPGEERDFFHASPYGHLDLPRARAGGFGGGFFAIYVPPPPAPKPPPEHIPSPPYYMPL
PPPLEHSYALHTTMAMAARLFRIEAESNGQFRICRTADDIHACLTNGIVAVVFHIEGAEAIGPDLDELEVLYQAGLRSLG
PVWSRPNIFGHGVPFAFPASPDTGPGLTDAGKALVKACNQLRILIDLSHLNEAGFWDVARLSTAPLVATHSNAHAICPSS
RNLTDRQLDAIRDSDGMVGLNFAVTFLRPDGHRDADTPVSIMVRHVQYLVERLGIDRVGFGSDFDGALIPQAIRDVSGLP
VLLQALADAGFTPAELRKLAYENWLRVLRLTWGA

Specific function: Unknown

COG id: COG2355

COG function: function code E; Zn-dependent dipeptidase, microsomal dipeptidase homolog

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M19 family [H]

Homologues:

Organism=Homo sapiens, GI189458885, Length=352, Percent_Identity=30.1136363636364, Blast_Score=145, Evalue=7e-35,
Organism=Homo sapiens, GI4758190, Length=352, Percent_Identity=30.1136363636364, Blast_Score=145, Evalue=7e-35,
Organism=Homo sapiens, GI11641269, Length=347, Percent_Identity=27.0893371757925, Blast_Score=117, Evalue=1e-26,
Organism=Homo sapiens, GI193211608, Length=355, Percent_Identity=24.5070422535211, Blast_Score=94, Evalue=2e-19,
Organism=Homo sapiens, GI193211610, Length=355, Percent_Identity=24.5070422535211, Blast_Score=89, Evalue=5e-18,
Organism=Drosophila melanogaster, GI221475880, Length=355, Percent_Identity=27.887323943662, Blast_Score=141, Evalue=7e-34,
Organism=Drosophila melanogaster, GI281362638, Length=356, Percent_Identity=28.3707865168539, Blast_Score=135, Evalue=6e-32,
Organism=Drosophila melanogaster, GI281362636, Length=356, Percent_Identity=28.3707865168539, Blast_Score=135, Evalue=6e-32,
Organism=Drosophila melanogaster, GI161083233, Length=355, Percent_Identity=25.6338028169014, Blast_Score=105, Evalue=6e-23,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008257 [H]

Pfam domain/function: PF01244 Peptidase_M19 [H]

EC number: NA

Molecular weight: Translated: 38735; Mature: 38603

Theoretical pI: Translated: 6.14; Mature: 6.14

Prosite motif: PS00869 RENAL_DIPEPTIDASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTTPVWPPVFDGHNDVILDLYRPKPGEERDFFHASPYGHLDLPRARAGGFGGGFFAIYVP
CCCCCCCCCCCCCCCEEEEEECCCCCCCCCCEECCCCCCCCCCHHHCCCCCCCEEEEEEC
PPPAPKPPPEHIPSPPYYMPLPPPLEHSYALHTTMAMAARLFRIEAESNGQFRICRTADD
CCCCCCCCCCCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHEEEECCCCCEEEEECHHH
IHACLTNGIVAVVFHIEGAEAIGPDLDELEVLYQAGLRSLGPVWSRPNIFGHGVPFAFPA
HHHHHHCCEEEEEEEECCCHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEECC
SPDTGPGLTDAGKALVKACNQLRILIDLSHLNEAGFWDVARLSTAPLVATHSNAHAICPS
CCCCCCCCCHHHHHHHHHHHHEEEEEEECCCCCCCCCHHHHHCCCCEEEECCCCEEECCC
SRNLTDRQLDAIRDSDGMVGLNFAVTFLRPDGHRDADTPVSIMVRHVQYLVERLGIDRVG
CCCCCHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC
FGSDFDGALIPQAIRDVSGLPVLLQALADAGFTPAELRKLAYENWLRVLRLTWGA
CCCCCCCCHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
TTPVWPPVFDGHNDVILDLYRPKPGEERDFFHASPYGHLDLPRARAGGFGGGFFAIYVP
CCCCCCCCCCCCCCEEEEEECCCCCCCCCCEECCCCCCCCCCHHHCCCCCCCEEEEEEC
PPPAPKPPPEHIPSPPYYMPLPPPLEHSYALHTTMAMAARLFRIEAESNGQFRICRTADD
CCCCCCCCCCCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHEEEECCCCCEEEEECHHH
IHACLTNGIVAVVFHIEGAEAIGPDLDELEVLYQAGLRSLGPVWSRPNIFGHGVPFAFPA
HHHHHHCCEEEEEEEECCCHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEECC
SPDTGPGLTDAGKALVKACNQLRILIDLSHLNEAGFWDVARLSTAPLVATHSNAHAICPS
CCCCCCCCCHHHHHHHHHHHHEEEEEEECCCCCCCCCHHHHHCCCCEEEECCCCEEECCC
SRNLTDRQLDAIRDSDGMVGLNFAVTFLRPDGHRDADTPVSIMVRHVQYLVERLGIDRVG
CCCCCHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC
FGSDFDGALIPQAIRDVSGLPVLLQALADAGFTPAELRKLAYENWLRVLRLTWGA
CCCCCCCCHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 1313537 [H]