Definition Chlorobium phaeobacteroides DSM 266 chromosome, complete genome.
Accession NC_008639
Length 3,133,902

Click here to switch to the map view.

The map label for this gene is appC [H]

Identifier: 119357477

GI number: 119357477

Start: 1904027

End: 1905511

Strand: Direct

Name: appC [H]

Synonym: Cpha266_1676

Alternate gene names: 119357477

Gene position: 1904027-1905511 (Clockwise)

Preceding gene: 119357476

Following gene: 119357478

Centisome position: 60.76

GC content: 50.03

Gene sequence:

>1485_bases
ATGAACAACGGAAAAAGCCATACTCAACCCGGTAATAACATGCCCCTCTTTTGCCTTATCCCCGGGCTGGGACTTCTCCG
CCTGAAAAAAACAGCTCAAGGTTTGTTCCTGCTCTTCTCTTTCCTCGCCTATCTCTTTATTCTCGTCATGCGGTTTGACC
TGGTTCTGTTCAGCTTCCGGTCGATGCTCACCGCACTGACCATGCTCATCATAACACCGGATACACTCCGGGAAATCTTC
AGTCCGGAGATCCTTGAATTCTGGATTGGCTCCTGCTGTCTTGTCGCCGTTCCCATGCTTCTTTTTTTCATCTCACTGCG
ATCCTGCAAAAAAACGGTTTCAGAAAAAAACAAGCCGACCAGAGAAGAATCAAGCCTCGGCAAAATAAGTCTTCAGGCTT
TCATGCGTCAATCAATCGCGCTGTATGCATCGGCCATCATTTTTGTCCTCTACTCGGTAGCATTCCTTGCGCCGTTCATT
GCACCATTCAGCCCCTATGATCAGCAGGACTTTCTTGTCACTGCCTATCAACCACCCATGACCCGTCTGCAGGCGCTGAT
ACTTCAGCAGCCGAAACACCTCGTCATTCCGATACAAAAGGGCTCAGACAAAGCAACAGAACTGAGTAATTCCTTTATCA
GCGACTATCAGAAACTCACCTCACGAAACGAACCCCACGCGCTGAAATTCGTCAACAGCTATAACGTTCAGTGCGAAACG
GTGACATACGTCCAGGGAATACGGACAAAAACCATTCCGATTGCCGAACTCGCAGCCGGAAAAGATGCTACGGCAAAACT
TGCCGTTACCCGAACGTTCATACTCGGTACCGACCAGTACGGACGCGACATTCTCAGCCGAGTTATCTATGGCTCGAGAA
TATCACTCTCCATCGGGTTTCTTGTCGTCCTTATCTCCGTAACGCTCGGAACCATTATCGGTGTCTCATCGGGCTATTTC
GGCGGCTGGATCGATGCCATACTGATGCGAATTGTCGATGTACTTATAGCCTTCCCGGCACTTTTTCTCATACTTATCAT
CATCGCCGCATTCGGAAACTCCATCTACCTTATTGTGATTACGCTTTCCTTCACCGGATGGATGGGTGTGGCAAGAATTG
TCAGAAGCCAGGTGCTCTCGCTCAAAGAACAGGAGTTCATTCTGGCCGCAAAATCGCTCGGGCTTTCTAATATGAGAATC
ATTTTCCGCCACCTTGCGCCCAACACGCTGACGCCGGTCATTATCGCGGCAACACTCCGTATAGGCAGCATCATTCTTAC
CGAAGCAGGACTATCGTTTCTCGGGCTCGGCGTTCAGCCGCCTACAGCAAGCTGGGGCAACATCATCAACGAAGGACGCG
ACAGCCTTTTGAACCACTGGTGGATATCAACATTTCCAGGCATCGCCATTCTCACCACGGTGGTATGCTTTAACCTGATC
GGTGACGGCGTGCGTGACGCTCTCGATCCGAGAATGAGAGGATAA

Upstream 100 bases:

>100_bases
AAAACAGCTTTTTCTCCATGAGAAAGCTGTTTTTTTTTATCCGAATTGTTTGATTTTGTGCTGCATCTGTCGTTTTTTCC
AGTAATTGATCGATTATTTC

Downstream 100 bases:

>100_bases
CCATGACTGAAACCCGGATACATGAGGAACCGAAACCATGGACAACGGTATCCTCCCGATACCTCTACACCGAACCATGG
CTGACGCTCAGAAAAGACAA

Product: binding-protein-dependent transport systems inner membrane component

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 494; Mature: 494

Protein sequence:

>494_residues
MNNGKSHTQPGNNMPLFCLIPGLGLLRLKKTAQGLFLLFSFLAYLFILVMRFDLVLFSFRSMLTALTMLIITPDTLREIF
SPEILEFWIGSCCLVAVPMLLFFISLRSCKKTVSEKNKPTREESSLGKISLQAFMRQSIALYASAIIFVLYSVAFLAPFI
APFSPYDQQDFLVTAYQPPMTRLQALILQQPKHLVIPIQKGSDKATELSNSFISDYQKLTSRNEPHALKFVNSYNVQCET
VTYVQGIRTKTIPIAELAAGKDATAKLAVTRTFILGTDQYGRDILSRVIYGSRISLSIGFLVVLISVTLGTIIGVSSGYF
GGWIDAILMRIVDVLIAFPALFLILIIIAAFGNSIYLIVITLSFTGWMGVARIVRSQVLSLKEQEFILAAKSLGLSNMRI
IFRHLAPNTLTPVIIAATLRIGSIILTEAGLSFLGLGVQPPTASWGNIINEGRDSLLNHWWISTFPGIAILTTVVCFNLI
GDGVRDALDPRMRG

Sequences:

>Translated_494_residues
MNNGKSHTQPGNNMPLFCLIPGLGLLRLKKTAQGLFLLFSFLAYLFILVMRFDLVLFSFRSMLTALTMLIITPDTLREIF
SPEILEFWIGSCCLVAVPMLLFFISLRSCKKTVSEKNKPTREESSLGKISLQAFMRQSIALYASAIIFVLYSVAFLAPFI
APFSPYDQQDFLVTAYQPPMTRLQALILQQPKHLVIPIQKGSDKATELSNSFISDYQKLTSRNEPHALKFVNSYNVQCET
VTYVQGIRTKTIPIAELAAGKDATAKLAVTRTFILGTDQYGRDILSRVIYGSRISLSIGFLVVLISVTLGTIIGVSSGYF
GGWIDAILMRIVDVLIAFPALFLILIIIAAFGNSIYLIVITLSFTGWMGVARIVRSQVLSLKEQEFILAAKSLGLSNMRI
IFRHLAPNTLTPVIIAATLRIGSIILTEAGLSFLGLGVQPPTASWGNIINEGRDSLLNHWWISTFPGIAILTTVVCFNLI
GDGVRDALDPRMRG
>Mature_494_residues
MNNGKSHTQPGNNMPLFCLIPGLGLLRLKKTAQGLFLLFSFLAYLFILVMRFDLVLFSFRSMLTALTMLIITPDTLREIF
SPEILEFWIGSCCLVAVPMLLFFISLRSCKKTVSEKNKPTREESSLGKISLQAFMRQSIALYASAIIFVLYSVAFLAPFI
APFSPYDQQDFLVTAYQPPMTRLQALILQQPKHLVIPIQKGSDKATELSNSFISDYQKLTSRNEPHALKFVNSYNVQCET
VTYVQGIRTKTIPIAELAAGKDATAKLAVTRTFILGTDQYGRDILSRVIYGSRISLSIGFLVVLISVTLGTIIGVSSGYF
GGWIDAILMRIVDVLIAFPALFLILIIIAAFGNSIYLIVITLSFTGWMGVARIVRSQVLSLKEQEFILAAKSLGLSNMRI
IFRHLAPNTLTPVIIAATLRIGSIILTEAGLSFLGLGVQPPTASWGNIINEGRDSLLNHWWISTFPGIAILTTVVCFNLI
GDGVRDALDPRMRG

Specific function: This protein is a component of an oligopeptide permease, a binding protein-dependent transport system. This APP system can completely substitute for the OPP system in both sporulation and genetic competence, though, unlike OPP, is incapable of transportin

COG id: COG1173

COG function: function code EP; ABC-type dipeptide/oligopeptide/nickel transport systems, permease components

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 ABC transmembrane type-1 domain [H]

Homologues:

Organism=Escherichia coli, GI1787054, Length=219, Percent_Identity=44.7488584474886, Blast_Score=208, Evalue=6e-55,
Organism=Escherichia coli, GI1787760, Length=234, Percent_Identity=41.8803418803419, Blast_Score=190, Evalue=2e-49,
Organism=Escherichia coli, GI1787498, Length=218, Percent_Identity=45.8715596330275, Blast_Score=181, Evalue=7e-47,
Organism=Escherichia coli, GI1789964, Length=220, Percent_Identity=45, Blast_Score=174, Evalue=9e-45,
Organism=Escherichia coli, GI1789889, Length=218, Percent_Identity=39.9082568807339, Blast_Score=163, Evalue=2e-41,
Organism=Escherichia coli, GI1788505, Length=223, Percent_Identity=36.322869955157, Blast_Score=135, Evalue=4e-33,
Organism=Escherichia coli, GI1787549, Length=221, Percent_Identity=33.9366515837104, Blast_Score=116, Evalue=3e-27,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000515 [H]

Pfam domain/function: PF00528 BPD_transp_1 [H]

EC number: NA

Molecular weight: Translated: 54577; Mature: 54577

Theoretical pI: Translated: 9.86; Mature: 9.86

Prosite motif: PS50928 ABC_TM1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.6 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNNGKSHTQPGNNMPLFCLIPGLGLLRLKKTAQGLFLLFSFLAYLFILVMRFDLVLFSFR
CCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
SMLTALTMLIITPDTLREIFSPEILEFWIGSCCLVAVPMLLFFISLRSCKKTVSEKNKPT
HHHHHHHHHHCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
REESSLGKISLQAFMRQSIALYASAIIFVLYSVAFLAPFIAPFSPYDQQDFLVTAYQPPM
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCH
TRLQALILQQPKHLVIPIQKGSDKATELSNSFISDYQKLTSRNEPHALKFVNSYNVQCET
HHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCEEEE
VTYVQGIRTKTIPIAELAAGKDATAKLAVTRTFILGTDQYGRDILSRVIYGSRISLSIGF
EHHHHCCCCCCCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHHHHHHCCCHHHHHHH
LVVLISVTLGTIIGVSSGYFGGWIDAILMRIVDVLIAFPALFLILIIIAAFGNSIYLIVI
HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEE
TLSFTGWMGVARIVRSQVLSLKEQEFILAAKSLGLSNMRIIFRHLAPNTLTPVIIAATLR
EEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHH
IGSIILTEAGLSFLGLGVQPPTASWGNIINEGRDSLLNHWWISTFPGIAILTTVVCFNLI
HHHHHHHHHCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH
GDGVRDALDPRMRG
HHHHHHHCCCCCCC
>Mature Secondary Structure
MNNGKSHTQPGNNMPLFCLIPGLGLLRLKKTAQGLFLLFSFLAYLFILVMRFDLVLFSFR
CCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
SMLTALTMLIITPDTLREIFSPEILEFWIGSCCLVAVPMLLFFISLRSCKKTVSEKNKPT
HHHHHHHHHHCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
REESSLGKISLQAFMRQSIALYASAIIFVLYSVAFLAPFIAPFSPYDQQDFLVTAYQPPM
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCH
TRLQALILQQPKHLVIPIQKGSDKATELSNSFISDYQKLTSRNEPHALKFVNSYNVQCET
HHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCEEEE
VTYVQGIRTKTIPIAELAAGKDATAKLAVTRTFILGTDQYGRDILSRVIYGSRISLSIGF
EHHHHCCCCCCCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHHHHHHCCCHHHHHHH
LVVLISVTLGTIIGVSSGYFGGWIDAILMRIVDVLIAFPALFLILIIIAAFGNSIYLIVI
HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEE
TLSFTGWMGVARIVRSQVLSLKEQEFILAAKSLGLSNMRIIFRHLAPNTLTPVIIAATLR
EEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHH
IGSIILTEAGLSFLGLGVQPPTASWGNIINEGRDSLLNHWWISTFPGIAILTTVVCFNLI
HHHHHHHHHCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH
GDGVRDALDPRMRG
HHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7997159; 9384377 [H]