Definition Prochlorococcus marinus str. MIT 9313 chromosome, complete genome.
Accession NC_005071
Length 2,410,873

Click here to switch to the map view.

The map label for this gene is thf1

Identifier: 33862947

GI number: 33862947

Start: 730396

End: 731127

Strand: Reverse

Name: thf1

Synonym: PMT0675

Alternate gene names: 33862947

Gene position: 731127-730396 (Counterclockwise)

Preceding gene: 33862949

Following gene: 33862944

Centisome position: 30.33

GC content: 48.77

Gene sequence:

>732_bases
TTGAGTGACCGAAAGACCATTGCTGACAGCAAACGTGCGTTTAATCATGATTTCCCTCATGTCATCCCCTCGCTGTACAG
ACGTACAACCGATGAACTTCTTGTCGAGCTGCATCTCCTGAGCCATCAGAAGCATTTCCACCCAGACGCACTCTTTGCTA
TTGGCCTTAGCCAAGTTTTTGATGTCTTCACGAGTGGTTACCGACCAGAAGCTCACGTCAAGACGTTGTTCGATGCCCTA
TGTCGAAGCTGCGGCTTCGACCCAAATGCTTTGCGCAAGCAGGCCCAACAAACTCTTGAGTCGGTACGGGGTCACGATCT
TGAGGAGGTCCAAGGCTGGATCCAACAACAAGGCAAAGGGGCTCCTGAAGCTCTCGCCAAGGCATTGCGTAATACTGCTG
GTAGCACCACTTTTCATTACTCACGCCTAATGGCCGTGGGGTTGCTCAGCCTTCTCGCATCTGCCCAAGGTGATGAATCA
TCTGATCCCGAAAAACTAAGCCAAATCGCCCATGAACTAAGTGAATCAGTTGGATTCTCCAAAGCCAGAGTTGAAAAAGA
TCTGAACCTCTACAAGTCCAACCTCGAGAAGATGGCCCAGGCTGTAGAACTAACTGAACAAATTCTAGAATCTGAACGTC
GCAAACGGGAACAAAATGAATCTGCAAAACTCAACACTGGATCATCAGAGCAAATGTCTCAGGGGGTTGAAGCTTGTTCC
AATATCAGCTGA

Upstream 100 bases:

>100_bases
TTCAATCACAATCGGGATCATGCTGCTCTGCTCGGTTTTCACGATCTTAACGGCGCCGTAGCAGCGCTATGGTCAGGTGC
GAAAAGGTCGGGCCAAGCAC

Downstream 100 bases:

>100_bases
CCTGACATCCACCACCAGTCTGTTTTTTTGAGTTTGGCACCCCGCAAATAACTTAGGTTCATCGTTAAAAACTTAGCCTG
TTTAACTGGGGCTGGCCAGG

Product: Thf1-like protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 243; Mature: 242

Protein sequence:

>243_residues
MSDRKTIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTSGYRPEAHVKTLFDAL
CRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEALAKALRNTAGSTTFHYSRLMAVGLLSLLASAQGDES
SDPEKLSQIAHELSESVGFSKARVEKDLNLYKSNLEKMAQAVELTEQILESERRKREQNESAKLNTGSSEQMSQGVEACS
NIS

Sequences:

>Translated_243_residues
MSDRKTIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTSGYRPEAHVKTLFDAL
CRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEALAKALRNTAGSTTFHYSRLMAVGLLSLLASAQGDES
SDPEKLSQIAHELSESVGFSKARVEKDLNLYKSNLEKMAQAVELTEQILESERRKREQNESAKLNTGSSEQMSQGVEACS
NIS
>Mature_242_residues
SDRKTIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVFDVFTSGYRPEAHVKTLFDALC
RSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKGAPEALAKALRNTAGSTTFHYSRLMAVGLLSLLASAQGDESS
DPEKLSQIAHELSESVGFSKARVEKDLNLYKSNLEKMAQAVELTEQILESERRKREQNESAKLNTGSSEQMSQGVEACSN
IS

Specific function: May be involved in photosynthetic membrane biogenesis

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the THF1 family

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): THF1_PROMM (Q7V7R3)

Other databases:

- EMBL:   BX548175
- RefSeq:   NP_894507.1
- STRING:   Q7V7R3
- GeneID:   1727933
- GenomeReviews:   BX548175_GR
- KEGG:   pmt:PMT0675
- NMPDR:   fig|74547.1.peg.674
- eggNOG:   NOG08111
- HOGENOM:   HBG634536
- OMA:   RTVSDTK
- ProtClustDB:   PRK13266
- BioCyc:   PMAR74547:PMT0675-MONOMER
- HAMAP:   MF_01843
- InterPro:   IPR017499
- TIGRFAMs:   TIGR03060

Pfam domain/function: PF11264 ThylakoidFormat

EC number: NA

Molecular weight: Translated: 27083; Mature: 26951

Theoretical pI: Translated: 6.51; Mature: 6.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.2 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSDRKTIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVF
CCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH
DVFTSGYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKG
HHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCC
APEALAKALRNTAGSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFS
CHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCHH
KARVEKDLNLYKSNLEKMAQAVELTEQILESERRKREQNESAKLNTGSSEQMSQGVEACS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
NIS
CCC
>Mature Secondary Structure 
SDRKTIADSKRAFNHDFPHVIPSLYRRTTDELLVELHLLSHQKHFHPDALFAIGLSQVF
CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH
DVFTSGYRPEAHVKTLFDALCRSCGFDPNALRKQAQQTLESVRGHDLEEVQGWIQQQGKG
HHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCC
APEALAKALRNTAGSTTFHYSRLMAVGLLSLLASAQGDESSDPEKLSQIAHELSESVGFS
CHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCHH
KARVEKDLNLYKSNLEKMAQAVELTEQILESERRKREQNESAKLNTGSSEQMSQGVEACS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
NIS
CCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 12917642