Definition Nostoc punctiforme PCC 73102, complete genome.
Accession NC_010628
Length 8,234,322

Click here to switch to the map view.

The map label for this gene is Not Available

Identifier: 186682911

GI number:

Start: 3244722

End: 3248939

Strand: Reverse

Name: Not Available

Synonym: Npun_R2610

Alternate gene names: NA

Gene position: NA

Preceding gene: 186682919

Following gene: 186682910

Centisome position: NA

GC content: NA

Gene sequence:

>4218_bases
ATGTTTGATTCAGTTTTTAGTGCTTTCGCTCCCCAATTCCCCAACGATAAATTGATTTTGGGAGAAAAGAAAGTATTAGC
TGATTTAGATAATTTAGAAGCCAAAGGTGCTTTGTTGTTAGCAAACATTCCTTTAGATGCAGCAAGAGCATTTATTGATA
AAGATTTGTTGCAAACCACGACCGCAGATGACGGAAATACCCGACTAAAAAAGAACCAGAAAAACCCGCTACAAGTAAAG
GAATATATGCTATCTGCACTCAAGCGATCGCTGTACTCCATCTGGTTATCCGGCTGGAATACGGGAAGTAACCACGCCGA
TACAGAGGTAAAAAACGCAAACAGGAAAGTCAACTTTAACTCATTCTCTAACCTGATTTATTTTGACGATGCACAAAAAC
CAGGTGTACAAATCCGACCAATCCGCAACATCCCCGCCGAAAGCGCAATTAGGGGACGGATTAACCAGTTAGCACAAGAC
GTAACAAACTCGGAGTACAAAAAGATTCAGCAAGACTTACTCGCAGCAGTCACCCCACAGCCCGACACCAAGCAACCCAT
AAGCAGGAATGAACTACTCAAGCGCATTGAGGAAAGGCTAGGCACTAAATCTGGCAGGTATTCAAAACGCGCTGAAACCA
TTGCTCGTACTGAGTTGACTTTTGCTTACAACGCTGGACGATTAGAGACTTACCGCGAGTCTGGATTAGTTGAAGCAGTT
AGATTTTACACAATAATTGACGAAAGACGGTGTAATATTTGCGCTTCTCGTCACGGCTTGGTTATTCCGTTGAATGATTG
GCAAGCGATCGCGGCTAATACTCCAGCAATTCATCCTCGATGTCGCTGCGTGCTAAGTCCAGTACTCAAAAAATCTCCTG
AACTAAAAGATCGCGATCGCTTTGTACAAAACCGCGAACTAGTCCCGCGCCCAATTTTATGGGCGGCTGCTGGAATTCTT
GCATCTGTACTAGTCGGGAAAACAATCACACCTATTCAAACCATTGCACGAGCCGCCGGACAGACCACGACTGCGGCGGC
GGGTATAACCCTTGCTGAAGCAATTGCTCAAATCCGTTCTGAGATTGGTGAGAAAGCCCCTGTAACTGAACCCGTTGCTG
ATAGCGAAGCGGTAGCGAGTCCTGCCTCCGGCACGCTGCGCGAACGAGCGTCGGAGCAACCCCAAGCGCAGCAACAACAG
GAAGTAGGGGAAACAGTAACACCCTTGGTATTACTAGGGCAACTTGACATCAATACCGCGACTCGTGAAGAATTGCAAAA
AATATTACCAGGGCGATCGCTTACTGTCCGCCAAGTAAATGCGATCCTCAAGCGCCGGGAACAGTCAGCAATTACCAAAA
TTGAGGACTTAAAAAACGTCCCCGAAATTGGAGCAAAAACCTTTGAGCGATTAAAGCAACTCTCAGAAGGGTATCAAATT
ATCCCCTTACTTGACCCTAAAAGCATTCGCACGCCTACACAGTTATGGGCGGCTAATCTGGGGTTGACCAAATCGCAATC
CGAAACCATATTCAACGAACTGCAAAAAGGTTCATTCAAAGATATTGAGGATTTAAAAACTCGACTTAAAGGCAAAGGTA
TAGGCGATCGCACCATTGAGAATATGCAGCAACGCGCTGTAGTTATCCCTCGGCAATTTTCGCCAGTTAAGGAAACTGTA
CCAGTAAGAGAGGCAGGGGGGCAGGGGGGCAGGGGGGCAGGGGAGGATTTATTACCACCAGATGTAAGCGCTGGTAATGC
TATTTCACCACGCGCACCGTTACCTTACTCCACTTCTGGATTACCACCGGAGCGAGTAATACCTCAAAAAGTGCCAACAA
TTGTAACCGATCCTTTGAAACAAGAACTGTCAGGGATTCGGAGTGGTGTTGACGACTTCAATCAAGCATCTGTCAAAACT
GCCAACGATCAGTTAAACAGTAAAGAATTATTTGGTAAGTCACTCAAACAGCAAGCCAGTGACACTCAAAATACTGTTGA
TGGTTTTCGTAGTTTCAATCAGCAAGTTTCTACCGTAGAAGACCAATTACTTGAATGGGATAGACGGTTAGCCAGTGTCG
AGGACAATTACCAAAACCTACTTGACCCCACCAAGCCCAACTACTTTGAACAAGCGCCCGCTAGCATCGCCCAAATCCGC
GCTCAACTTCAAAAAGCTACCAAAACTATTGATGGAGCGATTGAAAATAATAATAAGTATGCTGCCAAAATTAATGAGAA
AGTTGACGCAATAGAGCAGCGAGTAACTAAACTCAACAATCAACGCGCTGCTGTCACTTCCCAAAAAACTGCTGATAATC
TCCGCACCAACCTAGAAGAATTTAATACCAAAATCCAAACTCTAGAATCCCAGGTTTCTAGGCTTCCCCCCGGTGCGGAA
CGCTCCCAAGCATGGCTAGAACTCCAACGCCTCAAGCGTCAGCAGTCTGATGCCATCGACCAAATTAGAACCGTATCAGA
CCAGTTAAATGATTACACCAACCCAGTAATTGACACTGCTAGGGGTGCTATTAACACGATTGAACAAACAAATCAGGCGA
TCGCTTCTACTGCCGAAAAGTTAAAGCAATTACAACAACGACTTTCCAAGCTACCAGTCACTAAAAATCAGCTAGAACCA
GAACTTAGAACCCAGTACGACACTGTAAAAGACCTCCGCACTGTACAGCGCCGAATGCAAAGCAGTGCGAAACTTTACCG
CCGCGCCGATGAAGCGGCAACCCGCAACTTAAATAACCGTGCCACCAATCAAGAAAACTTTTACAACCGCTACGATGAGC
AATATCAAGGCTTTGAGGGTAACACTGCCCCAGTAATTGAGCGGCGATTGTCACAAATCCAAGCCGATATGCGAGAACTA
GAATCAATGCCTGAAGGTACGGTGGCATGGTTGCTTGGTTTCAATGGTTGGGATACTGCCAACATCCCCCGCGATTTATC
TTTAGTTCTTAACGGCGCTCCCACAACTTCCAAGTCACTCGCCCAGATCCGCAAAATGGCATCTGATTTGTGGGATAAAA
CGAAACAGATTGAAGCCAACATTGAAGTTGTTAAACGTAATACTGAGTTTAGGTTTATCAACTCAGACCGCAAAGTTCAG
GGTTTTGACGACTTGCTCAAAAATGCTGAAGACAATTTAACCTACTGGCTTTCAACACGAAACGCCGCTCTCAACCCCAG
CAACATCGGATCATCAAACAATCGATTAACCGAACTACAGCGAGAGATCCAAAAAGCTATTGATAAAAAAGATAACGCCG
CACTCACCGAGCTATTCCGCCAAATCTCTCCAGATGATGCTTATGGATATGCAAGGAAAACTTTGCTTGATATGGAGGAA
TGGAGCAAACGGTATCAAGATGTAATTGGTAGCACTGCAAAACCTGGGGCGATCGCCTCTCAACCCATTACAAAACTTTA
CGACCAATACCAACAAGCTAACGAAACCTTCCTGCAACAGTTAGGTAACTCTCGTGTAAACCAATCCCAAACTATTGAGC
AAGCGTATCAAGATGCCAAGGATATTTACGAGCGCCTACGCTATGACGCTAACCGCATACCCACCTTTGATCTCAAGGGT
AAAGCTACCAGTGCAGAGGAATTACAAAAACAACTACAAGCAACCCAAGAACGATTAAAAGGACTTGTTAACAGCAAACA
ATTTACTAACCCTGCGGTATCAACTAACCCCAAAGTAGTTAAGGCTCAACAACAATTAGATGCCCTTGATTCCACATCTC
GCAACACTTTTAAAGAAATCAACGACCTCAATCAAGAAATAGATTTACTCAAACAACAGGGCAAAGGTACAACTAAAAAT
GAGATTGCACTCGCCCAAAAACGAGATTTATTTGAGCAGACAAGACAGCAGATGAAGGGTGTGGGTGAAGATATCAAAGA
CCTTAAGCTAGGTGATGAAGCCTATCAGCAAATTCAACAACTTAAAAGCAGTACTGAAGAATTTAGAGGGGCGATCGCCT
CTTCTCAATCTCAACTTGCAACCGTCAATCAGCAGTTAGTGAGATTGCAAGAACAGCAGCAAATACTAGGTGGGGGATTG
TCGCAACGAGCAATCAAGCTATCTGATGAAATTGATAGTTTGCAAGCCAAAAGGGTAGATTTAGTTAAAAAAATTGGGGA
TGATTACAGGCAAGTTCAAGAGATGCAAGTACAGATGCGATCTCTAAGAAATCCATAA

Upstream 100 bases:

>100_bases
CTTACAAAATCAGTATCAAGCTCATCTAACCACGGCGCTAACTCTGTGGCGTATAAAGGAATTAGCTTACTTGAGTTTTC
AATAGTTGTCATAAAATATT

Downstream 100 bases:

>100_bases
TTATCCTTTAATGTCTGATTTCTTTTTTGACCTCTTTCTACGTTTTTCTATAAAATCCTTCGGGGATATCTTATAAAGAT
AACAAAGTTGCAATAGTTCT

Product: SPP1 family phage head morphogenesis protein

Products: NA

Alternate protein names: NA

Number of amino acids: NA

Protein sequence:

>1405_residues
MFDSVFSAFAPQFPNDKLILGEKKVLADLDNLEAKGALLLANIPLDAARAFIDKDLLQTTTADDGNTRLKKNQKNPLQVK
EYMLSALKRSLYSIWLSGWNTGSNHADTEVKNANRKVNFNSFSNLIYFDDAQKPGVQIRPIRNIPAESAIRGRINQLAQD
VTNSEYKKIQQDLLAAVTPQPDTKQPISRNELLKRIEERLGTKSGRYSKRAETIARTELTFAYNAGRLETYRESGLVEAV
RFYTIIDERRCNICASRHGLVIPLNDWQAIAANTPAIHPRCRCVLSPVLKKSPELKDRDRFVQNRELVPRPILWAAAGIL
ASVLVGKTITPIQTIARAAGQTTTAAAGITLAEAIAQIRSEIGEKAPVTEPVADSEAVASPASGTLRERASEQPQAQQQQ
EVGETVTPLVLLGQLDINTATREELQKILPGRSLTVRQVNAILKRREQSAITKIEDLKNVPEIGAKTFERLKQLSEGYQI
IPLLDPKSIRTPTQLWAANLGLTKSQSETIFNELQKGSFKDIEDLKTRLKGKGIGDRTIENMQQRAVVIPRQFSPVKETV
PVREAGGQGGRGAGEDLLPPDVSAGNAISPRAPLPYSTSGLPPERVIPQKVPTIVTDPLKQELSGIRSGVDDFNQASVKT
ANDQLNSKELFGKSLKQQASDTQNTVDGFRSFNQQVSTVEDQLLEWDRRLASVEDNYQNLLDPTKPNYFEQAPASIAQIR
AQLQKATKTIDGAIENNNKYAAKINEKVDAIEQRVTKLNNQRAAVTSQKTADNLRTNLEEFNTKIQTLESQVSRLPPGAE
RSQAWLELQRLKRQQSDAIDQIRTVSDQLNDYTNPVIDTARGAINTIEQTNQAIASTAEKLKQLQQRLSKLPVTKNQLEP
ELRTQYDTVKDLRTVQRRMQSSAKLYRRADEAATRNLNNRATNQENFYNRYDEQYQGFEGNTAPVIERRLSQIQADMREL
ESMPEGTVAWLLGFNGWDTANIPRDLSLVLNGAPTTSKSLAQIRKMASDLWDKTKQIEANIEVVKRNTEFRFINSDRKVQ
GFDDLLKNAEDNLTYWLSTRNAALNPSNIGSSNNRLTELQREIQKAIDKKDNAALTELFRQISPDDAYGYARKTLLDMEE
WSKRYQDVIGSTAKPGAIASQPITKLYDQYQQANETFLQQLGNSRVNQSQTIEQAYQDAKDIYERLRYDANRIPTFDLKG
KATSAEELQKQLQATQERLKGLVNSKQFTNPAVSTNPKVVKAQQQLDALDSTSRNTFKEINDLNQEIDLLKQQGKGTTKN
EIALAQKRDLFEQTRQQMKGVGEDIKDLKLGDEAYQQIQQLKSSTEEFRGAIASSQSQLATVNQQLVRLQEQQQILGGGL
SQRAIKLSDEIDSLQAKRVDLVKKIGDDYRQVQEMQVQMRSLRNP

Sequences:
NA

Specific function: NA

COG id: NA

COG function: NA

Gene ontology:

Cell location: NA

Metaboloic importance: NA

Operon status: NA

Operon components: NA

Similarity: NA

Homologues:

NA

Paralogues:

NA

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: NA

Theoretical pI: NA

Prosite motif: NA

Important sites: NA

Signals:

NA

Transmembrane regions:

NA

Cys/Met content:

NA

Secondary structure: NA

PDB accession: NA

Resolution: NA

Structure class: NA

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: NA

TargetDB status: NA

Availability: NA

References: NA