Definition Nostoc sp. PCC 7120, complete genome.
Accession NC_003272
Length 6,413,771

Click here to switch to the map view.

The map label for this gene is 17228531

Identifier: 17228531

GI number: 17228531

Start: 1206496

End: 1207446

Strand: Direct

Name: 17228531

Synonym: alr1036

Alternate gene names: NA

Gene position: 1206496-1207446 (Clockwise)

Preceding gene: 17228526

Following gene: 17228532

Centisome position: 18.81

GC content: 42.9

Gene sequence:

>951_bases
ATGTTTTGGAAATGGTGCTTTCGACTCTCAATAGTCTTTGTAGGACTTTGGCTACTCTTGGATTTGAGTTCCCGCTTGGG
AGCAGAAGTTTTTTGGTTTCGAGAAGTTGGTTATCTGCAAGTATTTCTCCTACGGCTGGTGAGTCGAGGGGTTTTATGGG
TGGTTGCTGCGGGTGTAACTGCTGTTTATCTGTGGGGAAATTTAGCTTTGGCGCAACGGCTAAAGTATCCCCGGTCTTTG
AAGATTGCGGAGGTTAGGCGAGAAGAAGCAGAGTTGAGTGTGGGACTGAAAAACTTTCTCAGTCCTCAATATTCTCGGCT
GAATGCGCCTAAGATTAATGATGCTGGACACTTAAAACCTTTCAGATTGCGTTGGCTGCTACCCTTGGCTTTTGTCTTCA
GTTTATTGGCAGGGTTAATTTTAGTTCACTATGGAAAGATAGCTCTGGCTTACTGGTATCCGGCTTTTAACAAAAATAGT
TTACCGATAATTACCCCATTTCGCTTAGAAACTATCTGGGAACTGGGCAGGCAAGTTTTTTCCCAAGTTTTATATCTAGG
TTTGATTGTCGGAATAGCGATCGCTATTCTTATTTACTCACAATTTTTCCTCAGGGCGATCGCTGTTGTTCTCAGTGTTG
TGTTTGGGACAATTCTGTTTTACAACTGGGCAAAGGTTTTACAGTATTTCTTTCCTACACCCTTCAACAGCACTGAGCCT
TTATTTGGGAAAGATATCAGCTTTTATATATTTTCCCTGCCATTGTGGGAACTGTTAGAACTCTGGTTGATGGGGATGTT
TTTGTACGGCTTTATTGCTGTGACTCTGACTTATCTCCTCTCAGCCGACAGTCTCAGTCAAGGAATTTTCCCTGGTTTTT
CACCCCAGCAGCAACGCCATCTCTACGGTATGGGTGGTTTATTAATGTTGATGGTGGCTTTTAGCTATTAG

Upstream 100 bases:

>100_bases
GCCAGCAGCATAATTACCAGGGAAATTCATAATAAAGAAGAAGGTAAAGGAAAAATTCCTGATTACCTATGCCCAGAGTA
ATTAATTTAGCTGCCCAAGA

Downstream 100 bases:

>100_bases
CTGAGTCGTTATGAGTTGGTTTATTCGCCTCGTGGGGTGAGTTATGGCGCTAGTTACACAGATGTGGTCGTACAGTTACC
AATCTATAACATCTTATGTG

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 316; Mature: 316

Protein sequence:

>316_residues
MFWKWCFRLSIVFVGLWLLLDLSSRLGAEVFWFREVGYLQVFLLRLVSRGVLWVVAAGVTAVYLWGNLALAQRLKYPRSL
KIAEVRREEAELSVGLKNFLSPQYSRLNAPKINDAGHLKPFRLRWLLPLAFVFSLLAGLILVHYGKIALAYWYPAFNKNS
LPIITPFRLETIWELGRQVFSQVLYLGLIVGIAIAILIYSQFFLRAIAVVLSVVFGTILFYNWAKVLQYFFPTPFNSTEP
LFGKDISFYIFSLPLWELLELWLMGMFLYGFIAVTLTYLLSADSLSQGIFPGFSPQQQRHLYGMGGLLMLMVAFSY

Sequences:

>Translated_316_residues
MFWKWCFRLSIVFVGLWLLLDLSSRLGAEVFWFREVGYLQVFLLRLVSRGVLWVVAAGVTAVYLWGNLALAQRLKYPRSL
KIAEVRREEAELSVGLKNFLSPQYSRLNAPKINDAGHLKPFRLRWLLPLAFVFSLLAGLILVHYGKIALAYWYPAFNKNS
LPIITPFRLETIWELGRQVFSQVLYLGLIVGIAIAILIYSQFFLRAIAVVLSVVFGTILFYNWAKVLQYFFPTPFNSTEP
LFGKDISFYIFSLPLWELLELWLMGMFLYGFIAVTLTYLLSADSLSQGIFPGFSPQQQRHLYGMGGLLMLMVAFSY
>Mature_316_residues
MFWKWCFRLSIVFVGLWLLLDLSSRLGAEVFWFREVGYLQVFLLRLVSRGVLWVVAAGVTAVYLWGNLALAQRLKYPRSL
KIAEVRREEAELSVGLKNFLSPQYSRLNAPKINDAGHLKPFRLRWLLPLAFVFSLLAGLILVHYGKIALAYWYPAFNKNS
LPIITPFRLETIWELGRQVFSQVLYLGLIVGIAIAILIYSQFFLRAIAVVLSVVFGTILFYNWAKVLQYFFPTPFNSTEP
LFGKDISFYIFSLPLWELLELWLMGMFLYGFIAVTLTYLLSADSLSQGIFPGFSPQQQRHLYGMGGLLMLMVAFSY

Specific function: Unknown

COG id: COG1615

COG function: function code S; Uncharacterized conserved protein

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential)

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the UPF0182 family

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y1037_NOSS1 (P58612)

Other databases:

- EMBL:   BA000019
- PIR:   AB1936
- RefSeq:   NP_485080.1
- STRING:   P58612
- GeneID:   1104631
- GenomeReviews:   BA000019_GR
- KEGG:   ana:alr1037
- eggNOG:   COG1615
- HOGENOM:   HBG538831
- OMA:   AHLRYPE
- BioCyc:   NSP103690:ALR1037-MONOMER
- HAMAP:   MF_01600
- InterPro:   IPR005372

Pfam domain/function: PF03699 UPF0182

EC number: NA

Molecular weight: Translated: 36349; Mature: 36349

Theoretical pI: Translated: 10.05; Mature: 10.05

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0xe64dee4)-; HASH(0xe5fd828)-; HASH(0xdfc4d84)-; HASH(0xca50d38)-; HASH(0xe5fd54c)-; HASH(0xe600504)-; HASH(0xe58fce4)-; HASH(0xe4ed1d0)-; HASH(0xd7c4694)-;

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MFWKWCFRLSIVFVGLWLLLDLSSRLGAEVFWFREVGYLQVFLLRLVSRGVLWVVAAGVT
CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH
AVYLWGNLALAQRLKYPRSLKIAEVRREEAELSVGLKNFLSPQYSRLNAPKINDAGHLKP
HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCH
FRLRWLLPLAFVFSLLAGLILVHYGKIALAYWYPAFNKNSLPIITPFRLETIWELGRQVF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCEECHHHHHHHHHHHHHHH
SQVLYLGLIVGIAIAILIYSQFFLRAIAVVLSVVFGTILFYNWAKVLQYFFPTPFNSTEP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC
LFGKDISFYIFSLPLWELLELWLMGMFLYGFIAVTLTYLLSADSLSQGIFPGFSPQQQRH
CCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHH
LYGMGGLLMLMVAFSY
HHHHHHHHHHHHHHCH
>Mature Secondary Structure
MFWKWCFRLSIVFVGLWLLLDLSSRLGAEVFWFREVGYLQVFLLRLVSRGVLWVVAAGVT
CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH
AVYLWGNLALAQRLKYPRSLKIAEVRREEAELSVGLKNFLSPQYSRLNAPKINDAGHLKP
HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCCH
FRLRWLLPLAFVFSLLAGLILVHYGKIALAYWYPAFNKNSLPIITPFRLETIWELGRQVF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCEECHHHHHHHHHHHHHHH
SQVLYLGLIVGIAIAILIYSQFFLRAIAVVLSVVFGTILFYNWAKVLQYFFPTPFNSTEP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC
LFGKDISFYIFSLPLWELLELWLMGMFLYGFIAVTLTYLLSADSLSQGIFPGFSPQQQRH
CCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHH
LYGMGGLLMLMVAFSY
HHHHHHHHHHHHHHCH

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 11759840