Definition Nitrosomonas eutropha C91, complete genome.
Accession NC_008344
Length 2,661,057

Click here to switch to the map view.

The map label for this gene is yqhA [C]

Identifier: 114331491

GI number: 114331491

Start: 1574254

End: 1574874

Strand: Direct

Name: yqhA [C]

Synonym: Neut_1503

Alternate gene names: 114331491

Gene position: 1574254-1574874 (Clockwise)

Preceding gene: 114331490

Following gene: 114331493

Centisome position: 59.16

GC content: 44.12

Gene sequence:

>621_bases
ATGCAACAGGATCATGATACCCCCCCTCCTTCAAAACGAATATCCTCTATCGCTTACTTCCTATTTCTGTCCCGCTGGTT
ACAGCTACCTTTATATTTGGGGCTGGTACTGGCACAGTGTGTCTATGTATATCACTTTTGGATTGAGCTATCAGATCTGA
TTGGCGCTGTATTTTCTAACCAGAATGCTTTACAGCATGTCCTTGATATGGTGGCAGTCAAAGGAGTAGAAAGAACCGAG
AAGGATCTGACCGAAACAGCCATTATGCTGGTAGTACTCGGCTTGATTGATGTGGTAATGATTTCGAATTTACTCATCAT
GGTGATTATCGGCGGATATGAAACCTTTGTTTCACGCATGAATCTTGAAGGTCATCCGGATCAGCCGGAATGGTTGTCAC
ACGTCAACGCTTCAGTATTAAAGGTCAAACTGGCAACTGCTATTATCGGCATTTCATCGATTCACCTGCTAAAGACATTT
ATCAACGCTACTGCTTATGATGAAAAAACATTGATTGCGCAGACTGCTATCCACCTTGCATTTCTGCTCTCCGCGCTCGC
AATAGCATATTGCGACCGGATCATTTCACAAACTATCCAGCATCCCGGCGAGCACGAATGA

Upstream 100 bases:

>100_bases
TAACGTCGCAGTTTTTTTACTTTGATAACTTTAAAATAACAATCTTCGCATTTTCCTGTTGTCATGGTAATCTTTTGCGC
TTTTCAGGAAATAGATCACC

Downstream 100 bases:

>100_bases
GAGGCTAATATATAAAGTTAGCACGATATCAAAGGTTAATGCTGCGTCCGCCATCGACGGCAATAATTTGCCCGGTAATA
AAAGGAGCATTTTCAATCAG

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 206; Mature: 206

Protein sequence:

>206_residues
MQQDHDTPPPSKRISSIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWIELSDLIGAVFSNQNALQHVLDMVAVKGVERTE
KDLTETAIMLVVLGLIDVVMISNLLIMVIIGGYETFVSRMNLEGHPDQPEWLSHVNASVLKVKLATAIIGISSIHLLKTF
INATAYDEKTLIAQTAIHLAFLLSALAIAYCDRIISQTIQHPGEHE

Sequences:

>Translated_206_residues
MQQDHDTPPPSKRISSIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWIELSDLIGAVFSNQNALQHVLDMVAVKGVERTE
KDLTETAIMLVVLGLIDVVMISNLLIMVIIGGYETFVSRMNLEGHPDQPEWLSHVNASVLKVKLATAIIGISSIHLLKTF
INATAYDEKTLIAQTAIHLAFLLSALAIAYCDRIISQTIQHPGEHE
>Mature_206_residues
MQQDHDTPPPSKRISSIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWIELSDLIGAVFSNQNALQHVLDMVAVKGVERTE
KDLTETAIMLVVLGLIDVVMISNLLIMVIIGGYETFVSRMNLEGHPDQPEWLSHVNASVLKVKLATAIIGISSIHLLKTF
INATAYDEKTLIAQTAIHLAFLLSALAIAYCDRIISQTIQHPGEHE

Specific function: Unknown

COG id: COG2862

COG function: function code S; Predicted membrane protein

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the UPF0114 family [H]

Homologues:

Organism=Escherichia coli, GI1789376, Length=174, Percent_Identity=33.3333333333333, Blast_Score=103, Evalue=8e-24,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005134
- InterPro:   IPR020761
- InterPro:   IPR020765 [H]

Pfam domain/function: PF03350 UPF0114 [H]

EC number: NA

Molecular weight: Translated: 23024; Mature: 23024

Theoretical pI: Translated: 6.01; Mature: 6.01

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQQDHDTPPPSKRISSIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWIELSDLIGAVFSN
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
QNALQHVLDMVAVKGVERTEKDLTETAIMLVVLGLIDVVMISNLLIMVIIGGYETFVSRM
CHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NLEGHPDQPEWLSHVNASVLKVKLATAIIGISSIHLLKTFINATAYDEKTLIAQTAIHLA
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH
FLLSALAIAYCDRIISQTIQHPGEHE
HHHHHHHHHHHHHHHHHHHCCCCCCC
>Mature Secondary Structure
MQQDHDTPPPSKRISSIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWIELSDLIGAVFSN
CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
QNALQHVLDMVAVKGVERTEKDLTETAIMLVVLGLIDVVMISNLLIMVIIGGYETFVSRM
CHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
NLEGHPDQPEWLSHVNASVLKVKLATAIIGISSIHLLKTFINATAYDEKTLIAQTAIHLA
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH
FLLSALAIAYCDRIISQTIQHPGEHE
HHHHHHHHHHHHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 11248100 [H]