Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is nrfD

Identifier: 157163543

GI number: 157163543

Start: 4322242

End: 4323198

Strand: Direct

Name: nrfD

Synonym: EcHS_A4318

Alternate gene names: 157163543

Gene position: 4322242-4323198 (Clockwise)

Preceding gene: 157163542

Following gene: 157163544

Centisome position: 93.08

GC content: 57.68

Gene sequence:

>957_bases
ATGACGCAGACTTCCGCATTTCATTTTGAATCGCTGGTGTGGGACTGGCCGATTGCCATCTACCTGTTTTTGATTGGTAT
TTCTGCCGGTCTGGTGACGCTGGCCGTGCTGTTACGTCGCTTCTACCCGCAGGCGGGCGGTGCAGACAGTACGTTGCTGC
GCACCACGCTGATTGTCGGGCCGGGCGCGGTGATCCTCGGTCTGTTGATCCTCGTCTTCCACCTGACAAGACCGTGGACC
TTCTGGAAGCTGATGTTCCACTACAGTTTTACCTCGGTGATGTCGATGGGGGTGATGCTGTTTCAGCTCTACATGGTGGT
GCTGGTGCTGTGGCTGGCGAAAATCTTTGAACATGATTTGCTTGCCCTGCAACAACGCTGGTTGCCGAAGCTGGGGATCG
TGCAAAAGGTTCTGAGCCTGCTGACGCCCGTTCATCGCGGACTGGAAACATTGATGCTGGTGTTGGCGGTGTTGTTGGGG
GCTTATACCGGCTTTCTGCTGTCGGCGCTGAAATCGTATCCGTTCCTCAATAACCCGATCCTGCCGGTGCTGTTCCTCTT
CTCCGGCATCTCGTCCGGTGCGGCGGTGGCGCTGATCGCCATGGCGATACGCCAACGCAGTAACCCGCATTCCACGGAAG
CGCAGTTTGTACACCGTATGGAAATCCCCGTGGTATGGGGTGAAATCTTCCTGCTGGTGGCGTTTTTTGTCGGTCTGGCG
CTGGGCGATGACGGTAAAGTGCGTGCGCTGGTGGCGGCATTAGGTGGCGGTTTCTGGACGTGGTGGTTCTGGCTTGGTGT
CGCCGGGCTGGGGCTGATTGTGCCAATGTTGCTCAAACCGTGGGTCAATCGCAGTTCCGGCATTCCTGCCGTGCTGGCGG
CGTGTGGGGCCAGTCTGGTCGGCGTGTTGATGCTGCGCTTTTTCATTCTCTACGCCGGGCAGTTAACGGTGGCGTAA

Upstream 100 bases:

>100_bases
CGCAACTGCTGCGCCAGAAGCCTACTTACCGCTACAAGCTGGCGCTGGGAACCAAACCGAAGCTGTACCGCGTACCGTTT
AAATACGGGGAGGTGAGCCA

Downstream 100 bases:

>100_bases
GCCAGAAAAGAGGTGGTTTCTGGACGTATTCCTTCCTGAAGTCGGTTTTCTGGCGTTGTTGTTAAGTCTCGGGGTCAACG
TGTTGACCCCGTTGACGGCC

Product: nrfD protein

Products: oxidized cytochrome C552; NH3 [C]

Alternate protein names: NA

Number of amino acids: Translated: 318; Mature: 317

Protein sequence:

>318_residues
MTQTSAFHFESLVWDWPIAIYLFLIGISAGLVTLAVLLRRFYPQAGGADSTLLRTTLIVGPGAVILGLLILVFHLTRPWT
FWKLMFHYSFTSVMSMGVMLFQLYMVVLVLWLAKIFEHDLLALQQRWLPKLGIVQKVLSLLTPVHRGLETLMLVLAVLLG
AYTGFLLSALKSYPFLNNPILPVLFLFSGISSGAAVALIAMAIRQRSNPHSTEAQFVHRMEIPVVWGEIFLLVAFFVGLA
LGDDGKVRALVAALGGGFWTWWFWLGVAGLGLIVPMLLKPWVNRSSGIPAVLAACGASLVGVLMLRFFILYAGQLTVA

Sequences:

>Translated_318_residues
MTQTSAFHFESLVWDWPIAIYLFLIGISAGLVTLAVLLRRFYPQAGGADSTLLRTTLIVGPGAVILGLLILVFHLTRPWT
FWKLMFHYSFTSVMSMGVMLFQLYMVVLVLWLAKIFEHDLLALQQRWLPKLGIVQKVLSLLTPVHRGLETLMLVLAVLLG
AYTGFLLSALKSYPFLNNPILPVLFLFSGISSGAAVALIAMAIRQRSNPHSTEAQFVHRMEIPVVWGEIFLLVAFFVGLA
LGDDGKVRALVAALGGGFWTWWFWLGVAGLGLIVPMLLKPWVNRSSGIPAVLAACGASLVGVLMLRFFILYAGQLTVA
>Mature_317_residues
TQTSAFHFESLVWDWPIAIYLFLIGISAGLVTLAVLLRRFYPQAGGADSTLLRTTLIVGPGAVILGLLILVFHLTRPWTF
WKLMFHYSFTSVMSMGVMLFQLYMVVLVLWLAKIFEHDLLALQQRWLPKLGIVQKVLSLLTPVHRGLETLMLVLAVLLGA
YTGFLLSALKSYPFLNNPILPVLFLFSGISSGAAVALIAMAIRQRSNPHSTEAQFVHRMEIPVVWGEIFLLVAFFVGLAL
GDDGKVRALVAALGGGFWTWWFWLGVAGLGLIVPMLLKPWVNRSSGIPAVLAACGASLVGVLMLRFFILYAGQLTVA

Specific function: Probably involved in the transfer of electrons from the quinone pool to the type-c cytochromes

COG id: COG3301

COG function: function code P; Formate-dependent nitrite reductase, membrane component

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the nrfD family

Homologues:

Organism=Escherichia coli, GI1790509, Length=318, Percent_Identity=100, Blast_Score=620, Evalue=1e-179,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NRFD_ECOLI (P32709)

Other databases:

- EMBL:   X72298
- EMBL:   U00006
- EMBL:   U00096
- EMBL:   AP009048
- PIR:   H65215
- RefSeq:   AP_004574.1
- RefSeq:   NP_418497.1
- ProteinModelPortal:   P32709
- STRING:   P32709
- EnsemblBacteria:   EBESCT00000004918
- EnsemblBacteria:   EBESCT00000017959
- GeneID:   948580
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW4034
- KEGG:   eco:b4073
- EchoBASE:   EB1890
- EcoGene:   EG11947
- eggNOG:   COG3301
- GeneTree:   EBGT00050000011934
- HOGENOM:   HBG458904
- OMA:   NANPNAK
- ProtClustDB:   CLSK869938
- BioCyc:   EcoCyc:NRFD-MONOMER
- Genevestigator:   P32709
- InterPro:   IPR017566
- InterPro:   IPR005614
- TIGRFAMs:   TIGR03148

Pfam domain/function: PF03916 NrfD

EC number: NA

Molecular weight: Translated: 35043; Mature: 34912

Theoretical pI: Translated: 10.30; Mature: 10.30

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x1921b1d4)-; HASH(0x18dd1c5c)-; HASH(0x19307c4c)-; HASH(0x183e15dc)-; HASH(0x404e983c)-; HASH(0x18eaaf88)-; HASH(0x18b629d8)-; HASH(0x18e58070)-;

Cys/Met content:

0.3 %Cys     (Translated Protein)
3.5 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTQTSAFHFESLVWDWPIAIYLFLIGISAGLVTLAVLLRRFYPQAGGADSTLLRTTLIVG
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHC
PGAVILGLLILVFHLTRPWTFWKLMFHYSFTSVMSMGVMLFQLYMVVLVLWLAKIFEHDL
CHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LALQQRWLPKLGIVQKVLSLLTPVHRGLETLMLVLAVLLGAYTGFLLSALKSYPFLNNPI
HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH
LPVLFLFSGISSGAAVALIAMAIRQRSNPHSTEAQFVHRMEIPVVWGEIFLLVAFFVGLA
HHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHH
LGDDGKVRALVAALGGGFWTWWFWLGVAGLGLIVPMLLKPWVNRSSGIPAVLAACGASLV
CCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
GVLMLRFFILYAGQLTVA
HHHHHHHHHHHHCCCCCH
>Mature Secondary Structure 
TQTSAFHFESLVWDWPIAIYLFLIGISAGLVTLAVLLRRFYPQAGGADSTLLRTTLIVG
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHC
PGAVILGLLILVFHLTRPWTFWKLMFHYSFTSVMSMGVMLFQLYMVVLVLWLAKIFEHDL
CHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LALQQRWLPKLGIVQKVLSLLTPVHRGLETLMLVLAVLLGAYTGFLLSALKSYPFLNNPI
HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH
LPVLFLFSGISSGAAVALIAMAIRQRSNPHSTEAQFVHRMEIPVVWGEIFLLVAFFVGLA
HHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHH
LGDDGKVRALVAALGGGFWTWWFWLGVAGLGLIVPMLLKPWVNRSSGIPAVLAACGASLV
CCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH
GVLMLRFFILYAGQLTVA
HHHHHHHHHHHHCCCCCH

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: reduced cytochrome C552; nitrite [C]

Specific reaction: reduced cytochrome C552 + nitrite = oxidized cytochrome C552 + NH3 [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 8057835; 8265357; 9278503