Definition | Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence. |
---|---|
Accession | NC_000914 |
Length | 536,165 |
Click here to switch to the map view.
The map label for this gene is rhcV
Identifier: 16520050
GI number: 16520050
Start: 69644
End: 71737
Strand: Reverse
Name: rhcV
Synonym: NGR_a00530
Alternate gene names: 16520050
Gene position: 71737-69644 (Counterclockwise)
Preceding gene: 16520049
Following gene: 16520051
Centisome position: 13.38
GC content: 57.07
Gene sequence:
>2094_bases ATGGCCAACGCCCTGCGTAGATTCACCGAGTATGCGCCGGCCAATCCGGACTTGATGGTCGCGTTGATGCTGCTTCTGGC CGTCAGCATGATGGTCATGCCAATACCGGTCATGGCGGTCGATGCGCTGATAGGCTTCAACATGGGTTTGGCTGTACTGC TGCTGATGGCGGCCCTGTATGTCAGCACGCCACTTGATTTTTCCTCTTTGCCCGGCGTCATTTTGCTTTCCACGGTATTC CGTCTGGCGCTCACCGTCGCGACGACGCGACTAATTCTGGCCGAGGGCGAGGCAGGCAGCATCATCCACACGTTCGGCAG TTTTGTGATCTCAGGCAATATTGTCGTGGGTTTCGTTATATTTCTGGTAGTGACCATGGTGCAGTTCATGGTTCTCGCGA AAGGCGCCGAACGCGTGGCAGAAGTGGCGGCGCGCTTCACCCTCGATGCTTTGCCAGGCAAGCAAATGGCGATCGACGCA GAGTTGCGGAACGGTCACATCGATGCCGACGAATCCCGCAGGCGGCGCGCCGCATTAGAAAAAGAAAGCAAACTTTATGG GGCGATGGACGGCGCGATGAAGTTTGTGAAGGGTGATTCCATCGCCGGGCTAGTGGTTATCTGCATCAACATGCTGGGTG GAATTTCCATCGGCCTGCTCTCGAAGGGCATGTCGTTCGCCCAGGTGCTGCATCACTACACTCTGCTGACGATAGGTGAT GCGTTAATCTCGCAGATTCCCGCCCTGCTGCTCTCAATTACAGCGGCAACCATGGTTACTCGTGTAACTGGGGCTTCGAA ACTCAACCTCGGTGAGGACATAGCCAATCAACTCACCGCCAGTACACGAGCATTGCGGTTGGCGGCCTGCGTCCTGGTGG CCATGGGCTTCGTTCCTGGTTTCCCTCTGCCTGTCTTTTTTATGTTGGCCGCAGTCTTCGCGGCGGCAAGCTTCGTCAAA GGTGACGTCCTAGATGCCGACAAAGTCGATGCTACAACTGTAACTCCGGCGGAGTCTCAAACGCCAAACGTGGCTGCGCA GCCAAATCCCATTGGCGTCTTCCTCGCGCCGAGCCTTACGAATGCGATCGACCAGGTCGAATTGCGGCAGCACATTGCGC GTATTTCCCAACTAGTCTCGGCCGATCTCGGCATTATCGTTCCTCCGATCCCAGTCGATGTCGACCAGCAGCTGCCCGAG TCGCAATTCAGGATAGATGTCGAAGGCGTGCCAGTCGAACAGGATTTGATTAATCCGGCGCAGCTGTCCCTCGCAGACGA TCTGAAGAAGATTGAGTCAAGCGGCATCCCTTTTCGGCATGATCCTGAAACCCACAGAATTTGGGTTGAACAAAGCCACG AGCCGGCGCTCAAAGCCGCCGGTATCCGGCATCATAGTCCCAGCGAACTCCTTGCGATGCGTGTCCATGCGACGTTGACT TGCCATGCGCCGCGCTTGGTGGGTATCCAAGAGACCCGCCAACTACTGGGCCGGATGGAGCAGGAATACTCTGATCTGGT GAAGGAGGTGCTGCGTACCACGCCGATCCCCCGGATTGCAGATGTGCTGCGCCGCCTCTTGGGCGAAGGTATACCAATCC GAAATACCCGGCTCGTCTTGGAGGCATTGGCCGAATGGAGCGAACGTGAGCAAAACGTCGCCCTGCTCACGGAACACGTT CGTTCTGGAATGAAGCGGCAGATCTGTCACCGCTATGGCAGACACGGTGTCCTACCTGCCTTCGTCATGGAACGTGAGAC TGAGGATGTGGTGCGCTGCGCGGTTCGGGAAACGGCTGCAGGCCCCTACCTCGCACTAGAGGATCGGCAAAGCGAGGCGC TGCTGTCACAGATGCGGCAGGTCTTTTCGAGCACGGCACCGGGCCAGACGCGCCCGATCGTCTTAACTTCAATGGATGTC CGACGCTTCGTCCGCGGTTTTCTTACCCGAAACGGTATCGAGCTTGCCGTACTGTCTTATCAGGACCTCGCCTCCGATTT TAAAATTCAACCCGTCGGATCCATCAGGCTCCCGCCCAGTAATGGAACGTCAGGGGAACCTCGCAGTATCCGCCCTTCTG CCACTACTGGATGA
Upstream 100 bases:
>100_bases CCGCAGGGGTCGGCCTCACGGACGACGCGCCTGCGGTCTTAGAGCTACGTTGCACCTGTGTAGCAGCAGAGTGGTTCAGA AACGATCGTGAAGGGACCTC
Downstream 100 bases:
>100_bases TAGGTGAGAACCAAACATGAATTCACATGAGAATAGAGTCGCCGCGCCGTTGCTTTCATTTCGTTTGAGCTTAGTTCTCT TCGCAGTACTCTCCGTTCTG
Product: inner membrane protein RhcV; component of type III secretion apparatus
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 697; Mature: 696
Protein sequence:
>697_residues MANALRRFTEYAPANPDLMVALMLLLAVSMMVMPIPVMAVDALIGFNMGLAVLLLMAALYVSTPLDFSSLPGVILLSTVF RLALTVATTRLILAEGEAGSIIHTFGSFVISGNIVVGFVIFLVVTMVQFMVLAKGAERVAEVAARFTLDALPGKQMAIDA ELRNGHIDADESRRRRAALEKESKLYGAMDGAMKFVKGDSIAGLVVICINMLGGISIGLLSKGMSFAQVLHHYTLLTIGD ALISQIPALLLSITAATMVTRVTGASKLNLGEDIANQLTASTRALRLAACVLVAMGFVPGFPLPVFFMLAAVFAAASFVK GDVLDADKVDATTVTPAESQTPNVAAQPNPIGVFLAPSLTNAIDQVELRQHIARISQLVSADLGIIVPPIPVDVDQQLPE SQFRIDVEGVPVEQDLINPAQLSLADDLKKIESSGIPFRHDPETHRIWVEQSHEPALKAAGIRHHSPSELLAMRVHATLT CHAPRLVGIQETRQLLGRMEQEYSDLVKEVLRTTPIPRIADVLRRLLGEGIPIRNTRLVLEALAEWSEREQNVALLTEHV RSGMKRQICHRYGRHGVLPAFVMERETEDVVRCAVRETAAGPYLALEDRQSEALLSQMRQVFSSTAPGQTRPIVLTSMDV RRFVRGFLTRNGIELAVLSYQDLASDFKIQPVGSIRLPPSNGTSGEPRSIRPSATTG
Sequences:
>Translated_697_residues MANALRRFTEYAPANPDLMVALMLLLAVSMMVMPIPVMAVDALIGFNMGLAVLLLMAALYVSTPLDFSSLPGVILLSTVF RLALTVATTRLILAEGEAGSIIHTFGSFVISGNIVVGFVIFLVVTMVQFMVLAKGAERVAEVAARFTLDALPGKQMAIDA ELRNGHIDADESRRRRAALEKESKLYGAMDGAMKFVKGDSIAGLVVICINMLGGISIGLLSKGMSFAQVLHHYTLLTIGD ALISQIPALLLSITAATMVTRVTGASKLNLGEDIANQLTASTRALRLAACVLVAMGFVPGFPLPVFFMLAAVFAAASFVK GDVLDADKVDATTVTPAESQTPNVAAQPNPIGVFLAPSLTNAIDQVELRQHIARISQLVSADLGIIVPPIPVDVDQQLPE SQFRIDVEGVPVEQDLINPAQLSLADDLKKIESSGIPFRHDPETHRIWVEQSHEPALKAAGIRHHSPSELLAMRVHATLT CHAPRLVGIQETRQLLGRMEQEYSDLVKEVLRTTPIPRIADVLRRLLGEGIPIRNTRLVLEALAEWSEREQNVALLTEHV RSGMKRQICHRYGRHGVLPAFVMERETEDVVRCAVRETAAGPYLALEDRQSEALLSQMRQVFSSTAPGQTRPIVLTSMDV RRFVRGFLTRNGIELAVLSYQDLASDFKIQPVGSIRLPPSNGTSGEPRSIRPSATTG >Mature_696_residues ANALRRFTEYAPANPDLMVALMLLLAVSMMVMPIPVMAVDALIGFNMGLAVLLLMAALYVSTPLDFSSLPGVILLSTVFR LALTVATTRLILAEGEAGSIIHTFGSFVISGNIVVGFVIFLVVTMVQFMVLAKGAERVAEVAARFTLDALPGKQMAIDAE LRNGHIDADESRRRRAALEKESKLYGAMDGAMKFVKGDSIAGLVVICINMLGGISIGLLSKGMSFAQVLHHYTLLTIGDA LISQIPALLLSITAATMVTRVTGASKLNLGEDIANQLTASTRALRLAACVLVAMGFVPGFPLPVFFMLAAVFAAASFVKG DVLDADKVDATTVTPAESQTPNVAAQPNPIGVFLAPSLTNAIDQVELRQHIARISQLVSADLGIIVPPIPVDVDQQLPES QFRIDVEGVPVEQDLINPAQLSLADDLKKIESSGIPFRHDPETHRIWVEQSHEPALKAAGIRHHSPSELLAMRVHATLTC HAPRLVGIQETRQLLGRMEQEYSDLVKEVLRTTPIPRIADVLRRLLGEGIPIRNTRLVLEALAEWSEREQNVALLTEHVR SGMKRQICHRYGRHGVLPAFVMERETEDVVRCAVRETAAGPYLALEDRQSEALLSQMRQVFSSTAPGQTRPIVLTSMDVR RFVRGFLTRNGIELAVLSYQDLASDFKIQPVGSIRLPPSNGTSGEPRSIRPSATTG
Specific function: Could be involved in the secretion of an unknown factor
COG id: COG4789
COG function: function code U; Type III secretory pathway, component EscV
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein (Potential)
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the FHIPEP (flagella/HR/invasion proteins export pore) family
Homologues:
Organism=Escherichia coli, GI1788187, Length=678, Percent_Identity=33.6283185840708, Blast_Score=327, Evalue=1e-90,
Paralogues:
None
Copy number: 10-20 (rich media) [C]
Swissprot (AC and ID): Y4YR_RHISN (P55726)
Other databases:
- EMBL: U00090 - RefSeq: NP_444170.1 - GeneID: 962238 - GenomeReviews: U00090_GR - KEGG: rhi:NGR_a00530 - HOGENOM: HBG595253 - ProtClustDB: CLSK506585 - InterPro: IPR001712 - InterPro: IPR006302 - PRINTS: PR00949 - TIGRFAMs: TIGR01399
Pfam domain/function: PF00771 FHIPEP
EC number: NA
Molecular weight: Translated: 75527; Mature: 75396
Theoretical pI: Translated: 6.80; Mature: 6.80
Prosite motif: PS00994 FHIPEP
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0xb8862d8)-; HASH(0xaec0178)-; HASH(0xac9a264)-; HASH(0xae8a358)-; HASH(0xad43150)-; HASH(0xaebfecc)-; HASH(0xaeca218)-; HASH(0xaec0124)-;
Cys/Met content:
0.7 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 4.3 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MANALRRFTEYAPANPDLMVALMLLLAVSMMVMPIPVMAVDALIGFNMGLAVLLLMAALY CCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHHHHHHH VSTPLDFSSLPGVILLSTVFRLALTVATTRLILAEGEAGSIIHTFGSFVISGNIVVGFVI HCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHEEECCCCCCHHHHHHHHHHCCCHHHHHHH FLVVTMVQFMVLAKGAERVAEVAARFTLDALPGKQMAIDAELRNGHIDADESRRRRAALE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHCCCCCCCCHHHHHHHHHH KESKLYGAMDGAMKFVKGDSIAGLVVICINMLGGISIGLLSKGMSFAQVLHHYTLLTIGD HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHCCHHHHHHHHHHHHHHHHH ALISQIPALLLSITAATMVTRVTGASKLNLGEDIANQLTASTRALRLAACVLVAMGFVPG HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC FPLPVFFMLAAVFAAASFVKGDVLDADKVDATTVTPAESQTPNVAAQPNPIGVFLAPSLT CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCEEEEECCHHH NAIDQVELRQHIARISQLVSADLGIIVPPIPVDVDQQLPESQFRIDVEGVPVEQDLINPA HHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCHHCCCCCCEEEEECCCCCHHHHCCHH QLSLADDLKKIESSGIPFRHDPETHRIWVEQSHEPALKAAGIRHHSPSELLAMRVHATLT HHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCHHHHCCCCCCCHHHHHHHHHHHEEE CHAPRLVGIQETRQLLGRMEQEYSDLVKEVLRTTPIPRIADVLRRLLGEGIPIRNTRLVL ECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHH EALAEWSEREQNVALLTEHVRSGMKRQICHRYGRHGVLPAFVMERETEDVVRCAVRETAA HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCHHHHHHHHHHHHCC GPYLALEDRQSEALLSQMRQVFSSTAPGQTRPIVLTSMDVRRFVRGFLTRNGIELAVLSY CCEEEECCCHHHHHHHHHHHHHHCCCCCCCCCEEEECHHHHHHHHHHHHCCCCEEEEEEH QDLASDFKIQPVGSIRLPPSNGTSGEPRSIRPSATTG HHHHCCCEECCCCCEECCCCCCCCCCCCCCCCCCCCH >Mature Secondary Structure ANALRRFTEYAPANPDLMVALMLLLAVSMMVMPIPVMAVDALIGFNMGLAVLLLMAALY CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHHHHHHH VSTPLDFSSLPGVILLSTVFRLALTVATTRLILAEGEAGSIIHTFGSFVISGNIVVGFVI HCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHEEECCCCCCHHHHHHHHHHCCCHHHHHHH FLVVTMVQFMVLAKGAERVAEVAARFTLDALPGKQMAIDAELRNGHIDADESRRRRAALE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHCCCCCCCCHHHHHHHHHH KESKLYGAMDGAMKFVKGDSIAGLVVICINMLGGISIGLLSKGMSFAQVLHHYTLLTIGD HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHCCHHHHHHHHHHHHHHHHH ALISQIPALLLSITAATMVTRVTGASKLNLGEDIANQLTASTRALRLAACVLVAMGFVPG HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC FPLPVFFMLAAVFAAASFVKGDVLDADKVDATTVTPAESQTPNVAAQPNPIGVFLAPSLT CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCEEEEECCHHH NAIDQVELRQHIARISQLVSADLGIIVPPIPVDVDQQLPESQFRIDVEGVPVEQDLINPA HHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCHHCCCCCCEEEEECCCCCHHHHCCHH QLSLADDLKKIESSGIPFRHDPETHRIWVEQSHEPALKAAGIRHHSPSELLAMRVHATLT HHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCHHHHCCCCCCCHHHHHHHHHHHEEE CHAPRLVGIQETRQLLGRMEQEYSDLVKEVLRTTPIPRIADVLRRLLGEGIPIRNTRLVL ECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHH EALAEWSEREQNVALLTEHVRSGMKRQICHRYGRHGVLPAFVMERETEDVVRCAVRETAA HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCHHHHHHHHHHHHCC GPYLALEDRQSEALLSQMRQVFSSTAPGQTRPIVLTSMDVRRFVRGFLTRNGIELAVLSY CCEEEECCCHHHHHHHHHHHHHHCCCCCCCCCEEEECHHHHHHHHHHHHCCCCEEEEEEH QDLASDFKIQPVGSIRLPPSNGTSGEPRSIRPSATTG HHHHCCCEECCCCCEECCCCCCCCCCCCCCCCCCCCH
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9163424