| Definition | Nitrosospira multiformis ATCC 25196 chromosome, complete genome. |
|---|---|
| Accession | NC_007614 |
| Length | 3,184,243 |
Click here to switch to the map view.
The map label for this gene is gspD [H]
Identifier: 82703023
GI number: 82703023
Start: 2189733
End: 2192102
Strand: Direct
Name: gspD [H]
Synonym: Nmul_A1902
Alternate gene names: 82703023
Gene position: 2189733-2192102 (Clockwise)
Preceding gene: 82703022
Following gene: 82703024
Centisome position: 68.77
GC content: 56.84
Gene sequence:
>2370_bases ATGGGCTACAGAGGTAAAATCGTGCGGCATCGCTCGCTATGTGCAGTTTTTTTATGGATGGCAGTGGCAGGCTGGGTTAC CGAGGCTTGGGCCGTAAATCCCTCTGCCACCCTGCCCGTCGAGGAAATTGGCGTCCCGCCGAGCGTCCTGCCGGAAGCGG ATTCCCTGCGCGCCAGTAAATCCAGGTCTTCTGCCGGTCAGGATCTCGTTACACTCAATTTCGTCAATGCCGATATTGAA GGGGTAGTGAAAGCAGTCAGCGAAATTACCCGAAAAAATTTCATGCTCGACCCCCGCGTCAAGGGTACGATCAACATCGT TTCAGCCAAGCCAGTGCCGAGGTCGTCGGTCTACGAAGTATTCCTGTCGGCACTGCGATTGCATGGATATGCAGTCGTTG AGGATTACGGCATCATCAGGATCGTTCCGGAAAGTGATGCCAAGCTATACCAGGGCCCGACACTTGGTCCCACGAACAAG CGGCAGCTCGCGGGCGACCGTATCCAGACACAGGTATTCACGCTGCAGTACGAATCCGCGGTGCAGATGGTGCCGATCCT GCGTCCGCTGATCGCTCCAAACAACAGTATCACTGCAAATCCCAACAGTAACACCCTGGTTATTACAGACTACGCGAGCA ATCTCCAACGCCTGGCGAAAATAATTGATTCGGTGGATCAGCCAAGTGGAACCGAGCCTGTCTCGATACCCCTTCAGCAC GCCTCGGCGATCGATGTCGCGCAAACCGTGAACCGGCTGTTTTCAGAATCGACGCAGTCCCAGGCCGAGGGCGCCGCGGA CCCCACCCAGCAGCGTTTCACAGTCGTCGCCGACGCCCGCTCGAATACCCTCCTTGCACGCTCCGGAAACCGGGCAGCGC TTGCGCGTCTGCGCCAGCTGGTAACAGTGCTCGATTCTCCCACCAGCGCTGCCGGCAACATGCACGTCGTCTTTCTCAAG AATGCCGATGCAGTCAGGCTTGCCGAAACCCTCAGGGCGATCTATCACAACATGGCGTCCCCGGTTTCCTCATCTTCGGG ACTGAGCCAGGGCACCGGCACAGCTTTTGGAACATCTTCCCTGGGTACATCCACCGGCGGGGGGATGGGTGCCTCGTCAG GCACCTCAACAGGGGGGTCGATGGGCACTTCCATGCCCGGTTCCAGCCTTGGTGCGGGGACTGTTCCCGCTGCTTCCACC GTCACCCCGGCTCCGATGCAAACTGGCGCAACTTCCGCCACCCCCGGCATCATTCAGGCGGATGCAGCCACCAACTCGAT CATTATTACTGCCCCGGATGCTATTTATAATAATTTGCGCGCGGTGGTGGAGAAGCTCGACGTGCGCCGCGTGCAGGTTT ATATTGAAGCGCTGATTGCCGAAATCACTGCCGACAGAGCCGCGGAATTCGGCATCCAGTGGCAGAATCTGAGCAATGCC GCGCAAGGTGGCACCCAGGTTTTCGGTGGCACCAACTTCAATGCCGGCACTGCCGGAGGCGGCAGTATCATCTCCACCGC CCAGAATCCGATAGCGAATGCAGCCTCCGGTCTGACTATCGGCATCATGAATGGCCTTGTCACGGCTATTCCCGGCATCG GCCCTGTTCTCAACATTCATACGCTCATCCGCGCGCTGGAAACGGATGCCAATGCCAACATTCTTTCCACCCCCACCCTG CTGACACTGAATAACGAAGAAGCCAGGATCATCATCGGGCAGAACGTTCCGATTCCCACCGGCCAATTCATTCCGCCAGT AGGAGGCGCCGTTACCTCCCCGTTTCAAACCGTTTCACGCCAGGACGTGGGACTATCATTGAAGATCAAGCCCCTTATCT CGGAAGGCAATACTGTCCGTGTGCAGATTTTTCAGGAAGTCTCGAGCGTCGTTCCTGGCACGGTCAACGCCACCAACGGG TTGATTACCAACAAACGCTCGATAGAATCGACAGTGCTGGTTGACGACGGGCAGATTCTCGTGCTCGGCGGTCTGATGCA GGATTCGGTAAATGACTCGGTTGAAAGAATTCCACTGGTCGGGGCGATTCCGCTGTTCGGACAATTGTTCAGTTACAACA AGCGCTCACGCAACAAAACCAATCTGATGGTATTCCTGCGGCCGACGCTGATGCGCGCGGGCGACGCCGCCGATCCGCTT TCTGACGCACAGTACGATCGGGTGCTGGGCGAACAGAAAAAAGTGAGACCCAAGTTTAATCTTGTGCTTCCGGATATGGA ATCGCCTACTTTGCCGCCGCGTCAACCACCTCCTGTCATCCTTGATGACAGCATCACTCCCGATGATCCCGGAATTTCCA ATGTTCAAGGCAACTGGGATACCGGGGGAGTGATGGATAATACACCCTGA
Upstream 100 bases:
>100_bases GCAGCGTCAGAATCCGCCTGCAGACCCTGCTCGTCCGGTGGAGACAGACCTGCCTGTCGAGCTTCACGCGCCGGTCATAA CTGAGTGATGTCGTCATCTC
Downstream 100 bases:
>100_bases AACCGGGCCGCAATCATGGCGTCTGACAGGATTCCTTATGTCTTTGTCAAGACAAATGGAGTGGCCGTGACGAGCGTCAC GAGCGATCACGCAGAGGTGG
Product: type II and III secretion system protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 789; Mature: 788
Protein sequence:
>789_residues MGYRGKIVRHRSLCAVFLWMAVAGWVTEAWAVNPSATLPVEEIGVPPSVLPEADSLRASKSRSSAGQDLVTLNFVNADIE GVVKAVSEITRKNFMLDPRVKGTINIVSAKPVPRSSVYEVFLSALRLHGYAVVEDYGIIRIVPESDAKLYQGPTLGPTNK RQLAGDRIQTQVFTLQYESAVQMVPILRPLIAPNNSITANPNSNTLVITDYASNLQRLAKIIDSVDQPSGTEPVSIPLQH ASAIDVAQTVNRLFSESTQSQAEGAADPTQQRFTVVADARSNTLLARSGNRAALARLRQLVTVLDSPTSAAGNMHVVFLK NADAVRLAETLRAIYHNMASPVSSSSGLSQGTGTAFGTSSLGTSTGGGMGASSGTSTGGSMGTSMPGSSLGAGTVPAAST VTPAPMQTGATSATPGIIQADAATNSIIITAPDAIYNNLRAVVEKLDVRRVQVYIEALIAEITADRAAEFGIQWQNLSNA AQGGTQVFGGTNFNAGTAGGGSIISTAQNPIANAASGLTIGIMNGLVTAIPGIGPVLNIHTLIRALETDANANILSTPTL LTLNNEEARIIIGQNVPIPTGQFIPPVGGAVTSPFQTVSRQDVGLSLKIKPLISEGNTVRVQIFQEVSSVVPGTVNATNG LITNKRSIESTVLVDDGQILVLGGLMQDSVNDSVERIPLVGAIPLFGQLFSYNKRSRNKTNLMVFLRPTLMRAGDAADPL SDAQYDRVLGEQKKVRPKFNLVLPDMESPTLPPRQPPPVILDDSITPDDPGISNVQGNWDTGGVMDNTP
Sequences:
>Translated_789_residues MGYRGKIVRHRSLCAVFLWMAVAGWVTEAWAVNPSATLPVEEIGVPPSVLPEADSLRASKSRSSAGQDLVTLNFVNADIE GVVKAVSEITRKNFMLDPRVKGTINIVSAKPVPRSSVYEVFLSALRLHGYAVVEDYGIIRIVPESDAKLYQGPTLGPTNK RQLAGDRIQTQVFTLQYESAVQMVPILRPLIAPNNSITANPNSNTLVITDYASNLQRLAKIIDSVDQPSGTEPVSIPLQH ASAIDVAQTVNRLFSESTQSQAEGAADPTQQRFTVVADARSNTLLARSGNRAALARLRQLVTVLDSPTSAAGNMHVVFLK NADAVRLAETLRAIYHNMASPVSSSSGLSQGTGTAFGTSSLGTSTGGGMGASSGTSTGGSMGTSMPGSSLGAGTVPAAST VTPAPMQTGATSATPGIIQADAATNSIIITAPDAIYNNLRAVVEKLDVRRVQVYIEALIAEITADRAAEFGIQWQNLSNA AQGGTQVFGGTNFNAGTAGGGSIISTAQNPIANAASGLTIGIMNGLVTAIPGIGPVLNIHTLIRALETDANANILSTPTL LTLNNEEARIIIGQNVPIPTGQFIPPVGGAVTSPFQTVSRQDVGLSLKIKPLISEGNTVRVQIFQEVSSVVPGTVNATNG LITNKRSIESTVLVDDGQILVLGGLMQDSVNDSVERIPLVGAIPLFGQLFSYNKRSRNKTNLMVFLRPTLMRAGDAADPL SDAQYDRVLGEQKKVRPKFNLVLPDMESPTLPPRQPPPVILDDSITPDDPGISNVQGNWDTGGVMDNTP >Mature_788_residues GYRGKIVRHRSLCAVFLWMAVAGWVTEAWAVNPSATLPVEEIGVPPSVLPEADSLRASKSRSSAGQDLVTLNFVNADIEG VVKAVSEITRKNFMLDPRVKGTINIVSAKPVPRSSVYEVFLSALRLHGYAVVEDYGIIRIVPESDAKLYQGPTLGPTNKR QLAGDRIQTQVFTLQYESAVQMVPILRPLIAPNNSITANPNSNTLVITDYASNLQRLAKIIDSVDQPSGTEPVSIPLQHA SAIDVAQTVNRLFSESTQSQAEGAADPTQQRFTVVADARSNTLLARSGNRAALARLRQLVTVLDSPTSAAGNMHVVFLKN ADAVRLAETLRAIYHNMASPVSSSSGLSQGTGTAFGTSSLGTSTGGGMGASSGTSTGGSMGTSMPGSSLGAGTVPAASTV TPAPMQTGATSATPGIIQADAATNSIIITAPDAIYNNLRAVVEKLDVRRVQVYIEALIAEITADRAAEFGIQWQNLSNAA QGGTQVFGGTNFNAGTAGGGSIISTAQNPIANAASGLTIGIMNGLVTAIPGIGPVLNIHTLIRALETDANANILSTPTLL TLNNEEARIIIGQNVPIPTGQFIPPVGGAVTSPFQTVSRQDVGLSLKIKPLISEGNTVRVQIFQEVSSVVPGTVNATNGL ITNKRSIESTVLVDDGQILVLGGLMQDSVNDSVERIPLVGAIPLFGQLFSYNKRSRNKTNLMVFLRPTLMRAGDAADPLS DAQYDRVLGEQKKVRPKFNLVLPDMESPTLPPRQPPPVILDDSITPDDPGISNVQGNWDTGGVMDNTP
Specific function: Involved in a general secretion pathway (GSP) for the export of proteins [H]
COG id: COG1450
COG function: function code NU; Type II secretory pathway, component PulD
Gene ontology:
Cell location: Cell outer membrane (Probable) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the GSP D family [H]
Homologues:
Organism=Escherichia coli, GI87082242, Length=674, Percent_Identity=32.0474777448071, Blast_Score=268, Evalue=8e-73, Organism=Escherichia coli, GI1789793, Length=304, Percent_Identity=25.9868421052632, Blast_Score=86, Evalue=1e-17,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001775 - InterPro: IPR005644 - InterPro: IPR004846 - InterPro: IPR013356 - InterPro: IPR004845 [H]
Pfam domain/function: PF00263 Secretin; PF03958 Secretin_N [H]
EC number: NA
Molecular weight: Translated: 82966; Mature: 82835
Theoretical pI: Translated: 6.30; Mature: 6.30
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.1 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.1 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGYRGKIVRHRSLCAVFLWMAVAGWVTEAWAVNPSATLPVEEIGVPPSVLPEADSLRASK CCCCCCCHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCHHHHCCCCCCCCCCHHHHHHC SRSSAGQDLVTLNFVNADIEGVVKAVSEITRKNFMLDPRVKGTINIVSAKPVPRSSVYEV CCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHCCEECCCCCCEEEEEECCCCCHHHHHHH FLSALRLHGYAVVEDYGIIRIVPESDAKLYQGPTLGPTNKRQLAGDRIQTQVFTLQYESA HHHHHHHCCEEEEECCCEEEEEECCCCCEECCCCCCCCCCHHHCCCCEEEEEEEEEHHHH VQMVPILRPLIAPNNSITANPNSNTLVITDYASNLQRLAKIIDSVDQPSGTEPVSIPLQH HHHHHHHHHHHCCCCCEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCCEEEECCC ASAIDVAQTVNRLFSESTQSQAEGAADPTQQRFTVVADARSNTLLARSGNRAALARLRQL CHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCEEEEECCCHHHHHHHHHH VTVLDSPTSAAGNMHVVFLKNADAVRLAETLRAIYHNMASPVSSSSGLSQGTGTAFGTSS HHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC LGTSTGGGMGASSGTSTGGSMGTSMPGSSLGAGTVPAASTVTPAPMQTGATSATPGIIQA CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE DAATNSIIITAPDAIYNNLRAVVEKLDVRRVQVYIEALIAEITADRAAEFGIQWQNLSNA CCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCHH AQGGTQVFGGTNFNAGTAGGGSIISTAQNPIANAASGLTIGIMNGLVTAIPGIGPVLNIH HCCCCEEECCCCCCCCCCCCCCCHHCCCCCHHHHHCCCEEHHHHHHHHHCCCCCHHHHHH TLIRALETDANANILSTPTLLTLNNEEARIIIGQNVPIPTGQFIPPVGGAVTSPFQTVSR HHHHHHHCCCCCCEECCCEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHH QDVGLSLKIKPLISEGNTVRVQIFQEVSSVVPGTVNATNGLITNKRSIESTVLVDDGQIL CCCCEEEEEEEEECCCCEEEEEEEHHHHHHCCCCCCCCCCEEECCCCCCEEEEEECCCEE VLGGLMQDSVNDSVERIPLVGAIPLFGQLFSYNKRSRNKTNLMVFLRPTLMRAGDAADPL EECCHHHHHHCCHHHHCCEEECHHHHHHHHHCCCCCCCCCEEEEEEEHHHHHCCCCCCCC SDAQYDRVLGEQKKVRPKFNLVLPDMESPTLPPRQPPPVILDDSITPDDPGISNVQGNWD CHHHHHHHHCHHHHCCCCEEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCC TGGVMDNTP CCCCCCCCC >Mature Secondary Structure GYRGKIVRHRSLCAVFLWMAVAGWVTEAWAVNPSATLPVEEIGVPPSVLPEADSLRASK CCCCCCHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCHHHHCCCCCCCCCCHHHHHHC SRSSAGQDLVTLNFVNADIEGVVKAVSEITRKNFMLDPRVKGTINIVSAKPVPRSSVYEV CCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHCCEECCCCCCEEEEEECCCCCHHHHHHH FLSALRLHGYAVVEDYGIIRIVPESDAKLYQGPTLGPTNKRQLAGDRIQTQVFTLQYESA HHHHHHHCCEEEEECCCEEEEEECCCCCEECCCCCCCCCCHHHCCCCEEEEEEEEEHHHH VQMVPILRPLIAPNNSITANPNSNTLVITDYASNLQRLAKIIDSVDQPSGTEPVSIPLQH HHHHHHHHHHHCCCCCEEECCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCCEEEECCC ASAIDVAQTVNRLFSESTQSQAEGAADPTQQRFTVVADARSNTLLARSGNRAALARLRQL CHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCEEEEECCCHHHHHHHHHH VTVLDSPTSAAGNMHVVFLKNADAVRLAETLRAIYHNMASPVSSSSGLSQGTGTAFGTSS HHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC LGTSTGGGMGASSGTSTGGSMGTSMPGSSLGAGTVPAASTVTPAPMQTGATSATPGIIQA CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE DAATNSIIITAPDAIYNNLRAVVEKLDVRRVQVYIEALIAEITADRAAEFGIQWQNLSNA CCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCHH AQGGTQVFGGTNFNAGTAGGGSIISTAQNPIANAASGLTIGIMNGLVTAIPGIGPVLNIH HCCCCEEECCCCCCCCCCCCCCCHHCCCCCHHHHHCCCEEHHHHHHHHHCCCCCHHHHHH TLIRALETDANANILSTPTLLTLNNEEARIIIGQNVPIPTGQFIPPVGGAVTSPFQTVSR HHHHHHHCCCCCCEECCCEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHH QDVGLSLKIKPLISEGNTVRVQIFQEVSSVVPGTVNATNGLITNKRSIESTVLVDDGQIL CCCCEEEEEEEEECCCCEEEEEEEHHHHHHCCCCCCCCCCEEECCCCCCEEEEEECCCEE VLGGLMQDSVNDSVERIPLVGAIPLFGQLFSYNKRSRNKTNLMVFLRPTLMRAGDAADPL EECCHHHHHHCCHHHHCCEEECHHHHHHHHHCCCCCCCCCEEEEEEEHHHHHCCCCCCCC SDAQYDRVLGEQKKVRPKFNLVLPDMESPTLPPRQPPPVILDDSITPDDPGISNVQGNWD CHHHHHHHHCHHHHCCCCEEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCC TGGVMDNTP CCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]