Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yicO
Identifier: 30064977
GI number: 30064977
Start: 3864319
End: 3865731
Strand: Reverse
Name: yicO
Synonym: S3971
Alternate gene names: 30064977
Gene position: 3865731-3864319 (Counterclockwise)
Preceding gene: 30064979
Following gene: 30064976
Centisome position: 84.05
GC content: 47.2
Gene sequence:
>1413_bases GTGATAATTATCACTGAACCGTTGTTGTCATTTGTTTTACAAAAGCAAGGGATTAAATCTCCTCCAATGGACAAAAAAAT GAATAATGACAATACCGATTACGTGAGTAATGAATCAGGGACGCTTTCGCGATTATTTAAACTACCTCAGCATGGGACCA CCGTCCGCACAGAATTGATTGCGGGGATGACCACTTTTTTAACCATGGTGTACATTGTTTTTGTGAACCCGCAAATCCTC GGCGCGGCACAAATGGACCCGAAAGTGGTGTTTGTTACCACCTGTTTGATTGCCGGTATCGGCAGTATTGCGATGGGGAT ATTTGCTAACTTACCCGTGGCGCTGGCTCCGGCAATGGGGCTGAACGCCTTCTTTGCGTTCGTGGTCGTGGGGGCGATGG GGATCTCCTGGCAGACCGGGATGGGCGCGATATTCTGGGGCGCAATTGGGCTATTTTTGCTCACGCTGTTTCGTATCCGG TACTGGATGATCTCCAACATTCCATTAAGTTTACGTATTGGTATCACCAGTGGAATCGGATTATTTATTGCCTTAATGGG ATTAAAAAACACTGGTGTTATTGTCGCCAATAAAGACACGCTGGTGATGATTGGCGATTTAAGTTCTCACGGCGTGTTGT TAGGTATTTTAGGGTTTTTTATTATAACCGTGTTGTCATCACGTCATTTTCATGCCGCGGTGCTGGTTTCTATTGTGGTG ACGTCTTGCTGTGGATTATTTTTCGGTGATGTTCATTTTAGCGGCGTCTATTCCATTCCGCCTGATATTAGCGGCGTCAT TGGTGAAGTAGATTTGAGCGGCGCGTTAACACTTGAACTCGCCGGTATCATTTTCTCCTTTATGCTGATCAACCTATTTG ATTCATCAGGAACATTAATTGGTGTAACTGATAAAGCGGGCTTAATAGATAGCAACGGTAAATTCCCCAATATGAATAAG GCGCTGTATGTTGATAGCGTCAGTTCGGTGGCAGGCGCATTTATCGGTACCTCGTCTGTTACTGCCTATATTGAAAGTAC TTCTGGTGTGGCAGTCGGTGGTCGCACGGGGCTAACGGCTGTCGTGGTCGGCGTTATGTTCCTGTTAGTTATGTTCTTCT CACCGCTGGTGGCGATGGTTCCTCCTTACGCAACCGCTGGAGCGTTAATCTTTGTTGGCGTGCTGATGACTTCGAGCCTG GCGCGCGTTAACTGGGATGATTTTACCGAATCGGTGCCTGCGTTTATTACCACGGTGATGATGCCCTTTACTTTCTCGAT CACCGAAGGGATTGCACTCGGCTTTATGTCGTACTGCATCATGAAAGTGTGCACCGGGCGCTGGCGCGATCTGAACCTGT GTGTGGTGGTGGTCGCAGCTCTGTTCGCATTGAAGATTATTCTGGTGGATTAG
Upstream 100 bases:
>100_bases TCATATTTTTCTCATTTTGATTTTATCAATTTGATTATTATTTATCATAAAGAAACGTCGCGAGATTATTAAAGAATATC TATTAATGTGCAATTGAAAT
Downstream 100 bases:
>100_bases TAACTGGCGAGGGTTTCCAGATATCATGAGTTCTGATTACGCAGGAGAACTCATGATCTGGATAATGCTCGCCACGCTGG CGGTAGTGTTTGTGGTTGGT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 470; Mature: 470
Protein sequence:
>470_residues MIIITEPLLSFVLQKQGIKSPPMDKKMNNDNTDYVSNESGTLSRLFKLPQHGTTVRTELIAGMTTFLTMVYIVFVNPQIL GAAQMDPKVVFVTTCLIAGIGSIAMGIFANLPVALAPAMGLNAFFAFVVVGAMGISWQTGMGAIFWGAIGLFLLTLFRIR YWMISNIPLSLRIGITSGIGLFIALMGLKNTGVIVANKDTLVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVV TSCCGLFFGDVHFSGVYSIPPDISGVIGEVDLSGALTLELAGIIFSFMLINLFDSSGTLIGVTDKAGLIDSNGKFPNMNK ALYVDSVSSVAGAFIGTSSVTAYIESTSGVAVGGRTGLTAVVVGVMFLLVMFFSPLVAMVPPYATAGALIFVGVLMTSSL ARVNWDDFTESVPAFITTVMMPFTFSITEGIALGFMSYCIMKVCTGRWRDLNLCVVVVAALFALKIILVD
Sequences:
>Translated_470_residues MIIITEPLLSFVLQKQGIKSPPMDKKMNNDNTDYVSNESGTLSRLFKLPQHGTTVRTELIAGMTTFLTMVYIVFVNPQIL GAAQMDPKVVFVTTCLIAGIGSIAMGIFANLPVALAPAMGLNAFFAFVVVGAMGISWQTGMGAIFWGAIGLFLLTLFRIR YWMISNIPLSLRIGITSGIGLFIALMGLKNTGVIVANKDTLVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVV TSCCGLFFGDVHFSGVYSIPPDISGVIGEVDLSGALTLELAGIIFSFMLINLFDSSGTLIGVTDKAGLIDSNGKFPNMNK ALYVDSVSSVAGAFIGTSSVTAYIESTSGVAVGGRTGLTAVVVGVMFLLVMFFSPLVAMVPPYATAGALIFVGVLMTSSL ARVNWDDFTESVPAFITTVMMPFTFSITEGIALGFMSYCIMKVCTGRWRDLNLCVVVVAALFALKIILVD >Mature_470_residues MIIITEPLLSFVLQKQGIKSPPMDKKMNNDNTDYVSNESGTLSRLFKLPQHGTTVRTELIAGMTTFLTMVYIVFVNPQIL GAAQMDPKVVFVTTCLIAGIGSIAMGIFANLPVALAPAMGLNAFFAFVVVGAMGISWQTGMGAIFWGAIGLFLLTLFRIR YWMISNIPLSLRIGITSGIGLFIALMGLKNTGVIVANKDTLVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVV TSCCGLFFGDVHFSGVYSIPPDISGVIGEVDLSGALTLELAGIIFSFMLINLFDSSGTLIGVTDKAGLIDSNGKFPNMNK ALYVDSVSSVAGAFIGTSSVTAYIESTSGVAVGGRTGLTAVVVGVMFLLVMFFSPLVAMVPPYATAGALIFVGVLMTSSL ARVNWDDFTESVPAFITTVMMPFTFSITEGIALGFMSYCIMKVCTGRWRDLNLCVVVVAALFALKIILVD
Specific function: Unknown
COG id: COG2252
COG function: function code R; Permeases
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the xanthine/uracil permease family. AzgA purine transporter (TC 2.A.1.40) subfamily [H]
Homologues:
Organism=Escherichia coli, GI87082309, Length=444, Percent_Identity=99.3243243243243, Blast_Score=873, Evalue=0.0, Organism=Escherichia coli, GI1790150, Length=444, Percent_Identity=74.5495495495496, Blast_Score=647, Evalue=0.0, Organism=Escherichia coli, GI1790499, Length=431, Percent_Identity=40.3712296983759, Blast_Score=301, Evalue=7e-83, Organism=Escherichia coli, GI48994909, Length=447, Percent_Identity=36.4653243847875, Blast_Score=265, Evalue=7e-72,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006043 [H]
Pfam domain/function: PF00860 Xan_ur_permease [H]
EC number: NA
Molecular weight: Translated: 49934; Mature: 49934
Theoretical pI: Translated: 6.66; Mature: 6.66
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 4.9 %Met (Translated Protein) 6.2 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 4.9 %Met (Mature Protein) 6.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIIITEPLLSFVLQKQGIKSPPMDKKMNNDNTDYVSNESGTLSRLFKLPQHGTTVRTELI CEEECHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHH AGMTTFLTMVYIVFVNPQILGAAQMDPKVVFVTTCLIAGIGSIAMGIFANLPVALAPAMG HHHHHHHHHHHHHHCCCHHEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH LNAFFAFVVVGAMGISWQTGMGAIFWGAIGLFLLTLFRIRYWMISNIPLSLRIGITSGIG HHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHH LFIALMGLKNTGVIVANKDTLVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVV HHHHHHCCCCCCEEEECCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH TSCCGLFFGDVHFSGVYSIPPDISGVIGEVDLSGALTLELAGIIFSFMLINLFDSSGTLI HHHHHHHHCCHHCCCEEECCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEE GVTDKAGLIDSNGKFPNMNKALYVDSVSSVAGAFIGTSSVTAYIESTSGVAVGGRTGLTA EECCCCCEECCCCCCCCCCCEEEEECHHHHHHHHHCCCCEEEEEECCCCEEECCCCCHHH VVVGVMFLLVMFFSPLVAMVPPYATAGALIFVGVLMTSSLARVNWDDFTESVPAFITTVM HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH MPFTFSITEGIALGFMSYCIMKVCTGRWRDLNLCVVVVAALFALKIILVD HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MIIITEPLLSFVLQKQGIKSPPMDKKMNNDNTDYVSNESGTLSRLFKLPQHGTTVRTELI CEEECHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHH AGMTTFLTMVYIVFVNPQILGAAQMDPKVVFVTTCLIAGIGSIAMGIFANLPVALAPAMG HHHHHHHHHHHHHHCCCHHEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH LNAFFAFVVVGAMGISWQTGMGAIFWGAIGLFLLTLFRIRYWMISNIPLSLRIGITSGIG HHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHH LFIALMGLKNTGVIVANKDTLVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVV HHHHHHCCCCCCEEEECCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH TSCCGLFFGDVHFSGVYSIPPDISGVIGEVDLSGALTLELAGIIFSFMLINLFDSSGTLI HHHHHHHHCCHHCCCEEECCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEE GVTDKAGLIDSNGKFPNMNKALYVDSVSSVAGAFIGTSSVTAYIESTSGVAVGGRTGLTA EECCCCCEECCCCCCCCCCCEEEEECHHHHHHHHHCCCCEEEEEECCCCEEECCCCCHHH VVVGVMFLLVMFFSPLVAMVPPYATAGALIFVGVLMTSSLARVNWDDFTESVPAFITTVM HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH MPFTFSITEGIALGFMSYCIMKVCTGRWRDLNLCVVVVAALFALKIILVD HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7686882; 9278503 [H]