| Definition | Escherichia coli ED1a chromosome, complete genome. |
|---|---|
| Accession | NC_011745 |
| Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is 218692588
Identifier: 218692588
GI number: 218692588
Start: 4984570
End: 4985838
Strand: Direct
Name: 218692588
Synonym: ECED1_5055
Alternate gene names: NA
Gene position: 4984570-4985838 (Clockwise)
Preceding gene: 218692587
Following gene: 218692595
Centisome position: 95.68
GC content: 44.76
Gene sequence:
>1269_bases ATGGCATTAATCGGAGTGTATGCGGATTGGGAAGGCCTGGATGGTCCGGAGCGGATTGGTTATTTACACAGTCGCCGCAC TCGGACGCGTGAGATATTTGAGTTTGAGTACGATAAAAAGGCGCTTGCAGATCCTTCACTTAATTTCATTCAGTTAGATC CTGAAATAATGTTGTATGAGGGAGCACAGTACCCAATTCCTCCAAAGGATAAATTTGGTGCATTTAGTGATTCCTGCCCG GATCGTTGGGGGCGTATGCTAATGAAGCGTCGTTTTGAGAGGGATATTCGAGACGGGTTATGTGATAAAGACTCCCACCT TTATGAGTCTGATTATCTACTAGGTGTTCATGATCTTTATCGGGTTGGGGCTCTTCGATACAAACGTGAAGACGCTGGAG AGTTTCTTGATAATCGCATTGATGTTGCAGCCCCACCATTTACCGAAATAGCGAGTCTGGAAAGGGCTAGCCGGGCAATT GAAGAAGATCCTGATAATAAAGAGCTAATGGGGCAAAAATGGTTAAGAATGCTAATCGCTCCGGGGGGATCATTAGGGGG GACACGTCCGAAAGCCAGTGTTGTTGATGAAGCTGGGCATCTCTACATCGCTAAATTTCCCAGTGTAAAAGATGAGTATG ATGTTGGTGGATGGGAAATGGTCGTTAATGCACTTGCTGTTGGCTGTGGACTGAATGTTGCACCTGCTCAGGCGCATAAA TTTGCCAGCAATTACCATTGTTTTATGGTTCGTCGTTTTGACCGCACTAATGCGGGAAGGCGTCTGCATTTTGCTTCGGC TATGACACTAACCAGGCATCAGGATGGTGAAGATGCCTCTACAGGTGTGAGTTATCTGGAATTAGCAGATGTTCTTATCA GACATGGAGCGCAGACTAATACTGACCTTAAGGAACTCTGGTCCAGGATTGTTTTTAATATTCTTGTTTCTAATACCGAT GACCATTTACGTAATCATGGATATATCTTGATACCGGGAAAAGGTTGGCGGCTTTCTGAGGCATATGATTTGAATCCTGT TGCCAGGTCTGACGGTCTAAAACTCAATATTACAGAAAATGATAATGCACTTGATTTGGAATTAGCTCGAGAAGTTGCGG AGTATTTTCGACTTGGTTTGACAGAAGCAGATGACATTATTGCTAATTTTAGAGGGATTGTCAGTCAGTGGCGAATCATT GCCGAGCGTCTTCGATTACCTGGTCGTGAGCAAGAGTTAATGGCAGAGGCGTTCAGGGGGGCAATTTAG
Upstream 100 bases:
>100_bases CACTGGCAGAAGGCTCAACGCCATCTATTACAGGAGAAACATCAACCTCTCTTGCAGCACTATTAAAACCGATAAAAAAA CCGTTAAAAGGAAATTAATT
Downstream 100 bases:
>100_bases AATAGTTATTTAGATATATTCGCTAAATCATAAATTTGCGAAGCAATGCCAGTAGATACTGAACGTTTTGGAAGAGACGT TTAATTTATGAGAAAAATAG
Product: hypothetical protein
Products: NA
Alternate protein names: HipA Domain Protein; Xin-Antitoxin System Toxin Component HipA Family; Amidophosphoribosyl Transferase; HipA Protein; Hipa Domain Protein; HipA-Like C-Terminal Domain-Containing; HipA-Like; HipA-Like Protein; HipA Family Protein; Protein With HipA-Like Domain; HipA-Family Phosphatidylinositol 3/4-Kinase; Protein Related Capsule Biosynthesis -Like; Protein Containing HipA-Like C-Terminal Domain; Phage-Related Protein; Conserved Protein; Capsule Biosynthesis Protein HipA; HipA-Like N-Terminal Domain Family; Capsule Biosynthesis -Like Protein; Orf_Bo
Number of amino acids: Translated: 422; Mature: 421
Protein sequence:
>422_residues MALIGVYADWEGLDGPERIGYLHSRRTRTREIFEFEYDKKALADPSLNFIQLDPEIMLYEGAQYPIPPKDKFGAFSDSCP DRWGRMLMKRRFERDIRDGLCDKDSHLYESDYLLGVHDLYRVGALRYKREDAGEFLDNRIDVAAPPFTEIASLERASRAI EEDPDNKELMGQKWLRMLIAPGGSLGGTRPKASVVDEAGHLYIAKFPSVKDEYDVGGWEMVVNALAVGCGLNVAPAQAHK FASNYHCFMVRRFDRTNAGRRLHFASAMTLTRHQDGEDASTGVSYLELADVLIRHGAQTNTDLKELWSRIVFNILVSNTD DHLRNHGYILIPGKGWRLSEAYDLNPVARSDGLKLNITENDNALDLELAREVAEYFRLGLTEADDIIANFRGIVSQWRII AERLRLPGREQELMAEAFRGAI
Sequences:
>Translated_422_residues MALIGVYADWEGLDGPERIGYLHSRRTRTREIFEFEYDKKALADPSLNFIQLDPEIMLYEGAQYPIPPKDKFGAFSDSCP DRWGRMLMKRRFERDIRDGLCDKDSHLYESDYLLGVHDLYRVGALRYKREDAGEFLDNRIDVAAPPFTEIASLERASRAI EEDPDNKELMGQKWLRMLIAPGGSLGGTRPKASVVDEAGHLYIAKFPSVKDEYDVGGWEMVVNALAVGCGLNVAPAQAHK FASNYHCFMVRRFDRTNAGRRLHFASAMTLTRHQDGEDASTGVSYLELADVLIRHGAQTNTDLKELWSRIVFNILVSNTD DHLRNHGYILIPGKGWRLSEAYDLNPVARSDGLKLNITENDNALDLELAREVAEYFRLGLTEADDIIANFRGIVSQWRII AERLRLPGREQELMAEAFRGAI >Mature_421_residues ALIGVYADWEGLDGPERIGYLHSRRTRTREIFEFEYDKKALADPSLNFIQLDPEIMLYEGAQYPIPPKDKFGAFSDSCPD RWGRMLMKRRFERDIRDGLCDKDSHLYESDYLLGVHDLYRVGALRYKREDAGEFLDNRIDVAAPPFTEIASLERASRAIE EDPDNKELMGQKWLRMLIAPGGSLGGTRPKASVVDEAGHLYIAKFPSVKDEYDVGGWEMVVNALAVGCGLNVAPAQAHKF ASNYHCFMVRRFDRTNAGRRLHFASAMTLTRHQDGEDASTGVSYLELADVLIRHGAQTNTDLKELWSRIVFNILVSNTDD HLRNHGYILIPGKGWRLSEAYDLNPVARSDGLKLNITENDNALDLELAREVAEYFRLGLTEADDIIANFRGIVSQWRIIA ERLRLPGREQELMAEAFRGAI
Specific function: Unknown
COG id: COG3550
COG function: function code R; Uncharacterized protein related to capsule biosynthesis enzymes
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 47888; Mature: 47757
Theoretical pI: Translated: 5.37; Mature: 5.37
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MALIGVYADWEGLDGPERIGYLHSRRTRTREIFEFEYDKKALADPSLNFIQLDPEIMLYE CEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCEEEECCCEEEEC GAQYPIPPKDKFGAFSDSCPDRWGRMLMKRRFERDIRDGLCDKDSHLYESDYLLGVHDLY CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEECCCCEECHHHHH RVGALRYKREDAGEFLDNRIDVAAPPFTEIASLERASRAIEEDPDNKELMGQKWLRMLIA HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC PGGSLGGTRPKASVVDEAGHLYIAKFPSVKDEYDVGGWEMVVNALAVGCGLNVAPAQAHK CCCCCCCCCCCHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHH FASNYHCFMVRRFDRTNAGRRLHFASAMTLTRHQDGEDASTGVSYLELADVLIRHGAQTN HHCCCEEEEEEEECCCCCCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCC TDLKELWSRIVFNILVSNTDDHLRNHGYILIPGKGWRLSEAYDLNPVARSDGLKLNITEN CHHHHHHHHHHHHHHHCCCHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCEEEEEECC DNALDLELAREVAEYFRLGLTEADDIIANFRGIVSQWRIIAERLRLPGREQELMAEAFRG CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHC AI CC >Mature Secondary Structure ALIGVYADWEGLDGPERIGYLHSRRTRTREIFEFEYDKKALADPSLNFIQLDPEIMLYE EEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCEEEECCCEEEEC GAQYPIPPKDKFGAFSDSCPDRWGRMLMKRRFERDIRDGLCDKDSHLYESDYLLGVHDLY CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEECCCCEECHHHHH RVGALRYKREDAGEFLDNRIDVAAPPFTEIASLERASRAIEEDPDNKELMGQKWLRMLIA HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC PGGSLGGTRPKASVVDEAGHLYIAKFPSVKDEYDVGGWEMVVNALAVGCGLNVAPAQAHK CCCCCCCCCCCHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHH FASNYHCFMVRRFDRTNAGRRLHFASAMTLTRHQDGEDASTGVSYLELADVLIRHGAQTN HHCCCEEEEEEEECCCCCCCEEEHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCC TDLKELWSRIVFNILVSNTDDHLRNHGYILIPGKGWRLSEAYDLNPVARSDGLKLNITEN CHHHHHHHHHHHHHHHCCCHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCEEEEEECC DNALDLELAREVAEYFRLGLTEADDIIANFRGIVSQWRIIAERLRLPGREQELMAEAFRG CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHC AI CC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA