Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is abgB
Identifier: 157160848
GI number: 157160848
Start: 1448062
End: 1449507
Strand: Reverse
Name: abgB
Synonym: EcHS_A1453
Alternate gene names: 157160848
Gene position: 1449507-1448062 (Counterclockwise)
Preceding gene: 157160849
Following gene: 157160847
Centisome position: 31.22
GC content: 52.56
Gene sequence:
>1446_bases ATGCAGGAAATCTATCGTTTTATCGACGATGCGATTGAAGCCGATCGCCAACGTTATACCGATATTGCCGATCAAATCTG GGATCATCCAGAAACACGTTTTGAAGAGTTCTGGTCAGCGGAGCATCTGGCTTCGGCGCTGGAATCTGCAGGCTTCACCG TTACCCGCAACGTAGGCAATATCCCAAATGCCTTTATTGCTTCGTTTGGTCAAGGCAAACCGGTTATCGCCCTGCTGGGA GAATATGACGCCCTGGCAGGTTTAAGTCAGCAAGCAGGTTGCGCGCAACCTACATCCGTGACGCCCGGTGAAAATGGTCA CGGTTGCGGACACAATTTGCTGGGAACCGCCGCCTTTGCCGCTGCAATAGCCGTCAAGAAATGGCTGGAACAATATGGGC AAGGCGGCACGGTGCGCTTTTATGGTTGTCCTGGCGAAGAAGGCGGCTCGGGTAAAACGTTCATGGTTCGCGAGGGGGTA TTTGATGATGTGGATGCGGCACTCACCTGGCACCCGGAAGCCTTTGCCGGTATGTTCAATACCCGCACGCTGGCAAACAT TCAGGCATCATGGCGCTTTAAAGGGATCGCAGCACATGCCGCGAATTCCCCTCATTTGGGACGCAGCGCCCTTGATGCCG TAACGTTGATGACCACTGGCACCAACTTCCTCAACGAACATATTATTGAAAAAGCGCGCGTACACTATGCCATCACAAAT AGCGGCGGGATCTCGCCCAACGTGGTCCAGGCGCAGGCAGAAGTGCTTTATCTTATCCGCGCCCCCGAAATGACCGACGT GCAGCATATTTATGATCGGGTCGCCAAAATCGCCGAAGGTGCGGCATTGATGACCGAAACCACGGTTGAATGCCGCTTCG ACAAAGCCTGTTCCAGTTATCTCCCGAATCGCACCTTAGAAAATGCCATGTACCAGGCCCTATCCCATTTTGGTACCCCG GAATGGAACTCCGAAGAACTGGCTTTTGCGAAACAAATTCAGGCTACGCTCACCTCCAACGATCGGCAAAACAGTCTGAA TAATATCGCCGCAACCGGTGGCGAAAACGGCAAGGTTTTTGCACTACGTCATCGTGAAACGGTACTGGCGAATGAAGTCG CTCCATATGCCGCCACCGATAACGTGCTTGCGGCATCGACTGATGTCGGCGACGTCAGTTGGAAACTGCCTGTTGCCCAG TGTTTCAGCCCCTGTTTTGCCGTCGGTACACCGCTACATACGTGGCAACTGGTTAGCCAGGGGCGAACATCTATTGCTCA TAAAGGAATGCTGCTGGCGGCGAAAACTATGGCAGCAACCACAGTCAATCTCTTCCTTGATTCAGGGCTATTGCAAGAAT GCCAACAAGAGCATCAGCAAGTAACGGACACGCAACCGTATCACTGCCCTATCCCGAAAAACGTGACACCGTCACCTTTA AAATAA
Upstream 100 bases:
>100_bases ATCACAACGAAAAATTCGATTTTGACGAGCAGGTTCTCGCTATTGCCGTCGAAACGCTGGCGCGCACCGCGCTCAATTTT CCCTGGACGCGAGGTATCTG
Downstream 100 bases:
>100_bases CAACAACAACGCAAACACAACAACCGAGGAATGCCCATGAGTATGTCATCCATACCGTCGTCCTCCCAATCCGGGAAGCT CTATGGCTGGGTCGAAAGAA
Product: aminobenzoyl-glutamate utilization protein B
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 481; Mature: 481
Protein sequence:
>481_residues MQEIYRFIDDAIEADRQRYTDIADQIWDHPETRFEEFWSAEHLASALESAGFTVTRNVGNIPNAFIASFGQGKPVIALLG EYDALAGLSQQAGCAQPTSVTPGENGHGCGHNLLGTAAFAAAIAVKKWLEQYGQGGTVRFYGCPGEEGGSGKTFMVREGV FDDVDAALTWHPEAFAGMFNTRTLANIQASWRFKGIAAHAANSPHLGRSALDAVTLMTTGTNFLNEHIIEKARVHYAITN SGGISPNVVQAQAEVLYLIRAPEMTDVQHIYDRVAKIAEGAALMTETTVECRFDKACSSYLPNRTLENAMYQALSHFGTP EWNSEELAFAKQIQATLTSNDRQNSLNNIAATGGENGKVFALRHRETVLANEVAPYAATDNVLAASTDVGDVSWKLPVAQ CFSPCFAVGTPLHTWQLVSQGRTSIAHKGMLLAAKTMAATTVNLFLDSGLLQECQQEHQQVTDTQPYHCPIPKNVTPSPL K
Sequences:
>Translated_481_residues MQEIYRFIDDAIEADRQRYTDIADQIWDHPETRFEEFWSAEHLASALESAGFTVTRNVGNIPNAFIASFGQGKPVIALLG EYDALAGLSQQAGCAQPTSVTPGENGHGCGHNLLGTAAFAAAIAVKKWLEQYGQGGTVRFYGCPGEEGGSGKTFMVREGV FDDVDAALTWHPEAFAGMFNTRTLANIQASWRFKGIAAHAANSPHLGRSALDAVTLMTTGTNFLNEHIIEKARVHYAITN SGGISPNVVQAQAEVLYLIRAPEMTDVQHIYDRVAKIAEGAALMTETTVECRFDKACSSYLPNRTLENAMYQALSHFGTP EWNSEELAFAKQIQATLTSNDRQNSLNNIAATGGENGKVFALRHRETVLANEVAPYAATDNVLAASTDVGDVSWKLPVAQ CFSPCFAVGTPLHTWQLVSQGRTSIAHKGMLLAAKTMAATTVNLFLDSGLLQECQQEHQQVTDTQPYHCPIPKNVTPSPL K >Mature_481_residues MQEIYRFIDDAIEADRQRYTDIADQIWDHPETRFEEFWSAEHLASALESAGFTVTRNVGNIPNAFIASFGQGKPVIALLG EYDALAGLSQQAGCAQPTSVTPGENGHGCGHNLLGTAAFAAAIAVKKWLEQYGQGGTVRFYGCPGEEGGSGKTFMVREGV FDDVDAALTWHPEAFAGMFNTRTLANIQASWRFKGIAAHAANSPHLGRSALDAVTLMTTGTNFLNEHIIEKARVHYAITN SGGISPNVVQAQAEVLYLIRAPEMTDVQHIYDRVAKIAEGAALMTETTVECRFDKACSSYLPNRTLENAMYQALSHFGTP EWNSEELAFAKQIQATLTSNDRQNSLNNIAATGGENGKVFALRHRETVLANEVAPYAATDNVLAASTDVGDVSWKLPVAQ CFSPCFAVGTPLHTWQLVSQGRTSIAHKGMLLAAKTMAATTVNLFLDSGLLQECQQEHQQVTDTQPYHCPIPKNVTPSPL K
Specific function: Required but not essential for aminobenzoyl-glutamate utilization. May participate in hydrolysis of aminobenzoyl- glutamate to aminobenzoate, either alone or in combination with AbgA
COG id: COG1473
COG function: function code R; Metal-dependent amidase/aminoacylase/carboxypeptidase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI58082085, Length=473, Percent_Identity=25.5813953488372, Blast_Score=112, Evalue=9e-25, Organism=Escherichia coli, GI1787598, Length=481, Percent_Identity=100, Blast_Score=1004, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): ABGB_ECOLI (P76052)
Other databases:
- EMBL: U00096 - EMBL: AP009048 - PIR: D64883 - RefSeq: AP_001965.1 - RefSeq: NP_415853.1 - ProteinModelPortal: P76052 - DIP: DIP-9029N - STRING: P76052 - EnsemblBacteria: EBESCT00000000411 - EnsemblBacteria: EBESCT00000014598 - GeneID: 945950 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW1331 - KEGG: eco:b1337 - EchoBASE: EB3134 - EcoGene: EG13351 - eggNOG: COG1473 - GeneTree: EBGT00050000010434 - HOGENOM: HBG487162 - OMA: HTWQVVA - ProtClustDB: CLSK880099 - BioCyc: EcoCyc:G6669-MONOMER - BioCyc: MetaCyc:G6669-MONOMER - Genevestigator: P76052 - InterPro: IPR017145 - InterPro: IPR010168 - InterPro: IPR002933 - InterPro: IPR011650 - PIRSF: PIRSF037227 - TIGRFAMs: TIGR01891
Pfam domain/function: PF01546 Peptidase_M20; SSF55031 Peptidase_M20_dimer
EC number: NA
Molecular weight: Translated: 52194; Mature: 52194
Theoretical pI: Translated: 5.53; Mature: 5.53
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQEIYRFIDDAIEADRQRYTDIADQIWDHPETRFEEFWSAEHLASALESAGFTVTRNVGN CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHCCCEEEECCCC IPNAFIASFGQGKPVIALLGEYDALAGLSQQAGCAQPTSVTPGENGHGCGHNLLGTAAFA CCHHHHHHCCCCCCEEEEECCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH AAIAVKKWLEQYGQGGTVRFYGCPGEEGGSGKTFMVREGVFDDVDAALTWHPEAFAGMFN HHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEEEECCCCCCCCCCEEECCHHHHHHHC TRTLANIQASWRFKGIAAHAANSPHLGRSALDAVTLMTTGTNFLNEHIIEKARVHYAITN CCHHEECEECEEEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHEEEEEEC SGGISPNVVQAQAEVLYLIRAPEMTDVQHIYDRVAKIAEGAALMTETTVECRFDKACSSY CCCCCCHHHHHHEEEEEEEECCCCCHHHHHHHHHHHHHCCCHHEECCHHHHHHHHHHHHH LPNRTLENAMYQALSHFGTPEWNSEELAFAKQIQATLTSNDRQNSLNNIAATGGENGKVF CCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHECCCCCCCCEE ALRHRETVLANEVAPYAATDNVLAASTDVGDVSWKLPVAQCFSPCFAVGTPLHTWQLVSQ EEECCHHHHHHCCCCCCCCCCEEEECCCCCCCEEECCHHHHHHHHHHHCCCHHHHHHHHC GRTSIAHKGMLLAAKTMAATTVNLFLDSGLLQECQQEHQQVTDTQPYHCPIPKNVTPSPL CCHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC K C >Mature Secondary Structure MQEIYRFIDDAIEADRQRYTDIADQIWDHPETRFEEFWSAEHLASALESAGFTVTRNVGN CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHCCCEEEECCCC IPNAFIASFGQGKPVIALLGEYDALAGLSQQAGCAQPTSVTPGENGHGCGHNLLGTAAFA CCHHHHHHCCCCCCEEEEECCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH AAIAVKKWLEQYGQGGTVRFYGCPGEEGGSGKTFMVREGVFDDVDAALTWHPEAFAGMFN HHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEEEECCCCCCCCCCEEECCHHHHHHHC TRTLANIQASWRFKGIAAHAANSPHLGRSALDAVTLMTTGTNFLNEHIIEKARVHYAITN CCHHEECEECEEEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHEEEEEEC SGGISPNVVQAQAEVLYLIRAPEMTDVQHIYDRVAKIAEGAALMTETTVECRFDKACSSY CCCCCCHHHHHHEEEEEEEECCCCCHHHHHHHHHHHHHCCCHHEECCHHHHHHHHHHHHH LPNRTLENAMYQALSHFGTPEWNSEELAFAKQIQATLTSNDRQNSLNNIAATGGENGKVF CCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHECCCCCCCCEE ALRHRETVLANEVAPYAATDNVLAASTDVGDVSWKLPVAQCFSPCFAVGTPLHTWQLVSQ EEECCHHHHHHCCCCCCCCCCEEEECCCCCCCEEECCHHHHHHHHHHHCCCHHHHHHHHC GRTSIAHKGMLLAAKTMAATTVNLFLDSGLLQECQQEHQQVTDTQPYHCPIPKNVTPSPL CCHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC K C
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9278503; 9829935