| Definition | Acidovorax citrulli AAC00-1 chromosome, complete genome. |
|---|---|
| Accession | NC_008752 |
| Length | 5,352,772 |
Click here to switch to the map view.
The map label for this gene is ytfL [H]
Identifier: 120612312
GI number: 120612312
Start: 4044655
End: 4046013
Strand: Direct
Name: ytfL [H]
Synonym: Aave_3668
Alternate gene names: 120612312
Gene position: 4044655-4046013 (Clockwise)
Preceding gene: 120612311
Following gene: 120612315
Centisome position: 75.56
GC content: 67.99
Gene sequence:
>1359_bases ATGGCGGCGCCCGCCCGTCCCATGACCCTGTCGCAAAGCTTCTTCCTCATCGGCCTCCTCATCGTGGCCAGCGCCTTCTT CTCCGTCGCGGAGATCTCCCTGGCTGCCTCCCGCCGCCTGCGCCTGCGCCAGCTCGCCGACGAGGGCGAAGCGCGCGCCG AGAGCGTCATGCGCATGCAGGAGCAGCCAGGCGACTATTTCACCGTGGTGCAGGTGGGCCAGAACGCCGTGGCCATCCTC GGCGGCATCGTCGGCGAAGGCGCGCTCAGCCCCCAATTCACCGCCCTGCTCCAGTTCTGGCTCAGCGACGCGCGTGCCGA AACCTTCGGCTTCCTGGCATCGTTTCTGACCATCACCTCGCTCTTCATCCTGTTCGCCGACCTGTTCCCCAAGCGCCTGG GCATGGTCAACTCCGAGCGCCTCGCCGTGGTCGTCGCGCGGCCCATGGCGATTCTCATGGCGGTGCTGCGGCCCGTCGTC TGGCTTTACAGCCGCGCCGCCGACCTGCTGTTCCGTGTGCTCGGCCTGTCGTCACTGCGCGACGACCGCATCACGTCCGA CGACATCCTGGCCATGATGGAGGCCGGTACCCGGGCCGGCGTGCTGGCGGCGCGCGAGCAGCAGGTGATCGAGAACGTGT TCGAACTCGACTCCCGCCCGGTGAGCAGCGCCATGTCGCCGCGCGACCGGATCGCCTTCTTCCTGCGCGACGATCCGGAC CAGCTCATCCGCGCCCGCATCGCCGCGGAGCCCTTCTCCACCTACCCCGTGTGCGAGGGAGATATCGACCACGTCGTCGG CTACGTCGATGCCAAGGACCTTTTCCAGCGCGTGCTGAACAACCAGCCCATCTCCCTGAAGGACGAAGGCCTCGTGCGCA AGGTACTCATCGTGCCCGACCGCCTTTCGCTGGCCGAGGTGCTGGACCAGTTCCGCCAGGTGCACGAGGATTTCGCCGTC ATCGTCAACGAGTACAGCCTCGTCGTGGGCGTCGTCACCCTGAACGACGTGATGAGCACGGTGATGGGCGACCTGATCGG CCCCGACGACGAGGAGCAGATCGTCCGGCGCGACGAGAACTCCTGGCTGATCGACGGCGTCACGCCCGTGGGCGACGTGC TGCGGGCCCTGCACCTCGACGAACTGCCCCATGCCGGCGAATACGAAACCCTGGCGGGCTTCCTCATGGTCATGCTGCGC CGCGTGCCCCGCCGCACGGACAGCGTGAACTGGGGCGGCTACAAGTTCGAGGTGCTCGACGTGGACAGCTACCGCATCGA CCAGATCATGGTGTCGCGCCTGCAGGAAGGCGGCGCCCATCCCGCCGGGCCCGGGCCGGCACCGGCCCCGGGCGGCTGA
Upstream 100 bases:
>100_bases GATCCGCCGCGCAAGCCGGCTGGACTGTTCCGCACGGCCGCGCCCGCACGGCCCGCGTACACTCGGGACGCTGGCGCACG GCCCTCCACACGACAACTCC
Downstream 100 bases:
>100_bases ACGCGGCGCATCGGCCTGCTCAGGCGGACCGGACGCGGTCCTCGAACAGCCAGTTGCGCTGCGCCGCGAACACCGCGTCC GGGTACGGCGAACGCCCGCG
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 452; Mature: 451
Protein sequence:
>452_residues MAAPARPMTLSQSFFLIGLLIVASAFFSVAEISLAASRRLRLRQLADEGEARAESVMRMQEQPGDYFTVVQVGQNAVAIL GGIVGEGALSPQFTALLQFWLSDARAETFGFLASFLTITSLFILFADLFPKRLGMVNSERLAVVVARPMAILMAVLRPVV WLYSRAADLLFRVLGLSSLRDDRITSDDILAMMEAGTRAGVLAAREQQVIENVFELDSRPVSSAMSPRDRIAFFLRDDPD QLIRARIAAEPFSTYPVCEGDIDHVVGYVDAKDLFQRVLNNQPISLKDEGLVRKVLIVPDRLSLAEVLDQFRQVHEDFAV IVNEYSLVVGVVTLNDVMSTVMGDLIGPDDEEQIVRRDENSWLIDGVTPVGDVLRALHLDELPHAGEYETLAGFLMVMLR RVPRRTDSVNWGGYKFEVLDVDSYRIDQIMVSRLQEGGAHPAGPGPAPAPGG
Sequences:
>Translated_452_residues MAAPARPMTLSQSFFLIGLLIVASAFFSVAEISLAASRRLRLRQLADEGEARAESVMRMQEQPGDYFTVVQVGQNAVAIL GGIVGEGALSPQFTALLQFWLSDARAETFGFLASFLTITSLFILFADLFPKRLGMVNSERLAVVVARPMAILMAVLRPVV WLYSRAADLLFRVLGLSSLRDDRITSDDILAMMEAGTRAGVLAAREQQVIENVFELDSRPVSSAMSPRDRIAFFLRDDPD QLIRARIAAEPFSTYPVCEGDIDHVVGYVDAKDLFQRVLNNQPISLKDEGLVRKVLIVPDRLSLAEVLDQFRQVHEDFAV IVNEYSLVVGVVTLNDVMSTVMGDLIGPDDEEQIVRRDENSWLIDGVTPVGDVLRALHLDELPHAGEYETLAGFLMVMLR RVPRRTDSVNWGGYKFEVLDVDSYRIDQIMVSRLQEGGAHPAGPGPAPAPGG >Mature_451_residues AAPARPMTLSQSFFLIGLLIVASAFFSVAEISLAASRRLRLRQLADEGEARAESVMRMQEQPGDYFTVVQVGQNAVAILG GIVGEGALSPQFTALLQFWLSDARAETFGFLASFLTITSLFILFADLFPKRLGMVNSERLAVVVARPMAILMAVLRPVVW LYSRAADLLFRVLGLSSLRDDRITSDDILAMMEAGTRAGVLAAREQQVIENVFELDSRPVSSAMSPRDRIAFFLRDDPDQ LIRARIAAEPFSTYPVCEGDIDHVVGYVDAKDLFQRVLNNQPISLKDEGLVRKVLIVPDRLSLAEVLDQFRQVHEDFAVI VNEYSLVVGVVTLNDVMSTVMGDLIGPDDEEQIVRRDENSWLIDGVTPVGDVLRALHLDELPHAGEYETLAGFLMVMLRR VPRRTDSVNWGGYKFEVLDVDSYRIDQIMVSRLQEGGAHPAGPGPAPAPGG
Specific function: Unknown
COG id: COG1253
COG function: function code R; Hemolysins and related proteins containing CBS domains
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 CBS domains [H]
Homologues:
Organism=Homo sapiens, GI310128564, Length=385, Percent_Identity=24.9350649350649, Blast_Score=153, Evalue=3e-37, Organism=Escherichia coli, GI1790664, Length=423, Percent_Identity=53.9007092198582, Blast_Score=486, Evalue=1e-138, Organism=Escherichia coli, GI87082033, Length=284, Percent_Identity=30.9859154929577, Blast_Score=129, Evalue=4e-31, Organism=Escherichia coli, GI145693175, Length=418, Percent_Identity=23.6842105263158, Blast_Score=114, Evalue=1e-26, Organism=Escherichia coli, GI1788119, Length=279, Percent_Identity=26.8817204301075, Blast_Score=107, Evalue=2e-24, Organism=Escherichia coli, GI1786879, Length=272, Percent_Identity=23.1617647058824, Blast_Score=75, Evalue=6e-15,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR016169 - InterPro: IPR000644 - InterPro: IPR002550 - InterPro: IPR005170 [H]
Pfam domain/function: PF00571 CBS; PF03471 CorC_HlyC; PF01595 DUF21 [H]
EC number: NA
Molecular weight: Translated: 49847; Mature: 49716
Theoretical pI: Translated: 4.55; Mature: 4.55
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAAPARPMTLSQSFFLIGLLIVASAFFSVAEISLAASRRLRLRQLADEGEARAESVMRMQ CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EQPGDYFTVVQVGQNAVAILGGIVGEGALSPQFTALLQFWLSDARAETFGFLASFLTITS HCCCCEEEEEEECCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LFILFADLFPKRLGMVNSERLAVVVARPMAILMAVLRPVVWLYSRAADLLFRVLGLSSLR HHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC DDRITSDDILAMMEAGTRAGVLAAREQQVIENVFELDSRPVSSAMSPRDRIAFFLRDDPD CCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCHHHEEEEEECCHH QLIRARIAAEPFSTYPVCEGDIDHVVGYVDAKDLFQRVLNNQPISLKDEGLVRKVLIVPD HHHHHHHHCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHCCCCCCCCCCCHHEEEEECCC RLSLAEVLDQFRQVHEDFAVIVNEYSLVVGVVTLNDVMSTVMGDLIGPDDEEQIVRRDEN CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCC SWLIDGVTPVGDVLRALHLDELPHAGEYETLAGFLMVMLRRVPRRTDSVNWGGYKFEVLD CEEEECCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEE VDSYRIDQIMVSRLQEGGAHPAGPGPAPAPGG CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCC >Mature Secondary Structure AAPARPMTLSQSFFLIGLLIVASAFFSVAEISLAASRRLRLRQLADEGEARAESVMRMQ CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EQPGDYFTVVQVGQNAVAILGGIVGEGALSPQFTALLQFWLSDARAETFGFLASFLTITS HCCCCEEEEEEECCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LFILFADLFPKRLGMVNSERLAVVVARPMAILMAVLRPVVWLYSRAADLLFRVLGLSSLR HHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC DDRITSDDILAMMEAGTRAGVLAAREQQVIENVFELDSRPVSSAMSPRDRIAFFLRDDPD CCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCHHHEEEEEECCHH QLIRARIAAEPFSTYPVCEGDIDHVVGYVDAKDLFQRVLNNQPISLKDEGLVRKVLIVPD HHHHHHHHCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHCCCCCCCCCCCHHEEEEECCC RLSLAEVLDQFRQVHEDFAVIVNEYSLVVGVVTLNDVMSTVMGDLIGPDDEEQIVRRDEN CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCC SWLIDGVTPVGDVLRALHLDELPHAGEYETLAGFLMVMLRRVPRRTDSVNWGGYKFEVLD CEEEECCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEE VDSYRIDQIMVSRLQEGGAHPAGPGPAPAPGG CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]