| Definition | Burkholderia sp. 383 chromosome 1, complete genome. |
|---|---|
| Accession | NC_007510 |
| Length | 3,694,126 |
Click here to switch to the map view.
The map label for this gene is 78065415
Identifier: 78065415
GI number: 78065415
Start: 862474
End: 865299
Strand: Direct
Name: 78065415
Synonym: Bcep18194_A3941
Alternate gene names: NA
Gene position: 862474-865299 (Clockwise)
Preceding gene: 78065414
Following gene: 78065416
Centisome position: 23.35
GC content: 71.73
Gene sequence:
>2826_bases ATGGGACACAATCTGAAATTGACGGGTGTGGCGCTGTCGGTCGCGACGGTGTTCGGGGTACTGGCTTCGGGTTCCGCGAT GGCGGGGGCCCTCGATGCGCTGCCGATTCCGCAAGTAATCGTCAACCCGCCGACGAACAGCGTGTCGGTCGGGTTGGTCG CGACGGGTACGTCGCCGCTGGGCTCGGTCACAGTGGCGGCAGGCGGGGCGGGCGCGATCCAGACGTCGCTTGGCGACCCC GGCCAGGTGCTGTCGGGTGCCGTGGGGGCGGTGACGGGCGCACTCGGCGGTGGCGGCGGCACGGTACAGCCGCTCGCACC GGTCCAGGGCGTGCTGAATCAGGTGACGGGCGCGCTGAGCGGCGGCAACCCGGCGGGTGCGTTGACCGGTGCGCTGAATA CGGCGACGGGGACGCTGAGCAATGCGGTTGGCACGGTCACGGGTGCACTGGGTGGTATCGGCGGCGGGTCGAACCCGCTG GCGCCGGTTCAAGGTGTGGTCAATCAAGTAACGGGTGCGCTCGGTGGTGGCAACCCGGCTGGTGCGTTGACGGGTGCGCT TGGCACGGTCACGGGTGCACTGGGCGGTATCGGCGGCGGGTCGAACCCGCTGGCGCCGGTTCAAGGTGTCGTGAATCAAG TGACGGGCGCGCTAGGTAGCGGCAACCCGGCAGGCGCACTGACGGGCGCACTCGGCACGGTCACGGGCGCACTGGGCGGT ATCGGCGGTGGCTCGAACCCGCTGGCGCCGGTTCAAGGTGTGGTCAATCAAGTAACGGGCGCGCTTGGTAGCGGCAACCC GGCAGGCGCACTGACCGGTGCACTTGGCACGGTCACGGGTGCACTGGGCGGTATCGGCGGCGGGTCGAGCCCGCTGGCGC CGGTACAAGGCGTCGTGAACCAAGTGACGGGCGCGCTGGGTAGCGGTAACCCGGCAGGCGCACTGACCGGCGCACTTGGC ACGGTCACGGGTGCACTGGGTGGTATCGGCGGCGGGTCGAACCCGCTGGCGCCGGTTCAAGGCGTCGTGAACCAAGTGAC GGGCGCACTGGGTAGCGGTAACCCGGCAGGCGCACTGACCGGCGCACTCGGCACGGTGACCGGCGCGCTGGGTAATGTCG GCGGCGGTTCGTACCCGCTTGGGGCGGTTCAAGGCGTCGTCAATCAAGTCACGACGGCGCTCAATAACGGCAATCCGGCT AGTGCATTGACGGGCGCACTCGGTACGGTGGCGGGCGAGCTCGGTGCGGGCAATCCGGCAGGTACGGTGACTGGTGCGAT CGGCAACGTAACCGGGGCACTGGGCGCATTGGGCGCCATTGGTGGCGGCGCGGATCCGCTGGCGCCGGTCCAGGGCGTCG TGAATCAAGTAACGGGCGCGCTCGGCAGCGGCAACCCGGCAGGCGCACTGACCGGCGCACTCGGCACGGTCACGGGTGCA CTGGGCGGCATCGGCGGCGGTTCGAACCCGCTGGCGCCGGTCCAAGGCGTCGTGAACCAAGTGACGGGCGCACTGGGCAG CGGCAACCCGGCAGGCGCACTGACCGGCGCACTCGGCACGGTCACGGGTGCACTGGGCGGCATCGGCGGCGGTTCGAACC CGCTGGCGCCGGTCCAAGGCGTCGTGAACCAAGTGACGGGCGCACTGGGCAGCGGCAACCCGGCCGGCGCACTGACGGGC GCACTCGGCACGGTCACGGGTGCACTGGGTGGTATCGGCGGCGGCTCGAGCCCGCTGGCGCCGGTCCAAGGCGTCGTGAA CCAGGTCACGGGCGCACTGGGCGGCATCGGCGGCGGTTCGAACCCGCTGGCGCCGATCCAGAGCGTCGTCGATCAGGTCA CGGGTACGCTCGGCAGCGGCAACCCGGCCGGCGCGCTGAGCAACGCGGTCAACACGATCACGGGCACGCTCGGCAACGTG GGTGGAGCGGGGAGCCCGCTGGCGCCGGTGCAAGGTGTCGTCACGCAGCTCGCCGGCACGCTCGGCGGCTCGAATCCGCT GGCGCCGGTGCAAGGCGTCGTGAACCAGGTCGTCGGCACGCTGTCCGGCGCCGGTGGCGGCAGCCCGATCGCGCCGATCA CGAACCTCGTCAACGGCCTGCAGAACGCGCTGCCGACGGGCGGCAACGCAGCCGGCGCACTGACCGGCGCACTCGGCTCG GTGACGGGCGCACTCGGCAACCTCGGCGGCTCGAACCCGCTGGCGCCGGTGCAGGGCGTCGTGAATCAGGTCGTCGGCAC GCTCAGCAACAACAATGCGGTCGGCACGGCCACGAACGCACTGGGTAACGCGGTCGGCTCCGTCGTGGGCGCACTCGGCA ACCTCGGGGCCTCGAACCCGCTGGCGCCGGTGCAAGGCGTCGTGAACCAGGTCGTCGGCACGTTGTCGGGCGCAGCGGGC AACAACCCGATCGCCCCGATCACGAACCTCGTGAGCGGCCTGACGGGCGGCAGCAACCCGGCCGGCGCACTGACCGGTGC GCTCGGTTCGGTGACGGGCGCGCTCGCGAACGGTCCGGTGGCCCTCGGGCAGGCGGCCGGCGCGCTGTCGGGCGCGGCTG GTTCGACGGCAGCAGCGGGCGGCAGCCTGCTCGGCTCGGGCGCGAACGCGGCCGGCGGCACGGCAGGCGCAGTCGGCTCG CTGCTGTCGACGGGCGCCAACGCGACGGCAACGGTCGTCAACGCGGTCGGCACGACGGTGGGCACGGCACTCGGCTCCGC GCCGGGCCTGTCGGTCACGCCGCACTCGGGCAACAGCGCGCCGGGCAACCCGCTCGCACCGGTGTCGTCGCTGCTCCAGT CGCTGACGGGCGCACTGCCGAAGTAA
Upstream 100 bases:
>100_bases TAGGCAGACGAATCAATCCCGTTTTTGGAATTGGCACACTTCGTGCAATCCGTCCCCATGCTTTATCAACGTTGTGCCAA CCGAGATGGGGAAAATAAAG
Downstream 100 bases:
>100_bases CCGGCCGCACACGCATGGCGGTGCCGGATGGGCCGCCGTGCGCAACGGGTCGCGCAACGCGGCATGACAGCGATGTCATG CCGCGTTGTCTTTGGGGGCC
Product: hypothetical protein
Products: NA
Alternate protein names: Collagen Alpha 2(I) Chain; Glycine-Rich Surface Protein; Signal Peptide Protein; Hemagglutinin-Related Transmembrane Protein; Surface-Exposed Adhesin Protein
Number of amino acids: Translated: 941; Mature: 940
Protein sequence:
>941_residues MGHNLKLTGVALSVATVFGVLASGSAMAGALDALPIPQVIVNPPTNSVSVGLVATGTSPLGSVTVAAGGAGAIQTSLGDP GQVLSGAVGAVTGALGGGGGTVQPLAPVQGVLNQVTGALSGGNPAGALTGALNTATGTLSNAVGTVTGALGGIGGGSNPL APVQGVVNQVTGALGGGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGG IGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGSGNPAGALTGALG TVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGNVGGGSYPLGAVQGVVNQVTTALNNGNPA SALTGALGTVAGELGAGNPAGTVTGAIGNVTGALGALGAIGGGADPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGA LGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTG ALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGGIGGGSNPLAPIQSVVDQVTGTLGSGNPAGALSNAVNTITGTLGNV GGAGSPLAPVQGVVTQLAGTLGGSNPLAPVQGVVNQVVGTLSGAGGGSPIAPITNLVNGLQNALPTGGNAAGALTGALGS VTGALGNLGGSNPLAPVQGVVNQVVGTLSNNNAVGTATNALGNAVGSVVGALGNLGASNPLAPVQGVVNQVVGTLSGAAG NNPIAPITNLVSGLTGGSNPAGALTGALGSVTGALANGPVALGQAAGALSGAAGSTAAAGGSLLGSGANAAGGTAGAVGS LLSTGANATATVVNAVGTTVGTALGSAPGLSVTPHSGNSAPGNPLAPVSSLLQSLTGALPK
Sequences:
>Translated_941_residues MGHNLKLTGVALSVATVFGVLASGSAMAGALDALPIPQVIVNPPTNSVSVGLVATGTSPLGSVTVAAGGAGAIQTSLGDP GQVLSGAVGAVTGALGGGGGTVQPLAPVQGVLNQVTGALSGGNPAGALTGALNTATGTLSNAVGTVTGALGGIGGGSNPL APVQGVVNQVTGALGGGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGG IGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGSGNPAGALTGALG TVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGNVGGGSYPLGAVQGVVNQVTTALNNGNPA SALTGALGTVAGELGAGNPAGTVTGAIGNVTGALGALGAIGGGADPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGA LGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTG ALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGGIGGGSNPLAPIQSVVDQVTGTLGSGNPAGALSNAVNTITGTLGNV GGAGSPLAPVQGVVTQLAGTLGGSNPLAPVQGVVNQVVGTLSGAGGGSPIAPITNLVNGLQNALPTGGNAAGALTGALGS VTGALGNLGGSNPLAPVQGVVNQVVGTLSNNNAVGTATNALGNAVGSVVGALGNLGASNPLAPVQGVVNQVVGTLSGAAG NNPIAPITNLVSGLTGGSNPAGALTGALGSVTGALANGPVALGQAAGALSGAAGSTAAAGGSLLGSGANAAGGTAGAVGS LLSTGANATATVVNAVGTTVGTALGSAPGLSVTPHSGNSAPGNPLAPVSSLLQSLTGALPK >Mature_940_residues GHNLKLTGVALSVATVFGVLASGSAMAGALDALPIPQVIVNPPTNSVSVGLVATGTSPLGSVTVAAGGAGAIQTSLGDPG QVLSGAVGAVTGALGGGGGTVQPLAPVQGVLNQVTGALSGGNPAGALTGALNTATGTLSNAVGTVTGALGGIGGGSNPLA PVQGVVNQVTGALGGGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGI GGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGSGNPAGALTGALGT VTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGNVGGGSYPLGAVQGVVNQVTTALNNGNPAS ALTGALGTVAGELGAGNPAGTVTGAIGNVTGALGALGAIGGGADPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGAL GGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGA LGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGGIGGGSNPLAPIQSVVDQVTGTLGSGNPAGALSNAVNTITGTLGNVG GAGSPLAPVQGVVTQLAGTLGGSNPLAPVQGVVNQVVGTLSGAGGGSPIAPITNLVNGLQNALPTGGNAAGALTGALGSV TGALGNLGGSNPLAPVQGVVNQVVGTLSNNNAVGTATNALGNAVGSVVGALGNLGASNPLAPVQGVVNQVVGTLSGAAGN NPIAPITNLVSGLTGGSNPAGALTGALGSVTGALANGPVALGQAAGALSGAAGSTAAAGGSLLGSGANAAGGTAGAVGSL LSTGANATATVVNAVGTTVGTALGSAPGLSVTPHSGNSAPGNPLAPVSSLLQSLTGALPK
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 83269; Mature: 83138
Theoretical pI: Translated: 4.58; Mature: 4.58
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 0.2 %Met (Translated Protein) 0.2 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 0.1 %Met (Mature Protein) 0.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGHNLKLTGVALSVATVFGVLASGSAMAGALDALPIPQVIVNPPTNSVSVGLVATGTSPL CCCCEEEHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCEEECCCCCCEEEEEEEECCCCC GSVTVAAGGAGAIQTSLGDPGQVLSGAVGAVTGALGGGGGTVQPLAPVQGVLNQVTGALS CCEEEECCCCCHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCC GGNPAGALTGALNTATGTLSNAVGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGGGNPA CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCH GALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGG HHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHC IGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVN CCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH QVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALT HHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHH GALGTVTGALGNVGGGSYPLGAVQGVVNQVTTALNNGNPASALTGALGTVAGELGAGNPA HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCC GTVTGAIGNVTGALGALGAIGGGADPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGA CCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH LGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQG HHCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH VVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGGIGGGS HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC NPLAPIQSVVDQVTGTLGSGNPAGALSNAVNTITGTLGNVGGAGSPLAPVQGVVTQLAGT CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH LGGSNPLAPVQGVVNQVVGTLSGAGGGSPIAPITNLVNGLQNALPTGGNAAGALTGALGS CCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH VTGALGNLGGSNPLAPVQGVVNQVVGTLSNNNAVGTATNALGNAVGSVVGALGNLGASNP HHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC LAPVQGVVNQVVGTLSGAAGNNPIAPITNLVSGLTGGSNPAGALTGALGSVTGALANGPV CHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCC ALGQAAGALSGAAGSTAAAGGSLLGSGANAAGGTAGAVGSLLSTGANATATVVNAVGTTV HHHHHHHHHCCCCCCCHHCCCHHHCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHH GTALGSAPGLSVTPHSGNSAPGNPLAPVSSLLQSLTGALPK HHHHCCCCCCEECCCCCCCCCCCCCHHHHHHHHHHHCCCCC >Mature Secondary Structure GHNLKLTGVALSVATVFGVLASGSAMAGALDALPIPQVIVNPPTNSVSVGLVATGTSPL CCCEEEHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCEEECCCCCCEEEEEEEECCCCC GSVTVAAGGAGAIQTSLGDPGQVLSGAVGAVTGALGGGGGTVQPLAPVQGVLNQVTGALS CCEEEECCCCCHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCC GGNPAGALTGALNTATGTLSNAVGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGGGNPA CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCH GALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGG HHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHC IGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVN CCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH QVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALT HHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHH GALGTVTGALGNVGGGSYPLGAVQGVVNQVTTALNNGNPASALTGALGTVAGELGAGNPA HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCC GTVTGAIGNVTGALGALGAIGGGADPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGA CCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH LGGIGGGSNPLAPVQGVVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSNPLAPVQG HHCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH VVNQVTGALGSGNPAGALTGALGTVTGALGGIGGGSSPLAPVQGVVNQVTGALGGIGGGS HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC NPLAPIQSVVDQVTGTLGSGNPAGALSNAVNTITGTLGNVGGAGSPLAPVQGVVTQLAGT CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH LGGSNPLAPVQGVVNQVVGTLSGAGGGSPIAPITNLVNGLQNALPTGGNAAGALTGALGS CCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH VTGALGNLGGSNPLAPVQGVVNQVVGTLSNNNAVGTATNALGNAVGSVVGALGNLGASNP HHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC LAPVQGVVNQVVGTLSGAAGNNPIAPITNLVSGLTGGSNPAGALTGALGSVTGALANGPV CHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCC ALGQAAGALSGAAGSTAAAGGSLLGSGANAAGGTAGAVGSLLSTGANATATVVNAVGTTV HHHHHHHHHCCCCCCCHHCCCHHHCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHH GTALGSAPGLSVTPHSGNSAPGNPLAPVSSLLQSLTGALPK HHHHCCCCCCEECCCCCCCCCCCCCHHHHHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA