Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is surf

Identifier: 159184176

GI number: 159184176

Start: 141629

End: 142459

Strand: Reverse

Name: surf

Synonym: Atu0138

Alternate gene names: NA

Gene position: 142459-141629 (Counterclockwise)

Preceding gene: 159184177

Following gene: 15887492

Centisome position: 5.01

GC content: 61.73

Gene sequence:

>831_bases
ATGCGGGCGGAACGAGGCCCGCCGAAGCAGACGATCTCTCTGCCGGTTGCGCTTCGCAAGGACCGTGACATGAGCGAAAT
CGCACCCCAAAACCGGCGCATCCGCCCTGCGGCAATCGTCGTCCTGTTTTTGACCATTGCCCTGACGGGCTGCCTGCTGG
CGCTTGGAACATGGCAGGTTCAGCGCCTTTTCTGGAAACTAGACCTGATCGAGCGCGTCGACGTCCGCGCCCATGCCGAG
CCAGTCGATGCCCCTGCCGCCAGCGACTGGCCGGCCCTCGGCAATCCCTCGGATTACGAATATCGTCGCGTGAAACTGAC
CGGCACACTCCTTAACGACAGGGAAGTTCAGGTCTACACGGTCACCGATCTCGGGCCCGGCTACTGGGTCATGACCCCGC
TTCGGCGCGACGATGGTTCAAGCATCATCGTCAATCGCGGTTTCGTGCCCTCCGACCGGCGCGATCCATCCTCGCGCACG
GGAGGTGAACCGACCGGAAACGTCGAGATCGTCGGGCTGATGCGCGCGCCGGAAACCGGTGGGCTTTTCCTTAGAACCAA
CGACCCGGCAAATGGCCGCTGGTATTCCCGCAATATCCCTCAGATCACGCAGGCATCCGGGCTTTCAGACGTCGCGCCCT
TTTATGTCGATGCGGATGCGACACCCAATCCGGGTGGCCTGCCGGTTGGCGGCAAGACCATGCTCACTTTCCCCAACAAC
CACCTCTCCTACGCCGTGACATGGTATATTCTGGCGGCGATGGTGGTGGCGGCCGGCTGGTATGTGCTACGCAACCTCAA
CGCACCGAAATCGAAACGGGATGCTGATTGA

Upstream 100 bases:

>100_bases
AATCAGCGGCCTGCCGGAAGCCGGCAGGCCGCTTTCCGCGCCAACACATAACCCCATGAATCCTGGCGCGATTTATGGGG
TTATTTGCTGGAGGAAGCGC

Downstream 100 bases:

>100_bases
ACCGCTGGTTTTACCGGATGGTTCATAAAAGCGCTTCCTTCTTCACGTTTTGCTGATCTATAACGACAGTTCGGCCTGAC
ATTGCCCCGTCGGCGCAATT

Product: surfeit 1

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 276; Mature: 276

Protein sequence:

>276_residues
MRAERGPPKQTISLPVALRKDRDMSEIAPQNRRIRPAAIVVLFLTIALTGCLLALGTWQVQRLFWKLDLIERVDVRAHAE
PVDAPAASDWPALGNPSDYEYRRVKLTGTLLNDREVQVYTVTDLGPGYWVMTPLRRDDGSSIIVNRGFVPSDRRDPSSRT
GGEPTGNVEIVGLMRAPETGGLFLRTNDPANGRWYSRNIPQITQASGLSDVAPFYVDADATPNPGGLPVGGKTMLTFPNN
HLSYAVTWYILAAMVVAAGWYVLRNLNAPKSKRDAD

Sequences:

>Translated_276_residues
MRAERGPPKQTISLPVALRKDRDMSEIAPQNRRIRPAAIVVLFLTIALTGCLLALGTWQVQRLFWKLDLIERVDVRAHAE
PVDAPAASDWPALGNPSDYEYRRVKLTGTLLNDREVQVYTVTDLGPGYWVMTPLRRDDGSSIIVNRGFVPSDRRDPSSRT
GGEPTGNVEIVGLMRAPETGGLFLRTNDPANGRWYSRNIPQITQASGLSDVAPFYVDADATPNPGGLPVGGKTMLTFPNN
HLSYAVTWYILAAMVVAAGWYVLRNLNAPKSKRDAD
>Mature_276_residues
MRAERGPPKQTISLPVALRKDRDMSEIAPQNRRIRPAAIVVLFLTIALTGCLLALGTWQVQRLFWKLDLIERVDVRAHAE
PVDAPAASDWPALGNPSDYEYRRVKLTGTLLNDREVQVYTVTDLGPGYWVMTPLRRDDGSSIIVNRGFVPSDRRDPSSRT
GGEPTGNVEIVGLMRAPETGGLFLRTNDPANGRWYSRNIPQITQASGLSDVAPFYVDADATPNPGGLPVGGKTMLTFPNN
HLSYAVTWYILAAMVVAAGWYVLRNLNAPKSKRDAD

Specific function: Unknown

COG id: COG3346

COG function: function code S; Uncharacterized conserved protein

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the SURF1 family [H]

Homologues:

Organism=Homo sapiens, GI4507319, Length=229, Percent_Identity=36.2445414847162, Blast_Score=131, Evalue=6e-31,
Organism=Caenorhabditis elegans, GI17553856, Length=238, Percent_Identity=31.5126050420168, Blast_Score=107, Evalue=5e-24,
Organism=Saccharomyces cerevisiae, GI6321550, Length=179, Percent_Identity=31.8435754189944, Blast_Score=82, Evalue=6e-17,
Organism=Drosophila melanogaster, GI17864366, Length=280, Percent_Identity=29.2857142857143, Blast_Score=108, Evalue=4e-24,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002994 [H]

Pfam domain/function: PF02104 SURF1 [H]

EC number: NA

Molecular weight: Translated: 30438; Mature: 30438

Theoretical pI: Translated: 9.51; Mature: 9.51

Prosite motif: PS50895 SURF1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRAERGPPKQTISLPVALRKDRDMSEIAPQNRRIRPAAIVVLFLTIALTGCLLALGTWQV
CCCCCCCCCCEEEEEEEECCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
QRLFWKLDLIERVDVRAHAEPVDAPAASDWPALGNPSDYEYRRVKLTGTLLNDREVQVYT
HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCEEEEEE
VTDLGPGYWVMTPLRRDDGSSIIVNRGFVPSDRRDPSSRTGGEPTGNVEIVGLMRAPETG
EEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCC
GLFLRTNDPANGRWYSRNIPQITQASGLSDVAPFYVDADATPNPGGLPVGGKTMLTFPNN
CEEEEECCCCCCEEEECCCCCHHHHCCCCCCCCEEEECCCCCCCCCCCCCCCEEEECCCC
HLSYAVTWYILAAMVVAAGWYVLRNLNAPKSKRDAD
CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC
>Mature Secondary Structure
MRAERGPPKQTISLPVALRKDRDMSEIAPQNRRIRPAAIVVLFLTIALTGCLLALGTWQV
CCCCCCCCCCEEEEEEEECCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
QRLFWKLDLIERVDVRAHAEPVDAPAASDWPALGNPSDYEYRRVKLTGTLLNDREVQVYT
HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCEEEEEE
VTDLGPGYWVMTPLRRDDGSSIIVNRGFVPSDRRDPSSRTGGEPTGNVEIVGLMRAPETG
EEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCC
GLFLRTNDPANGRWYSRNIPQITQASGLSDVAPFYVDADATPNPGGLPVGGKTMLTFPNN
CEEEEECCCCCCEEEECCCCCHHHHCCCCCCCCEEEECCCCCCCCCCCCCCCEEEECCCC
HLSYAVTWYILAAMVVAAGWYVLRNLNAPKSKRDAD
CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 11557893 [H]