Definition | Opitutus terrae PB90-1 chromosome, complete genome. |
---|---|
Accession | NC_010571 |
Length | 5,957,605 |
Click here to switch to the map view.
The map label for this gene is Not Available
Identifier: 182415484
GI number:
Start: 4769841
End: 4776353
Strand: Direct
Name: Not Available
Synonym: Oter_3673
Alternate gene names: NA
Gene position: NA
Preceding gene: 182415483
Following gene: 182415485
Centisome position: NA
GC content: NA
Gene sequence:
>6513_bases ATGAACTACCGCCTGCTCGCCAGCTTGGTGACCTCCCTGCTGCTCGCCGTCACCGCTTCCGCTTACAATTTTTCGATCAG CACGCTGCACGGCGTCTCGTTCGCCAGCGGCGTAACCTCGCTGCAATTCGGCCCGGATCAGCGGCTCTACGCGGCACAGG TGAGCGGCGAGATCCTGGTGATGACCGTGCAGCGACTTGGGCCGAACAACTACCAGGTCACGGCGACCGAGACGATCAAT CTCGTCCGCGACATTCCGAACTACAACGATGATGGTACCCGGGCCGCCACGCTGACGACGCGCCAGGTGACGGGTATCCT GGTGGTGGGCACCGCGAGCAACCCGGTGATCTACGTGACCTCCAGCGACCCCCGGATCGGGGGTGGCAGCGGCGGCGTCA ACGATCTCAACCTCGACACGAACTCGGGCATCATTTCCCGACTGACCCGTACGGGGGGCGTCTGGTCCAAGGTTGATCTC GTCCGCGGGTTGCCGCGGTCGGAAGAGAATCATGCCAGCAACGGCATGCAGCTCGATGCGGCGACCAACACGCTCTACCT CGCGCAGGGCGGCAACACCAATGCCGGTGGCCCCTCGAACAATTTCGCGTTCATGTGCGAAACCGCGCTCTCGGCCGCGA TTCTTTCCATCGATCTCGACGCGATCAACGCGATGCCCGTCCAGACCGACGCCTACGGGCAGCAATACCTCTACAACCTG CCGACCGTCGACGACCCGAATCCCAGCCGGGGCCAGAACCCCGACGGCAGCGACGTCAACGATCCGTTCGGCGGCAACGA CGGCCTGAACCAGGCGCGGCTCGTCGCGGACGGCCCGGTGCAGGTCTACTCGTCCGGCTACCGCAACCCCTACGACCTCG TCATCACGCGCACGCCCGGCGCGGAGGGGCGGATGTACGCGTGGGACAACGGCGCCAACAACGGCTGGGGCGGCTACCCC AAGAACGAAGGCCCCGAGGGCAACGTCACCAACGAATACGTCGACGGCGAGCCCGGCACGGTGAACAACAAGGACAACCT GCACGTGATCACCGCCGGCTACTACGCCGGGCACCCGAATCCGATCCGCGCGAACCCCGCGGGCGCCGGTTGGTTCCACT ACGATTCCACGCAGCCTGCCGGTTCCGAAAAGATTTTCTCGCTTACGCCGACCACGGATTGGCCGCCAGTGCCGGTGGCG ATGGCCAATCCGGTCGAAGGCGATTTTCGGCAGCCGGGCGTGAACGACGGCGCATTGATGACCTACACGGCCTCCACGAA TGGCCTCGCCGAATACATCGCGACCAATTTCTCCGGTGAAATGCGCGGCAACCTCATCGCCGCGAGCTACGACGGCAAGC TGCTCCGCATCGCGCTCAGCGCCGACGGCACGACCGTGACCAACGGCGTCGAGCCGCTCGCCAGCGGTTTCGGCAACCTC CCGCTCGACGTCACCACGCCCGACCCGATCAACGGCGCGGCGTTCGTCGGCACCATTTGGGTCAGCCATTACGCGCCCGC GAAAATCAGCGTGCTCGAGCCGAGCGACTTCGACTCCCCCGGCAGCGACACGTGCACGGGCATCAACTCCGCCGCGGTCG ACGAGGATGGCGACGGCTACAGCAACGCCGACGAGATCGCCAACGACTCCGATCCGTGCTCGGCCGCGATCCGACCGGCG GATGTCGATGGCGACCACATTTCCGACCAGCTCGACACCGACGACGACAACGATGGCGTGCCCGACACGCAGGACCCCTT CCCGATCGATCCGCTCGACGGCAAGAGCGTGCCGCTGCCGCTGCGCTACGATCTCTTCAATGAGACGGGCATCGGCTTCT TCGGCGTCGGCATGACCGGCCTGATGATGAACCCCGGCCAGGACTACCTGCCGTTGATCAGTCCGGACAACGCGATCGCC GGCGGCACCGCGGGGTTGTTCACGCTCGCCGAGGTCGGCCCCGGCCGCGCGCTCGGGTCCAGCAACAACCAGAAGGACGC CTACCAGTTCGCCTTCAACTCCGACGAATTCACCACGCCGTACATCATCTCCTCGCGCCTTGGCGGACCGTTCTTCAACA GCAGTCCAACCGGTCAGCAGGCGCAGGGCATCTATCTCGGCAATGGCGATCAGGACAACTACGTCTCCGTCGCGATTCAC GGCAACGAGGGCGCGGGCGGGCTGCAGGTCGTCTATGAAAACGACGGCACGATCGTATCGCAGGACCTCTATCCGCTCGC CGGGCTCGCGGACCTCACGACGATCGATCTGTATTTCACCGTCGATCCCGTCGCCGCCACGGTGCAGCCCGGCTATCGGC TCAGCGAAAGCGATCCGGTCACCGCGTTGGGCTCGCCGATTGCCGTCGGCGGCGAGGTGCTGGCCGCGCTGATGGGCGCG CGCCCGCTCGCCATCGGATTCTTCGCCACCACCGGCGGCGGCGGCGCACCGACGTTCTCTGCGACGTGGGACTACTTCGA CATCGCGCCGATCAACAACACCGCGATCGCCAAGTTCACCATCGATCCGCCGGTCACCGACATGGCGACCGCCAGCACCT ACACGGCCGGAGCCTTCAAGATTCAGAACAACTCCACTGACGGGCAGCAAATCGAGAGCGTTTCCATCGATCTGAGCACG GCGATCTTCCCCGACATGGTGTTCGATCCGGCCGGCACCGCCGGCGATCCGGACCACAAGGCTTTCCAGCCTAACACCTA TGCCGGCGGCGCAGCCGGCGCGGTCGGCACGAACACCAAACCGCACAACGGCACGAGCGGCGAGGACGGCTACGATGGGC TCGACATCACGTTCGGCGCCTTCCCGCCCGGCGGCTCACTCGCCTTCTCGATCGATGTCGATCCAAACAACGTGAAAGGT GTCGCCGCGCCCGGGGAGAACCACGCCGCCAGCGTCTCCGGCCTCGAACTCATCGGCAGCACCGTCGAGGTGTTCTTCAG CGACGGCAGCGTCCAGCGCTCGCGGCTCGGCCGGCTCGAGAACACGCAGGATGGCTCCTATGCGTGGCTCCGCTCCGAAA AGCCGCCGAAGCCGGGACTCACCGCGCTCGGCAAGTCTTCGCCGTTCACCACCGGTGAACCGGAAACCCTTCGCGTCGCC GGGCCGACCGGCTTCAATGTCACGCTGACCCGCATCGAAGGCGCGCTCTACCTCGGTGGCGTGCCCGGCGGTGGCTACGA TATCGATCCGTTCGAAGCCAACACCGCCATCGACATCGCCGAGCAGACCGGCGTGATCCCCGCGGCAGGTTACCTCGATT TCACCGTCACCGGCACCAAGTCGAACGACGACGGCGGCTACAACTACGTCACCGCCGTCCTGAGCAACAGCTCCGGCATC AAGGGTCCGGCGTCGACTCCGGTCGTGTTCATGTTCGACCCCAGCCTCAGCCCCGACACCCAGCCGCCCACCCAGCCGGG CGCACTCACGTTCAGCAACGTGACGCCCAACAGTCTCACGGTTACGTGGACCGCCTCGACGGACAACGTCGGTGTCACCG GTTATCGCGTTTCGCGCGACGGCGCGTTGATCGCGACGGTCACCGGCCTCTCCTACAACGACTCGTCGCTCGCGCCGGCG ACCACCTATGACTACTCGGTGGTCGCTATCGACAGCGCGGGCAACCTCTCGGCCCCACGCGCCGCGAGCGTCACCACGCT CGAGGCGAGCGCGAACACGGTGGTTCGGATCAACTGCGGCGGCCCGACTTACCTCGACGCGACCGGCAACACTTGGATCG CTGACACCTACTACAACACCGGCTTTGCCTCGACGGATTCCGCGACGGTGACGGGCACCACCCTCGGCCAGCTGTTCAAA AGCTACCGCTGGGACGACACGCCGTCACCGGAACTTCTGTACACCATCCCGATCGCGAATGGTTCCTATGTCGTCCGACT TTATCTCGCGGAAACGTCCTCGAGCGCGAAGGCGCCGGGCAAGCGCGTCTTCGACGTCGATATCGAGGGCACGCGCGCGT TCGAGGATGTCGATGTCTACGTCATGGCCGGCGGCGGCAACAAGGCGCTGATCCTCGAGGCCGCGACGACGGTCAATGAC GGCAACGTGCAGATCAAATTCCTCCACCAGGTCTTCCATCCGCGCATCTACGCGATCGAGGTGCTGCCGACCTCGGGCCC GAGCGACACCGAACCGCCCACCCAGCCCGGCGCGATCACCTTCAGCGGAGTGACTTCGAGCAGCGTGACACTGGACTGGA CCGGCTCGACCGACGACGTGGGCGTCACCGGTTATCGGATCTCGCGCGACGGCTCGGTCCTCACGACGGTGAGCAGCCTG ACCTTCACCGACTCGACGGTGAGTCCGGAAACCGCCTACGACTACTCCGTCGTGGCGCTCGATGCGGCGGGCAACGCCTC GACCGCGAGCACCGCATCGGTCACGACCCTGGACGCGCCGCCGCCCGACGACACGCAACCGCCGAGCCAGCCCGGCGTGC TCTCGTTCACTGGCGTGACGACGAGCAGCGTCACCGTGAACTGGGCGGCATCGACCGACAATGTCGGCGTGGCCGGCTAC CGGATTTCGCGCAACGGCGTGGAACTGACCACGGTCACGGGCCTGACGTTCACCGACTCGACCGTCGTCGCGAACACGAC CTACGACTACTCGGTCGTGGCCTTCGACGCGGCGAACAACACCTCGCCCGAGCGCACCGGCAGCGTGGCCACGCCGCCGA GCGGCGGCGCCAGCACCGTGATCCGCGTCAACGCAGGCGGCTCGAGCTACGTCGACGGCGCCGGGAACACCTGGTCGGCG GACACTGGCTACAACACCGGCGGCACCTATTCGGTCTCCTCTTCGACGGCCATCGCCGGCACGACCGATGACCCGCTGTT CCGCACAGAGCGCTATGATGCCTCGACCGCACCGGAGCTCACCTACTCGTTCGCCGTGCCGAACGGAAACTACCTTGTCC GTCTGCTCTTCGCCGAGAACTACGGCAGCGCCAAGGGCGTCGGCAAACGCGTCTTCGACGTGGATATCGAAGGCGCCCGG GCCTTCGAAGATGTGGACATCTATGCGCAGGCGGGCGGCGGCAACAAGGCCCTGATTCTCGAGCACACGACCGCCGTCAC GGACGGTCAGCTCAACATCGGCTTCGTGCACCAGGTCCAGAATCCGAAGGTCAACGCGATCGAGATCCTTTCGGTCGACG CGCCGGCGGACACGCAACCGCCCACGCAGCCCGGCGCGATCACGTTCAGTGACGTGAGCTGGGACAGCGTCACGCTCAAT TGGGGTGCGTCGACCGACAACATCGGCGTCGCAGGATATCGGATTTCCCGCGACGGAACCGAGCTCGCCACGGTCAATGC GCTCACGTTCACCGACTCGACGGTCGCAGCGCAGACCGACTACGACTACACCGTTGTCGCGCTCGATGCGGCCGGCAACG AATCCACCGCGCGCAACGCTTCCGTCACCACCACCGCCTCGCCCGATACGCAGCCGCCGTCCGCGCCGGGCGCGCTCGTC TTCTCCGACGTCACGGCGAGCAGTCTCACCGTGAGCTGGACGGCTGCGACCGACAACGTCGAGGTAACGGGCTACCGCGT GTCGCGCGACGGCGTGCAACTGGCGACCGTCACGACGCTGTCGTTCGACGACACCGGGCTCTCCGCCGCCACGTCCTACA GTTACTCGGTCGAAGCGCTGGACGCCGCCGGCAACGCGTCGCCGGCGAGCACCGCCTCGGTCACGACGTCCACGGCGGCC GATGCGCAGCCGCCTTCCGCGCCGGGCGCGCTTGTGTTCTCCAACGTGACCGCGAGCAGCCTGACCGTGAGTTGGACGGC TGCGACCGACAACGTCGAGGTAACGGGCTACCGCGTGTCGCGCGACGGCGTGCAACTGGCGACCGTCACGACGCTGTCGT TCGACGACACCGGACTCTCCGCCGCGACGTCCTACAGCTACTCGGTCGAAGCGCTGGACGCCGCCGGCAATGCCTCGGTG GCGAGCACTGCGACGATATCGACCGCGTCGAGCGGGCCGACGCTACCGACGATTCGCGTGAACGCCGGTGGCAGCAGCTA TGTCGACAGCGCCGGCAACACGTGGTCGGCCGACCACGGCTATAACACCGGCGGCAAATATTCCGTCTCGGGCTCGACGA CGATCACGAACACGAGCGACCCGACGCTCTTCCGCAGCGAGCGCTACGACGCGAGCACCTCGCCGGAGCTCACCTATTCG TTCACCGTGCCGAACGGCACGTACGTCGTGCGGCTCTACTTTGCGGAAAACTATGCCAGCGCGAAAGGCGCCGGGCTGCG CGTGTTTGACATCGACATCGAGGGCGCACGCGCCTTCGAGGATGTCGACATCTTCGTGCAAGCCGGCGGAGCCAACCGAG CGATGATGTTGGAGAACACCGTCACGGTCACCGATGGACAACTGAACATCGGTTTCGTGCACCAGGTGCAAAATCCGAAG ATCAACGCGATCGAGATCCTGCCCGCGCCGTAG
Upstream 100 bases:
>100_bases ACCCCGGAGGTGTCCTCGGGGTTTTGCTGGGAGCGTCCCCGTTTTCGCTCCGCCCGCGGAGAGTCCTGTCCAGCAAGTTT CTTCCGATCGCTTTTCACCT
Downstream 100 bases:
>100_bases CGTTCGCTCGTCGTCAAAAGGTGCAGCCCGCCGTCCCCGGCGGGCTGCTCTTTCGGCTCGCTTGTAGCCGGCCTCGCTGA GGCCGGCAGCATGGAGCCGC
Product: fibronectin type III domain-containing protein
Products: NA
Alternate protein names: NA
Number of amino acids: NA
Protein sequence:
>2170_residues MNYRLLASLVTSLLLAVTASAYNFSISTLHGVSFASGVTSLQFGPDQRLYAAQVSGEILVMTVQRLGPNNYQVTATETIN LVRDIPNYNDDGTRAATLTTRQVTGILVVGTASNPVIYVTSSDPRIGGGSGGVNDLNLDTNSGIISRLTRTGGVWSKVDL VRGLPRSEENHASNGMQLDAATNTLYLAQGGNTNAGGPSNNFAFMCETALSAAILSIDLDAINAMPVQTDAYGQQYLYNL PTVDDPNPSRGQNPDGSDVNDPFGGNDGLNQARLVADGPVQVYSSGYRNPYDLVITRTPGAEGRMYAWDNGANNGWGGYP KNEGPEGNVTNEYVDGEPGTVNNKDNLHVITAGYYAGHPNPIRANPAGAGWFHYDSTQPAGSEKIFSLTPTTDWPPVPVA MANPVEGDFRQPGVNDGALMTYTASTNGLAEYIATNFSGEMRGNLIAASYDGKLLRIALSADGTTVTNGVEPLASGFGNL PLDVTTPDPINGAAFVGTIWVSHYAPAKISVLEPSDFDSPGSDTCTGINSAAVDEDGDGYSNADEIANDSDPCSAAIRPA DVDGDHISDQLDTDDDNDGVPDTQDPFPIDPLDGKSVPLPLRYDLFNETGIGFFGVGMTGLMMNPGQDYLPLISPDNAIA GGTAGLFTLAEVGPGRALGSSNNQKDAYQFAFNSDEFTTPYIISSRLGGPFFNSSPTGQQAQGIYLGNGDQDNYVSVAIH GNEGAGGLQVVYENDGTIVSQDLYPLAGLADLTTIDLYFTVDPVAATVQPGYRLSESDPVTALGSPIAVGGEVLAALMGA RPLAIGFFATTGGGGAPTFSATWDYFDIAPINNTAIAKFTIDPPVTDMATASTYTAGAFKIQNNSTDGQQIESVSIDLST AIFPDMVFDPAGTAGDPDHKAFQPNTYAGGAAGAVGTNTKPHNGTSGEDGYDGLDITFGAFPPGGSLAFSIDVDPNNVKG VAAPGENHAASVSGLELIGSTVEVFFSDGSVQRSRLGRLENTQDGSYAWLRSEKPPKPGLTALGKSSPFTTGEPETLRVA GPTGFNVTLTRIEGALYLGGVPGGGYDIDPFEANTAIDIAEQTGVIPAAGYLDFTVTGTKSNDDGGYNYVTAVLSNSSGI KGPASTPVVFMFDPSLSPDTQPPTQPGALTFSNVTPNSLTVTWTASTDNVGVTGYRVSRDGALIATVTGLSYNDSSLAPA TTYDYSVVAIDSAGNLSAPRAASVTTLEASANTVVRINCGGPTYLDATGNTWIADTYYNTGFASTDSATVTGTTLGQLFK SYRWDDTPSPELLYTIPIANGSYVVRLYLAETSSSAKAPGKRVFDVDIEGTRAFEDVDVYVMAGGGNKALILEAATTVND GNVQIKFLHQVFHPRIYAIEVLPTSGPSDTEPPTQPGAITFSGVTSSSVTLDWTGSTDDVGVTGYRISRDGSVLTTVSSL TFTDSTVSPETAYDYSVVALDAAGNASTASTASVTTLDAPPPDDTQPPSQPGVLSFTGVTTSSVTVNWAASTDNVGVAGY RISRNGVELTTVTGLTFTDSTVVANTTYDYSVVAFDAANNTSPERTGSVATPPSGGASTVIRVNAGGSSYVDGAGNTWSA DTGYNTGGTYSVSSSTAIAGTTDDPLFRTERYDASTAPELTYSFAVPNGNYLVRLLFAENYGSAKGVGKRVFDVDIEGAR AFEDVDIYAQAGGGNKALILEHTTAVTDGQLNIGFVHQVQNPKVNAIEILSVDAPADTQPPTQPGAITFSDVSWDSVTLN WGASTDNIGVAGYRISRDGTELATVNALTFTDSTVAAQTDYDYTVVALDAAGNESTARNASVTTTASPDTQPPSAPGALV FSDVTASSLTVSWTAATDNVEVTGYRVSRDGVQLATVTTLSFDDTGLSAATSYSYSVEALDAAGNASPASTASVTTSTAA DAQPPSAPGALVFSNVTASSLTVSWTAATDNVEVTGYRVSRDGVQLATVTTLSFDDTGLSAATSYSYSVEALDAAGNASV ASTATISTASSGPTLPTIRVNAGGSSYVDSAGNTWSADHGYNTGGKYSVSGSTTITNTSDPTLFRSERYDASTSPELTYS FTVPNGTYVVRLYFAENYASAKGAGLRVFDIDIEGARAFEDVDIFVQAGGANRAMMLENTVTVTDGQLNIGFVHQVQNPK INAIEILPAP
Sequences:
NA
Specific function: NA
COG id: COG3979
COG function: function code R; Uncharacterized protein contain chitin-binding domain type 3
Gene ontology:
Cell location: NA
Metaboloic importance: NA
Operon status: NA
Operon components: NA
Similarity: NA
Homologues:
NA
Paralogues:
NA
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: NA
Theoretical pI: NA
Prosite motif: NA
Important sites: NA
Signals:
NA
Transmembrane regions:
NA
Cys/Met content:
NA
Secondary structure: NA
PDB accession: NA
Resolution: NA
Structure class: NA
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: NA
TargetDB status: NA
Availability: NA
References: NA