格式转换工具
合并FASTA格式序列
EMBL格式转换为FASTA格式
EMBL特征抽取器
EMBL翻译信息提取
DNA序列清理
蛋白序列清理
GenBank格式转换成FASTA格式
GenBank文件特征提取器
GenBank翻译信息提取
氨基酸单字母到三字母
DNA子串提取
蛋白序列子串提取
反向互补序列计算
密码子分离器
FASTA文件分割
氨基酸三字母到单字母
滑动窗口方式提取DNA子串
滑动窗口方式提取蛋白序列子串
序列分析工具
绘制密码子谱
密码子使用频率表计算
CpG岛分析
DNA分子量计算
正则表达式查找DNA子串
DNA碱基统计
模糊方式获取DNA子串
模糊方式获取蛋白子串
DNA/蛋白序列相似度比较
根据已知蛋白序列生成cDNA
酶切突变工具
开放阅读框查找
双序列密码子比对
双序列DNA碱基比对
双序列蛋白残基比对
PCR引物信息统计
PCR产物推测
蛋白亲水分析工具
蛋白等电点计算器
蛋白分子量计算
正则表达式查找蛋白子串
蛋白序列氨基酸组成统计
DNA模拟酶切分析
常见酶切位点统计
蛋白序列到基因序列
基因序列到蛋白序列
序列图形化分析工具
颜色法标记DNA/蛋白保守区
颜色法标记蛋白残基生化相似度
DNA带数字分组格式化
蛋白序列带数字分组格式化
模板引物匹配图
酶切位点标记图
DNA翻译图谱
随机序列生成工具
随机突变DNA
随机突变蛋白序列
随机生成cDNA
随机生成DNA
随机突变特定位置的DNA
随机生成蛋白质序列
随机突变蛋白序列的特定位置
DNA随机采样
蛋白质随机采样
打乱DNA
打乱蛋白序列
Miscellaneous
IUPAC编码表
密码子表
浏览器兼容性要求
如何镜像此程序
单机版使用方法
关于序列操作套件
致谢
文献
The Sequence Manipulation Suite Copyright © 2000, 2004 Paul Stothard. Send questions and comments to stothard@ualberta.ca
Home
EMBL Feature提取器
- SMS2南京德泰生物镜像
EMBL特征提取器(EMBL Feature Extractor)可以依照EMBL中Feature的定义信息,将DNA片段提取并拼接出FASTA格式。在需要从基因组中去除内含子、获取cDNA时,此工具特别有用。
粘贴一条或多条EMBL文件内容,长度限定在200000以内。
ID AF177870 standard; DNA; INV; 3123 BP. XX AC AF177870; XX SV AF177870.1 XX DT 02-NOV-1999 (Rel. 61, Created) DT 17-AUG-2000 (Rel. 64, Last updated, Version 2) XX DE Caenorhabditis sp. CB5161 putative PP2C protein phosphatase FEM-2 (fem-2) DE gene, complete cds. XX KW . XX OS Caenorhabditis sp. CB5161 OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3123 RA Stothard P.M., Hansen D., Pilgrim D.; RT "Isolation of PP2C sequences using degenerate-oligo PCR"; RL Unpublished. XX RN [2] RP 1-3123 RA Stothard P.M., Hansen D., Pilgrim D.; RT ; RL Submitted (17-AUG-1999) to the EMBL/GenBank/DDBJ databases. RL Biological Sciences, University of Alberta, Edmonton, AB T6G-2E9, Canada XX DR SPTREMBL; Q9U6S2; Q9U6S2. XX FH Key Location/Qualifiers FH FT source 1..3123 FT /db_xref="taxon:135651" FT /organism="Caenorhabditis sp. CB5161" FT /strain="CB5161" FT mRNA join(<265..402,673..781,911..1007,1088..1215,1377..1573, FT 1866..2146,2306..2634,2683..>2855) FT /gene="fem-2" FT /product="putative FEM-2 protein phosphatase type 2C" FT CDS join(265..402,673..781,911..1007,1088..1215,1377..1573, FT 1866..2146,2306..2634,2683..2855) FT /codon_start=1 FT /db_xref="SPTREMBL:Q9U6S2" FT /note="possible sex-determining protein" FT /gene="fem-2" FT /product="putative PP2C protein phosphatase FEM-2" FT /protein_id="AAF04557.1" FT /translation="MSDSLNHPSSSTVHADDGFEPPTSPEDNNKKPSLEQIKQEREALF FT TDLFADRRRSARSVIEEAFQNELMSAEPVQPNVPNPHSIPIRFRHQPVAGPAHDVFGDA FT VHSIFQKIMSRGVNADYSHWMSYWIALGIDKKTQMNYHMKPFCKDTYATEGSLEAKQTF FT TDKIRSAVEEIIWKSAEYCDILSEKWTGIHVSADQLKGQRNKQEDRFVAYPNGQYMNRG FT QSDISLLAVFDGHGGHECSQYAAAHFWEAWSDAQHHHSQDMKLDELLEKALETLDERMT FT VRSVRESWKGGTTAVCCAVDLNTNQIAFAWLGDSPGYIMSNLEFRKFTTEHSPSDPEEC FT RRVEEVGGQIFVIGGELRVNGVLNLTRALGDVPGRPMISNKPDTLLKTIEPADYLVLLA FT CDGISDVFNTSDLYNLVQAFVNEYDVEDYHELARYICNQAVSAGSADNVTVVIGFLRPP FT EDVWRVMKTDSDDEESELEEEDDNE" XX SQ Sequence 3123 BP; 986 A; 605 C; 597 G; 935 T; 0 other; gaacgcgaat gcctctctct ctttcgatgg gtatgccaat tgtccacatt cactcgtgtt 60 gcctcctctt tgccaacacg caagacacca gaaacgcgtc aaccaaagag aaaaagacgc 120 cgacaacggg cagcactcgc gagagacaaa ggttatcgcg ttgtgttatt atacattcgc 180 atccgggtca actttagtcc gttgaacatg cttcttgaaa acctagttct cttaaaataa 240 cgttttagaa gttttggtct tcagatgtct gattcgctaa atcatccatc gagttctacg 300 gtgcatgcag atgatggatt cgagccacca acatctccgg aagacaacaa caaaaaaccg 360 tctttagaac aaattaaaca ggaaagagaa gcgttgttta cggttagtta cctattagct 420 gcaagttttg aaaaagcgga atctgtaaaa agcggaatct gtaaaaaaaa catctaagga 480 ataattctga aaagaaaaag tttctaaatg ttaatcggaa tccaattttt atgaaattat 540 ttaaaaaaaa actaaaatta gtttctaaaa aatttttcta aagtaattgg accatgtgaa 600 ggtacaccca cttgttccaa tatgccatat ctaactgtaa aataatttga ttctcatgag 660 aatatttttc aggatctatt cgcagatcgt cgacgaagcg ctcgttctgt gattgaagaa 720 gctttccaaa acgaactcat gagtgctgaa ccagtccagc caaacgtgcc gaatccacat 780 tgtgagttgg aaatttttat ttgataacca agagaaaaaa agttctacct ttttttcaaa 840 aacctttcca aaaatgattc catctgatat aggattaaga aaaatatttt ccgaaatctc 900 tgcttttcag cgattcccat tcgtttccgt catcaaccag ttgctggacc tgctcatgat 960 gttttcggag acgcggtgca ttcaattttt caaaaaataa tgtccaggta tacactattt 1020 ttgcatattt ttcttgccaa atttggtcaa aaaccgtagt acaacccaaa aagtttcttc 1080 atttcagagg agtgaacgcg gattatagtc attggatgtc atattggatc gcgttgggaa 1140 tcgacaaaaa aacacaaatg aactatcata tgaaaccgtt ttgcaaagat acttatgcaa 1200 ctgaaggctc cttaggtagg ttagtctttt ctaggcacag aagagtgaga aaattctaaa 1260 tttctgagca gtctgctttt tgttttcctt gagtttttac ttaaagctct taaaagaaat 1320 ctaggcgtga agttcgagcc ttgtaccata ccacaacagc attccaaatg ttacagaagc 1380 gaaacaaaca tttactgata aaatcaggtc agctgttgag gaaattatct ggaagtccgc 1440 tgaatattgt gatattctta gcgagaagtg gacaggaatt catgtgtcgg ccgaccaact 1500 gaaaggtcaa agaaataagc aagaagatcg ttttgtggct tatccaaatg gacaatacat 1560 gaatcgtgga caggttagtg cgaatcgggg actcaagatt tactgaaata gtgaagagaa 1620 aacaaaagaa aactatattt tcaaaaaaaa tgagaactct aataaacaga atgaaaaaca 1680 ttcaaagcta cagtagtatt tccagctgga gtttccagag ccaaaaaaat gcgagtatta 1740 ctgtagtttt gaaattggtt tctcacttta cgtacgattt tttgattttt ttttcagact 1800 cttcatatga aaaaaaatca tgttttctcc tttacaagat ttttttgatc tcaaaacatt 1860 tccagagtga catttcactt cttgcggtgt tcgatgggca tggcggacac gagtgctctc 1920 aatatgcagc tgctcatttc tgggaagcat ggtccgatgc tcaacatcat cattcacaag 1980 atatgaaact tgacgaactc ctagaaaagg ctctagaaac attggacgaa agaatgacag 2040 tcagaagtgt tcgagaatct tggaaaggtg gaaccactgc tgtctgctgt gctgttgatt 2100 tgaacactaa tcaaatcgca tttgcctggc ttggagattc accagggtaa tcaatttttt 2160 tttagttttt ggaactttac gtcccgaaaa attattcctt tatcacctaa ttcctacagt 2220 aacccaagct ccgaattaaa taaagttaaa gcgtggtata cacataaaaa taagaaaaaa 2280 ttgttcatga aatccatttt tccagttaca tcatgtcaaa cttggagttc cgcaaattca 2340 ctactgaaca ctccccgtct gacccggagg aatgtcgacg agtcgaagaa gtcggtggcc 2400 agatttttgt gatcggtggt gagctccgtg tgaatggagt actcaacctg acgcgagcac 2460 taggagacgt acctggaaga ccaatgatat ccaacaaacc tgatacctta ctgaagacga 2520 tcgaacctgc ggattatctt gttttgttgg cctgtgacgg gatttctgac gtcttcaaca 2580 ctagtgattt gtacaatttg gttcaggctt ttgtcaatga atatgacgta gaaggtatca 2640 aactgatcgt ttttcacatc acaaaattct tgaattttcc agattatcac gaacttgcac 2700 gctacatttg caatcaagca gtttcagctg gaagtgctga caatgtgaca gtagttatag 2760 gtttcctccg tccaccagaa gacgtttggc gtgtaatgaa aacagactcg gatgatgaag 2820 agagcgagct cgaggaagaa gatgacaatg aatagtttat tgcaagtttt ccaaaacttt 2880 tccaatttcc ctgggtattg attagcatcc atatcttacg gcgattatat caattgtaac 2940 attatttctg tttctccccc cacctctcaa attttcaaat gacccttttt cttttcgtct 3000 acctgtatcg ttttccattc atctcccccc ctccactgtg gtatatcatt ttgtcattag 3060 aaagtattat tttgattttc attggcagta gaagacaaca ggatacagaa gaggttttca 3120 cag 3123 //
使用此工具之前,请详细了解
浏览器兼容性
要求.
Feature序列输出时需要
单独列出
大写
* 此工具需要浏览器支持JavaScript. 详情请见
浏览器兼容性介绍.
* 您可以
镜像此工具
到自己的网站 or 也可以单机使用。
新窗口打开
|
SMS2汉化版
|
引用文献
Thu Aug 27 17:17:36 2015