With gene1 expression analysis growing in importance for both basic researchers and medical practitioners2, researchers at Carnegie Mellon University and the University of Maryland have developed a new computational method that dramatically speeds up estimates of gene activity from RNA sequencing data. With the new method, dubbed3 Sailfish4 after the famously speedy fish, estimates of gene expression that previously5 took many hours can be completed in a few minutes, with accuracy that equals or exceeds previous methods. The researchers report on their new method is being published online April 20 by the journal Nature Biotechnology.Gigantic repositories(贮藏室) of RNA-seq data now exist, making it possible to re-analyze experiments in light of new discoveries. But 15 hours a pop really starts to add up, particularly if you want to look at 100 experiments, said Carl Kingsford, an associate professor in CMUs Lane Center for Computational Biology. With Sailfish, we can give researchers everything they got from previous methods, but faster.Though an organisms genetic6 makeup7 is static, the activity of individual genes8 varies greatly over time, making gene expression an important factor in understanding how organisms work and what occurs during disease processes. Gene activity cant be measured directly, but can be inferred by monitoring RNA, the molecules10 that carry information from the genes for producing proteins and other cellular11 activities. RNA-seq is a leading method for producing these snapshots of gene expression; in genomic medicine, it has proven particularly useful in analyzing12 certain cancers.The RNA-seq process results in short sequences of RNA, called reads. In previous methods, the RNA molecules from which they originated could be identified and measured only by painstakingly13 mapping these reads to their original positions in the larger molecules.But Kingsford, working with Rob Patro, a post-doctoral researcher in the Lane Center, and Stephen M. Mount, an associate professor in Marylands Department of Cell Biology and Molecular14 Genetics and its Center for Bioinformatics and Computational Biology, found that the time-consuming mapping step could be eliminated. Instead, they found they could allocate15(分配) parts of the reads to different types of RNA molecules, much as if each read acted as several votes for one molecule9 or another.Without the mapping step, Sailfish can complete its RNA analysis 20-30 times faster than previous methods.This numerical approach might not be as intuitive as a map to a biologist, but it makes perfect sense to a computer scientist, Kingsford said. Moreover, the Sailfish method is more robust16 -- better able to tolerate errors in the reads or differences between individuals genomes. These errors can prevent some reads from being mapped, he explained, but the Sailfish method can make use of all the RNA read votes, which improves the methods accuracy.
点击收听单词发音
1 gene n.遗传因子,基因 参考例句: A single gene may have many effects.单一基因可能具备非常多种效应。 The targeting of gene therapy has been paid close attention.其中基因治疗的靶向性是值得密切关注的问题之一。