A modernized ART for Illumina read simulation. Long Description (required)
High-performance simulation of realistic next-generation sequencing (NGS) data is a must for various algorithm development and benchmarking tasks. However, most existing simulators are either slow or generate data that does not reflect the real-world error profile of simulators. Here we introduce art_modern, a modern re-implementation of the popular ART simulator with enhanced performance and functionality. It can be used for anyone who wants to simulate sequencing data for their own research, like benchmarking of DNA- or RNA-Seq alignment algorithms, test whether the RNA-Seq pipeline built by your lab performs well or perform pressure testing of pipelines on a cluster. This simulator would be best suited for GNU/Linux-based High-End Desktops (HEDTs) with multiple cores and a fast SSD. However, it can also work on laptops or high-performance clusters (HPCs) with only one node. We believe with such simulators, the testing and benchmarking of NGS-related bioinformatics algorithms can be largely accelerated. rna-seq; simulation; ngs; illumina-sequencing; dna-seq https://github.com/YU-Zhejian/art_modern/ yuzj25@seas.upenn.edu
Step 1: Use the attribute tree to add new attributes or remove pre-selected attributes to describe the simulator.
Every sub-attribute is selected Not all sub-attributes are selectedFill Clear Expand Collapse Reset
Summary of Proposed Changes Step 2: Review list of proposed attribute addition(s) and subtraction(s).
Can't Find the Attribute You Are Looking For? If you would like to propose an attribute that you cannot find in the tree above, or if you would like to add a clarification to one or more attributes for this simulator (e.g. a specific file format for attribute /Output/File Format/Other), please list them in the Additional Comment box of the Submit tab .
Summary of Proposed Changes Current Citations/Applications