PacBio sequencing simulator Long Description (required) By analyzing the characteristic features of CLR data from PacBio SMRT (single molecule real time) sequencing, we developed a new PacBio sequencing simulator (called NPBSS) for producing CLR reads. NPBSS simulator firstly samples the read sequences according to the read length logarithmic normal distribution, and choses different base quality values with different proportions. Then, NPBSS computes the overall error probability of each base in the read sequence with an empirical model, and calculates the deletion, substitution and insertion probabilities with the overall error probability to generate the PacBio CLR reads. Alignment results demonstrate that NPBSS fits the error rate of the PacBio CLR reads better than PBSIM and FASTQSim. In addition, the assembly results also show that simulated sequences of NPBSS are more like real PacBio CLR data. https://github.com/NWPU-903PR/NPBSS_Octave Step 1: Use the attribute tree to add new attributes or remove pre-selected attributes to describe the simulator. Every sub-attribute is selected Not all sub-attributes are selected Fill Clear Expand Collapse Reset Summary of Proposed Changes Step 2: Review list of proposed attribute addition(s) and subtraction(s). Can't Find the Attribute You Are Looking For?
If you would like to propose an attribute that you cannot find in the tree above, or if you would like to add a clarification to one or more attributes for this simulator (e.g. a specific file format for attribute /Output/File Format/Other), please list them in the
Additional Comment box of the Submit tab. Summary of Proposed Changes Current Citations/Applications
NPBSS: a new PacBio sequencing simulator for generating the continuous long reads with an empirical model. BMC Bioinformatics,
https://www.ncbi.nlm.nih.gov/pubmed/?term=29788930, Primary Citation