GSR: Editing - ART Simulator

You may request changes to this simulator by navigating to the Basic, Details, and Citations/Applications tabs. When you are finished, open the Submit tab. To return back to the simulator view, click ART. Finally, please take note of the GSR simulator privacy policy.
ART
ART is a set of simulation tools to generate synthetic next-generation sequencing data by mimicking real sequencing process with empirical error models or quality profiles.
ART is a set of simulation tools to generate synthetic next-generation sequencing data by mimicking real sequencing process with empirical error models or quality profiles. ART supports simulation of single-end, paired-end and mate-pair reads of three major commercial next-generation sequencing platforms: Illumina's Solexa, Roche's 454 and Applied Biosystems' SOLiD. ART can perform regular genome sequencing simulation as well amplicon sequencing simulation. ART is implemented in C++ with optimized algorithms and is highly efficient in read simulation. ART outputs reads in the FASTQ format, and alignments in the ALN/MAP and/or SAM format. ART can also generate alignments in UCSC BED file format.
ART-MountRainier-2016-06-05
04-21-2011
06-05-2016
http://www.niehs.nih.gov/research/resources/software/biostatistics/art/

Attribute Tree Control

Step 1: Use the attribute tree to add new attributes or remove pre-selected attributes to describe the simulator.

Every sub-attribute is selected
Not all sub-attributes are selected
  • Target
    • Type of Simulated Data
      • Genotype at Genetic Markers
      • Diploid DNA Sequence
      • Haploid DNA Sequence
      • RNA
      • Sex Chromosomes
      • Mitochondrial DNA
      • Protein Sequence
      • Sequencing Reads
      • Phenotype
    • Variations
      • Biallelic Marker
      • Multiallelic Marker
      • Single Nucleotide Variation
      • Amino acid variation
      • Microsatellite
      • Insertion and Deletion
      • CNV
      • Inversion and Rearrangement
      • Alternative Splicing
      • Missing Genotypes
      • Genotype or Sequencing Error
      • Other
  • Simulation Method
    • Standard Coalescent
    • Exact Coalescent
    • Machine Learning
    • Forward-time
    • Resample Existing Data
    • Phylogenetic
    • Gene dropping
    • Other
  • Input
    • Data Type
      • Allele Frequencies
      • Empirical
      • Ancestral Sequence
      • Saved simulation
      • Reference genome
      • Other
    • File format
      • Arlequin
      • CREATE
      • Fstat
      • GDA
      • Genepop
      • MIGRATE
      • MS
      • SAM or BAM
      • NEXUS
      • Phylip
      • STRUCTURE
      • XML
      • Tree Sequence
      • Program Specific
      • Other
  • Output
    • Data Type
      • Genotype or Sequence
      • Phenotypic Trait
      • Individual Relationship
      • Demographic
      • Mutation
      • Methylation
      • Gene Expression
      • Protein Expression
      • Linkage Disequilibrium
      • Diversity Measures
      • Fitness
      • Sequencing Reads
        • Illumina
        • Roche 454
        • SOLiD
        • IonTorrent
        • PacBio
        • Nanopore
        • Other
      • Other
    • File Format
      • Arlequin
      • Fasta or Fastq
      • Fstat
      • Genepop
      • Linkage
      • MIGRATE
      • MS
      • PED
      • Phylip
      • NEXUS
      • STRUCTURE
      • VCF
      • SAM or BAM
      • Tree Sequence
      • Program Specific
      • Other
    • Sample Type
      • Random or Independent
      • Sibpairs, Trios and Nuclear Families
      • Extended or Complete Pedigrees
      • Case-control
      • Longitudinal
      • Other
  • Phenotype
    • Trait Type
      • Binary or Qualitative
      • Quantitative
      • Multiple
    • Determinants
      • Single Genetic Marker
      • Multiple Genetic Markers
      • Sex-linked
      • Gene-Gene Interaction
      • Environmental Factors
      • Gene-Environment Interaction
  • Evolutionary Features
    • Demographic
      • Population Size Changes
        • Constant Size
        • Exponential Growth or Decline
        • Logistic Growth
        • Bottleneck
        • Carrying Capacity
        • User Defined
      • Gene Flow
        • Stepping Stone Models
        • Island Models
        • Continent-Island Models
        • Sex or Age-Specific Migration Rates
        • Influenced by Environmental Factors
        • Admixed Population
        • User-defined Matrix
        • Other
      • Spatiality
        • Discrete Models
        • Continuous Models
        • Landscape Factors
    • Life Cycle
      • Discrete Generation Model
      • Age structured
      • Overlapping Generation
      • User-Defined transition matrices
    • Mating System
      • Random Mating
      • Monogamous
      • Polygamous
      • Haplodiploid
      • Selfing
      • Age- or Stage-Specific
      • Assortative or Disassortative
      • Other
    • Fecundity
      • Constant Number
      • Randomly Distributed
      • Individually Determined
      • Influenced by Environment
      • Other
    • Natural Selection
      • Determinant
        • Single-locus
        • Multi-locus
        • Codon-based
        • Fitness of Offspring
        • Phenotypic Trait
        • Environmental Factors
      • Models
        • Directional Selection
        • Balancing Selection
        • Multi-locus models
        • Epistasis
        • Random Fitness Effects
        • Disruptive
        • Phenotype Threshold
        • Frequency-Dependent
        • Other
    • Recombination
      • Uniform
      • Varying Recombination Rates
      • Gene Conversion Allowed
    • Mutation Models
      • Two-allele Mutation Model
      • Markov DNA Evolution Models
      • k-Allele Model
      • Infinite-allele Model
      • Infinite-sites Model
      • Stepwise Mutation Model
      • Codon and Amino Acid Models
      • Indels and Others
      • Heterogeneity among Sites
      • Others
    • Events Allowed
      • Population Merge and Split
      • Varying Demographic Features
      • Population Events
      • Varying Genetic Features
      • Change of Mating Systems
      • Other
    • Other
      • Phenogenetic
      • Polygenic background
  • Interface
    • Command-line
    • Graphical User Interface
    • Integrated Development Environment
    • Script-based
    • Web-based
  • Development
    • Tested Platforms
      • Windows
      • Mac OS X
      • Linux and Unix
      • Solaris
      • Others
    • Language
      • C or C++
      • Java
      • R
      • Python
      • Perl
      • Visual Basic
      • Other
    • License
      • GNU Public License
      • BSD
      • Creative Commons
      • MIT
      • Other
  • GSR Certification
    • Accessibility
    • Documentation
    • Application
    • Support

Summary of Proposed Changes

Step 2: Review list of proposed attribute addition(s) and subtraction(s).

To Add

    To Remove

      Can't Find the Attribute You Are Looking For?

      If you would like to propose an attribute that you cannot find in the tree above, or if you would like to add a clarification to one or more attributes for this simulator (e.g. a specific file format for attribute /Output/File Format/Other), please list them in the Additional Comment box of the Submit tab.

      You may add citations by pmid, add citations by direct entry, remove citations (using the recycling bin icon), and edit citations (using the rarely seen edit icon) that were originally entered by direct entry.

      Summary of Proposed Changes

      To Add

      To Remove

      Current Citations/Applications

      [Pubmed ID: 22199392], Huang W, Li L, Myers JR, Marth GT, ART: a next-generation sequencing read simulator., Bioinformatics, 02-15-2012, https://www.ncbi.nlm.nih.gov/pubmed/?term=22199392,Primary Citation
      [Pubmed ID: 23558742], Liao Y, Smyth GK, Shi W, The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote., Nucleic Acids Res, 05-01-2013, https://www.ncbi.nlm.nih.gov/pubmed/?term=23558742,, Application
      [Pubmed ID: 24142950], Zhang J, Kobert K, Flouri T, Stamatakis A, PEAR: a fast and accurate Illumina Paired-End reAd mergeR., Bioinformatics, 03-01-2014, https://www.ncbi.nlm.nih.gov/pubmed/?term=24142950,, Application
      [Pubmed ID: 27576524], Sessegolo C, Burlet N, Haudry A, Strong phylogenetic inertia on genome size and transposable element content among 26 species of flies., Biol Lett, 08-01-2016, https://www.ncbi.nlm.nih.gov/pubmed/?term=27576524,, Application
      [Pubmed ID: 27756224], Hutchinson S, Glover L, Horn D, High-resolution analysis of multi-copy variant surface glycoprotein gene expression sites in African trypanosomes., BMC Genomics, 10-18-2016, https://www.ncbi.nlm.nih.gov/pubmed/?term=27756224,, Application
      [Pubmed ID: 27806686], Faksri K, Xia E, Tan JH, Teo YY, Ong RT, In silico region of difference (RD) analysis of Mycobacterium tuberculosis complex from sequence reads using RD-Analyzer., BMC Genomics, 11-02-2016, https://www.ncbi.nlm.nih.gov/pubmed/?term=27806686,, Application
      [Pubmed ID: 27843713], DeMaere MZ, Darling AE, Deconvoluting simulated metagenomes: the performance of hard- and soft- clustering algorithms applied to metagenomic chromosome conformation capture (3C)., PeerJ, 11-08-2016, https://www.ncbi.nlm.nih.gov/pubmed/?term=27843713,, Application
      [Pubmed ID: 27998267], Lux M, Krüger J, Rinke C, Maus I, Schlüter A, Woyke T, Sczyrba A, Hammer B, acdc - Automated Contamination Detection and Confidence estimation for single-cell genome data., BMC Bioinformatics, 12-20-2016, https://www.ncbi.nlm.nih.gov/pubmed/?term=27998267,, Application
      This email will never be published. This email is used only for verification and communication purposes.
      Please inform the GSR team here if you would like to see an attribute added to the attribute tree (or any other changes to the simulator description system as it exists).