Reproducible Analyses
Reproducibility is important professionally, but the main person it is helping is future-you.
Exit Strategy
When an experiment is finished you should have an 'exit strategy' checklist to make sure your work is as reproducible as possible. We hope that we have made this achievable in Tapirs
- data provenance
- data archive is possible
- list of all software, sources, and versions
- conda export of environment, all software conda installable
- workflow
- human readable reports
- easy to archive
Reports
You can generate an overall snakemake report on what was run and the provinance of the data for each results with the command:
snakemake --report reports/snakemake_report.html
Other reports are written in subdirectories in reports/ by the analysis programs
Software list and versions
The full list of software, their dependencies and version numbers is written to envs/archived_envs/environment.yaml at the end of the run. This file can be used to reproduce the experimental software conditions.
Archiving
Snakemake can be asked to make an archive of code, config, and input files: snakemake --archive my-workflow.tar.gz