The software work completed so far mainly involves working with the BulldogN cluster and setting up bioinformatics pipelines, and what has been done is still in development. Two tools that are ready to be discussed are the following:
- RMS – Run My Samples, software that allows you to easily create bioinformatics pipelines in perl, python, R and/or bash. You can then run them across HPC clusters for tens to hundreds of samples (or just one), incrementally rerun them as more data arrives, and specify QC checks to stop the process if there are quality issues with the data.
- PBM – Poor Bioinformatician’s Module, a very simple script/alias combination that allows you to hide all of the details of setting your environment variables to run bioinformatics tools.