Ten simple rules — #6 Version both software and data
Bringing workflows under version control to make large-scale data processing reproducible; inspired by “Ten simple rules for large-scale data processing” (Fungtammasan 2022).
Bringing workflows under version control to make large-scale data processing reproducible; inspired by “Ten simple rules for large-scale data processing” (Fungtammasan 2022).
Verily software engineers share tips and resources for running Jupyter notebooks programmatically without writing any code.
Senior product manager Alessandro Culotti shares his team’s plans for making it possible to run a wider variety of analysis applications in Terra.
Dr. Kiran Garimella gives an overview of MAS-ISO-seq, a new method for generating a lot more data per run with long-read sequencing technologies such as PacBio, and shares a workspace that demonstrates the method’s data processing.
Sam Friedman, data scientist in the Broad’s Data Sciences Platform, explains how using the Intel OpenVINO framework enabled him to accelerate the GATK CNN tools.
Terra is developed by the Broad Institute of MIT and Harvard in collaboration with Microsoft and Verily.