Having to manage computational workflows can be difficult, more importantly tracking which input and output files have recently been updated to ensure that you don’t repeat work will likely result in you screwing something up or repeating work.
A minimal standard for data analysis and other scientific computations is that they be reproducible: that the code and data are assembled in a way so that another group can re-create all of the results (e.
We use these techniques for data reduction often sometimes blindly. Here I attempt to provide an intuitive explanation of how they work using images and examples.
Databases are key to our ability to do meaningful biological work. This ongoing blog series will be about exploring some common databases and vizualizing their contents.