Webinar (2023-Mar-28) with Alex Razoumov
This webinar provides a more beginner-oriented tutorial to version control of large data files with DataLad. We start with a textbook introduction to DalaLad showing its main features on top of Git and git-annex. Next we demonstrate several simple but useful workflows. Please note that not everything fit into the 50-min presentation, but the notes below contain everything.
/project
,Webinar (2023-Feb-28) with Gemma Hoad
Webinar (2023-Feb-14) with Ian Percel
This talk is a brief introduction to version controlling data and data processing workflows. Three illustrative use cases – taken from neuroimaging, geophysics, and workflows for analyzing housing data respectively – are used to provide an introduction to the main concepts of git-based file management, collaboration, and analysis.
Webinar (2023-Jan-17) by Alex Razoumov
Many unoptimized HPC cluster workflows result in writing large numbers of files to distributed filesystems which can create significant problems for the performance of these shared filesystems. One of the ways to alleviate this is to organize write operations inside a persistent overlay directory attached to an immutable read-only container with your scientific software. These output files will be stored separately from the base container image, and to the host filesystem an overlay appears as a single large file. In this presentation, we demo running parallel OpenFOAM simulations where all output goes into overlay images, and the total number of files on the host filesystem is reduced from several million to several dozen or less. The same approach can be used in post-processing and visualization, where you can read simulation data from multiple overlays both in serial and in parallel. In this webinar we walk you through all stages of creating and using overlays. We assume no prior knowledge of the container technology.
Webinar (2021-Mar-03) by Simon Goring
Webinar (2021-Feb-17) by Alex Razoumov
Webinar (2020-Sep-30) by Alex Razoumov
Webinar (2019-Oct-30) by Sergiy Stepanenko
Webinar (2019-May-01) by Alex Razoumov
Webinar (2019-Mar-20) by Alex Garnett and Adam McKenzie