day-of-data-2021

View the Project on GitHub labordynamicsinstitute/day-of-data-2021

Cornell Day of Data 2021: Reproducibility and collaboration when your data is really large or confidential

DOI

Abstract

Many new collaborative and often reproducible or dynamic tools are being developed or in use. One feature that they have in common is that is hard to use them when the data being used is really large (you cannot put 1TB of data into Github) or confidential (don’t even try to do that with Github). In this workshop, I will convey some tips and tricks on how to set up a reproducible environment that allows for such features of the data.

Authors

Lars Vilhuber and David Wasser

Materials

License

CC-BY-4.0