Time | January 19, 2025 |
---|---|
8:00 | Breakfast |
9:00 | Introduction |
10:00 | Reproducible practices, A template README |
10:50 | Coffee break |
11:00 | Data provenance, data citations |
12:00 | Lunch Break |
2025-01-19
Time | January 19, 2025 |
---|---|
8:00 | Breakfast |
9:00 | Introduction |
10:00 | Reproducible practices, A template README |
10:50 | Coffee break |
11:00 | Data provenance, data citations |
12:00 | Lunch Break |
Part 1:
guides a reader through the available material and a route to replicating the results in the research paper, including
It contains information about the sources of data used in the replication package, in addition to or instead of such detailed description in the manuscript.
These may include
For simple replication packages, may appear to be trivial (a laptop and some common software)
What if requirement is expensive commercial software and a super computer cluster?
In order to assess the complexity of the task of replicating, authors should specify each of the following elements:
The README is strongly suggested, but sometimes ignored.
You should nevertheless treat all replication packages as if they should have had the same information, easily accessible.
Important: The information should describe ALL data used, regardless of whether they are provided as part of the replication archive or not, and regardless of size or scope.
For instance, if using GDP deflators, the source of the deflators (e.g. at the national statistical office) should also be listed here.
For the AEA submissions, this information is also available (somewhat different) as part of the “Data and Code Availability Form” (DCAF):
Data sources translate into datasets. Ideally, the README lists them:
To some extent, the crux of the matter: what do you need to run the analysis?
You will need to figure out if you can do it (we’ll get to that part).
Portions of the code were last run on a 12-node AWS R3 cluster, consuming 20,000 core-hours.
This should provide some details, but ideally:
In many of the READMEs you will see, not everything is as clear as what we just outlined.
You will need to find the information.
After lunch, we will talk about the report you will prepare.