Research data management using DataLad
Information
The estimated time to complete this training module is 3h.
The prerequisites to take this module are:
- the installation module.
- the introduction to the terminal module.
- the introduction to git and github module can help, but not required.
- the project management module is recommended, but not required.
Contact Pierre Bellec if you have questions on this module, or if you want to check that you completed successfully all the exercises.
Resources
This module was presented by Adina Wagner during the HBM brainhack in 2020.
The material of the tutorial is available here.
The video of her presentation is available below:
Exercise
- Follow along the tutorial with Adina. You can copy paste the commands from the datalad handbook section linked above, while following the video.
- Warning: the url for one of the books in the tutorial (
byte-of-python.pdf
) is broken, so the pdf is unreadable. This does not impact the tutorial, but just don't be surprised if that document does not open. Also it shows how important it is to create persistent URLs when you release material, such as those offered on platforms likezenodo
,osf
orfigshare
. - warning 2 to follow the tutorial you may need to install new command line tools, such as
tree
.
- Warning: the url for one of the books in the tutorial (
- Check with Pierre Bellec to validate that the history of your datalad repository includes all the steps of the tutorial.
- 🎉 🎉 🎉 you completed this training module! 🎉 🎉 🎉
More resources
If you want to learn more, check:
- the datalad handbook, which features lot of additional resources as well!
- the datalad datasets github organization, which provides an easy access to a number of data resources. This type of DataLad repositories are the easiest way to get access to datasets.
- note that for the last part of the tutorial you will need to install singularity and the
datalad-container
extension (installable throughpip
). - all of the Open Neuro datasets available on the Open Neuro github organization.
- you can also read about the YODA principles for reproducible papers.