Exercise Review 3

Creating Backups and Sharing DataLad Datasets

Michał Szczepanik

Research Center Juelich

Ole Bialas

University of Bonn

August 19, 2025

Using a Bare Git Repository as Push Target

  • bare == no worktree
    • contents of .git directly in the directory
    • when shared pushes can’t break the worktree
    • suited for storing on a server 1
  • with git-annex on the machine, can store git+annex
  • good default for Git & git-annex aware services

What Contents Are Actually being pushed?

  • Any git repository (local or online) can be added as a dataset sibling
  • The first push initializes the repository’s annex ID, the second push actually tranfers the files
  • datalad push acts on the contents you already got
    • to transfer specific files, use datalad get first

Using non-Git-aware Storage Services

  • git-annex has special remotes that implement various strorage services (git annex initremote)
  • you can have multiple special remotes with fine-grained configuration:

DataLad Supported Services

  • DataLad supports all services that are supported by git-annex
  • DataLad supports additional services via dedicated extensions (check the handbook)