archivebox.misc.folders

Folder utilities for ArchiveBox.

Note: This file only contains legacy cleanup utilities. The DB is the single source of truth - use Snapshot.objects queries for all status checks.

Module Contents

Functions

fix_invalid_folder_locations

Legacy cleanup: Move folders to their correct timestamp-named locations based on index.json.

API

archivebox.misc.folders.fix_invalid_folder_locations(out_dir: pathlib.Path = DATA_DIR) tuple[list[str], list[str]][source]

Legacy cleanup: Move folders to their correct timestamp-named locations based on index.json.

This is only used during ‘archivebox init’ for one-time cleanup of misnamed directories. After this runs once, ‘archivebox update’ handles all filesystem operations.