Skip to content

HumeAI/ducl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ducl — Disk Usage Command Line toolkit

Scan filesystems (or S3 buckets), build interactive dashboards, and query scan data — all from one CLI.

Install

pip install ducl            # from PyPI
pip install ducl[s3]        # with S3 support (boto3)
pip install -e ".[dev]"     # development (editable + pytest)

On Linux the bundled pwalk2 C binary is compiled automatically at install time.

Usage

# Scan a filesystem → Feather file + dashboard
ducl scan /mnt/data -o scan.feather

# Scan an S3 bucket
ducl scan my-bucket -o bucket.feather --s3

# Build dashboard from existing Feather
ducl dashboard scan.feather ./output/

# Incremental update after rescanning a subtree
ducl update ./output/ subtree.feather

# Query scan data
ducl query scan.feather --under /mnt/data/models/ --ext wav --top 10

# Run bundled pwalk2 directly
ducl pwalk2 /mnt/data --threads 64

Development

cd v3
pip install -e ".[dev]"
pytest -v

License

MIT

About

Cluster-scale disk usage analysis tool with an interactive browser-based dashboard.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors