Initiative profile

Spawning Data Diligence

Live

Python package and API helpers for checking whether works are opted out before model training.

Website
github.com/Spawning-Inc/datadiligence
Latest update
Oct 08, 2024 PyPI release 0.1.7 published
Primary approach
New infrastructure
Pipeline
Collect / Train / Fine-tune

What it is

Data Diligence is a developer-facing compliance tool for filtering or checking data before model training. It aims to make opt-out respect more practical by wrapping multiple signals behind a single interface for common ML workflows.

That makes it a good fit for this catalog as downstream enforcement infrastructure: it is not the signal itself, but the tooling that helps training pipelines honor those signals.

Watchouts

Coverage depends on which opt-out methods the tool knows about and, for some workflows, on access to external services maintained by Spawning.

Evidence trail