NoML

Archived

Proposed robots-style directive for keeping content searchable while asking crawlers not to use it for machine learning.

This profile remains available for historical context, but it is hidden from the default catalog view because no newer public activity was found during review.

Website
noml.info
Latest update
Oct 25, 2023 Mojeek publishes NoML proposal and open letter
Primary approach
Preference signal
Pipeline
Collect / Train

What it is

NoML proposes a simple extension to existing robots-style signaling: a noml value that can be added to HTML meta tags or X-Robots-Tag headers. The goal is to let publishers keep content discoverable in search while separately asking that it not be used for machine learning.

It is intentionally lightweight and easy to deploy, but it remains an advocacy proposal rather than a settled web standard.

Limitations

NoML is a proposal rather than a broadly adopted standard, and it depends on voluntary compliance by crawlers, search engines, and downstream API users.

Evidence trail

Examples

HTML meta tag Source

NoML proposes this as the simplest page-level signal for HTML documents.

<meta name="robots" content="noml">
HTTP header Source

For non-HTML responses, the proposal uses the parallel X-Robots-Tag form.

X-Robots-Tag: noml