04/20/2026
The Precinct 6 Cybersecurity Dataset is live on Hugging Face, and iTnews Australia covered the release today.
114 million labelled security records. Telemetry from 158 products across more than 70 vendors. Over 10,000 real incident graphs, collected from five US enterprises between July and August 2024 and sanitised through an open-source four-stage pipeline.
We built it with the University of Canterbury | Te Whare Wānanga o Waitaha. It is roughly 50× the size of CICIDS2017 — and, as far as we can tell, the largest dataset of its kind built from real adversary behaviour.
Apache 2.0. Built from live attack traffic, not synthetic lab data.
Dataset: https://huggingface.co/datasets/witfoo/precinct6-cybersecurity-100m iTnews: https://www.itnews.com.au/news/security-firm-releases-114m-record-dataset-built-from-live-enterprise-attack-traffic-625190
Captured on five enterprise networks in 2024.