Constellation Network adn common Crawl Revolutionize AI Data Security
SAN FRANCISCO,Dec. 19, 2024 – Constellation Network, a Web3 ecosystem validated by the U.S. Department of Defense, has announced a groundbreaking partnership with the Common Crawl Foundation. Together, they’ve launched a custom-built blockchain to create the industry’s first cryptographically secure, immutable archive of internet data specifically designed for AI training and advancement.
This collaboration tackles a major challenge in the rapidly expanding field of artificial intelligence: ensuring the trustworthiness and provenance of the massive datasets used to train AI models. The new system leverages Constellation’s blockchain technology to provide a secure and verifiable record of 17 years of internet crawl data—nearly 9 petabytes of information, a significant portion of what powers many Large Language Models (LLMs).
A Secure and Clear Solution
This innovative submission-specific network, dubbed Metagraph, uses constellation’s Directed Acyclic graph (DAG) utility asset to secure the archived internet crawls. This approach shifts the focus from consumer-facing transaction fees to operational expenses, making it a more practical solution for businesses needing to notarize large datasets. Key features include:
- Extensive Data Archiving: A completely immutable copy of internet history, offering unprecedented clarity and traceability for AI training datasets.
- End-to-End Encryption: Robust cryptographic security ensures data integrity throughout the AI development lifecycle.
- Ethical AI Framework: A powerful solution addressing concerns about data collection, storage, and usage in LLMs.
“This integration is a critical step forward in securing the future of AI development,” said Alex Brandes, CTO of Constellation Network. “By ensuring cryptographic integrity and immutability of training data, we are addressing one of the most pressing challenges in the field today: trustworthiness and provenance of datasets. We believe our platform will grow to become a cornerstone in the field of responsible AI development, setting new standards for data integrity and trust.”
Real-World Applications and Industry Impact
The impact of this blockchain-enabled data archive is already being felt. TraceAI, a project funded by the National Science Foundation (NSF) and the Small Business Innovation Research (SBIR) programme, is currently testing it’s own application-specific network built on Constellation. This network will enhance the immutability, auditability, and proof of authorship for its training models, and will also leverage Common Crawl’s Constellation-built solution to track data origins.
Kevin Jackson, vice President of Space Domain Communications & Commercialization for Forward edgeai, added: “This represents the natural evolution of AI and machine learning model development—transforming data management from a technical challenge to a trusted business tool that drives global standardization and verification.”
Rich Skrenta, executive Director of the Common Crawl, stated: “For users of the Crawl who are concerned about the provenance of the data, especially those using it for AI models, Constellation and their hypergraph blockchain provides an elegant solution. We are looking forward to adding the ability to securely validate the crawl as part of our standard distribution by partnering with Constellation”.
The solution is accessible now. Evidence of this integration can be found on Constellation’s transaction viewer, the “DAG explorer,” and developers can begin using verified past crawls for AI applications. constellation, Forward Edge-AI, and Common Crawl will continue to collaborate on further solutions.
About Constellation Network: Constellation is a leading blockchain network advancing innovation through on-chain data security, partnering with critical global stakeholders, including the U.S. Department of Defense, to deliver transformative, next-generation technologies.
About Common Crawl Foundation: [Insert Common Crawl Foundation description here]
Constellation Network and Forward Edge-AI Partner to Democratize Access to Web Data with Stardust
San Francisco, CA – Constellation Network, a leading provider of decentralized network solutions, has announced a groundbreaking partnership with Forward Edge-AI to revolutionize access to the vast trove of data offered by Common Crawl. This collaboration aims to empower researchers, businesses, and developers with unprecedented access to web information through Stardust, a decentralized data platform.
The partnership leverages Constellation Network’s Hypergraph Transfer Protocol (HGTP), a secure and scalable distributed ledger technology, to create a decentralized network for accessing and processing Common Crawl’s petabytes of web data. This innovative approach bypasses the traditional centralized infrastructure, offering enhanced security, transparency, and efficiency.
forward Edge-AI, a pioneer in responsible and inclusive Artificial Intelligence, plays a crucial role in this initiative. Their expertise in AI will be instrumental in developing tools and applications that enable users to effectively analyse and utilize the massive dataset provided by Common crawl.
As its foundation in 2019, our goal is to become the dominant player in Artificial Intelligence and lead the revolution in augmenting edge technology with human intelligence.
This statement from Forward Edge-AI underscores their commitment to leveraging AI for the betterment of humanity, a vision that aligns perfectly with Constellation Network’s mission to democratize access to information.
Stardust, the platform born from this partnership, promises to be a game-changer for researchers, businesses, and developers.By providing seamless access to common Crawl’s extensive web archive, Stardust empowers users to conduct in-depth research, develop innovative applications, and gain valuable insights from the wealth of information available online.
The Common Crawl Foundation, a 501(c)(3) non-profit organization, has been instrumental in making this initiative possible. Their dedication to providing a free and open copy of the internet aligns perfectly with the democratizing principles of this partnership.
About Constellation Network
Constellation Network is a decentralized network solutions provider focused on building secure and scalable distributed ledger technologies. Their Hypergraph Transfer Protocol (HGTP) is at the heart of this innovative partnership,enabling the decentralized access and processing of Common Crawl’s vast data archive.
About Forward Edge-AI
Forward edge-AI is a leading company in the field of responsible and inclusive Artificial Intelligence. They are committed to developing AI solutions that benefit humanity and are playing a key role in building the tools and applications that will empower users to leverage the data accessible through Stardust.
Contact Information
Constellation Network
Email: press@constellationnetwork.io
Website: https://constellationnetwork.io/
Twitter: https://x.com/conste11ation
GitHub: https://github.com/Constellation-Labs/tessellation
DAG Explorer: https://mainnet.dagexplorer.io/
Dagnum PI
dagnum@stardust-collective.org