Added on 18 February 2025


The Common Crawl Foundation (CCF) has joined the Digital Preservation Coalition (DPC) this month, becoming the Coalition’s newest Associate Member.Common Crawl Wordmark Logo Blue

As a non-profit foundation dedicated to the Open Web, Common Crawl maintains a free, open repository of web crawl data that can be used by anyone.

Greg Lindahl, CCF’s Chief Technology Officer, says: "We envision a truly open web that enables free access to information and fosters innovation in research, business, and education.” Since its inception in 2008, Common Crawl has steadily expanded its web archive, which now exceeds 10 petabytes and continues to grow each month. This ever-expanding resource is instrumental in countless areas of research worldwide. "We look forward to collaborating with DPC members to improve the quality and integrity of our archive."

Anna Perricci, Head of DPC Americas, welcomed the news, saying: “We are delighted that the Common Crawl Foundation is now part of the Coalition. Their commitment to making large corpora of data from the web openly accessible aligns perfectly with our efforts to bring about a sustainable future for digital materials. This collaboration will bring valuable expertise to our membership and strengthen our collective ability to preserve digital heritage."

The DPC is an international charitable foundation which supports digital preservation, helping its members around the world to deliver resilient long-term access to digital content and services through community engagement, targeted advocacy work, training and workforce development, capacity building, good practice and standards, and through good management and governance. Its vision is a secure digital legacy.  

Click for more information about:


Scroll to top