The Internet Archive: The Double-Edged Sword of Information Freedom and Privacy

There is a huge difference between journalism and archiving though (at least individually vs as an organization), journalists who reference or screenshot deleted material are usually acting under their individual freedom of expression. They do not systematically “process” or “retain” the full dataset, but instead use the information in a specific context (e.g. commentary, reporting, criticism). They aren’t a “data controller” in the sense of the GDPR, they don’t have your data, they just republish it (there might be other laws and ethical issues related to publishing deleted things shared by someone but its not against the GDPR), while the Internet Archive is an organization running a structured, automated system that crawls, stores, indexes, and indefinitely retains personal data.