The BBC reports that the British Library, National Libraries of Scotland and Wales, Oxford’s Bodleian Libraries, Cambridge’s University Library and the Library of Trinity College in Dublin will be cataloging billions of web pages created in the U.K. A total of 4.8 million websites will be included, covering everything from academic journals to Stephen Hawking’s official website.
The British Library has been given the right to archive the digital world. Following new regulations coming into force on Saturday, six major libraries will be able to collect, preserve and provide long term access to internet based information, including blogs, e-books and even the entire UK web domain. An estimated 1bn pages a year will be available to researchers through the new archive.
During the process, the British Library launched a survey of the top 100 U.K. sites that should be preserved right away. Sites collected as part of the project will all be available publicly on computers in the libraries.