Part of Wellcome Collection
 
 
Discover what the Wellcome Library has for you

acq_jung.jpg
C G Jung. Wellcome Library no. 14091i (Gift of Ernst Freud)

Wonders of the web captured forever…

Launch of UK Web Archiving Consortium will dramatically boost lifespan of key web materials

Launched today, the UK Web Archiving Consortium (UKWAC) aims to expand the lifespan of website materials from around 44 days (the same life expectancy as a housefly) to a century or more. Comprising six leading UK institutions, the UKWAC will work, with the permission of rights holders, on an experimental system for archiving selected key UK websites – ensuring that invaluable scholarly, cultural and scientific resources remain available for future generations.

The UKWAC - comprising The British Library, Joint Information Systems Committee of the Higher and Further Education Councils (JISC), The National Archives, The National Library of Wales, the National Library of Scotland and the Wellcome Trust – will run for an initial period of two years, during which approximately 6,000 websites will be collected and archived.

Consortium members will obtain the permission of Website owners to archive selected sites whilst working collaboratively to explore how to develop compatible selection policies and to investigate the complex technical challenges involved in collecting and archiving web material.

Each consortium member will select and ‘capture’ content relevant to its subject and/or domain. For example, the British Library will archive sites reflecting national culture and events of historical importance. These could include web pages focusing on key events in national life, museum web pages, e-theses, selected blogs to support research material and web-based literary and creative projects by British subjects. The Wellcome Trust will preserve a record of medicine on the web whilst the National Archives will focus on archiving selected materials from six main clusters of government departments. The Scottish and Welsh national libraries will collect material reflecting the culture and history of Scotland and Wales and JISC will preserve its own project websites.

Infrastructure costs, such as software, hardware, and ongoing technical development and support will be shared equally amongst the Consortium members. UKWAC will use HTTrack – the open source web crawler to acquire files for storage. The software to carry out the archiving processes – PANDORA Digital Archiving System (PANDAS) – has already been developed and tested by the National Library of Australia and its partners for archiving Australian websites and making them accessible through PANDORA the Australian national Web Archive (see: http://pandora.nla.gov.au/index.html). PANDAS can be set to automatically tag, gather and prepare pages for public display. If pages are not suitable for immediate public access, due to commercial, cultural or privacy reasons, PANDAS can manage appropriate access restrictions.

UKWAC members have selected Magus Research Limited to help extend the PANDAS software for UK needs and provide the shared hardware and technical support they require.

David Thomas, Head of Government Archives Services at the National Archives, said: ‘From government organisations posting travel advice to newlyweds putting their wedding photos online, websites provide a unique insight into the political and social world we live in today. Through collaboration in the UKWAC, The National Archives are taking steps to ensure that government websites are preserved for future generations.’

Lynne Brindley, Chair of the Digital Preservation Coalition and Chief Executive of The British Library said: ‘The launch of UKWAC is an essential step in helping us to understand the scope of the UK web space and how we can set about developing a selective yet useful national web archive. Initially this will be on a voluntary basis, although it is anticipated that secondary legislation will, in due course, allow the BL – and the other legal deposit libraries – to collect web materials. Working with other UKWAC members, we can make real progress in developing complementary selection policies, exploring the best ways to collect and archive web materials and refining how we work together.’

For further information see: http://www.webarchive.org.uk

or contact Val McBurney in Press and Public Relations at the British Library. Telephone: 020 7412 7112, fax: 020 7412 7168, email: val.mcburney@bl.uk

Notes for editors

1. The JISC – Joint Information Systems Committee – is a joint committee of the UK further and higher education funding bodies, and is responsible for supporting the innovative use of information and communication technology (ICT) to support learning, teaching, and research. It is best known for providing the JANET network, a range of support, content and advisory services, and a portfolio of high-quality resources. In the UK Web Archiving Consortium, the JISC will be selecting a range of JISC funded project websites and working within the ac.uk domain with consortium partners. Information about the JISC, its services and programmes can be found at http://www.jisc.ac.uk/.

2. The British Library - is the national library of the United Kingdom and one of world’s greatest libraries. Its collection of 150 million items covers every age of written civilisation, every written language and every aspect of human thought. Users, including industrial companies and academic scholars, have access to the Library's collection via the Reading Rooms and global document supply services, which provide over 15,000 documents per day to 20,000 customers in 111 countries. Information on the Library's collection and services is available on the British Library website at www.bl.uk

3. The Wellcome Trust is an independent, research funding charity, established under the will of Sir Henry Wellcome in 1936. The Trust’s mission is to foster and promote research with the aim of improving human and animal health. The Wellcome Library exists as a resource to provide access to the documentary record of medicine. The Web Archiving Project is one way of translating that vision into the digital age.

4. The National Archives, Kew, www.nationalarchives.gov.uk has one of the largest archival collections in the world, spanning 1000 years of British history, from Domesday Book to newly released government papers. The free museum and research rooms in Kew, west London, are open to the public 6 days a week.
For press enquiries please contact The National Archives Press Office on 020 8392 5277 or e-mail press@nationalarchives.gov.uk

5. The National Library of Wales is one of Wales’ foremost cultural institutions, and its pre-eminent source of recorded knowledge and information. It acts as the memory of the nation, storing and giving access to recorded knowledge, in all forms, about Wales. It is a body of international standing, and contributes to a worldwide network of knowledge providers. As a member of the UK Web Archiving Consortium, The National Library of Wales will be selecting web sites of Welsh significance in Welsh, English and other relevant languages, creating a collection that will serve both current and future research needs. Information about The National Library of Wales can be found at http://www.llgc.org.uk/.

6. The National Library of Scotland is Scotland's largest library, and one of the leading research libraries in Europe. It serves both as a general research library and as the world's leading repository for the printed and manuscript record of Scotland's history and culture. It houses more than eight million printed items, over 250,000 electronic titles, and has been a Legal Deposit library since 1710. As part of our involvement in the Consortium, we aim to concentrate on sites of Scottish interest, and see this as an extension of one of our principal functions, "to create, preserve and ensure access to a comprehensive collection of the recorded knowledge, culture and history of Scotland, for the benefit of the people of Scotland and the rest of the world".

7. Magus Research (http://www.magusresearch.com) is an Internet content and information management specialist which has been delivering services to FTSE-100 companies, institutions, and global organisations since 1995. Magus has a history of creating innovative solutions that help businesses to capitalise on developing technologies.
Magus provides a broad range of technology solutions including Internet and Intranet projects from concept through deployment and beyond, with expertise from the front-end interface, information architecture and visual design, through to the back-end content management and database systems. In addition, their search and retrieval, business intelligence and alerting products are designed to filter and deliver web-based information, and combine this with information from syndicated content providers, in order to help knowledge-intensive corporations to effectively manage their use of the Internet.
As well as providing complete end-to-end solutions, many of Magus's product offerings are available as standalone modules, and completely managed service solutions.

8. The establishment of a collaborative Web-archiving project was one of the key recommendations of the Wellcome/JISC Web archiving feasibility study. For further details see the Web Archiving project pages.

ENDS

 Wellcome Library, 183 Euston Road, London NW1 2BE, UK  tel:+44 (0)20 7611 8722  email: library@wellcome.ac.uk Blog Sitemap|Privacy statement|Disclaimer