In the new issue of D-Lib Magazine: January/February 2008, Volume 14 Number 1/2. ISSN 1082-9873, is the article: Creating Preservation-Ready Web Resources, by Joan A. Smith and Michael L. Nelson.
It addresses important issues all of us building Web Resources should be thinking about.
There are innumerable departmental, community, and personal web sites worthy of long-term preservation but proportionally fewer archivists available to properly prepare and process such sites. We propose a simple model for such everyday web sites which takes advantage of the web server itself to help prepare the site's resources for preservation. This is accomplished by having metadata utilities analyze the resource at the time of dissemination. The web server responds to the archiving repository crawler by sending both the resource and the just-in-time generated metadata as a straight-forward XML-formatted response. We call this complex object (resource + metadata) a CRATE. In this paper we discuss modoai, the web server module we developed to support this approach, and we describe the process of harvesting preservation-ready resources using this technique.