File Format Identification

Comment:  here is another form of potential Information loss and an issue that challenges archivists world-wide and is an area of focus of the EU SCAPE project. (SCAlable Preservation Environments)

SCAPE Event:  Scalable Preservation Environments for Identification and Characterization

One of the biggest initial challenges to digital preservation is file format identification. While there has been a lot of work in this area, the ever changing nature of digital formats realistically means the problem will never be "solved". This first SCAPE training course will give you the knowledge and experience to confidently choose file formation identification and characterization tools, which have been developed or extended during the SCAPE project.
With more businesses and organizations actively deploying preservation services, there is a critical need for more knowledge to be shared, and services to be developed in order to inform of change. Panels and open discussion sessions will provide a valuable space for voices to discuss the latest preservation services to monitor change.
  • Objectives:
    • Distinguish between different file types and identify the requirements for characterizing each of them
    • Carry out a number of identification, characterization, and duplication detection experiments on example files.
    • Critically evaluate characterization and identification tools and assess their advantages and disadvantages when used in different scenarios.
    • Compare and contrast the differences in running characterization and identification tools both stand-alone and within workflows.
    • Envisage a system that combines workflows with identification, characterization and validation tools to suit a variety of scenarios.
    • Conduct an in-depth analysis of large volumes of identification and characterisation data and find representative sample records suitable for preservation planning experiments.