![]() |
| VERS STORY | STANDARD | ASSESSMENT | PROJECTS | DIGITAL ARCHIVE | TRAINING | TOOLKIT | PUBLICATIONS | ||
|
4.2 Encoding Metadata An Encoding is a physical representation of a Document; it is equivalent to a file on a computer. A Document may have several Encodings; for example a report may be represented as a Word file, a PDF file, and as an RTF file. Many types of Documents have many ways of being represented. For example, a colour picture could be saved in (at least) the following formats: Photoshop PSD, Amiga IFF, BMP, Photoshop EPS, FlashPix, JPEG, PCX, Photoshop PDF, PICT, Pixar, PNG, Raw, Scitex, Targa, and TIFF (not all of these would be suitable long-term preservation formats).
Figure 22. Subelements of the Encoding Metadata element. The subelements of Encoding are:
The contents of the Rendering Keywords subelement are a list of file types. The extracting program uses each file type to identify a suitable program to open a file of that type and to turn it into a file of the following type. The specification allows files to be described using the standard three-letter extensions used in Word systems (e.g. 'pdf' for PDF files), or MIME types. For example the value 'b64 pdf' instructs the extracting program that the encoding is represented in the VEO as a 'b64' (Base64) file. The program identifies an application that can open (decode) 'b64' files. The resulting file is a 'pdf' (PDF) file, and a second application is identified to open files of this type. The ability to use the contents of the Rendering Keyword subelement is obviously dependent on retaining the link between a file format and the application able to process it. This link is not expected to survive for long periods, but while it does survive the functionality is useful. Over the longer term, the contents of this subelement can be used to identify all of the Documents in a particular Encoding. This can be used, for example, in identifying Encodings that need to be migrated. An example of a minimal set of Encoding Metadata follows: <vers:EncodingMetadata> The content of the DocumentData element is a PDF file. The file conforms to 'PDF Reference', third edition, Adobe Portable Document Format, Version 1.4, Adobe Systems Incorporated, Addison Wesley, 2001, ISBN 0-201-75839-3 (http://partners.adobe.com/asn/developer/acrosdk/docs/filefmtspecs/PDFReference.pdf visited 7 January 2003) as modified in the 'Errata for PDF Reference, third edition' (http://partners.adobe.com/asn/developer/acrosdk/docs/PDF14errata.txt visited 7 January 2003). It may contain digital signatures defined by PDF Public-key Digital Signature and Encryption Specification, Version 3.2, Jim Pravetz, 12 September 2001, Adobe Systems Incorporated (http://partners.adobe.com/asn/developer/pdfs/tn/ppk_pdfspec.pdf visited 28 March 2003) and the appearance of the digital signature in a PDF document is defined in Digital Signature Appearances for Public-Key Interoperability, Adobe Systems Incorporated, September 2001 (http://partners.adobe.com/asn/developer/pdfs/tn/PPKAppearances.pdf visited 28 March 2003). The file has been encoded using Base64 which is defined in IETF RFC 2045 "Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies", Section 6.8 "Base64 Content-Transfer-Encoding".
</vers:Text> | |||||
![]() |
![]() |
|