![]() |
| VERS STORY | STANDARD | COMPLIANCE | PROJECTS | DIGITAL ARCHIVE | TRAINING | TOOLKIT | PUBLICATIONS | ||
|
3. Standard Long Term Preservation Formats 3.1. Text Usage Use for plain text document type records (e.g. text files). Standard Text files wholly in a latin script (e.g. English) must conform to ISO/IEC 8859-1:1998, Information technology – ISO 8-bit single-byte coded graphic character sets – Part 1: Latin alphabet No. 1. Text files not wholly in a latin script must conform to the latest version of the Unicode standard. (The Unicode Standard, Version 3.0, The Unicode Consortium, Addison-Wesley, 2000, ISBN 0-201-61633-5, http://www.unicode.org/unicode/uni2book/u2.html visited 14 June 2006). Unicode is functionally equivalent to ISO 10646-1:2000. Unicode text is to be encoded in UTF-8. Text encoded in UTF-8 or ISO 8859-1 must not be further encoded in Base64 when it is included in the VEO. Suggested value of File Encoding (M128) The following text is recommended for inclusion in the File Encoding (M128) element when the Encoding contains text in ISO 8859-1:1998. The content of the DocumentData element is a text file. The characters in the text file conform to ISO/IEC 8859-1:1998, Information technology – ISO 8-bit single-byte coded graphic character sets – Part 1: Latin alphabet No. 1. The ampersand (and), greater than, and less than characters have been escaped using the standard XML escape conventions '&', '<', and '>' respectively. The following text is recommended for inclusion in the File Encoding (M128) element when the Encoding contains text in Unicode using UTF-8, or it is not known whether the text is Unicode or ISO 8859-1:1998. The content of the DocumentData element is a text file. The characters in the text file conform to UTF-8 encoded Unicode. Unicode is defined in ‘The Unicode Standard’, Version 3.0, The Unicode Consortium, Addison-Wesley, 2000, ISBN 0-201-61633-5, or the equivalent ISO 10646:2000. The ampersand (and), greater than, and less than characters have been escaped using the standard XML escape conventions '&', '<', and '>' respectively. | |||||
![]() |
![]() |
|