DRAFT - DRAFT - DRAFT
This is a draft page and therefore can contain errors and omissions. You are welcome to edit this page if you have any corrections or additions. Comments are also welcome.
DBPRES comprehensive breakdown of database archiving tools and formats currently used
Abbreviations:
- ADDML: Archives Data Description Markup Language
- DANS:Data Archiving and Networked Services
- DBML: DataBase Markup Language
- DDI-L: Data Documentation Initiative
- HPDBA: HP Database Archiving software
- MIXED: Migration to Intermediate XML for Electronic Data
- ODF: Open Document Format for Office Applications
- RODA: Repositório de Objectos Digitais Autênticos (Repository of Authentic Digital Objects)
- SIARD: Software Independent Archiving of Relational Databases
- SIARDK: Software Independent Archiving of Relational Databases - custom Danish version
- SDFP: Standard Data Format for Preservation
- SPSS: Statistical Package for the Social Sciences
XML preservation formats:
- ADDML
- DBML
- DDI-L
- SIARD (SIARDK)
- SDFP
- SPSS
Danish National Archives:
- Tool: SIARDK
- Ingest format: Oracle, Microsoft SQL Server, Microsoft Access
- Preservation format: SIARD (customized)
- Access format: Oracle, Microsoft SQL Server, Microsoft Access, SQLite (via ADA workflow)
Danish Data Archive:
- Tool: SPSS (commercial), Python API, PSPP, DeXt, StatTransfer, DdiEditor
- Ingest format: SPSS
- Preservation format: DDI-L
- Access format: SPSS, DDI-L
Swedish National Archives:
- Tool: KRAM (workflow), RADAR (workflow), RALF (tool)
- Ingest format: Excel (with RALF tool)
- Preservation format: ADDML
- Access format: ?
Norwegian National Archives:
- Tool: ?
- Ingest format: ?
- Preservation format: ADDML
- Access format: ?
Portuguese National Archives:
- Tool: RODADB
- Ingest format: MS Access, MSSQL, MySQL and Generic SQL
- Preservation format: DBML
- Access format: MySQL, PostgreSQL en PHPMyAdmin
Schweizerisches Bundesarchiv:
- Tool: SIARD
- Ingest format: Oracle, Microsoft SQL Server, Microsoft Access
- Preservation format: SIARD
- Access format: Oracle, Microsoft SQL Server, Microsoft Access
DANS (NL):
- Tool: MIXED
- Ingest format: Data Perfect (only input), Access 2000 and 2002, dBase III and IV, Excel 2003
- Preservation format: SDFP (derived from SIARD and ODF)
- Access format: ?
Das Bundesarchiv Deutschland:
- Tool: SIARD, HPDBA
- Ingest format: Microsoft Access 2002 testdatabase (pilot)
- Preservation format: SIARD
- Access format: ?
SIARD and SIARDK differences:
- SIARD container: ZIP64 (> 4 GB)
- SIARDK container: ZIP (<= 4 GB)
- SIARD binary blobs: conserved in data
- SIARDK binary blobs: deleted from data with reference to object through identifier; converted to preservation format and saved as binary in SIP container
- SIARDK institute specific metadata: saved in SIARD XML