Difference between revisions of "Roadmap"

From openZIM
Jump to navigation Jump to search
Line 7: Line 7:
*** add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")
*** add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")
*** add Pointer to MimeTypeList
*** add Pointer to MimeTypeList
** ZIM file structure:
** ZIM file structure:
*** add UrlPointerList (article list ordered by URL)
*** add UrlPointerList (article list ordered by URL)
Line 13: Line 12:
*** make new integer compression (UTF-8 compression / ZInt compression)
*** make new integer compression (UTF-8 compression / ZInt compression)
*** break version number into major / minor number
*** break version number into major / minor number
** Directory Entry:
** Directory Entry:
*** drop QUnicode on article titles
*** drop QUnicode on article titles
*** add URL
*** add URL
*** add rev_id int compressed
*** add rev_id int compressed
** Cluster:
** Cluster:
*** add LZMA compression
*** add LZMA compression
** Category Namespace (U / V)
** Category Namespace (U / V)
*** U contains standard article text
*** U contains standard article text
*** V contains article pointers to articles within that category
*** V contains article pointers to articles within that category
** Metadata Namespace (M)
** Metadata Namespace (M)
*** "language" - ISO Code 639-3
*** "language" - ISO Code 639-3
Line 33: Line 28:
*** "relation"
*** "relation"
*** "source" - URL
*** "source" - URL
** Index Namespace (X)
** Index Namespace (X)
*** switch to new int compression
*** switch to new int compression

Revision as of 13:48, 22 November 2009

See also current Status and next steps.


Until end of 2009
  • Finalizing ZIM file format
    • ZIM file header:
      • add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")
      • add Pointer to MimeTypeList
    • ZIM file structure:
      • add UrlPointerList (article list ordered by URL)
      • add MimeTypeList to store MimeTypes in a zero-terminated list
      • make new integer compression (UTF-8 compression / ZInt compression)
      • break version number into major / minor number
    • Directory Entry:
      • drop QUnicode on article titles
      • add URL
      • add rev_id int compressed
    • Cluster:
      • add LZMA compression
    • Category Namespace (U / V)
      • U contains standard article text
      • V contains article pointers to articles within that category
    • Metadata Namespace (M)
      • "language" - ISO Code 639-3
      • "creator"
      • "date" - YYYYMMDD
      • "description"
      • "relation"
      • "source" - URL
    • Index Namespace (X)
      • switch to new int compression
Later
  • Layout Namespace (A / B)
    • A - HTML body
    • B - HTML header template
    • reader sets flag when loading library if it wants to get HTML body or full layout using header template
    • MIME types used
      • html-body
      • html-layout
  • Packaging
    • Debian maintainer, contact by Tommi
    • RPM? - maybe openSuSE buildservice
    • static binaries should be updated regularly
    • Emmanuel adds Microsoft Visual Studio project file to SVN
  • Updating
    • tool to merge two ZIM files
June 2009
  • 1. openZIM release of a Wikipedia DVD for LinuxTag 2009 in Berlin.