Difference between revisions of "Roadmap"
Jump to navigation
Jump to search
Line 7: | Line 7: | ||
*** add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList") | *** add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList") | ||
*** add Pointer to MimeTypeList | *** add Pointer to MimeTypeList | ||
** ZIM file structure: | ** ZIM file structure: | ||
*** add UrlPointerList (article list ordered by URL) | *** add UrlPointerList (article list ordered by URL) | ||
Line 13: | Line 12: | ||
*** make new integer compression (UTF-8 compression / ZInt compression) | *** make new integer compression (UTF-8 compression / ZInt compression) | ||
*** break version number into major / minor number | *** break version number into major / minor number | ||
** Directory Entry: | ** Directory Entry: | ||
*** drop QUnicode on article titles | *** drop QUnicode on article titles | ||
*** add URL | *** add URL | ||
*** add rev_id int compressed | *** add rev_id int compressed | ||
** Cluster: | ** Cluster: | ||
*** add LZMA compression | *** add LZMA compression | ||
** Category Namespace (U / V) | ** Category Namespace (U / V) | ||
*** U contains standard article text | *** U contains standard article text | ||
*** V contains article pointers to articles within that category | *** V contains article pointers to articles within that category | ||
** Metadata Namespace (M) | ** Metadata Namespace (M) | ||
*** "language" - ISO Code 639-3 | *** "language" - ISO Code 639-3 | ||
Line 33: | Line 28: | ||
*** "relation" | *** "relation" | ||
*** "source" - URL | *** "source" - URL | ||
** Index Namespace (X) | ** Index Namespace (X) | ||
*** switch to new int compression | *** switch to new int compression |
Revision as of 13:48, 22 November 2009
See also current Status and next steps.
- Until end of 2009
- Finalizing ZIM file format
- ZIM file header:
- add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")
- add Pointer to MimeTypeList
- ZIM file structure:
- add UrlPointerList (article list ordered by URL)
- add MimeTypeList to store MimeTypes in a zero-terminated list
- make new integer compression (UTF-8 compression / ZInt compression)
- break version number into major / minor number
- Directory Entry:
- drop QUnicode on article titles
- add URL
- add rev_id int compressed
- Cluster:
- add LZMA compression
- Category Namespace (U / V)
- U contains standard article text
- V contains article pointers to articles within that category
- Metadata Namespace (M)
- "language" - ISO Code 639-3
- "creator"
- "date" - YYYYMMDD
- "description"
- "relation"
- "source" - URL
- Index Namespace (X)
- switch to new int compression
- ZIM file header:
- Later
- Layout Namespace (A / B)
- A - HTML body
- B - HTML header template
- reader sets flag when loading library if it wants to get HTML body or full layout using header template
- MIME types used
- html-body
- html-layout
- Packaging
- Debian maintainer, contact by Tommi
- RPM? - maybe openSuSE buildservice
- static binaries should be updated regularly
- Emmanuel adds Microsoft Visual Studio project file to SVN
- ZIM export running on http://download.wikimedia.org/
- work in MW API
- Updating
- tool to merge two ZIM files
- June 2009
- 1. openZIM release of a Wikipedia DVD for LinuxTag 2009 in Berlin.