Difference between revisions of "Roadmap"
Jump to navigation
Jump to search
Line 4: | Line 4: | ||
* Finalizing ZIM file format | * Finalizing ZIM file format | ||
** ZIM file header: | ** ZIM file header: | ||
*** add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList") | *** <s>add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")</s> | ||
*** add Pointer to MimeTypeList | *** <s>add Pointer to MimeTypeList</s> | ||
** ZIM file structure: | ** ZIM file structure: | ||
*** add UrlPointerList (article list ordered by URL) | *** <s>add UrlPointerList (article list ordered by URL)</s> | ||
*** add MimeTypeList to store MimeTypes in a zero-terminated list | *** <s>add MimeTypeList to store MimeTypes in a zero-terminated list</s> | ||
*** make new integer compression (UTF-8 compression / ZInt compression) | *** <s>make new integer compression (UTF-8 compression / ZInt compression)</s> | ||
*** break version number into major / minor number | *** break version number into major / minor number | ||
** Directory Entry: | ** Directory Entry: | ||
*** drop QUnicode on article titles | *** <s>drop QUnicode on article titles</s> | ||
*** add URL | *** <s>add URL</s> | ||
*** add rev_id int compressed | *** <s>add rev_id int compressed</s> (not int compressed) | ||
** Index Namespace (X) | ** Index Namespace (X) | ||
*** switch to new int compression | *** <s>switch to new int compression</s> | ||
;Later | ;Later |
Revision as of 19:28, 2 January 2010
See also current Status and next steps.
- Until end of 2009
- Finalizing ZIM file format
- ZIM file header:
add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")add Pointer to MimeTypeList
- ZIM file structure:
add UrlPointerList (article list ordered by URL)add MimeTypeList to store MimeTypes in a zero-terminated listmake new integer compression (UTF-8 compression / ZInt compression)- break version number into major / minor number
- Directory Entry:
drop QUnicode on article titlesadd URLadd rev_id int compressed(not int compressed)
- Index Namespace (X)
switch to new int compression
- ZIM file header:
- Later
- Layout Namespace (A / B)
- A - HTML body
- B - HTML header template
- reader sets flag when loading library if it wants to get HTML body or full layout using header template
- MIME types used
- html-body
- html-layout
- Cluster compression
- add LZMA compression - DONE LZMA compression
- switch to compression streaming (only keep in memory what is really needed)
- Packaging
- Debian maintainer, contact by Tommi
- RPM? - maybe openSuSE buildservice
- static binaries should be updated regularly
- Emmanuel adds Microsoft Visual Studio project file to SVN
- Category Namespace (U / V)
- U contains standard article text
- V contains article pointers to articles within that category
- Metadata Namespace (M) - these fields should be available as variables for layout templates
- "language" - ISO Code 639-3
- "creator"
- "date" - YYYYMMDD
- "description"
- "relation"
- "source" - URL
- ZIM export running on http://download.wikimedia.org/ This is not likely to happen until after the WMF fundraiser finishes which will be after 1/2010 Tomasz 15:06, 22 November 2009 (UTC)
- work in MW API
- dumper has to add license name and link to the HTML content
- Updating
- tool to merge two ZIM files
- April 2009