LinuxTag 2010

From openZIM
Jump to navigation Jump to search

June 9th - 12th

Travelling

Accommodation

Hotel Pension Messe

Booked:

  • Single Room 48 EUR (-5% discount)
  • Double Room 60 EUR (-5% discount)

including breakfast

How to reach

just take the S train to "Messe Nord/ICC" - the hotel is just adjacent at the corner of Wundtstrasse / Neue Kantstrasse.

Participants

please list yourself

08 09 10 11 12 13 Transportation Name Accomodation
X X X X X X aircraft or train from Basel rowspan=2 double room booked 2010-03-27
X X X X X X aircraft or train from Basel Annette Schneider
X X X X X X Tommi Mäkitalo single room booked 2010-03-27
Mirko Lindner(?)
Mirko Voigt(?)
Emmanuel Engelhart n/a

Material List

please list your name if you can bring something from this list or add other things you will bring or which you think we should get

Talk

  • Title: openZIM - Wikipedia Offline - Current State
  • Language: English prefered, German possible
  • Category: Applications
  • License: Creative Commons License

Abstract

for program committee, approx. 1 page

Since 2008 we are working on a free implementation of a toolchain for offline Wikipedia, providing a container format, reader and writer softare. The first attempt was based on the Zeno file format used by Directmedia, the publisher of the Wikipedia DVD. The Wikipedia DVD 2008/2009 contained a free implementation developed by the people who form today the openZIM project and was the start of the openZIM project.

Starting 2009 openZIM defines itself as "a free and open implementation of the ZIM file format", while openZIM provides both a free documentation of the file format as well as the tools to create ZIM files from web content and a reader to present them in a browser. Inspired by the Zeno format ZIM means "Zeno Improved".

While the open documentation of the ZIM file format enables everyone to write software using it, openZIM also provides tools to create ZIM files from HTML and a reader application that allows to browse the content in a standard internet browser along with some advanced tools. These are all based on zimlib, a library written in C++ that can be easily used in other applications to make them ZIM-aware.

Since our start we released two versions of the ZIM format, the first version was spread in the public during LinuxTag 2009 when Wikimedia CH sponsored a batch of 500 editions of the german Wikipedia on DVD as free give-aways which gave us a lot of valuable feedback.

During our first year we have been able to attract partners such as the Wikimedia Foundation that is working on a regular ZIM export on MediaWiki, the Israelian OLPC project that wants to provide hebrew Wikipedia on XO computers or a manufacturer of embedded devices which presents Wikipedia Offline on a device with only 8 MB of RAM. Most of these people gathered to a Developers Meeting and we are very happy that we were able to cover all of the needs that occured to us, even though some of them seemed to be mutually exclusive at the first glance.

With this talk we introduce openZIM for those who haven't heard about it and give a short overview over the history of Offline Wikipedia in general to provide a common basis and an understanding of the issues when dealing with huge data chunks such as Wikipedia content. The main part of the talk is focused on these issues and how openZIM has addressed these. A part of this will involve the ZIM internal structures, tools and the zimlib library. The goal is that attendants are able to understand how ZIM works, where implementational details can be found in case the file format should be implemented in any other programming language as well as enabling to work with the zimlib which can be used in any C++ application to make use of the ZIM format directly.

Short Abstract

for visitors / schedule, max. 450 characters

openZIM provides a free and open ZIM file format and tools - developed for and used to provide offline access to Wikipedia content.

We will give a introduction in openZIM and the ZIM format and then focus on technical issues when dealing with amounts of data like Wikipedia and how we have addressed them. After the talk you know the tools that are available, have got a overview on the file format and the resources to find more details if you plan to implement it by yourself and you know how to start using zimlib in your own application.

Links

  • URL: http://openzim.org/
  • Description: website of the openZIM developer team with documentation, SVN, bugtracker and ZIM file archive

Project

General Information

Project Description

  • English (short): The openZIM project develops a file format and software to store and use Wikipedia articles and other content on offline devices - suitable for handhelds, ebook readers, DVDs or classroom servers.
  • English:
    Our mission is to make Wiki content available offline, with a primary focus on the contents of Wikipedia and other Wikimedia projects.
    The core focus of this project is on extracting and encoding of all data from MediaWiki for offline use. With the help of the Wikimedia Foundation we are working together to create a regular process of making ZIM files for the Wikimedia projects.
    When this is successful we would like to add support for other wiki engines so that this can become a ubiquitous file format for offline wiki data. We even believe the file format could be flexible enough to support any kind of web data.
    The general goals we want to achieve are:
    • the file format is the most efficient and works on all platforms, embedded devices and desktop computers
    • the file format is used by all kind of publishers and even outside the Wikimedia community
    • both reader applications and content files are freely interchangeable
    • everyone can implement software for this file format because it is in a open and documented standard
    • there is at least one free implementation which is open source
    • the free implementation can be used easily in other software to make it easy to use the file format without having to implement it
  • German (short): Das openZIM-Projekt entwickelt ein Dateiformat und Programme um Wikipedia-Artikel und andere Daten ausserhalb des Internets zu nutzen - passend für Handheld-Geräte, ebook-Leser oder Schulserver.
  • German:
    Unser Ziel ist es Wiki-Inhalte ausserhalb des Internets verfügbar zu machen, dabei konzentrieren wir uns vorranging auf die Inhalte der Wikipedia und der anderen Wikimedia-Projekte.
    Der Kern des Projektes besteht darin Daten aus MediaWiki zu exportieren und für die offline-Nutzung aufzubereiten. Mit der Hilfe der Wikimedia Foundation arbeiten wir daran einen regelmässigen Prozess zu schaffen um ZIM-Dateien aus den Wikimedia-Projekte zu exportieren.
    Sobald dies abgeschlossen wurde möchten wir auch andere Wiki-Software unterstützen und damit ZIM zum universellen Datenformat für Offline-Wikidaten zu machen. Wir denken dass das Dateiformat flexibel genug ist um jede Art von Internet-Daten zu speichern.
    Unsere allgemeine Ziele sind:
    • ZIM ist das effizienteste Speicherverfahren und funktioniert auf allen Plattformen, embedded Geräten und normalen Computern
    • das Dateiformat wird von allen Herausgebern benutzt, auch ausserhalb der Wikimedia-Gemeinschaft
    • sowohl die Lesesoftware als auch die Inhalte sind beliebig austauschbar
    • jeder kann seine eigene ZIM-Software schreiben, da es sich um einen offenen, dokumentierten Standard handelt
    • es gibt mindestens eine Implementierung als Freie Software
    • die Freie Implementierung can einfach in andere Programme eingebunden werden um das Dateiformat zu nutzen ohne es selbst implementieren zu müssen

Project Community

Development Process

Developer Community Size

  • Germany: 4
  • Europe: 7
  • Worldwide: 9
  • Developers employed by single company: 0

Supporting Institution

Exhibtion at Linuxtag

Last year's participation

  • Did you participate at last year's LinuxTag? yes
  • Feedback on booth size and equipment: we had a small booth together with tntnet, but with two counters and a table shared for both which was perfect and would be great to have the same setup this year again
    the network connection (LAN) was not available at the booth, later we got one cable from another booth causing problems as eg. some exhibitors shut down the booth early on Saturday, cutting us off the net
  • Feedback on booth location: the location was maybe the most quiet area in the hall, directly behind the second entrance where hardly people were around - only a lot of commercial exhibitors
    a bit more inside the halls with more community projects would be nice - eg. Linux4Africa, SkoleLinux, OpenOffice.org

Presentation

  • Highlights: new version of the ZIM file format, running smoothly on tiny embedded devices
  • Merchandising: n/a
  • Partner Project(s): tntnet

Booth

  • Wanted size: 9 m²
  • Number of Staff: 5
  • Equipment: posters, beamer
  • Advantages for other exhibitors:
  • Required special booth equipment: board to put beamer on

Public Relations

Press Contact

  • Name: Manuel Schneider
  • Address: Scheffelstrasse 1, 79585 Steinen
  • URL: http://openzim.org/
  • Email: manuel.schneider@wikimedia.ch

News during LinuxTag

  • News: n/a