Jump to: navigation, search

LinuxTag 2010

2,395 bytes added, 09:36, 11 June 2010
* '''License:''' Creative Commons License
=== Abstract (Presentation) ===
for program committee, approx. 1 page
''With this talk we introduce openZIM for those who haven't heard about it and give a short overview over the history of Offline Wikipedia in general to provide a common basis and an understanding of the issues when dealing with huge data chunks such as Wikipedia content. The main part of the talk is focused on these issues and how openZIM has addressed these. A part of this will involve the ZIM internal structures, tools and the zimlib library. The goal is that attendants are able to understand how ZIM works, where implementational details can be found in case the file format should be implemented in any other programming language as well as enabling to work with the zimlib which can be used in any C++ application to make use of the ZIM format directly.''
=== Abstract (Workshop) ===openZIM deals with the technical issues when webcontent should be usable offline. Especially with big amounts of data, eg. Wikipedia there are many challenges to solve. 1.000.000 articles (all german Wikipedia, only text) take about 7-8 GB disc space, so they need to be compressed. On the other hand we have to deal with very small devices such as embedded Linux gadgets or mobile phones. In the past several special solutions for special needs have been implemented by individuals and abonded soonafter. There is no way to update contents for these applications as nobody can tell how it was stored. openZIM has developped an efficient file format that provides high compression (lzma), fulltext search and that can store all kind of content including images, CSS or JavaScript files. This file format is called ZIM and publicly documented to be implemented and used by everyone who wants to store webcontent offline. openZIM also provides a Free and Open Source implementation of the ZIM file format, written in C++ and published under GPL. The zimlib can be integrated in any third-party software to make it very easy to use the ZIM file format without implementing it. This workshop will provide you an insight into the ZIM file format and the tools provided by the openZIM project and third parties. ZIM runs on standard PCs, XO notebooks (OLPC), mobile phones or Linux gadgets (Ben NanoNote). Manuel will show you reader applications as webserver on Linux, a GUI browser on Windows and a Symbian application running on mobile phones. Tommi will give an introduction on how to use zimlib in another application with live coding example. We invite you to join the discussion, ask questions etc... we can tell a lot about the processes around publishing Wikipedia offline or on the technical challenges and solutions we have been working on, but we don't want to bore you with theoretical things. === Short Abstract (Presentation) ===
for visitors / schedule, max. 450 characters
''We will give a introduction in openZIM and the ZIM format and then focus on technical issues when dealing with amounts of data like Wikipedia and how we have addressed them. After the talk you know the tools that are available, have got a overview on the file format and the resources to find more details if you plan to implement it by yourself and you know how to start using zimlib in your own application.''
=== Short Abstract (Workshop) ===
openZIM provides a free and open ZIM file format and tools - developed for and used to provide offline access to Wikipedia content.
We will give a introduction in openZIM and the ZIM format and then focus on technical issues when dealing with amounts of data like Wikipedia and how we have addressed them. We also give live coding examples on how to use the zimlib in your own applications.
=== Links ===

Navigation menu