[WikiEN-l] Putting Wikipedia on a CD or DVD

Magnus Manske magnus.manske at web.de
Wed Mar 23 22:00:10 UTC 2005


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

David Gerard schrieb:
> Are there compression algorithms good for text that go smaller than bzip2
> and are fast to uncompress even though they may be slow to compress?

There's a commercial variant of the public domain sqlite engine
(database in a single file, "server" as a linkable library, understands
most of SQL) which can store fields in the sqlite database as zipped BLOBs.

A better alternative might be to write a wrapper around sqlite that can
read a bzip2ed sqlite file.

Ultimately though, en.wikipedia will grow beyond any compression
algorithm's capability. Maybe we should go ahead to DVD, waste an
additional half a gig or so on an uncompressed sqlite file, and have a
fulltext index and some pretty images.

Magnus
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (MingW32)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFCQebqCZKBJbEFcz0RAprlAJ4xWlWMOpZZOaoco31aePUGfcgBegCdEN7r
F7GhElGDCw3C8cmTkaF9N9o=
=KAVY
-----END PGP SIGNATURE-----



More information about the WikiEN-l mailing list