Re: [Wikitech-l] An alternate parser

17 Aug 2004

On Fri, 2004-08-13 at 20:46 +0200, Magnus Manske wrote:
...
  Warning: Yeat Another Crazy Idea of Mine ahead. If
you're sick of these 
 (by bitter experience;-) delete this mail *now*.

 Still here? Great!

 OK, we all know that the current parser, while working, is not the final 
 word. It is kinda slow due to multi-pass, the source is confusing, and 
 there are some persistant bugs in it, like the template malfunctions.

 I therefore suggest a new structure:
 1. Preprocessor
 2. Wiki markup to XML
 3. XML to (X)HTML 
This is what i'm writing currently, except that the parser will return a
dom tree instead of the xml dump of it. Saves another parse step before
postpocessing (template replacement, link status updates etc and the
final xslt transform). Besides being able to save the dom tree as xml at
any stage it's also possible to pickle the python object, which is a bit
faster to wake up than xml.

Caveat: Based on python's xml features, don't know a lot about php dom
implementations.
-- 
Gabriel Wicke

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] An alternate parser