"Tim Starling"
<t.starling(a)physics.unimelb.edu.au> wrote in
message news:dsfaip$87l$1@sea.gmane.org...
I've made a new diff extension, called wikidiff2.
It uses the same diff
algorithm that we've been
using in PHP, ported to C++. I've done some benchmarks on test sets which
require lots of word-level
diffs. For the lines "a b c d" -> "a b c e" repeated many times,
on srv31,
the timings are:
PHP DifferenceEngine: 10230us per line
wikidiff (old C++ extension): 379us per line
wikidiff2 (new C++ extension): 11.5us per line
Just to double-check (3 sig-figs since that's all we've been given in some
cases):
PHP: 97.8 lines/s
wikidiff: 2640 lines/s
wikidiff2: 87000 lines/s
Strewth! Unless there's something devastatingly wrong with your coding, this
would seem likely to make a huge impact on performance.
Is this currently a serious bottleneck?
What is the next bottleneck that needs consideration?
--
Phil
[[en:User:Phil Boswell]]