[Foundation-l] Foundation-l word cloud

John Vandenberg jayvdb at gmail.com
Mon Oct 4 23:24:06 UTC 2010


On Tue, Oct 5, 2010 at 7:48 AM, Peter Gehres <in2thats12 at gmail.com> wrote:
> In looking at the contents of the gzip'ed archives, stripping out the
> headers does not look trivial, but it appears that it could be done in most
> cases.  A whole other problem is quoted text.  Any preference on whether or
> not that should be included as well? If it is included, the word are not
> entirely accurate.

If it is including quoted passages, a simple way to address this is to
remove any line starting with '>' and all attachments.

btw, very interesting Nemo!

--
John Vandenberg



More information about the foundation-l mailing list