Daniel Friesen wrote:
Perhaps some deeper segregation of those stats would be
useful. ie:
Separate the numbers of styles used between templates and pages.
Then we might have a better idea of what kind of patterns are being used
directly in pages that should actually be moved to templates or
stylesheets.
This reply confused me a little. The script I ran exclusively looked at
pages in the main namespace and exclusively looked at an XML dump, which
is unexpanded wikitext. That is, assuming people aren't doing a lot of
inline styling as arguments/parameters to templates, we should already
have a decent amount of segregation as I only looked at direct uses.
Looking at the template namespace or looking at pages post-expansion would
be annoying. I think templates aren't necessarily a bad place for inline
styling, so I'm a lot less focused on templates than I am on articles.
Vi to wrote:
Do you have a old dump to check whatever the ratio has
increased?
I'm personally not very interested in doing this, but using a similar dump
from <https://dumps.wikimedia.org/> and following the instructions laid
out in <https://phabricator.wikimedia.org/T115228> should make this fairly
easy to do, if anyone is interested. I tried to methodically document all
of the relevant source code and commands that I used, so that this same
audit or an audit on another project or dump would be less work.
MZMcBride