Wikitech-l October 2010

wikitech-l@lists.wikimedia.org

95 participants
67 discussions

by Ryan Lane

I've just finished the SVN migration. If you've logged into the new server before the migration via SSH, you'll have SSH key problems. To fix this issue, edit ~/.ssh/known_hosts, and remove the entries for formey and formey.wikimedia.org. Respectfully, Ryan Lane

13 years, 7 months

reintroduce myself

by Ashar Voultoiz

Hello, Just a quick message to reintroduce myself to people who might be wondering who is this new committer. I am from France and discovered Wikipedia in 2002. Getting interested in bug fixing, I have eventually been granted commit access by Tim or Brion back in 2003 or 2004. I haven't contributed a lot of code but have an overall knowledge of MediaWiki. I mostly fixed funny bugs, converted double quotes to single quotes and occasionally synced stuff to live (read: blank page on live site). I am back around since a few weeks and willing to contribute again to MediaWiki development. I have no aim in particular beside having fun and meeting some new people. My area of interests are in no special order : - parser (still have to understand Tim's preprocessing stuff) - ajax features - testing - IPv6 My secret project is to migrate to git. I beg your pardon for my very basic english :^b I got a short user page at : http://en.wikipedia.org/wiki/User:Hashar My main page is on the french wikipedia (french language only) : http://fr.wikipedia.org/wiki/Utilisateur:Hashar -- Ashar "hashar" Voultoiz

13 years, 7 months

Migrating SVN from mayflower to formey

by Ryan Lane

In the next hour or two we'll be migrating SVN to a new server. Nothing is changing from the usage perspective. During this time SVN may be inaccessible for a few minutes. Let me know if you are having an access issue, and I'll fix it for you. Respectfully, Ryan Lane

13 years, 7 months

Release process

by Rob Lanphier

Hi everyone, There have been a number of calls to make the release process more predictable (or maybe just faster). There are plenty of examples of projects that have very predictable release schedules, such as the GNOME project or the Ubuntu Linux distribution. It's not at all unreasonable to expect that we could achieve that same level of predictability if we're prepared to make some tradeoffs, such as: 1. Is the release cadence is more important (i.e. reverting features if they pose a schedule risk) or is shipping a set of features is important (i.e. slipping the date if one of the predetermined feature isn't ready)? For example, as pointed out in another thread + IRC, there was a suggestion for creating a branch point prior to the introduction of the Resource Loader.[1] Is our priority going to be about ensuring a fixed list of features is ready to go, or should we be ruthless about cutting features to make a date, even if there isn't much left on the feature list for that date? 2. Projects with generally predictable schedules also have a process for deciding early in the cycle what is going to be in the release. For example, in Ubuntu's most recently completed release schedule [2], they alloted a little over 23 weeks for development (a little over 5 months). The release team slated a "Feature Definition Freeze" on June 17 (week 7), with what I understand was a pretty high bar for getting new features listed after that, and a feature freeze on August 12 (week 15). Many features originally slated in the feature definition were cut. Right now, we have nothing approaching that level of formality. Should we? 3. How deep is the belief that Wikimedia production deployment must precede a MediaWiki tarball release? Put another way, how tightly are they coupled? Thoughts on these? Any other tradeoffs we need to consider? We're going to have a number of conversations over the coming days on this topic, so I wanted to add a little structure and get some (more) initial impressions now. Rob [1] MZMcBride's mail: http://lists.wikimedia.org/pipermail/wikitech-l/2010-October/049969.html ...which in turn references IRC from 2010-10-18 @ 14:08 or so: http://toolserver.org/~mwbot/logs/%23mediawiki/20101018.txt [2] Ubuntu Maverick Meerkat (10.10) release schedule: https://wiki.ubuntu.com/MaverickReleaseSchedule

13 years, 7 months

Collaboration between staff and volunteers: a two-way street

by Roan Kattouw

Since the discussion about staff collaboration with volunteers started a few weeks ago, actions and statements by staff members have undergone an increasing amount of scrutiny and criticism. That in itself is not a bad thing necessarily: staff members need to be kept on their toes and not be allowed to get away with doing bad things, and some scrutiny and criticism is needed to accomplish this. In recent weeks, however, posts on this mailing list have gone way beyond 'some' scrutiny and criticism, instead suggesting something closer to distrust and paranoia. Statements made by staff members have been picked apart, with anything that could be interpreted to suggest an exclusive, disrespectful or otherwise negative attitude towards volunteers being interpreted this way, along with the occasional ominous warning about how the world will end if this attitude won't change. This extreme behavior comes from just a few people, but I'm seeing a less extreme version of it in other people too. Unlike the former group, the latter group doesn't seem to be particularly paranoid or uncivil, but they seem to be getting increasingly critical of staff members as well. Quite understandably, staff members aren't gonna be encouraged to be more collaborative when they get the feeling that their attempts to do so more often than not result in increased scrutiny, criticism or drama and that their sometimes unfortunate but nevertheless good-faith and well-intentioned actions or words backfire the way we've seen happen a few times recently. Rather than feeling this environment encourages them to collaborate (which it should), they'll feel this environment is hostile and will be driven away from it if it continues to feel hostile. A crucial point that I think is being missed by a number of people right now is that collaboration is a two-way street. Staffers and volunteers are both responsible for making it work. While staff members have to be open to, respectful of and collaborative with volunteer developers, the reverse is also true: volunteers are supposed to make staff members feel welcome and appreciated, and treat them as their equals. Right now, the opposite seems to be happening, which I fear will lead to a negative spiral. A few weeks ago, staff members were called upon to adjust their attitudes to do their part in fostering collaboration between staff and volunteers. Volunteers, in turn, should be aware that they have a part to play too. Also, both sides should realize behaviors don't change overnight, and should give each other time to adapt and cut each other some slack in the meantime. Roan Kattouw (Catrope)

13 years, 7 months

ResourceLoader Debug Mode

by Trevor Parscal

There seems to be some confusion about how ResourceLoader works, which has been leading people to make commits like r73196 and report bugs like #25362. I would like to offer some clarification. ResourceLoader, if you aren't already aware, is a new system in MediaWiki 1.17 which allows developers to bundle collections of *resources* (like JavaScript and CSS files, or localized messages) into *modules*. Modules may represent any number of scripts, styles and messages, which are read from the file system, the database, or generated by software. When a request is made for one or more modules, each resource is packaged together and sent back to the client as a response. The way in which these requests and responses are performed depends on whether debug is on or off. When debug mode is off: * Modules are requested in batches * Resources are combined into modules * Modules are combined into a response * The response is minified When debug mode is on: * Modules are requested individually * Resources are combined into modules I think it's debatable whether debug=true mode goes far enough, since it still combines resources into modules, and I am open to contributions that can make debug=true mode even more debugging friendly by delivering the resources to the client as unchanged as possible. I also think it's debatable if debug=false mode goes far enough, since things like Google Closure Compiler have been proven to even further reduce the size of JavaScript resources, so I am also open to contributions which can make debug=false even more production friendly by improving front-end performance. The commits and bugs that I'm contending here are ones which are aiming to dilute the optimized nature of debug=false mode, when debug=true mode is really what they should be using or improving. These kinds of changes and suggestions result in software that is neither optimized for debugging or for production, making the front-end performance of the site in production slower without making it any easier to debug than it would have been by using debug=true. If you are a developer, working on your localhost, you probably want to code with... $wgResourceLoaderDebug = true; .. and then test that things work in debug=false mode before committing your code. This will result in more requests but less processing, which will be much faster when developing on localhost. I hope this helps clarify this situation. - Trevor

13 years, 7 months

processing irc://irc.wikimedia.org/en.wikipedia

by Ed Summers

A question from an IRC/wikipedia newbie. I've been experimenting with processing pubmsg events in irc://irc.wikimedia.org/en.wikipedia (thanks for the channel btw) and have been noticing some control characters that I wasn't expecting to see in the message content. I've attached a raw line form the channel, where you should be able to see an 0x03 byte (ctrl-c) at position 60. There are several others scattered throughout the line followed by integers. Is this a character encoding of some kind that I need to decode, or some artifact of the IRC protocol that I need to handle? Any advice/tips would be greatly appreciated! //Ed

13 years, 7 months

Convention for logged vs not-logged page requests

by Rob Lanphier

Hi all, In diving into a problem with logging[1], we discovered that we were unintentionally treating several special page accesses (in this case, containing included Javascript) as normal pageviews, thus throwing our pageview statistics way off. The proposed solution involves changing the way we access those Javascript requests from this form: http://en.wikipedia.org/wiki/Special:BannerController ...to this form: http://en.wikipedia.org/w/index.php?title=Special:BannerController I'm assuming this convention isn't documented anywhere (other than earlier today on the wikitech wiki[2]). Before we run off and document this as something code reviewers need to look out for, I'd like to make sure this is really how we'd like to make the distinction. Is this a sensible convention, or is there a different convention we should implement? Note that any changes to the convention would need to be implemented here: http://svn.wikimedia.org/viewvc/mediawiki/trunk/webstatscollector/filter.c?… ...so futzing with the convention isn't free, but *may* be worth it if we have arrive at a vastly superior convention. Rob [1] https://bugzilla.wikimedia.org/show_bug.cgi?id=25564 [2] http://wikitech.wikimedia.org/view/Squid_logging#Inflated_Stats

13 years, 7 months

using parserTests code for selenium test framework

by Dan Nessett

I have been tasked to evaluate whether we can use the parserTests db code for the selenium framework. I just looked it over and have serious reservations. I would appreciate any comments on the following analysis. The environment for selenium tests is different than that for parserTests. It is envisioned that multiple concurrent tests could run using the same MW code base. Consequently, each test run must: + Use a db that if written to will not destroy other test wiki information. + Switch in a new images and math directory so any writes do not interfere with other tests. + Maintain the integrity of the cache. Note that tests would *never* run on a production wiki (it may be possible to do so if they do no writes, but safety considerations suggest they should always run on a test data, not production data). In fact production wikis should always retain the setting $wgEnableSelenium = false, to ensure selenium test are disabled. Given this background, consider the following (and feel free to comment on it): parserTests temporary table code: A fixed set of tables are specified in the code. parserTests creates temporary tables with the same name, but using a different static prefix. These tables are used for the parserTests run. Problems using this approach for selenium tests: + Selenium tests on extensions may require use of extension specific tables, the names of which cannot be elaborated in the code. + Concurrent test runs of parserTests are not supported, since the temporary tables have fixed names and therefore concurrent writes to them by parallel test runs would cause interference. + Clean up from aborted runs requires dropping fossil tables. But, if a previous run tested an extension with extension-specific tables, there is no way for a test of some other functionality to figure out which tables to drop. For these reasons, I don't think we can reuse the parserTests code. However, I am open to arguments to the contrary. -- -- Dan Nessett

13 years, 7 months

Selenium Framework - test run configuration data

by Dan Nessett

Back in June the Selenium Framework had a local configuration file called LocalSeleniumSettings.php. This was eliminated by Tim Starling in a 6/24 commit with the comment that it was an insecure concept. In that commit, new globals were added that controlled test runs. Last Friday, mah ripped out the globals and put the configuration information into the execute method of RunSeleniumTests.php with the comment "@todo Add an alternative where settings are read from an INI file." So, it seems we have dueling developers with contrary ideas about what is the best way to configure selenium framework tests. Should configuration data be exposed as globals or hidden in a local configuration file? Either approach works. But, by going back and forth, it makes development of functionality for the Framework difficult. I am working on code not yet submitted as a patch that now requires reworking because how to reference configuration data has changed. We need a decision that decides which of the two approaches to use. -- -- Dan Nessett

13 years, 7 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Wikitech-l October 2010