[Labs-l] Bot cluster reliability issues

Ryan Lane rlane at wikimedia.org
Mon Feb 4 07:58:03 UTC 2013


On Sun, Feb 3, 2013 at 1:50 PM, Rschen7754 <rschen7754.wiki at gmail.com>wrote:

> I've been getting more and more frustrated with the complete lack of
> reliability here. Currently I am unable to run my bot anywhere: bots-2
> can't access storage anymore, bots-3 is completely dead, and bots-4 has no
> memory. I don't have root access on bots-nr1 and -nr2 and can't install the
> necessary packages.
>
>
Thanks for reporting this. Two of the four gluster servers glusterd service
had crashed. I'm working on repairing this issue right now.


> Even when I have been able to run the bot, I keep having to log in every
> day and restart the bot because it shut down for some reason or another. At
> this point, I would prefer running it on personal computing equipment
> because that would be more stable at this point.
>
> Are there any plans to resolve these issues, or do I need to find another
> hosting solution entirely?
>
>
We actively work on issues when they are reported. The current bots setup
is non-ideal and needs some love. The new contractor position will have
this as a priority, but we should likely take some time to fix some of the
more serious issues immediately.

- Ryan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wikimedia.org/pipermail/labs-l/attachments/20130203/dfe80a11/attachment.html>


More information about the Labs-l mailing list