[Engineering] CI outage - ongoing

Antoine Musso amusso at wikimedia.org
Tue Jul 5 22:47:10 UTC 2016


On 05/07/16 23:28, Chad Horohoe wrote:
> Hi folks,
>
> Right now our CI infrastructure (Zuul/Jenkins/Nodepool) are having a bad
> day and aren't able
> to spawn new instances to perform tests. The outage is ongoing and there
> isn't an ETA for
> restoration of service just yet.
>
> In the meantime: please avoid force-merging (doing the Verified+2 check
> yourself) and skipping
> Jenkins unless you're dealing with an urgent production issue that must
> land today. Doing so
> makes Zuul get extra noisy which makes further diagnosis difficult.
>
> Thanks for your patience!
>
> -Chad & rest of RelEng

Hello,

The issue is resolved now and the backlog has been processed.

It started around 19:40 UTC when labs lost the ability to create 
instance. That fully recovered at 21:40 UTC and the backlog has been 
completely processed by 22:30UTC.



-- 
Antoine "hashar" Musso



More information about the Engineering mailing list