The biggest issues I see are the lack of any good logging, monitoring and alerting tools.  Things like icinga, logstash, grafina.  The kind of things that are standard for supporting any production system.  I've raised this before, so I won't belabor the point here.

And https://phabricator.wikimedia.org/T256426 continues to be an every-day pain in my side.  The related https://phabricator.wikimedia.org/T127367 is triaged as high priority.  It's been open for 6-1/2 years.



On Sep 7, 2022, at 10:17 AM, Slavina Stefanova <sstefanova@wikimedia.org> wrote:

On a side note, I'd be interested in hearing what you dislike about Toolforge, if you'd like to share. We (the cloud services team) are working on improving Toolforge and don't always get as much feedback, good or bad, as we'd want.