Buildbot/OutageReports/20070909-01

From MozillaWiki
Jump to: navigation, search

2007-09-04

On 2007-09-09 at 13:25PDT, try1-win32-slave experienced a service outage for 20 hours.

bug 394841

What was affected:

Windows try builds on trunk.

What was the cause of the outage:

Traceback (most recent call last):
Failure: buildbot.slave.commands.TimeoutError: SIGKILL failed to kill process

Has this type of outage happened before?

Yes.

What was done to repair:

Shell killed and reopened.

What will be done to prevent this in the future:

  • upgrade buildbot to trunk code ?
  • use KillableProcess.py ?