Adding a userspace application crash handling system to autotest

This patch adds a system to watch user space segmentation
faults, writing core dumps and some degree of core dump
analysis report. We believe that such a system will be
beneficial for autotest as a whole, since the ability to
get core dumps and dump analysis for each app crashing
during an autotest execution can help test engineers with
richer debugging information.

The system is comprised by 2 parts:

 * Modifications on test code that enable core dumps
generation, register a core handler script in the kernel
and check by generated core files at the end of each
test.

 * A core handler script that is going to write the
core on each test debug dir in a convenient way, with
a report that currently is comprised by the process that
died and a gdb stacktrace of the process. As the system
gets in shape, we could add more scripts that can do
fancier stuff (such as handlers that use frysk to get
more info such as memory maps, provided that we have
frysk installed in the machine).

This is the proof of concept of the system. I am sending it
to the mailing list on this early stage so I can get
feedback on the feature. The system passes my basic
tests:

 * Run a simple long test, such as the kvm test, and
then crash an application while the test is running. I
get reports generated on test.debugdir

 * Run a slightly more complex control file, with 3 parallel
bonnie instances at once and crash an application while the
test is running. I get reports generated on all
test.debugdirs.

3rd try:
 * Explicitely enable core dumps using the resource module
 * Fixed a bug on the crash detection code, and factored
   it into a utility function.

I believe we are good to go now.

Signed-off-by: Lucas Meneghel Rodrigues <lmr@redhat.com>
---
 client/common_lib/test.py     |   66 +++++++++++++-
 client/tools/crash_handler.py |  202 +++++++++++++++++++++++++++++++++++++++++
 2 files changed, 266 insertions(+), 2 deletions(-)
 create mode 100755 client/tools/crash_handler.py

Adding a userspace application crash handling system to autotest

Commit Message

Comments

Patch