fix: sigsev in valgrind due to incorrect optimization #2

not-matthias · 2025-05-13T17:54:36Z

See linear issue for details. The main issue was having zeroStats and startInstrumentation in one function (with both being inlined). This seemed to confuse the compiler. Exporting one of the functions (startInstrumentation in this case) fixes the issue.

Also exported stopInstrumentation since we might want to call it manually later on to reduce the overhead (if there is one).

I'll add a CI step tomorrow:

Add a CI test to check for this (use just test-c)

art049

@not-matthias is this the only way? Not inlining the instrumentation calls is an issue for accuracy of the measurement as we discussed already

art049 · 2025-05-13T18:03:39Z

src/helpers/valgrind/callgrind.zig

@@ -48,6 +48,6 @@ pub inline fn startInstrumentation() void {
 /// Use this to bypass Callgrind aggregation for uninteresting code parts.
 /// To start Callgrind in this mode to ignore the setup phase, use
 /// the option "--instr-atstart=no".
-pub inline fn stopInstrumentation() void {
+pub export fn callgrind_stop_instrumentation() void {


Hmm, not being able to inline this is kindof an issue

The reason I chose to export this was the following:

The export will add 1 additional call to this function, which is not that much considering that we have a few instructions to setup/clean the stack before/afterwards.

We can call this exported function directly from within the benchmark. We can't really control how the compiler lowers the function to assembly.

The callgrind_start_benchmark function doesn't cause any overhead since we directly return to the benchmark:

stop_benchmark when using export:

stop_benchmark when using inline:

As seen on the two examples above, we will execute at least 10+ other instructions to setup the stack, which will cause overhead. The only real way to ensure that we will always call it at the start would be using inline assembly which has an additional maintenance burden.

So my proposed solution is as follows:

Keep the exports for starting and stopping valgrind, and call them in the root.

If we are using valgrind for interpreted languages such as Python, the overhead of 10 instructions will be negligible. If we want to use valgrind for Rust or C++, we can call the callgrind_start/stop_benchmark functions directly.

not-matthias · 2025-05-14T12:34:13Z

Looks like it fails to compile in CI with export, even though it works locally. Very weird 🤔

I'll invest a bit more time to try to figure out what's going on and to see if we can fix it, if that doesn't work I think it's best to just revert back to wrapping the valgrind header. There's little benefit in transpiling C -> Zig -> C.

EDIT: Turns out the issue is wrong inline assembly in the Zig std library. Using the inline assembly as defined in the valgrind header works.

     .x86_64 => asm volatile (
            \\ rolq  $3,    %%rdi ; rolq $13, %%rdi
            \\ rolq  $61,   %%rdi ; rolq $51, %%rdi
            \\ xchgq %%rbx, %%rbx
            : [_] "={rdx}" (-> usize),
            : [_] "a" (args),
              [_] "0" (default),
            : "cc", "memory"
        ),

However, this doesn't work in Zig for some reason:

zig build test --summary all
test
└─ run test
   └─ zig test Debug native failure
error: error: couldn't allocate input reg for constraint 'a'

I think the best way to deal with this is to just use the header from valgrind, then we don't have to deal with the inline assembly and differences between clang/gcc/zig. Inline assembly is currently also still unstable with major work going on (ziglang/zig#5241) so it's just safer to not yet rely on it.

not-matthias · 2025-05-14T15:04:50Z

To summarize the final results:

We're now using the original valgrind header with wrapper C functions. This way we don't introduce any errors when translating the inline assembly to Zig and then back to C. Inline assembly is currently marked as unstable, so we shouldn't rely on it.
If we want absolute precision, we should either implement the callgrind marker in the target language or call the callgrind_start/stop_instrumentation wrapper functions directly. We can't control how the compiler arranges our generic start_benchmark function and it doesn't make sense doing so.

art049

looks good, thanks for the detailed analysis

not-matthias requested a review from art049 May 13, 2025 17:54

not-matthias mentioned this pull request May 13, 2025

Cod 732 support perf profiling in python walltime CodSpeedHQ/pytest-codspeed#79

Open

art049 reviewed May 13, 2025

View reviewed changes

not-matthias force-pushed the cod-868-sigsev-when-running-valgrind-with-zig-library branch 6 times, most recently from a671f32 to 0fb92ee Compare May 14, 2025 10:26

not-matthias requested a review from art049 May 14, 2025 15:24

not-matthias force-pushed the cod-868-sigsev-when-running-valgrind-with-zig-library branch from 9d8502e to 709cb61 Compare May 14, 2025 16:50

art049 approved these changes May 14, 2025

View reviewed changes

fix: sigsev in valgrind due to incorrect optimization

acc213d

not-matthias force-pushed the cod-868-sigsev-when-running-valgrind-with-zig-library branch from 709cb61 to acc213d Compare May 14, 2025 17:17

not-matthias merged commit acc213d into main May 14, 2025
5 checks passed

not-matthias deleted the cod-868-sigsev-when-running-valgrind-with-zig-library branch May 14, 2025 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: sigsev in valgrind due to incorrect optimization #2

fix: sigsev in valgrind due to incorrect optimization #2

Uh oh!

not-matthias commented May 13, 2025 •

edited

Loading

Uh oh!

art049 left a comment

Uh oh!

art049 May 13, 2025

Uh oh!

not-matthias May 14, 2025

Uh oh!

not-matthias commented May 14, 2025 •

edited

Loading

Uh oh!

not-matthias commented May 14, 2025

Uh oh!

art049 left a comment

Uh oh!

Uh oh!

Uh oh!

fix: sigsev in valgrind due to incorrect optimization #2

fix: sigsev in valgrind due to incorrect optimization #2

Uh oh!

Conversation

not-matthias commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

art049 left a comment

Choose a reason for hiding this comment

Uh oh!

art049 May 13, 2025

Choose a reason for hiding this comment

Uh oh!

not-matthias May 14, 2025

Choose a reason for hiding this comment

Uh oh!

not-matthias commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

not-matthias commented May 14, 2025

Uh oh!

art049 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

not-matthias commented May 13, 2025 •

edited

Loading

not-matthias commented May 14, 2025 •

edited

Loading