Prototype for tracing through instrumentation #1

jasonz-dfinity · 2025-01-09T19:02:18Z

Why

Sometimes it's difficult to understand why a certain benchmark takes X instructions, when the code being benchmarked is more complicated.

Update canbench-rs-macros to produce another query call where tracing is enabled. This is needed to avoid affecting the non-tracing behavior, while allowing tracing functionality to return different results.
Update canbench-bin:
- Add a tracing flag for the binary
- Create a new instrumented wasm (not modifying the existing one) where tracing is enabled:
  - For every tracing query methods (added by canbench-rs-macros), call an exported function __prepare_tracing
  - Add a trace_func which calls ic0.performance_counter and persist the counter along with the func_id into the buffer
  - For every wasm function, move its original functionality into a block, and call trace_func before and after
  - Collect the traces and normalize them
  - Convert traces into a flamegraph and write to the file system
Update canbench-rs:
- Add an exported function to allocate the tracing buffer
- When tracing is enabled, go down a different code path to execute the benchmarked function, where performance counter before it and after it is also recorded to the buffer.
- Provide a function to get traces from the buffer (called by generated code from canbench-rs-macros

The traces are returned through the canister reply of the tracing methods, and they are subject to the message size limit. there is not yet a way to override the message size limit similar to the instruction limit, but it is theoretically possible.
Pre-allocating tracing buffer could have an effect on the heap memory allocation, which causes the benchmarked function to behave slightly differently than otherwise.

jasonz-dfinity added 2 commits January 9, 2025 18:38

Prototype for tracing through instrumentation

d349a30

Add bound checking for buffer

53fa10a