You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
looks like some culprits are torchao.float8.float8_utils: ~437ms individual time and torchao.quantization.autoquant: ~220ms individual time and ~999ms cumulative
Also noticed that the float8 related modules have very deep nested imports for inference.
Hi folks, not a bug. In torchtune, importing the library takes ~7s. When I profile it, majority is coming from torchao imports.
just a simple 'import torchao' takes ~4s
Its possible to do some profiling like this (by cumulative):
by self
Just wanted to share it here in case someone wants to take a look. Thanks!
outputs self
outputs cumulative:
The text was updated successfully, but these errors were encountered: