Add docs on interfacing with surrogates #804

theo-brown · 2025-03-06T15:44:37Z

Includes:

Manually reimplementing the model in JAX
Converting a Pytorch model to a JAX model using torch_xla2
Using an ONNX model with jaxonnxruntime

Potentially closes #538

jcitrin

Thanks for this! Really awesome and appreciated that you are writing documentation.

Made a few comments. @hamelphi , @sbodenstein , @ernoc could you also take a look at this?

docs/interfacing_with_surrogates.rst

jcitrin · 2025-03-07T21:39:10Z

When fusion_transport_surrogates matures more it will hopefully help abstract away some of this and we can supplement/modify these docs with examples using that library

hamelphi · 2025-03-11T16:16:18Z

Nice! LGTM. Thanks for the contribution Theo.

theo-brown · 2025-03-21T10:59:51Z

Thanks for the comments, sorry for the delay in responding!

theo-brown · 2025-03-25T15:44:53Z

Following some of the suggestions made by @sbodenstein, I've added a bit on saving/loading models in HLO format, which is the one backed by OpenXLA.

sbodenstein · 2025-04-07T15:03:40Z

docs/interfacing_with_surrogates.rst

+
+    torch_model = PyTorchMLP(hidden_dim, n_hidden, output_dim, input_dim)
+
+This model can be converted to a Flax model as follows:


I would prefer: 'can be replicated in Flax as follows'.

sbodenstein · 2025-04-07T15:05:02Z

docs/interfacing_with_surrogates.rst

+    params = {'params': params}
+
+
+The model can then be called like any Flax model,


Have you verified this? Eg. that no params are transposed between libraries

sbodenstein · 2025-04-07T15:05:27Z

docs/interfacing_with_surrogates.rst

+    output_tensor = flax_model.apply(params, input_tensor)
+
+
+Option 2: converting a Pytorch model to a JAX model


nit: Pytorch -> PyTorch

sbodenstein · 2025-04-07T15:08:17Z

docs/interfacing_with_surrogates.rst

+    import torch_xla2 as tx
+
+    trained_model = torch.load(PYTORCH_MODEL_PATH, weights_only=False) # Use weights_only=False if you want to load the full model
+    params, jax_model_from_torch = tx.extract_jax(model)


I think it needs be be jitted (build in good practice: otherwise many users might be hit by terrible performance)

sbodenstein · 2025-04-07T15:08:59Z

docs/interfacing_with_surrogates.rst

+
+.. code-block:: python
+
+    output_tensor = flax_model.apply(params, input_tensor)


I would show this jitted as PyTorch users might not know that this line of code is a terrible idea

sbodenstein · 2025-04-07T15:10:20Z

docs/interfacing_with_surrogates.rst

+    import numpy as np
+
+    # jax.export uses StableHLO to serialize the model to a binary format
+    exported_model = jax.export(jax_model_from_torch)


In the docs, they first wrap the function in JIT, another reason to JIT this https://pytorch.org/xla/master/features/stablehlo.html#using-extract-jax

sbodenstein · 2025-04-07T15:11:36Z

docs/interfacing_with_surrogates.rst

+    )
+
+However, JAX will not be able to differentiate through the InferenceSession.
+To convert the ONNX model to a JAX representation, you can use the `jaxonnxruntime`_ package:


Maybe use hyperlink to jaxonnxruntime

sbodenstein · 2025-04-07T15:12:11Z

docs/interfacing_with_surrogates.rst

+
+    jax_model_from_onnx = ONNXJaxBackend.prepare(onnx_model)
+    # NOTE: run() returns a list of output tensors, in order of the output nodes
+    output_tensors = jax_model_from_onnx.run({"input": jnp.asarray(input_tensor, dtype=jnp.float32)})


Definitely needs jitting

Add docs on interfacing with surrogates

207db6f

theo-brown mentioned this pull request Mar 6, 2025

Provide ONNX converter #538

Open

jcitrin reviewed Mar 7, 2025

View reviewed changes

docs/interfacing_with_surrogates.rst Outdated Show resolved Hide resolved

docs/interfacing_with_surrogates.rst Show resolved Hide resolved

docs/interfacing_with_surrogates.rst Show resolved Hide resolved

docs/interfacing_with_surrogates.rst Show resolved Hide resolved

jcitrin added copybara:import-manual Set when ready for copybara manual import and removed copybara:import-manual Set when ready for copybara manual import labels Mar 18, 2025

Add best practices; fix minor formatting

70e26f6

theo-brown requested a review from jcitrin March 21, 2025 10:59

Add instructions on saving models to disk in HLO format

1955059

theo-brown requested a review from sbodenstein March 25, 2025 15:47

sbodenstein reviewed Apr 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docs on interfacing with surrogates #804

Add docs on interfacing with surrogates #804

theo-brown commented Mar 6, 2025 •

edited

Loading

jcitrin left a comment

jcitrin commented Mar 7, 2025

hamelphi commented Mar 11, 2025

theo-brown commented Mar 21, 2025

theo-brown commented Mar 25, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025

sbodenstein Apr 7, 2025


		torch_model = PyTorchMLP(hidden_dim, n_hidden, output_dim, input_dim)

		This model can be converted to a Flax model as follows:

		params = {'params': params}


		The model can then be called like any Flax model,

		output_tensor = flax_model.apply(params, input_tensor)


		Option 2: converting a Pytorch model to a JAX model


		.. code-block:: python

		output_tensor = flax_model.apply(params, input_tensor)

Add docs on interfacing with surrogates #804

Are you sure you want to change the base?

Add docs on interfacing with surrogates #804

Conversation

theo-brown commented Mar 6, 2025 • edited Loading

jcitrin left a comment

Choose a reason for hiding this comment

jcitrin commented Mar 7, 2025

hamelphi commented Mar 11, 2025

theo-brown commented Mar 21, 2025

theo-brown commented Mar 25, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

theo-brown commented Mar 6, 2025 •

edited

Loading