configurable actor, environment, data loader #30

rizar · 2025-05-28T13:56:03Z

includes #23

AlexPiche

I am very happy with the refactoring! It makes PipelineRL much more approachable. I left some minor comments. Also counting/ and math/ could live in an examples folder instead of directly living in pipelinerl/

AlexPiche · 2025-05-29T19:52:28Z

pipelinerl/run_preprocess.py

@@ -405,8 +402,10 @@ def run_preprocessing_loop(
                        stats = {
                            "preprocessor/published_samples": published_samples,
                            "preprocessor/published_model_version": max_model_version,
-                            "preprocessor/samples_in_input_queue": raw_chunk_queue.qsize() * cfg.preprocess.chunk_size,
-                            "preprocessor/samples_in_output_queue": samples_in_queue,
+                            "processossor/queue/raw_samples": raw_chunk_queue.qsize() * cfg.preprocess.chunk_size,


typo in the stats key

yeah it would make a lot of sense to have examples someplace separate, but in practice right now it creates no pain, so let's do this when it does create pain

AlexPiche · 2025-05-29T19:52:49Z

pipelinerl/tapeagents_rollouts.py

+    reward = 0  # TODO: implement verifier usage and reward calculation
+    metrics = {
+        "reward": reward,
+        "success": reward > 0,


not sure we want to hard code success as reward greater than 0.

this is not finished anyway

AlexPiche · 2025-05-29T19:55:21Z

pipelinerl/utils.py

@@ -238,6 +240,24 @@ def wait_for_inference_servers(urls: list[str]):
    logger.info("All inference servers are up")


+def wait_for_environments(cfg: DictConfig):


can we re-use wait_for_inference_servers?

true, we could reuse some code here... too lazy to fix this right now

ollmer and others added 24 commits May 8, 2025 16:57

debug mode

5286210

remove mount usage

b7f2d91

adjust for new sample lengths

7baccf0

error message

f93ecf5

debug config

fd23d60

pyproject toml to build package

d6e7dd3

move deps to pyproject

5def616

get size of the stream

3542f51

log queue and stream sizes to wandb

e3c9309

little more logging of tuning steps

247c74f

save launcher commands to respective stage folders

0fb9c16

auto names for debug runs, limit seq lengths

547e6ca

tapeagents rollout generator, non async

3e04427

:wqMerge branch 'main' into oleh_exps

a8f81c4

Merge branch 'shared_memory_array_rizar' into oleh_exps

8aef881

spend less time on logging

bd01bc7

configurable rollouts actor

2db0f31

more straightforward way to set rollout function

4b5bac9

fix

7844e30

revert seq length

6de3ecc

Merge branch 'main' into configurable_rollouts

7ece4d5

move all math to math folder

21d3072

configurable environment endpoint

6dc51b7

many environments hell yeah

8a74fa8

rizar requested a review from ollmer May 28, 2025 15:48

rizar added 5 commits May 28, 2025 17:47

better log folder for the environment

7271420

fix sort of typo

a4d655f

improve environment logging

a3a1605

fix a couple bugs

cd4d610

less logging

644aef8

rizar added 4 commits May 29, 2025 15:40

colocate actor and preprocesosr

2d744f3

update deps

2223063

delete old code

79a6923

counting examples

488a7eb

rizar changed the base branch from configurable_rollouts to main May 29, 2025 18:42

rizar changed the title ~~configurable environment endpoint~~ configurable actor, environment, data loader May 29, 2025

rizar added 2 commits May 29, 2025 18:51

cleanup

d267f0a

move math dataset load to math folder

d0f9619

rizar requested a review from AlexPiche May 29, 2025 19:32

there change don't seem to be needed

8dd4abd

AlexPiche approved these changes May 29, 2025

View reviewed changes

fix typo

79ed98a

rizar merged commit 3bf08b6 into main May 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

configurable actor, environment, data loader #30

configurable actor, environment, data loader #30

Uh oh!

rizar commented May 28, 2025 •

edited

Loading

Uh oh!

AlexPiche left a comment

Uh oh!

AlexPiche May 29, 2025

Uh oh!

rizar May 29, 2025

Uh oh!

AlexPiche May 29, 2025

Uh oh!

rizar May 29, 2025

Uh oh!

AlexPiche May 29, 2025

Uh oh!

rizar May 29, 2025

Uh oh!

Uh oh!

		@@ -238,6 +240,24 @@ def wait_for_inference_servers(urls: list[str]):
		logger.info("All inference servers are up")


		def wait_for_environments(cfg: DictConfig):

configurable actor, environment, data loader #30

configurable actor, environment, data loader #30

Uh oh!

Conversation

rizar commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexPiche left a comment

Choose a reason for hiding this comment

Uh oh!

AlexPiche May 29, 2025

Choose a reason for hiding this comment

Uh oh!

rizar May 29, 2025

Choose a reason for hiding this comment

Uh oh!

AlexPiche May 29, 2025

Choose a reason for hiding this comment

Uh oh!

rizar May 29, 2025

Choose a reason for hiding this comment

Uh oh!

AlexPiche May 29, 2025

Choose a reason for hiding this comment

Uh oh!

rizar May 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rizar commented May 28, 2025 •

edited

Loading