Fix test typing #1235

florian-wagner-frequenz · 2025-06-17T11:30:06Z

Previously there was a test which violated type-hints. As we can not keep having the test and type hints on that test, the test itself was rewritten to test a semantically slightly different but similar edge case.
The original test case is of course prevented when using type-hints (i.e not passing None to a function that doesn't accept None)

Copilot

Pull Request Overview

This PR updates a test to avoid passing None and instead validates behavior with an empty config dict, aligning with current type hints.

Renames test from test_load_config_load_None to test_load_config_load_empty
Updates the docstring to describe the empty-config edge case
Changes the load_config call to pass the entire config dict instead of config.get("loggers", None)

Comments suppressed due to low confidence (1)

tests/config/test_util.py:57

[nitpick] The test name test_load_config_load_empty repeats 'load'. Consider renaming it to test_load_config_empty for clarity.

def test_load_config_load_empty(

florian-wagner-frequenz · 2025-06-17T11:30:44Z

Fixes the underlying problem preventing #1234 from proceeding

llucax

The suggested improvements to the tests are completely optional, not approving only because I think we should not add stuff that users don't care about in the release notes.

tests/config/test_util.py

RELEASE_NOTES.md

Marenz · 2025-06-18T11:41:03Z

removed this pull request from the merge queue due to no response for status checks 1 hour ago

hmm maybe my fix 45c4ad7
wasn't a fix?

Previously there was a test which violated type-hints. As we can not keep having the test _and_ type hints on that test, the test itself was rewritten to test a semantically slightly different but similar edge case. The original test case is of course prevented when using type-hints (i.e not passing `None` to a function that doesn't accept `None`)

florian-wagner-frequenz · 2025-06-18T14:17:03Z

Trying once more in the hope that this is spurious. Otherwise I would leave it to @Marenz to figure out what is going on

florian-wagner-frequenz · 2025-06-18T16:07:30Z

Yup, your problem now @Marenz

Signed-off-by: Mathias L. Baumann <[email protected]>

shsms · 2025-06-19T16:33:03Z

src/frequenz/sdk/actor/_actor.py

+            except Exception as exc:  # pylint: disable=broad-except
+                if isinstance(exc, RuntimeError) and "no running event loop" in str(
+                    exc
+                ):


Maybe just a loop.is_closed check here, instead of string match?

just calling asyncio.get_running_eventloop() will throw exactly the exception we're catching here.
We do it this way also in run_forever, I just copied it from luca

llucax

It looks to me that we have a more fundamental problem if this fixes the infinite loop issue. Both the actor and the battery status tracker loops should exit when there is a CancelledError, so it looks like we are forgetting to cancel at some point, or worse, someone is eating up the CancelledError in the middle.

This hack will uncover this more fundamental issue, which could come to bite us in other ways, so I would try to find the real root cause for this instead of patching it just to pass the tests.

llucax · 2025-06-23T11:33:08Z

Also probably not the best idea to hijack this PR to fix the CI, why didn't you create a new PR like last time?

Marenz · 2025-06-23T11:53:15Z

Also probably not the best idea to hijack this PR to fix the CI, why didn't you create a new PR like last time?

Because this was easier to test, and well, it's working now.. it wasn't reliably reproducable with the other PR which is how we ended up in this situation in the first place ;)

so I would try to find the real root cause for this

This is fixing the real root cause.
I wrote it on slack last week, but here is the analysis:

I think I got it. So the run loop in _batter_status_tracker will never quit except for a cancelled exception. On shutdown, it will get the eventloop closed error and will try again..and again..and again.. and because the eventloop is closed, it's never giving up control to let other tasks properly clean up, including the task that would trigger the cancelexception for it (componentpool_status_tracker.py the _run method with the async exit stack context lib thing).

I didn't write it explicitly, but the exact same thing is also happening in the actor-restart part. The actor stops because of the RuntimeError: Eventloop is closed error makes it try to restart before any chancel exception can be done.

…t loop Signed-off-by: Mathias L. Baumann <[email protected]>

llucax · 2025-06-23T12:24:58Z

Because this was easier to test, and well, it's working now.. it wasn't reliably reproducable with the other PR which is how we ended up in this situation in the first place ;)

You could have created another PR with the same commits as this PR for testing. Now is like the original PR got completely lost in the noise, and we are also spamming @florian-wagner-frequenz unnecessarily :P

Also discussing via slack, but for the records, what I mean about root cause is that these loops should finish, before the event loop is closed, via a CancelledError. If this is not happening it means that something of these is happening:

We are missing a cancel()
We are missing an await on the cancelled task
We are doing the above, but somehow after the loop was closed (in __del__ for example).

We should find and fix when the loop is being closed without every task being properly stopped and awaited, instead of just hiding when it happens pretending that nothing is wrong... 😱

Reraise the cancel error if it was our task that was cancelled. Signed-off-by: Mathias L. Baumann <[email protected]>

Signed-off-by: Mathias L. Baumann <[email protected]>

Marenz · 2025-06-23T14:29:47Z

and we are also spamming florian-wagner-frequenz unnecessarily :P

Nah, he announced that he unsubscribed and gave over ownership to me :) (that is, until you highlighted him again now :P )
His part is reviewed and approved, nothing to get lost.

But yes, I suppose for future historians this is a less optimal conflation of things that get fixed.

Copilot AI review requested due to automatic review settings June 17, 2025 11:30

florian-wagner-frequenz requested a review from a team as a code owner June 17, 2025 11:30

florian-wagner-frequenz requested review from Marenz and removed request for a team June 17, 2025 11:30

github-project-automation bot added this to Python SDK Roadmap Jun 17, 2025

github-project-automation bot moved this to To do in Python SDK Roadmap Jun 17, 2025

github-actions bot added the part:tests Affects the unit, integration and performance (benchmarks) tests label Jun 17, 2025

Copilot AI reviewed Jun 17, 2025

View reviewed changes

florian-wagner-frequenz requested a review from llucax June 17, 2025 11:30

florian-wagner-frequenz force-pushed the fix_test_types branch from a524089 to 2e24f61 Compare June 17, 2025 11:32

github-actions bot added the part:docs Affects the documentation label Jun 17, 2025

llucax reviewed Jun 17, 2025

View reviewed changes

tests/config/test_util.py Outdated Show resolved Hide resolved

RELEASE_NOTES.md Outdated Show resolved Hide resolved

florian-wagner-frequenz force-pushed the fix_test_types branch from 2e24f61 to abeae86 Compare June 18, 2025 08:44

florian-wagner-frequenz requested a review from llucax June 18, 2025 08:44

florian-wagner-frequenz added the cmd:skip-release-notes It is not necessary to update release notes for this PR label Jun 18, 2025

florian-wagner-frequenz enabled auto-merge June 18, 2025 08:52

florian-wagner-frequenz force-pushed the fix_test_types branch 2 times, most recently from 3e55f21 to e28a775 Compare June 18, 2025 08:53

Marenz previously approved these changes Jun 18, 2025

View reviewed changes

florian-wagner-frequenz added this pull request to the merge queue Jun 18, 2025

github-project-automation bot moved this from To do to Review approved in Python SDK Roadmap Jun 18, 2025

github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Jun 18, 2025

llucax previously approved these changes Jun 18, 2025

View reviewed changes

florian-wagner-frequenz added this pull request to the merge queue Jun 18, 2025

github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Jun 18, 2025

Marenz dismissed stale reviews from llucax and themself via 3b45bbd June 19, 2025 14:06

github-actions bot added the part:tooling Affects the development tooling (CI, deployment, dependency management, etc.) label Jun 19, 2025

Marenz removed this pull request from the merge queue due to a manual request Jun 19, 2025

Marenz force-pushed the fix_test_types branch from 3b45bbd to a5521af Compare June 19, 2025 14:11

Set min numpy version to lowest prebuilt python3.13-compatible release

65b84ca

Signed-off-by: Mathias L. Baumann <[email protected]>

Marenz force-pushed the fix_test_types branch from 3242588 to 46026b2 Compare June 19, 2025 15:35

github-actions bot added the part:microgrid Affects the interactions with the microgrid label Jun 19, 2025

Marenz force-pushed the fix_test_types branch from 46026b2 to 5f92868 Compare June 19, 2025 16:00

github-actions bot added the part:actor Affects an actor ot the actors utilities (decorator, etc.) label Jun 19, 2025

Marenz added 2 commits June 19, 2025 18:08

ComponentTracker: Use run_forever with eventloop fixes for _run() loop

7db59ae

Signed-off-by: Mathias L. Baumann <[email protected]>

Actor: Don't restart on RuntimeError:No Running Eventloop

82e131b

Signed-off-by: Mathias L. Baumann <[email protected]>

Marenz force-pushed the fix_test_types branch from 5f92868 to 82e131b Compare June 19, 2025 16:09

Marenz requested review from llucax and shsms June 19, 2025 16:09

shsms reviewed Jun 19, 2025

View reviewed changes

Marenz enabled auto-merge June 23, 2025 10:44

llucax reviewed Jun 23, 2025

View reviewed changes

Use get_running_loop() instead of str cmp to check for a running even…

9bb153d

…t loop Signed-off-by: Mathias L. Baumann <[email protected]>

github-actions bot added the part:core Affects the SDK core components (data structures, etc.) label Jun 23, 2025

github-actions bot added the part:data-pipeline Affects the data pipeline label Jun 23, 2025

Marenz added 2 commits June 23, 2025 16:27

cancel_and_wait: Re-raise if it was our task

7a1ced1

Reraise the cancel error if it was our task that was cancelled. Signed-off-by: Mathias L. Baumann <[email protected]>

Fix swallowing of cancel error in Formula Engine & Voltage streamer

058a0f1

Signed-off-by: Mathias L. Baumann <[email protected]>

Marenz force-pushed the fix_test_types branch from b77d29f to 058a0f1 Compare June 23, 2025 14:27

Marenz requested review from llucax and shsms June 23, 2025 16:58

Fix test typing #1235

Are you sure you want to change the base?

Fix test typing #1235

Uh oh!

Conversation

florian-wagner-frequenz commented Jun 17, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

florian-wagner-frequenz commented Jun 17, 2025

Uh oh!

llucax left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Marenz commented Jun 18, 2025

Uh oh!

florian-wagner-frequenz commented Jun 18, 2025

Uh oh!

Uh oh!

florian-wagner-frequenz commented Jun 18, 2025

Uh oh!

Uh oh!

shsms Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

Marenz Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

llucax left a comment

Choose a reason for hiding this comment

Uh oh!

llucax commented Jun 23, 2025

Uh oh!

Marenz commented Jun 23, 2025

Uh oh!

llucax commented Jun 23, 2025

Uh oh!

Marenz commented Jun 23, 2025

Uh oh!

Uh oh!

Marenz Jun 19, 2025 •

edited

Loading