[SPARK-51711][ML][CONNECT] Memory based MLCache eviction policy #50530

xi-db · 2025-04-07T13:45:10Z

What changes were proposed in this pull request?

Currently the ML Cache is limited by the number of cache entries (100 entries at this time), but it is not ideal because model size varies.

In this PR, we are updating the MLCache model eviction policy to be memory based, i.e. to evict old models if the total size of models is greater than a limit.

Besides, two new internal Spark confs are introduced:

spark.connect.session.connectML.mlCache.maxSize: Maximum size of the MLCache per session. The cache will evict the least recently used models if the size exceeds this limit.
spark.connect.session.connectML.mlCache.timeout: Timeout of models in MLCache. Models will be evicted from the cache if they are not used for this amount of time.

Why are the changes needed?

This improve the memory management of MLCache.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

New test and existing tests.

Was this patch authored or co-authored using generative AI tooling?

No.

xi-db · 2025-04-07T13:46:18Z

Hi @WeichenXu123 , could you review this PR of changing entry count based MLCache eviction policy to memory based?

WeichenXu123

LGTM!

zhengruifeng

let's make this jira a subtask of https://issues.apache.org/jira/browse/SPARK-51236

zhengruifeng · 2025-04-08T04:46:10Z

sql/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala

+    buildConf("spark.connect.session.connectML.mlCache.maxSize")
+      .doc("Maximum size of the MLCache per session. The cache will evict the least recently" +
+        "used models if the size exceeds this limit. The size is in bytes.")
+      .version("4.0.0")


Suggested change

.version("4.0.0")

.version("4.1.0")

I think this PR doesn't need to be included in 4.0.0

zhengruifeng · 2025-04-08T04:46:58Z

sql/connect/server/src/main/scala/org/apache/spark/sql/connect/config/Connect.scala

+    buildConf("spark.connect.session.connectML.mlCache.timeout")
+      .doc("Timeout of models in MLCache. Models will be evicted from the cache if they are not " +
+        "used for this amount of time. The timeout is in minutes.")
+      .version("4.0.0")


Suggested change

.version("4.0.0")

.version("4.1.0")

WeichenXu123

LGTM.

zhengruifeng · 2025-04-08T11:56:33Z

merged to master

Memory based MLCache eviction policy

404f5f2

github-actions bot added SQL ML PYTHON CONNECT labels Apr 7, 2025

WeichenXu123 approved these changes Apr 7, 2025

View reviewed changes

zhengruifeng reviewed Apr 8, 2025

View reviewed changes

Make config version 4.1.0, reformat code

b40c9dc

WeichenXu123 approved these changes Apr 8, 2025

View reviewed changes

zhengruifeng closed this in 92f5d38 Apr 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-51711][ML][CONNECT] Memory based MLCache eviction policy #50530

[SPARK-51711][ML][CONNECT] Memory based MLCache eviction policy #50530

xi-db commented Apr 7, 2025 •

edited

Loading

xi-db commented Apr 7, 2025

WeichenXu123 left a comment

zhengruifeng left a comment

zhengruifeng Apr 8, 2025

zhengruifeng Apr 8, 2025

WeichenXu123 left a comment

zhengruifeng commented Apr 8, 2025

[SPARK-51711][ML][CONNECT] Memory based MLCache eviction policy #50530

[SPARK-51711][ML][CONNECT] Memory based MLCache eviction policy #50530

Conversation

xi-db commented Apr 7, 2025 • edited Loading

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

xi-db commented Apr 7, 2025

WeichenXu123 left a comment

Choose a reason for hiding this comment

zhengruifeng left a comment

Choose a reason for hiding this comment

zhengruifeng Apr 8, 2025

Choose a reason for hiding this comment

zhengruifeng Apr 8, 2025

Choose a reason for hiding this comment

WeichenXu123 left a comment

Choose a reason for hiding this comment

zhengruifeng commented Apr 8, 2025

xi-db commented Apr 7, 2025 •

edited

Loading