Synchronization Issue in ML-Agents Decision Timing #6175

matinmoezzi · 2024-11-22T16:31:49Z

Is your feature request related to a problem? Please describe.
Yes, the problem arises from the desynchronization between the decision time (when Python produces an action) and Academy.EnvironmentStep() (when Unity applies actions). This desync can lead to duplicate actions being taken or duplicate states being read, which is frustrating when trying to maintain consistency and synchronization during training. The issue seems to stem from the non-blocking websocket communication between Unity and Python.

Describe the solution you'd like
I’d like a blocking mechanism in Unity that pauses rendering until the next action from Python is available. This would ensure that Academy.EnvironmentStep() only processes actions and states in sync with the decision-making process, preventing duplicates.

Describe alternatives you've considered

Increasing the frequency of EnvironmentStep() updates to match decision-making timing, but this doesn't fully address the synchronization issue.
Adjusting Time.timeScale and FixedUpdate() to better align the two processes, though this approach is limited by the non-blocking nature of the websocket mechanism.

Additional context
This issue is likely caused by the asynchronous nature of websocket-based communication. A blocking mechanism could help bridge the gap between Unity and Python processes. Feedback or alternative solutions from the community would be greatly appreciated.

The text was updated successfully, but these errors were encountered:

wlsdn2749 · 2024-12-04T11:56:04Z

Howdy!

Have you ever used RequestDecision method instead of Decision Requester Components?

main usage is

public class Test : Agent
...
private bool isShootable;
void FixedUpdate()
{
  if(isShootable)
  {
    RequestDecision();
  }
}

...

The RequestDecision method works exactly the same as the Decision Requester component.
However, decisions can be almost perfectly controlled as much as you want, so it will likely be more useful than you think

It follows the same process, such as RequestDecision -> CollectObservation -> OnActionReceived, and so on.

https://docs.unity3d.com/Packages/[email protected]/api/Unity.MLAgents.Agent.html#Unity_MLAgents_Agent_RequestDecision

matinmoezzi added the request Issue contains a feature request. label Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Synchronization Issue in ML-Agents Decision Timing #6175

Synchronization Issue in ML-Agents Decision Timing #6175

matinmoezzi commented Nov 22, 2024

wlsdn2749 commented Dec 4, 2024 •

edited

Loading

Synchronization Issue in ML-Agents Decision Timing #6175

Synchronization Issue in ML-Agents Decision Timing #6175

Comments

matinmoezzi commented Nov 22, 2024

wlsdn2749 commented Dec 4, 2024 • edited Loading

wlsdn2749 commented Dec 4, 2024 •

edited

Loading