no detection of run time errors of tutorials? #235

gertjan123 · 2021-06-08T14:28:01Z

I think the test of the tutorials shuts down to quickly to test and raise run time errors. This is due to the test form tut1().shutdown().
The potential alternative

tut1()
time.sleep(20)
tut1.shutdown()

doesn't work, as it never shutsdown. Maybe the links to the agent processes are lost.

Is there a way to run the framework 20 seconds and shutting down only then?

BjoernLudwigPTB · 2021-06-10T11:01:51Z

The proper way to do this the way you seem to intend above would be something like

tut1_network = tut1()
time.sleep(20)
tut1_network.shutdown()

What errors are you experiencing?

You are right, that the tests are not ideal. Of course the time the networks run is close to zero. The tests rather ensure, that the networks are properly set up but nothing more really. We were working on comparing to expected output in #229 but did not succeed with that yet, due to the various ways the agents print to the console in the two backends. We could improve the mechanisms with some more effort though, as far as I can see.

gertjan123 · 2021-06-10T11:40:14Z

With your code it works, and it does what I intended, thank you!
I now use 60 seconds, as one of my agents only processes a buffer of data after 20 seconds. This test catches more errors than with the run time close to zero. The communication and data handling is now also tested, and the fact that the network doesn't crash within one minute.
I would suggest to use this code in the tests, or do you see a problem?

BjoernLudwigPTB · 2021-06-10T11:48:53Z

Indeed I see one significant problem. If we simply include a one minute sleep into those tutorial tests we increase the time needed for one execution of the test suite form something like 4 minutes to something like 10 minutes. I have a look into parallel test execution.

BjoernLudwigPTB · 2021-06-10T12:02:07Z

I would rather set an individual minimal execution time for those tests that should have one. Regarding the error checking in general I would rather try to test against specific expected results, if needed after some time, to catch unexpected behaviour, instead of just letting the tutorials run longer. Xiang already provided one possible pattern to do that in his test_memory_monitor_agent.py. I am still not sure, what is the best way to achieve that for the tutorials but I think, revising the printing mechanisms and using nbval is one option we should consider.

BjoernLudwigPTB · 2021-06-10T12:03:42Z

Quick experiments with parallel execution of the tests (via pytest-parallel and pytest-xdist) suggest, that this would require some thorough preparation, due to the several servers started during the test executions.

gertjan123 · 2021-06-10T12:12:04Z

The 60 seconds mainly comes from the agent data generation rate that is set somewhere, and the minimum buffer size. If we increase the data generation rate, the tests can go much faster.
That means setting a fast data generation rate for testing, and a nice, moderate, data generation rate for human watchers of the tutorial. So it means an additional test parameter.

gertjan123 · 2021-06-10T12:14:19Z

"I would rather set an individual minimum execution time for those tests that should have one."
All tests involving the agents should have one. It has often happened to me that my network started, but crashed at run time.
Should we hard code the minimum execution time using the sleep command in the test_tutorial files?

gertjan123 · 2021-06-10T12:15:55Z

"Regarding the error checking in general I would rather try to test against specific expected results, if needed after some time, to catch unexpected behaviour, instead of just letting the tutorials run longer."
That is even nicer, but still running the agent network a few seconds with large data generation rates is better than not sending and plotting any data.

gertjan123 · 2021-06-10T12:16:56Z

Of course, all depends on the time and budget we have to improve the tests.

gertjan123 · 2021-06-10T12:23:04Z

If the agents should calculate something, they can print the output, we can grap it, and compare it with the reference outcome.
(I don't know how to do it exactly, though it should be possible.)

But how to automatically test that the plots look nice? Even if the network hasn't crashed, the plots can still look bad/ erroneous.

BjoernLudwigPTB · 2021-06-10T13:36:06Z

"I would rather set an individual minimum execution time for those tests that should have one."
All tests involving the agents should have one. It has often happened to me that my network started, but crashed at run time.
Should we hard code the minimum execution time using the sleep command in the test_tutorial files?

If we agree on that, I would suggest including such a minimum_runtime_during_tests in conftest.py which then gets used whereever needed.

BjoernLudwigPTB · 2021-06-10T13:39:56Z

The 60 seconds mainly comes from the agent data generation rate that is set somewhere, and the minimum buffer size. If we increase the data generation rate, the tests can go much faster.
That means setting a fast data generation rate for testing, and a nice, moderate, data generation rate for human watchers of the tutorial. So it means an additional test parameter.

That's right, although the tutorials do not take any parameters as they are right now. We would have to introduce that additioonal parameter first into the tutorials with an appropriate default and use that to speed up the agents during tests. As you already indicated we may have to introduce another buffer_size parameter to make the agents remember everything during test execution.

gertjan123 · 2021-06-14T11:39:35Z

BTW: test_timeout = 10 in conftest.py doesn't work for my computer. You should either increase it to 20, or, preferably, buy me a faster computer :-).

BjoernLudwigPTB · 2021-06-15T08:24:34Z

BTW: test_timeout = 10 in conftest.py doesn't work for my computer. You should either increase it to 20, or, preferably, buy me a faster computer :-).

Oh, that is probably the reason for #240, right?! So you do actually observe failing tests due to the 10 seconds and with 20 seconds those same tests are passing?

gertjan123 · 2021-06-15T08:25:55Z

Exactly!

BjoernLudwigPTB mentioned this issue Jun 15, 2021

Update conftest.py (test_timeout) #240

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

no detection of run time errors of tutorials? #235

no detection of run time errors of tutorials? #235

gertjan123 commented Jun 8, 2021

BjoernLudwigPTB commented Jun 10, 2021 •

edited

gertjan123 commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021 •

edited

BjoernLudwigPTB commented Jun 10, 2021 •

edited

gertjan123 commented Jun 14, 2021

BjoernLudwigPTB commented Jun 15, 2021

gertjan123 commented Jun 15, 2021

no detection of run time errors of tutorials? #235

no detection of run time errors of tutorials? #235

Comments

gertjan123 commented Jun 8, 2021

BjoernLudwigPTB commented Jun 10, 2021 • edited

gertjan123 commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

gertjan123 commented Jun 10, 2021

BjoernLudwigPTB commented Jun 10, 2021 • edited

BjoernLudwigPTB commented Jun 10, 2021 • edited

gertjan123 commented Jun 14, 2021

BjoernLudwigPTB commented Jun 15, 2021

gertjan123 commented Jun 15, 2021

BjoernLudwigPTB commented Jun 10, 2021 •

edited

BjoernLudwigPTB commented Jun 10, 2021 •

edited

BjoernLudwigPTB commented Jun 10, 2021 •

edited