Run AsyncConsumer tasks concurrently #1933

primal100 · 2022-10-14T18:14:19Z

For #1924.

Exploratory PR which uses asyncio.create_task for dispatching tasks in a consumer, and add_done_callback to raise exceptions, rather than await for each task one by one.

A couple of notes:

The existing behaviour is that if one of the dispatch tasks raises an uncaught exception, that exception will be propagated to the parent task running the consumer. As far as I can see, this kills the consumer and means that no more tasks will be processed by that consumer. When using create_task and add_done_callback as in this PR the exception will still be raised by the dispatch task and appear in the console but it will not interrupt the parent task which will continue processing new requests.
In order for the existing tests to pass, I added some extra code to keep the existing behaviour which is why the code is a bit messier. But I am wondering which behaviour is more desirable? It seems at the moment that if a user sends invalid data that causes an unhandled exception in one of the dispatch tasks, the entire consumer will stop working. But there may be other reasons why the existing behaviour is desirable (it does force the programmer to handle any possible exceptions caused by user input which is a good thing).
I did try using the Python 3.11 TaskGroup backport for previous versions which actually implements the existing behaviour for point 1) and also would make the code cleaner, but discovered it's not exactly plug and play in previous Python versions, so not really an option. I could maybe implement a similar but much more simpler TaskGroup which make the code a bit cleaner, if you think it's needed.
One backward compatibility consideration as brought up in the issue is that tasks will be finished out of order. Personally I think this is fine, as this is a feature of Asyncronous programming which async protocols and implementations should be able to handle. What I didn't know when I created the issue is that the SyncConsumer inherits from the AsyncConsumer so I think a change in this behaviour would also impact the SyncConsumer. Still, I think it shouldn't be an issue as network protocols should be able to deal with this. That said, I am mostly used to working with Channels from a runworker perspective, so someone with more experience with implementing http protocols may have a better idea if this is an issue from a backward compatibility POV. One option would be add a class boolean attribute which controls either old or new behaviour. Overall I think the new behaviour is worth it to allow the consumer to run in a truly asyncronous way which is faster.
Everything is ok with the existing tests. Do you want me to add profiling tests to measure performance improvement?

carltongibson · 2022-10-15T18:26:09Z

Hi @primal100 — thanks for this — interesting!

Ref point 3: e.g. AsyncHttpConsumer.http_request waits for the whole body to be available before passing off to the handle method, so should be OK. (That was the only obvious case that came to mind... 🤔)

One option would be add a class boolean attribute which controls either old or new behaviour.

Fancy putting that in a commit (maybe just a diff 🤔) to have a look at too?

carltongibson · 2022-10-15T18:29:19Z

Do you want me to add profiling...

Always happy to see some numbers 🙂

Paul Martin added 2 commits October 14, 2022 16:09

Run tasks consumer tasks concurrently

e8fd670

Don't raise child task exception on CancelledError

d65a5d3

carltongibson linked an issue Oct 15, 2022 that may be closed by this pull request

AsyncConsumer runs tasks sequentially (should be in parallel?) #1924

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run AsyncConsumer tasks concurrently #1933

Run AsyncConsumer tasks concurrently #1933

primal100 commented Oct 14, 2022 •

edited

carltongibson commented Oct 15, 2022

carltongibson commented Oct 15, 2022

Run AsyncConsumer tasks concurrently #1933

Are you sure you want to change the base?

Run AsyncConsumer tasks concurrently #1933

Conversation

primal100 commented Oct 14, 2022 • edited

carltongibson commented Oct 15, 2022

carltongibson commented Oct 15, 2022

primal100 commented Oct 14, 2022 •

edited