Document communication protocol #3357

Kobzol · 2020-01-08T16:31:53Z

Hi! Together with @spirali we are attempting to rewrite the Dask scheduler in Rust (#3139). We had some initial success, but when we moved to more advanced Dask programs (for example distributed pandas), the communication protocol between the clients/workers and the scheduler became very difficult to handle.

Right now we are facing two issues:

The serialization format of the protocol is somewhat static-type-language unfriendly, which is described here.
We haven't found a proper and complete documentation of the Dask communication protocol (which messages are available between client/scheduler and worker/scheduler, what are their parameters, which parameters are optional etc.). It is difficult to reimplement the scheduler without the protocol documented. There is some information at https://distributed.dask.org/en/latest/protocol.html, but that's just a very small part of the protocol.

For example, we thought that the compute-task message which is sent from the scheduler to workers needs to have function and args parameters which contain the necessary code and data to run on the worker. However, when running the following Python script:

import dask
from dask.distributed import Client

client = Client("tcp://localhost:8786")
df = dask.datasets.timeseries(start="2020-01-30", end="2020-01-31")
print(len(df))

df.groupby("name")["x"].mean().compute()
df[(df["x"] > 0) | (df["y"] < 0)].compute()

the scheduler sends some compute-task messages to workers which do not contain the function and args keys:

{
    'duration': 0.5,
    'key': "('series-groupby-count-chunk-series-groupby-count-agg-24c2448278b10581d563ee8a2bb5c45b', '0)'",
    'nbytes': {"('make-timeseries-11817facbee00a90e53448d7973e9de2', 0)": 8205680},
    'op': 'compute-task',
    'priority': (0, 1, 1),
    'who_has': {"('make-timeseries-11817facbee00a90e53448d7973e9de2', 0)": [   'tcp://127.0.0.1:45605']}
}

Another problem is that the definition of the task itself (inside tasks dictionary in update-graph messages) may be serialized. Even if it only uses msgpack, the serialized format can be quite complex and we are not sure how to reimplement the (de)serialization in Rust. Without a proper documentation of the format we can only guess how to implement the scheduler properly.

I suspect that there may be some legacy cruft hidden inside the Dask communication protocol (both the serialization format and the message API itself) and it may be worth it to document it properly and possibly make some simplifications. If this documentation exists somewhere already please let us know. What do you think?

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2020-01-14T23:09:19Z

cc @mrocklin if you have time to comment.

My understanding is that the protocol developed organically as we needed things.

Another problem is that the definition of the task itself (inside tasks dictionary in update-graph messages) may be serialized. Even if it only uses msgpack, the serialized format can be quite complex and we are not sure how to reimplement the (de)serialization in Rust.

Is deserialization necessary? I (possibly incorrectly) thought that the scheduler didn't deserialize things, but I may be thinking of something else.

The documents in docs/ are the extent of whatever documentation we have on this. And the usual disclaimer that they may be out of date.

I suspect we'd be happy to take simplifications to the protocol as long as they don't harm the current implementation.

Kobzol · 2020-01-15T12:17:43Z

Right, (de)serialization is an overloaded term in this scenario. I'll describe a specific use case.
We wanted to add the simplest possible functionality - receive a task from client (update-graph) and then send it to a worker (compute-task).

The update-graph message sends tasks in a dictionary keyed by the task id/key. The definition of the task is a dict containg attributes func, args. The compute-task message also has func and args attributes, which we passed on from the task definition. This has worked fine for simple Dask scripts.

However, with more complex pipelines (distributed pandas), we have observed that the definition of the task may also be a msgpack object, which is recursively serialized, with subheaders, possibly compression and other stuff. We thought that since the scheduler should be language agnostic, we could just take this msgpack blob and pass it on to the worker, but the compute-task message expects a dict with func and args, so we didn't know how to extract these attributes from the serialized task definition (later we also found out that sometimes the compute-task message is sent by the Dask scheduler without the func/args attributes, but we have no idea how that works).

I suspect that the property that the Dask scheduler is language agnostic might not be enforced strictly. The low level loads and dumps methods in protocol/core.py call a deserialize function from distributed/protocol/serialize.py, which transforms the complex serialized msgpack values into Python objects, which is something that we cannot do in Rust. It's possible the scheduler is implementable without this deserialization, but without a proper documentation of the protocol, it's pretty difficult to find out what should the scheduler do.

So to go forward, we would need to either document or simplify the protocol. If there is no documentation because the protocol grew organically, we will probably have to find some simple subset of the protocol and change the complex client code (for example the Dask messages generated from pandas tables) to use this simpler version of the protocol. Because without a documentation I'm not sure that we could reimplement the existing protocol in Rust, since there will probably be a ton of edge cases, intended or not (Python is super dynamic, so what is an edge case in Rust might not be a problem in Python).

TomAugspurger · 2020-01-15T12:30:19Z

Thanks. I suspect that improvements to the docs would be welcome wholeheartedly. And changes to the protocol / the implementation that make things easier in other languages would be welcome as long they don't overly harm the current implementation's performance and readability.

mrocklin · 2020-01-18T17:00:04Z

+1. Sorry for my lack of engagement here. I've been swamped

…

On Wed, Jan 15, 2020 at 4:30 AM Tom Augspurger ***@***.***> wrote: Thanks. I suspect that improvements to the docs would be welcome wholeheartedly. And changes to the protocol / the implementation that make things easier in other languages would be welcome as long they don't overly harm the current implementation's performance and readability. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3357?email_source=notifications&email_token=AACKZTA3QP5UB4DVYXWFDCDQ536VXA5CNFSM4KELBFS2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEJAEXYQ#issuecomment-574639074>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACKZTGVEW7DB5T2G6OCYY3Q536VXANCNFSM4KELBFSQ> .

Kobzol · 2020-01-29T18:36:47Z

FYI I created a work-in-progress documentation of Dask messages. I couldn't find a better (terser) schema format, so I just used TypeScript.

The documentation contains a subset of Dask messages (more or less exactly the subset that our Rust scheduler currently attempts to support).

Apart from some inconsistent naming conventions and the Task definition structure (which is pretty... polymorphic :-) ), there are some small quirks that we had to replicate. For example the {"op": "stream-start"} message, which is sent from the scheduler to the client after it registers, needs to be inside a message list of size one, because the client asserts it (https://github.com/dask/distributed/blob/master/distributed/client.py#L1067).

quasiben mentioned this issue May 7, 2020

[Discussion] Client/Scheduler Performance #3783

Open

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document communication protocol #3357

Document communication protocol #3357

Kobzol commented Jan 8, 2020 •

edited

Loading

TomAugspurger commented Jan 14, 2020

Kobzol commented Jan 15, 2020

TomAugspurger commented Jan 15, 2020

mrocklin commented Jan 18, 2020 via email

Kobzol commented Jan 29, 2020 •

edited

Loading

Document communication protocol #3357

Document communication protocol #3357

Comments

Kobzol commented Jan 8, 2020 • edited Loading

TomAugspurger commented Jan 14, 2020

Kobzol commented Jan 15, 2020

TomAugspurger commented Jan 15, 2020

mrocklin commented Jan 18, 2020 via email

Kobzol commented Jan 29, 2020 • edited Loading

Kobzol commented Jan 8, 2020 •

edited

Loading

Kobzol commented Jan 29, 2020 •

edited

Loading