Skip to content

How to configure temporary directory to spill data #2009

Answered by samster25
djouallah asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @djouallah,

Great question! You should be able to do via

import json
import ray
import daft

ray.init(
    _system_config={
        "object_spilling_config": json.dumps(
            {"type": "filesystem", "params": {"directory_path": "/tmp/spill"}},
        )
    },
)

daft.context.set_runner_ray()

check out these docs if you want to try it out in a distributed fashion.

https://docs.ray.io/en/latest/ray-core/objects/object-spilling.html#single-node

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by djouallah
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants