Add example script to export webknossos volume annotations #7

kabilar · 2024-10-09T21:21:18Z

Hi @jingjingwu1225, here is an example script to export Webknossos annotations based on their documentation. I have tested this on the LINC Jupyter Hub for the magnification level of 8-8-1 and all lower resolutions. Please let me know if it crashes for higher resolution levels.

In order to run:

Install the following Python packages: webknossos, tifffile, and numpy==1.26.1
The WK_TOKEN can be found under the main menu under Auth Token:

Use the command:

WK_URL="https://webknossos.lincbrain.org" WK_TOKEN="<add_token>" python export_annotation.py

Please update the script to include the following changes:

Export to a single Zarr instead of TIFF files
Export all z slices to the same Zarr
Export all segment_ids to the same Zarr
Export all resolution levels to the same Zarr
Review the annotations to ensure a 1-1 mapping between the annotations in the exported Zarrs and the annotations on webknossos.lincbrain.org.

cc @aaronkanzer @balbasty

kabilar · 2024-10-09T21:27:55Z

Upon quick review of some of the TIFFs generated, the annotations are visible in the images.

kabilar · 2024-10-09T21:30:08Z

I added points 4 and 5 above.

kabilar · 2024-10-09T21:36:21Z

Drafting this pull request since it shouldn't be merged as is, but it is here for reference.

kabilar · 2024-10-09T22:10:02Z

Perhaps using a smaller buffer_size for the get_buffered_slice_reader() method would reduce the risk of filling up the JupyterHub node memory at higher resolution levels.

Also, get_buffered_slice_writer() seems to write data to disk as soon as the buffer is full. Will have to explore more to see what formats can be written to disk.

kabilar · 2024-10-16T19:45:26Z

Hi @jingjingwu1225 @balbasty @aaronkanzer, I am still testing but the following seems to be working for me without putting a stress on the Webknossos server or the JupyterHub instance. Please let me know what you think.

annotation_zarr_link = 'https://webknossos.lincbrain.org/data/annotations/zarr/v2yszt4hvDxpIXKK/Volume/'
annotation_name = 'JW_MR243_20240927'
local_path = '/home/jovyan'

source_group = zarr.convenience.open(store=annotation_zarr_link)
dest_group = zarr.hierarchy.group(store=local_path)
zarr.convenience.copy(source=source_group, dest=dest_group, name=f'{annotation_name}.zarr')

balbasty · 2024-10-16T19:47:31Z

That's very neat! Do you know if it's possible to specify the chunking options for the output array in the copy operation?

kabilar · 2024-10-16T20:17:55Z

It does look like we can pass any keyword arguments to zarr.convenience.copy that would then get passed to create_dataset when copying the array. But we may only be able to use a single chunk size for all resolution levels. I am still exploring but would we want different chunk sizes for the different resolution levels?

balbasty · 2024-10-16T20:26:24Z

We've used the same chunk size across levels so far, so I don't see this as too much of a problem.

kabilar · 2024-10-16T20:38:09Z

Great. I am now testing for chunks=(1,128,128,1). Will let you know how it goes.

If we do need different chunk sizes, we could loop through the resolution levels (see code snippet below) and write them individually to the destination group.

Input

for array_name, array in source_group.arrays(): print(array_name, array)

Output

1 <zarr.core.Array '/1' (1, 126976, 99630, 73) uint32>
128-128-1 <zarr.core.Array '/128-128-1' (1, 992, 778, 73) uint32>
16-16-1 <zarr.core.Array '/16-16-1' (1, 7936, 6226, 73) uint32>
2-2-1 <zarr.core.Array '/2-2-1' (1, 63488, 49815, 73) uint32>
256-256-1 <zarr.core.Array '/256-256-1' (1, 496, 389, 73) uint32>
32-32-1 <zarr.core.Array '/32-32-1' (1, 3968, 3113, 73) uint32>
4-4-1 <zarr.core.Array '/4-4-1' (1, 31744, 24907, 73) uint32>
64-64-1 <zarr.core.Array '/64-64-1' (1, 1984, 1556, 73) uint32>
8-8-1 <zarr.core.Array '/8-8-1' (1, 15872, 12453, 73) uint32>

kabilar · 2024-10-17T02:52:24Z

Hi team, it looks like we also need to pass the arguments fill_value=0 and write_empty_chunks=False to ensure that only chunks with non-zero values are stored. See Zarr API docs.

Webknossos is offline right now so will finish up testing tomorrow.

kabilar · 2024-10-17T02:54:22Z

Also, I switched to a chunk size of (1, 4096, 4096, 1) since that was previously decided upon.

kabilar added 2 commits October 9, 2024 16:00

Add example script to export webknossos volume annotations

071e3c0

Remove print statement

453a60e

kabilar marked this pull request as draft October 9, 2024 21:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add example script to export webknossos volume annotations #7

Add example script to export webknossos volume annotations #7

kabilar commented Oct 9, 2024 •

edited

Loading

kabilar commented Oct 9, 2024

kabilar commented Oct 9, 2024

kabilar commented Oct 9, 2024

kabilar commented Oct 9, 2024 •

edited

Loading

kabilar commented Oct 16, 2024

balbasty commented Oct 16, 2024

kabilar commented Oct 16, 2024 •

edited

Loading

balbasty commented Oct 16, 2024

kabilar commented Oct 16, 2024

kabilar commented Oct 17, 2024

kabilar commented Oct 17, 2024

Add example script to export webknossos volume annotations #7

Are you sure you want to change the base?

Add example script to export webknossos volume annotations #7

Conversation

kabilar commented Oct 9, 2024 • edited Loading

kabilar commented Oct 9, 2024

kabilar commented Oct 9, 2024

kabilar commented Oct 9, 2024

kabilar commented Oct 9, 2024 • edited Loading

kabilar commented Oct 16, 2024

balbasty commented Oct 16, 2024

kabilar commented Oct 16, 2024 • edited Loading

balbasty commented Oct 16, 2024

kabilar commented Oct 16, 2024

kabilar commented Oct 17, 2024

kabilar commented Oct 17, 2024

kabilar commented Oct 9, 2024 •

edited

Loading

kabilar commented Oct 9, 2024 •

edited

Loading

kabilar commented Oct 16, 2024 •

edited

Loading