Feature/image benchmarking #880

qh681248 · 2024-11-28T16:24:29Z

PR Type

Feature

Description

This PR extends the original benchmark framework and adds a visual benchmark. The benchmarking process follows these steps:

Load an input image and downsample it using a specified factor.
Convert to grayscale and extract non-zero pixel locations and values.
Generate coresets with varying algorithms.
Plot the original image alongside coresets generated by each algorithm.
Save the resulting plot as an output file.

How Has This Been Tested?

Existing tests pass as expected.

New tests introduced with this change verify that...

Does this PR introduce a breaking change?

Checklist before requesting a review

I have made sure that my PR is not a duplicate.
My code follows the style guidelines of this project.
I have ensured my code is easy to understand, including docstrings and comments where necessary.
I have performed a self-review of my code.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
New and existing unit tests pass locally with my changes.
Any dependent changes have been merged and published in downstream modules.
I have updated CHANGELOG.md, if appropriate.

github-actions · 2024-11-28T16:27:31Z

Performance review

Commit `22efbbd` - Merge `89244b6` into `5039cff`

Statistically significant changes

basic_rpc:
- OLD: compilation 0.7774 units ± 0.08315 units; execution 0.1256 units ± 0.000803 units
- NEW: compilation 0.8478 units ± 0.08107 units; execution 0.1472 units ± 0.009568 units
- Significant increase in execution time (17.23%, p=5.124e-05)

Normalisation values for new data:
Compilation: 1 unit = 406.66 ms
Execution: 1 unit = 888.55 ms

…david_benchmark.py

github-actions · 2024-11-29T10:01:24Z

Performance review

Commit `5302c89` - Merge `540fb7e` into `87dd4a7`

No significant changes to performance.

bk958178

Thanks, @qh681248. Some minor changes requested

bk958178 · 2024-12-03T16:20:48Z

.pylintrc

@@ -556,7 +556,7 @@ contextmanager-decorators=contextlib.contextmanager
 # system, and so shouldn't trigger E1101 when accessed. Python regular
 # expressions are accepted.
 generated-members=
-
+    cv2.*


Can you tell me why this has been added? Are we accessing non-existent members of cv2?

bk958178 · 2024-12-03T16:22:38Z

benchmark/david_benchmark.py

+
+The benchmarking process follows these steps:
+1. Load an input image and downsample it using a specified factor.
+2. Convert to grayscale and extract non-zero pixel locations and values.


What is meant by 'non-zero pixel locations and values' ? Does this mean pixels that have value above zero? i.e., are not completely black?

bk958178 · 2024-12-03T16:23:06Z

benchmark/david_benchmark.py

+The benchmarking process follows these steps:
+1. Load an input image and downsample it using a specified factor.
+2. Convert to grayscale and extract non-zero pixel locations and values.
+3. Generate coresets with varying algorithms.


change 'varying' to 'various' or 'different'

bk958178 · 2024-12-03T16:24:55Z

benchmark/david_benchmark.py

+Benchmark performance of different coreset algorithms on pixel data from an image.
+
+The benchmarking process follows these steps:
+1. Load an input image and downsample it using a specified factor.


Can you briefly describe why we downsample the image?

bk958178 · 2024-12-03T16:29:31Z

benchmark/david_benchmark.py

+    )
+
+    # Set up the original data object and coreset parameters
+    data = Data(pre_coreset_data)


do we need to jax this? data = Data(jnp.asarray(pre_coreset_data))

bk958178 · 2024-12-03T16:30:17Z

benchmark/david_benchmark.py

+
+    # Set up the original data object and coreset parameters
+    data = Data(pre_coreset_data)
+    coreset_size = 8_000 // (downsampling_factor**2)


Why is this the coreset size? Can you add detail in the docstring above to explain

bk958178 · 2024-12-03T16:32:19Z

benchmark/david_benchmark.py

+
+    # Initialize each coreset solver
+    key = random.PRNGKey(0)
+    solvers = initialise_solvers(Data(jnp.array(pre_coreset_data)), key)


Why are we doing Data(jnp.array(pre_coreset_data)) again here? Surely this can just be:

solvers = initialise_solvers(data, key).

NB: I'm not sure which is correct for data - either Data(jnp.array(pre_coreset_data)) or Data(jnp.asarray(pre_coreset_data))

qh681248 added 2 commits November 28, 2024 16:17

feat: initial commit for David benchmarking

2e447f8

chore: sync requirements-doc.txt with uv.lock

89244b6

feat: refactored code in mnist_benchmark to avoid code repetition in …

540fb7e

…david_benchmark.py

bk958178 self-requested a review December 3, 2024 15:54

bk958178 requested changes Dec 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/image benchmarking #880

Feature/image benchmarking #880

qh681248 commented Nov 28, 2024

github-actions bot commented Nov 28, 2024

github-actions bot commented Nov 29, 2024

bk958178 left a comment

bk958178 Dec 3, 2024

bk958178 Dec 3, 2024

bk958178 Dec 3, 2024

bk958178 Dec 3, 2024

bk958178 Dec 3, 2024

bk958178 Dec 3, 2024

bk958178 Dec 3, 2024

Feature/image benchmarking #880

Are you sure you want to change the base?

Feature/image benchmarking #880

Conversation

qh681248 commented Nov 28, 2024

PR Type

Description

How Has This Been Tested?

Does this PR introduce a breaking change?

Checklist before requesting a review

github-actions bot commented Nov 28, 2024

Performance review

Commit 22efbbd - Merge 89244b6 into 5039cff

Statistically significant changes

github-actions bot commented Nov 29, 2024

Performance review

Commit 5302c89 - Merge 540fb7e into 87dd4a7

bk958178 left a comment

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

bk958178 Dec 3, 2024

Choose a reason for hiding this comment

Commit `22efbbd` - Merge `89244b6` into `5039cff`

Commit `5302c89` - Merge `540fb7e` into `87dd4a7`