Milan multiple compression #11083

milanro · 2022-07-13T14:30:14Z

Description

Facility for adding extensions to the SharedPropertyTree without duplicating the code. Added LZ4 compression algorithm.

Does this introduce a breaking change?

No

milanro · 2022-07-13T15:30:42Z

@ruiterr @DLehenbauer This PR is an extension of already implementedcompression at PropertyDDS. This allows us to add various algorithms without code duplication.

milanro · 2022-07-19T14:33:42Z

@DLehenbauer I checked backward compatibility and now this PR can be reviewed and merged.

DLehenbauer · 2022-07-26T18:03:15Z

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExt.ts

+
+export class LZ4PropertyTree extends SharedPropertyTree {
+    public static create(runtime: IFluidDataStoreRuntime, id?: string, queryString?: string) {
+        return runtime.createChannel(id, DeflatedPropertyTreeFactory.Type) as LZ4PropertyTree;


I suspect this should be 'LZ4PropertyTreeFactory.Type'.

When loading a document, the type string is used to find the right factory to instantiate the DDS.

The way the unit test is written, the LZ4 and Deflate factories are never both registered in the same container instance, so the bug doesn't surface.

@DLehenbauer thank you for finding this

DLehenbauer · 2022-07-26T18:57:57Z

@milanro, @dstanesc - I wanted to introduce you to @justus-camp, who has started work on op compression at the Fluid framework layer.

@justus-camp - Milan has been adding experimental compression support to the Property DDS, and is ramping up on tackling compression at the fluid runtime level. @dstanesc has done work to synthetically create real-world data for benchmarking.

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExtFactories.ts

DLehenbauer · 2022-07-26T19:23:32Z

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExtFactories.ts

+ * This class contains builders of the compression methods used to compress
+ * of summaries and messages with the plugable compression algorithm.
+ */
+class CompressionMethods {


Most JS developers would find a function more natural than a class here:

function createCompressionMethods(encoder, decoder) { return { messageEncoder: { encode: (...) => { ... }, decode: (...) => { ... }, }, summaryEncoder: { encode: (...) => { ... }, decode: (...) => { ... }, }, }; };

Possibly this function should be part of CompressedPropertyTreeFactory.

@DLehenbauer The idea here is complete separation of compression methods from the CompressedPropertyTreeFactory. The only relevant method here is getEncDec and how it is implemented is the responsibility of the extended Factory class. CompressionMethods is only a helper, just a utility class which helps the extended factories to reduce the code if they want this behavior but that is only one way of many. Puristic approach would require here the method getEncDec to be abstract. I have chosen the default behavior using the CompressionMethods but did not want to bind them too closely with CompressedPropertyTreeFactory so I kept them outside in another util class.

Concerning the method usage instead of class usage, I can do it but it looks to me that it might produce messy code putting various responsibilities in one place (one method). I was trying to follow single responsibility pattern and separate the functionality in dedicated methods referencing them in the returned object. I could create 5 functions instead of the class for building encoders, decoders and createCompressionMethods but it looked to me more self explaining if I put them to one class.

Nevertheless I will follow your suggestions if you still think that it is more reasonable to do it by function within the CompressedPropertyTreeFactory.

…ompressedPropertyTreeFactory

milanro · 2022-08-02T13:37:47Z

@DLehenbauer I implemented the createCompressionMethods method and fixed the Typo you suggested above.

DLehenbauer · 2022-08-09T15:30:54Z

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExtFactories.ts

+                        // eslint-disable-next-line @typescript-eslint/dot-notation
+                        change["isZipped"] = "1";
+                        change.changeSet = zippedStr;
+                    }


Nit: "zipped" implies deflate to most people. Perhaps substitute "compressed"?

DLehenbauer · 2022-08-09T15:33:53Z

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExtFactories.ts

+                decode: (transferChange: IPropertyTreeMessage) => {
+                    // eslint-disable-next-line @typescript-eslint/dot-notation
+                    if (transferChange["isZipped"]) {
+                        const zipped = new Uint8Array(stringToBuffer(transferChange.changeSet, "base64"));


Just FYI - In the new SharedTree, we're using "changeset" rather than "changeSet". (i.e., "changeset" is a single word, like "hashtable".)

However, 'changeSet' is okay if you prefer to be consistent with PropertyTree.

This implementation is dedicated to Property DDS only and will not be used in any other context so I would suggest to stay consistent with Property DDS.

DLehenbauer · 2022-08-09T15:44:15Z

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExtFactories.ts

+    public abstract getDecodeFce();
+    private createCompressionMethods(encodeFn, decodeFn): ISharedPropertyTreeEncDec {
+        return {
+            messageEncoder: {


FYI: @justus-camp has landed runtime-level per-op compression yesterday:
#11208

Thanks for pointing to this. I will go through. At first time I do not see any reusable code which we can use here or in general summary compression (such which would determine the compression algorithm based on configuration, environment or any further conditions). The API is dedicated to the fluid operation message : unpackRuntimeMessage(message: ISequencedDocumentMessage) and then hard-coding usage of lz4. So I still need to keep this implementation as is. I would like to point you to #11366 where we could discuss the reusability.

github-actions · 2022-08-09T15:55:43Z

This commit is queued for merging with the next branch! Please ignore this PR for now. Contact @microsoft/fluid-cr-infra for help.

milanro added 3 commits July 7, 2022 16:06

Multiple compress algorithms facility.

0d8090e

Test of LZ4PropertyTree

77afecb

Merge branch 'main' into milan_multiple_compression

7abd34a

milanro requested a review from msfluid-bot as a code owner July 13, 2022 14:30

github-actions bot added the area: dds: propertydds label Jul 13, 2022

tylerbutler added the community-contribution label Jul 13, 2022

github-actions bot added the base: main PRs targeted against main branch label Jul 13, 2022

Copyright fixed!

d846201

DLehenbauer reviewed Jul 26, 2022

View reviewed changes

experimental/PropertyDDS/packages/property-dds/src/propertyTreeExtFactories.ts Show resolved Hide resolved

DLehenbauer reviewed Jul 26, 2022

View reviewed changes

milanro added 3 commits August 1, 2022 13:29

Improper type for LZ4PropertyTree fixed (Typo caused by copy/paste).

7bbdd3b

Merge branch 'main' into milan_multiple_compression

7772d7e

Compresion methods builder moved from class to function / method of C…

938a703

…ompressedPropertyTreeFactory

DLehenbauer reviewed Aug 9, 2022

View reviewed changes

DLehenbauer approved these changes Aug 9, 2022

View reviewed changes

DLehenbauer merged commit 685c9fb into microsoft:main Aug 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Milan multiple compression #11083

Milan multiple compression #11083

milanro commented Jul 13, 2022 •

edited

Loading

milanro commented Jul 13, 2022

milanro commented Jul 19, 2022

DLehenbauer Jul 26, 2022 •

edited

Loading

milanro Aug 1, 2022

DLehenbauer commented Jul 26, 2022

DLehenbauer Jul 26, 2022 •

edited

Loading

milanro Aug 1, 2022

milanro commented Aug 2, 2022

DLehenbauer Aug 9, 2022

DLehenbauer Aug 9, 2022

milanro Aug 9, 2022

DLehenbauer Aug 9, 2022

milanro Aug 9, 2022

github-actions bot commented Aug 9, 2022

Milan multiple compression #11083

Milan multiple compression #11083

Conversation

milanro commented Jul 13, 2022 • edited Loading

Description

Does this introduce a breaking change?

milanro commented Jul 13, 2022

milanro commented Jul 19, 2022

DLehenbauer Jul 26, 2022 • edited Loading

Choose a reason for hiding this comment

milanro Aug 1, 2022

Choose a reason for hiding this comment

DLehenbauer commented Jul 26, 2022

DLehenbauer Jul 26, 2022 • edited Loading

Choose a reason for hiding this comment

milanro Aug 1, 2022

Choose a reason for hiding this comment

milanro commented Aug 2, 2022

DLehenbauer Aug 9, 2022

Choose a reason for hiding this comment

DLehenbauer Aug 9, 2022

Choose a reason for hiding this comment

milanro Aug 9, 2022

Choose a reason for hiding this comment

DLehenbauer Aug 9, 2022

Choose a reason for hiding this comment

milanro Aug 9, 2022

Choose a reason for hiding this comment

github-actions bot commented Aug 9, 2022

milanro commented Jul 13, 2022 •

edited

Loading

DLehenbauer Jul 26, 2022 •

edited

Loading

DLehenbauer Jul 26, 2022 •

edited

Loading