Additional Metadata for Schema? #192

jamesmunns · 2024-11-28T19:05:49Z

CC @max-heller and #179

There have been a couple of asks for additional schema metadata. Off the top of my head:

Things like max size (for bounded types, and possibly annotations for unbounded types)
Things like descriptions/comments (though some of this veers into postcard-rpc's Endpoints and Topics, which tend to benefit from metadata as well

Open questions would be:

Should (some or all of) these fields affect the schema hash calculation?
How can users opt-in/out of sending this information over the wire?

The text was updated successfully, but these errors were encountered:

max-heller · 2024-11-28T19:28:30Z

Min size: Feature request: MinSize #180
Custom enum discriminants?

jamesmunns · 2024-11-28T19:29:55Z

Whatcha mean by custom enum discriminants? (Postcard specifically states that it uses "lexical ordering")

max-heller · 2024-11-28T19:36:57Z

Whatcha mean by custom enum discriminants? (Postcard specifically states that it uses "lexical ordering")

enum Foo {
    A = 1,
    ...
}

Similar to comments, serde and postcard don't care about discriminants and use a 0-indexed "lexical ordering", but some use cases of Schema as a reflection mechanism might.

max-heller · 2024-11-28T19:58:36Z

Things like max size (for bounded types, and possibly annotations for unbounded types)

Annotations as in something like this?

#[postcard(serialized_size(max = 512))]
bytes: Vec<u8>,

Should (some or all of) these fields affect the schema hash calculation?

One way this could work is by having a wrapper type for customizing hashing behavior:

struct HashBy<T> {
    // Which fields to include in the hash
    fields: Fields,
    value: T,
}
// Could be a bitset or something more compact
struct Fields {
    names: bool,
    max_size: bool,
    ...
}
impl Hash for HashBy<NamedType> {}
...

How can users opt-in/out of sending this information over the wire?

I could see this working with a SerializeWith<T> wrapper (similar to the one above for hashing) combined with optional/defaulted fields for max size, comments, etc. on the deserializing side.

One other open question:

How much will additional (unused) metadata affect type and binary sizes? Ideally it could be optimized out and comment strings wouldn't end up getting embedded in binaries if only the basic schema is needed, but I'm not sure just how smart the compiler is with consts

jamesmunns · 2024-11-28T20:13:26Z

re: hashing, I specifically meant what postcard-rpc does for creating a Key from a NamedType.

re: annotations, it could mean that! It would specifically be an annotation used when deriving postcard-schema::Schema (from the postcard-derive crate). This data would show up in NamedType (or somewhere similar). It's unclear if/how this would affect postcard itself (e.g. should we reject serializing/deserializing types that exceed this annotation? for example a String with a max of 512 but contains 600 bytes)

re: enum discriminants, hmm, that makes sense, I wonder if this adds more confusion than is useful.

re: "How much will additional (unused) metadata affect type and binary sizes", I would assume for "non-postcard-rpc users", it would be elided. However postcard-rpc supports sending the schemas for all endpoints, so I would assume it would be included in those cases. I don't assume the compiler is smart enough (yet) to totally remove unused fields (only totally unused consts).

max-heller · 2024-11-28T20:24:47Z

re: hashing, I specifically meant what postcard-rpc does for creating a Key from a NamedType.

The least surprising option would probably be to consider only the pieces that break wire compatibility if changed, i.e. only the serde-relevant pieces.

should we reject serializing/deserializing types that exceed this annotation? for example a String with a max of 512 but contains 600 bytes

Would be nice to have a serializer/deserializer flag to reject oversized inputs (re. #135) but that might be tricky to integrate with the various postcard::from_*() helpers.

re: enum discriminants, hmm, that makes sense, I wonder if this adds more confusion than is useful.

It might, but wanted to mention it since they're meaningful in some cases.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Additional Metadata for Schema? #192

Additional Metadata for Schema? #192

jamesmunns commented Nov 28, 2024

max-heller commented Nov 28, 2024

jamesmunns commented Nov 28, 2024

max-heller commented Nov 28, 2024 •

edited

Loading

max-heller commented Nov 28, 2024

jamesmunns commented Nov 28, 2024

max-heller commented Nov 28, 2024

Additional Metadata for Schema? #192

Additional Metadata for Schema? #192

Comments

jamesmunns commented Nov 28, 2024

max-heller commented Nov 28, 2024

jamesmunns commented Nov 28, 2024

max-heller commented Nov 28, 2024 • edited Loading

max-heller commented Nov 28, 2024

jamesmunns commented Nov 28, 2024

max-heller commented Nov 28, 2024

max-heller commented Nov 28, 2024 •

edited

Loading