Add NNEF support for `copy` operation #1318

mmagician · 2024-01-31T18:50:14Z

For a .nnef model such as:

graph.nnef:

version 1.0;

graph main(external1) -> (copy1)
{
    external1 = external<scalar>(shape = [1, 28, 28]);
    copy1 = copy(external1);
}

graph.quant:

"external1": zero_point_linear_quantize(zero_point = 0, scale = 0.003921568859368563, bits = 8, signed = false, symmetric = false);
"copy1": zero_point_linear_quantize(zero_point = -128, scale = 0.003921568859368563, bits = 8, signed = true, symmetric = false);

Let me know if I should include a sample .nnef model for testing somewhere?

mmagician · 2024-01-31T18:51:11Z

core/src/ops/math/mod.rs

+element_wise!(copy, Copy, [i8, i16, i32, i64, f16, f32, f64, TDim] => |_, _| {
+ Ok(())
+};
+q: [i8, u8, i32] => |x: f32| x);


Not really sure what this q part does, for now just copied from the other ops

The q section is for dealing with quantized datum types by converting to f32.

kali · 2024-02-01T07:48:18Z

Thanks for your contribution.

But... tract has immutable tensor semantics, so it does not need a copy operator. I must say I fail to see why NNEF needs one to be honest. I assume it's for some kind of aesthetic completion.

So unless I miss something, it should be mapped to... well nothing, or eventually to the operator Identity.

Am I missing something ?

mmagician · 2024-02-09T12:21:09Z

You're right in that it's an identity operator- although it does carry the quantization information that's applied to the input & output.

kali · 2024-02-09T12:39:09Z

Mmm... So should we map it to a cast operator instead ?

I'm saying this knowing tract cast semantics are weak: half of them are conversions and the other half are reinterprets. This need sorting out. But in the meantime, we may be lucky and your model may work...

mmagician · 2024-02-09T12:59:02Z

I'm happy to adapt the PR as you suggest. Will a cast preserve quantization then?

kali · 2024-02-09T13:16:13Z

Maybe I misunderstood. When you mentioned the quantization, I guessed that the copy operator was helping with converting from one conversion to another (as defined per a graph.quant file). Did I got this wrong ?

So if that really the case, you want an operator that will act as a conversion (like, actually recomputing stuff as the bytes representing the same values in the input and output quantization scheme will be different) and not a reinterpret cast (that would just switch the quantization parameter, not computing anything, not altering the bytes in the tensor).

I checked the code, and I think a cast operator between two quantization in tract will do a conversion. But as I said, this is a dark corner of tract at this stage, so we may have surprises.

add NNEF support for copy operation

01c7b4b

mmagician commented Jan 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NNEF support for `copy` operation #1318

Add NNEF support for `copy` operation #1318

mmagician commented Jan 31, 2024

mmagician Jan 31, 2024

kali Feb 1, 2024

kali commented Feb 1, 2024

mmagician commented Feb 9, 2024

kali commented Feb 9, 2024 •

edited

mmagician commented Feb 9, 2024

kali commented Feb 9, 2024

Add NNEF support for copy operation #1318

Are you sure you want to change the base?

Add NNEF support for copy operation #1318

Conversation

mmagician commented Jan 31, 2024

mmagician Jan 31, 2024

Choose a reason for hiding this comment

kali Feb 1, 2024

Choose a reason for hiding this comment

kali commented Feb 1, 2024

mmagician commented Feb 9, 2024

kali commented Feb 9, 2024 • edited

mmagician commented Feb 9, 2024

kali commented Feb 9, 2024

Add NNEF support for `copy` operation #1318

Add NNEF support for `copy` operation #1318

kali commented Feb 9, 2024 •

edited