refactor: break fraudAssessment into evaluations #442

bajtos · 2025-01-09T14:22:18Z

Break Measurement.fraudAssessment into two new fields:

taskingEvaluation for the tasking-related checks
Example: DUP_INET_GROUP
majorityEvaluation for the result of the majority seeking process based on committees.
Example: MINORITY_RESULT

After this change, places filtering "accepted" measurements have to explicitly spell out how they define "accepted".

Some places are interested in tasking evaluation results only and consider minority results as "accepted" too. Example: RSR calculated from individual measurements.
Other places are stricter and want only measurements in majority. Example: which measurements to reward.

However, this pull request is intended to be pure refactoring with no change in the functionality. It should simply expand the check fraudAssessment === 'OK' into taskingEvaluation === 'OK' && majorityEvaluation === 'OK' and surface the places where we may want to include minority measurements too. Such changes can be implement in follow-up pull requests.

This is a follow-up for #396 (comment)

See also #439

Break `Measurement.fraudAssessment` into two new fields: - `taskingEvaluation` for the tasking-related checks Example: DUP_INET_GROUP - `majorityEvaluation` for the result of the majority seeking process based on committees. Example: MINORITY_RESULT After this change, places filtering "accepted" measurements have to explicitly spell out how they define "accepted". - Some places are interested in tasking evaluation results only and consider minority results as "accepted" too. Example: RSR calculated from individual measurements. - Other places are stricter and want only measurements in majority. Example: which measurements to reward. Signed-off-by: Miroslav Bajtoš <[email protected]>

Signed-off-by: Miroslav Bajtoš <[email protected]>

bajtos

🧵 TODO items blocking the conversion from "draft" to "ready" 👇🏻

lib/evaluate.js

bin/evaluate-measurements.js

bajtos · 2025-01-09T14:28:59Z

lib/round.js

-    /** @type {import('./preprocess.js').Measurement[]} */
+    /** @type {string[]} */
    this.measurementBatches = []
+    /** @type {Measurement[]} */
    this.measurements = []


The field measurementBatches stores a list of CIDs, see here:

spark-evaluate/lib/preprocess.js

Line 119 in 6709840

round.measurementBatches.push(cid)

bajtos

More TODOs 👇🏻

lib/platform-stats.js

lib/retrieval-stats.js

Signed-off-by: Miroslav Bajtoš <[email protected]>

lib/retrieval-stats.js

bin/dry-run.js

juliangruber · 2025-01-10T14:14:16Z

bin/evaluate-measurements.js

@@ -98,22 +98,24 @@ async function processRound (roundIndex, measurements, resultCounts) {
  })

  for (const m of round.measurements) {
-    if (m.fraudAssessment !== 'OK') continue
+    if (m.taskingEvaluation !== 'OK') continue


This is also a change in behavior. Previously, majority failures would be rejected here, now they are kept.

Before I continue with the review, can we please resolve this and the above comment? It looks like this question affects most of the changes.

Great catch, I'll fix this place to preserve the current behaviour and add a FIXME comment similar to the one on lines 109-111 below.

I'd also like to point out that this is a helper script for SPs to inspect recent Spark measurements, it's known to be flawed (see #396), and yet nobody complained about that bug yet - so again, I think the impact of a breaking change in this file is low.

I re-read the PR description and now see that this change in behavior might be intentional. Are you confident that all changes in behavior in this PR are intentional?
(...)
Before I continue with the review, can we please resolve this and the above comment? It looks like this question affects most of the changes.

I started this PR with the intention to combine refactoring with changes fixing the conditions determining which measurements are considered as accepted. I quickly realise that would make it difficult to reason about the changes in the future, so I decided to pivot and change this pull request into pure refactoring with no change in functionality.

I updated the pull request description to clearly spell out this intention.

I am 90% confident that this pull request does not introduce any change in the functionality in the lib files. I wasn't paying as much attention to bin files, but I think you caught the remaining places to fix and this PR is not introducing any unintentional changes now.

I searched for all occurences of fraudAssessment === in the pull request and verified that they are all expanded to taskingEvaluation === 'OK' && majorityEvaluation === 'OK'.

I found one place I had to fix - see 16938d8

Co-authored-by: Julian Gruber <[email protected]>

Signed-off-by: Miroslav Bajtoš <[email protected]>

bajtos requested review from juliangruber and NikolasHaimerl January 9, 2025 14:22

fixup! remove unused import

9e59ed6

Signed-off-by: Miroslav Bajtoš <[email protected]>

bajtos commented Jan 9, 2025

View reviewed changes

This was referenced Jan 9, 2025

refactor: remove unused param honestMeasurements #443

Merged

fix: fetch-recent-miner-measurements & committees #396

Open

bajtos commented Jan 9, 2025

View reviewed changes

lib/platform-stats.js Show resolved Hide resolved

lib/retrieval-stats.js Show resolved Hide resolved

lib/retrieval-stats.js Show resolved Hide resolved

Merge branch 'main' into refactor-fraud-assessment-step2

07be2cc

bajtos mentioned this pull request Jan 10, 2025

Include non-majority measurements in retrieval result breakdown #446

Open

bajtos added 5 commits January 10, 2025 10:16

fixup! add FIXME comments

1392633

Signed-off-by: Miroslav Bajtoš <[email protected]>

test: we don't reward measurements that are not in majority

11d896e

Signed-off-by: Miroslav Bajtoš <[email protected]>

Merge branch 'main' into refactor-fraud-assessment-step2

8731b08

test: counts only majority measurements as accepted

01dc39f

Signed-off-by: Miroslav Bajtoš <[email protected]>

test: records histogram of "score per inet group"

c08c10a

Signed-off-by: Miroslav Bajtoš <[email protected]>

bajtos marked this pull request as ready for review January 10, 2025 09:43

NikolasHaimerl reviewed Jan 10, 2025

View reviewed changes

lib/retrieval-stats.js Show resolved Hide resolved

juliangruber reviewed Jan 10, 2025

View reviewed changes

bajtos and others added 3 commits January 10, 2025 15:23

Update bin/dry-run.js

aa08724

Co-authored-by: Julian Gruber <[email protected]>

fix evaluate-measurements

f8a0505

Signed-off-by: Miroslav Bajtoš <[email protected]>

preserve behaviour in bin/evaluate-measurements.js

16938d8

Signed-off-by: Miroslav Bajtoš <[email protected]>

bajtos requested review from juliangruber and NikolasHaimerl January 10, 2025 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: break fraudAssessment into evaluations #442

refactor: break fraudAssessment into evaluations #442

bajtos commented Jan 9, 2025 •

edited

Loading

bajtos left a comment

bajtos Jan 9, 2025

bajtos left a comment

juliangruber Jan 10, 2025

bajtos Jan 10, 2025

bajtos Jan 10, 2025

bajtos Jan 10, 2025

refactor: break fraudAssessment into evaluations #442

Are you sure you want to change the base?

refactor: break fraudAssessment into evaluations #442

Conversation

bajtos commented Jan 9, 2025 • edited Loading

bajtos left a comment

Choose a reason for hiding this comment

bajtos Jan 9, 2025

Choose a reason for hiding this comment

bajtos left a comment

Choose a reason for hiding this comment

juliangruber Jan 10, 2025

Choose a reason for hiding this comment

bajtos Jan 10, 2025

Choose a reason for hiding this comment

bajtos Jan 10, 2025

Choose a reason for hiding this comment

bajtos Jan 10, 2025

Choose a reason for hiding this comment

bajtos commented Jan 9, 2025 •

edited

Loading