Add an AST ngram extractor to the Aura framework #1

RootLUG · 2020-11-08T13:10:34Z

There is already an experimental ngram.py in the repository root that is able to extract n-gram features from the source code in the JSON format. This extractor needs to be finished & refactored to port the changes from the new Aura v2.

This extractor should be disabled by default as it would produce huge amounts of data that is not needed during a standard scan but can be enabled when collecting the dataset for the ML.

The text was updated successfully, but these errors were encountered:

RootLUG added the enhancement New feature or request label Nov 8, 2020

RootLUG added this to the Basic support for extracting ML features milestone Nov 8, 2020

RootLUG self-assigned this Nov 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an AST ngram extractor to the Aura framework #1

Add an AST ngram extractor to the Aura framework #1

RootLUG commented Nov 8, 2020

Add an AST ngram extractor to the Aura framework #1

Add an AST ngram extractor to the Aura framework #1

Comments

RootLUG commented Nov 8, 2020