feat(fmt): An attempt at aesthetic items into PL #4639

max-sixty · 2024-06-20T00:45:44Z

By adding comments (named "aesthetics" here, and includes linewraps) to PL, this is an attempt to get around the complications of combining lexer + parser output in prqlc fmt, which #4397 has hit in a few incarnations.

This very very nearly works — with chumsky we can create a function that wraps anything that might have a trailing or following comment, implement a trait on the AST items that contain it — and away we go. (though it did require lots of debugging in the end...). The AST would then be really easy to write back out.

This requires comments to lead or follow tokens that are part of an AST item. I think there's literally a single case where it doesn't work, which is when a comment follows the final trailing comma of a tuple or array. So apart from that case, a comment always leads or follows a token that's part of an AST item.

...so tests fail at the moment, on that case.

Next we need to consider:

Can we workaround that one case? We don't actually care about whether there's a trailing comma — it has no semantic meaning — and we're going to override that when we write it out, so we could likely hack around it...
Are there actually other cases of this model failing? I know this approach — of putting aesthetic items into AST — is not generally favored. And at a meta level we should be skeptical of the claim "there's just a single case of something not working" — bad models generally have multiple failures!

This is an attempt to get around the complications of managing lexer + parser output, which PRQL#4397 has hit in a few incarnations by just adding comments ('aesthetics') to PL. This very very nearly works -- with chumsky we can create a function that wraps anything that might have a comment, implement a trait on the AST items that contain it, and away we go (though it did require a lot of debugging in the end). This would then be really easy to write back out. I think there's literally a single case where it doesn't work -- where a comment doesn't come directly before or directly after an AST item -- in the final trailing comma of a tuple or array. So tests fail at the moment. Next we need to consider: - Can we workaround that one case? We don't actually care about whether there's a trailing comma, so we could likely hack around it... - Are there actually other cases of this model failing? I know this approach -- of putting aesthetic items into AST -- is not generally favored, and it's really rare that there's even a single case of something not working.

Extracting this from PRQL#4639

max-sixty · 2024-06-24T18:17:28Z

One thing we could do land from this is doc-comments — which must be attached to an item, and we may want to push through the AST. It's much less important to the project than getting prqlc fmt to work, but would let us merge something from this work rather than having a bunch of PRs & branches gradually accumulating merge conflicts...

aljazerzen · 2024-06-24T20:50:33Z

Oh, this is an interesting idea: instead of discarding all comments and new lines, we keep them in the first AST, so they can be re-incorporated into codegen.

It's a shame that it doesn't work for all the cases. It would be a beautiful solution for formatting comments. Could we parse trailing comma as an aesthetic too?

re doc comments: yes, they are very similar. I think they should be allowed only on statements, so that's even simpler to parse.

max-sixty · 2024-06-24T21:59:08Z

It's a shame that it doesn't work for all the cases. It would be a beautiful solution for formatting comments. Could we parse trailing comma as an aesthetic too?

Yeah, I added some more thoughts at #4397 (comment). It's possible to do it this way, but not as elegant as I first thought — I think it would require lots of backtracking and custom parsers (i.e. not just delimited_by...) to be able to distinguish between the two cases in that comment...

max-sixty mentioned this pull request Jun 20, 2024

refactor!: prqlc-parser major reorg changes, remove prqlc-ast #4634

Merged

2 tasks

max-sixty added 4 commits June 19, 2024 21:27

Merge branch 'main' into add-comments-to-pl

dda1b0c

Merge branch 'main' into add-comments-to-pl

36bfd06

clarify explanation

d4a8fb7

internal: Add Clone to parsers

3ee3cad

Extracting this from PRQL#4639

max-sixty mentioned this pull request Jun 20, 2024

internal: Add Clone to parsers #4642

Merged

max-sixty added 2 commits June 19, 2024 21:42

Merge branch 'add-clone-to-parsers' into add-comments-to-pl

b7269a5

Merge branch 'main' into add-comments-to-pl

9900fc0

max-sixty mentioned this pull request Jun 24, 2024

feat: Add comments to format output #4397

Open

This was referenced Jul 1, 2024

Add comments to fmt #4116

Open

feat: Add DocComments to PR #4701

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(fmt): An attempt at aesthetic items into PL #4639

feat(fmt): An attempt at aesthetic items into PL #4639

max-sixty commented Jun 20, 2024 •

edited

Loading

max-sixty commented Jun 24, 2024

aljazerzen commented Jun 24, 2024

max-sixty commented Jun 24, 2024

feat(fmt): An attempt at aesthetic items into PL #4639

Are you sure you want to change the base?

feat(fmt): An attempt at aesthetic items into PL #4639

Conversation

max-sixty commented Jun 20, 2024 • edited Loading

max-sixty commented Jun 24, 2024

aljazerzen commented Jun 24, 2024

max-sixty commented Jun 24, 2024

max-sixty commented Jun 20, 2024 •

edited

Loading