LSP: action to remove redundant tuple wrapper from case expression's subject #2982

giacomocavalieri · 2024-04-13T07:21:44Z

The compiler warns about unnecessary wrapping of tuples in a case's subject, it would be nice if an action could automatically fix that:

case #(wibble, wobble) {
//   ~~~~~~~~~~~~~~~~~ Action to remove redundant tuple
}

And an action to batch remove it from all cases in case there's more than one, much like with unused imports

lpil · 2024-04-15T18:50:03Z

Good idea!

nicklimmm · 2024-04-25T03:46:15Z

This looks interesting and might be simple enough for newcomers, I'd like to work on this!

lpil · 2024-04-25T10:01:42Z

Awesome! Though I think it might be quite hard to write the code that edits the existing expression to remove the tuple. We don't have any mechanism for that today.

nicklimmm · 2024-04-25T11:44:39Z

I'm thinking of using the Visitor pattern to visit the case expressions (need to implement this before continuing). I believe the pattern could be reused for future use.

An example: syn crate for Rust syntax tree - syn::visit and syn::visit_mut

A general idea to solve this issue:

Visit each TypedExpr::Case and UntypedExpr::Case
For each subject, if it is a tuple -> add lsp_types::TextEdit with the corresponding range and new text without #()

I've found something similar to walking the AST in UntypedExprFolder, I'm not sure why it exists though.

nicklimmm · 2024-04-26T13:35:35Z

I'm currently writing a proof of concept and looks promising so far. Will push a draft PR soon.

lpil · 2024-04-26T13:44:40Z

The untyped AST isn't available here, and if it were it's an AST not a CST so a parse->modify-print loop would result in formatting and comments to all be discarded.

nicklimmm · 2024-04-26T13:56:44Z

The untyped AST isn't available here, and if it were it's an AST not a CST so a parse->modify-print loop would result in formatting and comments to all be discarded.

Does this mean that we shouldn't walk on TypedModule or TypedExpr to identify the case expression nodes of concern?

lpil · 2024-04-26T14:38:50Z

You can identify them but there's not an easy to way to perform the edit after that.

nicklimmm · 2024-04-27T07:42:10Z

You can identify them but there's not an easy to way to perform the edit after that.

I think I get what you mean now, I'm guessing that we should modify the (typed) AST -> prettified string output -> final text edits.

But from what I've observed, the prettifier is only implemented for untyped AST, while we only have typed AST in this case. 🤔

The AST modification looks something like this:

flowchart LR
    before --> |collapse| after
    subgraph before
    C[TypedExpr::Case] --> S{subjects}
    S --> T1
    S --> T2
    T1[TypedExpr::Tuple]
    T1 --> E1[TypedExpr]
    T1 --> E2[TypedExpr]
    T2[TypedExpr::Tuple]
    T2 --> E3[TypedExpr]
    T2 --> E4[TypedExpr]
    end

    subgraph after
    C2[TypedExpr::Case] --> S2{subjects}
    S2 --> E21[TypedExpr]
    S2 --> E22[TypedExpr]
    S2 --> E23[TypedExpr]
    S2 --> E24[TypedExpr]
    end

lpil · 2024-04-27T10:27:29Z

The formatter modifies the formatting of the code. We don't want to change any code beyond removing the tuple so it's not possible to implement it by editing and printing an AST. If we had a CST we could modify and print that, but we don't have a CST or a parser for one yet.

nicklimmm · 2024-04-27T15:27:24Z

Should we implement CSTs before tackling this issue (and possibly many more LSP-related ones)? Will definitely take a lot of effort to build the infrastructure.

I've found out that rust-analyzer uses rowan for their lossless syntax trees and ungrammar to define their CST structure (which is then used for codegen the types and trait impls to Rust code).

lpil · 2024-04-27T17:20:05Z

Whether it's worthwhile to implement CSTs for this isn't clear to me. It would be a very large amount of work and it's only one of the options. I thing it is likely that something else may be more appropriate. Something more lightweight which outputs patches to the text file.

Rather I was explaining why printing an AST wouldn't work, a CST would be required.

nicklimmm · 2024-04-27T17:58:56Z

I'll try to explore the available options and share some insights later on.

lpil · 2024-04-27T20:56:33Z

Just had a thought. We can always delete the first 2 bytes from the start and last byte from the end as we dont need to remove the commas. No need to know anything but the code span of each tuple!

giacomocavalieri · 2024-04-27T23:12:52Z

Mmh would that work even if I wrote the code like this? # ( 1,2 ) (notice the space between the hashtag and the open parentheses) because technically that's a valid tuple and I guess we can't assume the code we're analysing is well formatted

nicklimmm · 2024-04-28T03:26:19Z

Mmh would that work even if I wrote the code like this? # ( 1,2 ) (notice the space between the hashtag and the open parentheses) because technically that's a valid tuple and I guess we can't assume the code we're analysing is well formatted

We can delete from the start span of the tuple to the start span of the first element (removing the opening), and from the end span of the last item to the end span of the tuple (removing the closing).

We need to handle comments in between those to prevent any deletion.

nicklimmm · 2024-04-28T17:13:09Z

After some thought, I'm thinking of a simpler method to retain comments and newlines: delete #, (, a trailing comma for the final tuple element (if applicable), and ).

Determining both # and ) is trivial, which is just at the start and the end of the tuple span, respectively.

To get the correct ( even with comments after the #, we scan for the first ( that doesn't belong to a comment. Similar idea for the trailing comma, but we scan from right to left.

Example

Even though this looks cursed, but this is valid:

// Essentially: `case #(1) { a -> 0 }`
case #
  
// ( <- should not delete this one
(
// (
1
// ,
,
// , <- should not delete this one
)

{
    a -> 0
}

The result after the change should look like this:

// Essentially: `case 1 { a -> 0 }`
case 
  
// ( <- should not delete this one

// (
1
// ,

// , <- should not delete this one


{
    a -> 0
}

Could there be any edge case that I miss?

lpil · 2024-04-28T22:08:37Z

Ah! I thought #( was one token. My bad! I forgot about trailing commas too.

nicklimmm · 2024-04-29T15:11:35Z

Seems like the method of deleting each of #, (, and ) works. The (linked) PR is up.

lpil added help wanted Contributions encouraged good first issue Good for newcomers priority:medium labels Apr 15, 2024

nicklimmm mentioned this issue Apr 28, 2024

LSP: code action to remove redundant tuple in case subject #3057

Merged

lpil added area:language-server and removed stream:language-server labels May 3, 2024

lpil closed this as completed in #3057 May 15, 2024

lpil added this to the LS01 milestone May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LSP: action to remove redundant tuple wrapper from case expression's subject #2982

LSP: action to remove redundant tuple wrapper from case expression's subject #2982

giacomocavalieri commented Apr 13, 2024 •

edited

lpil commented Apr 15, 2024

nicklimmm commented Apr 25, 2024

lpil commented Apr 25, 2024

nicklimmm commented Apr 25, 2024 •

edited

nicklimmm commented Apr 26, 2024

lpil commented Apr 26, 2024

nicklimmm commented Apr 26, 2024

lpil commented Apr 26, 2024

nicklimmm commented Apr 27, 2024

lpil commented Apr 27, 2024

nicklimmm commented Apr 27, 2024 •

edited

lpil commented Apr 27, 2024 •

edited

nicklimmm commented Apr 27, 2024

lpil commented Apr 27, 2024

giacomocavalieri commented Apr 27, 2024 •

edited

nicklimmm commented Apr 28, 2024 •

edited

nicklimmm commented Apr 28, 2024 •

edited

lpil commented Apr 28, 2024

nicklimmm commented Apr 29, 2024

LSP: action to remove redundant tuple wrapper from case expression's subject #2982

LSP: action to remove redundant tuple wrapper from case expression's subject #2982

Comments

giacomocavalieri commented Apr 13, 2024 • edited

lpil commented Apr 15, 2024

nicklimmm commented Apr 25, 2024

lpil commented Apr 25, 2024

nicklimmm commented Apr 25, 2024 • edited

nicklimmm commented Apr 26, 2024

lpil commented Apr 26, 2024

nicklimmm commented Apr 26, 2024

lpil commented Apr 26, 2024

nicklimmm commented Apr 27, 2024

lpil commented Apr 27, 2024

nicklimmm commented Apr 27, 2024 • edited

lpil commented Apr 27, 2024 • edited

nicklimmm commented Apr 27, 2024

lpil commented Apr 27, 2024

giacomocavalieri commented Apr 27, 2024 • edited

nicklimmm commented Apr 28, 2024 • edited

nicklimmm commented Apr 28, 2024 • edited

Example

lpil commented Apr 28, 2024

nicklimmm commented Apr 29, 2024

giacomocavalieri commented Apr 13, 2024 •

edited

nicklimmm commented Apr 25, 2024 •

edited

nicklimmm commented Apr 27, 2024 •

edited

lpil commented Apr 27, 2024 •

edited

giacomocavalieri commented Apr 27, 2024 •

edited

nicklimmm commented Apr 28, 2024 •

edited

nicklimmm commented Apr 28, 2024 •

edited