Syntactic completion #1257

let-def · 2021-02-04T21:31:07Z

This PR implements completion based on language grammar (this complements the existing completion based on semantic information).

Here is a rough explanation of the algorithm, taken from the Syntactic_completion module:

Generate syntactic completion by analysing the parser stack.
The task is split in a few steps:

First enumerate all reachable states by simulating all possible
reductions.
This is done by the Lookahead, Level and Stack modules. Lookahead keep
tracks of a set of lookahead terminals: rather than simulating
separately for each possible lookahead token, we regroup lookahead
tokens that trigger the same reduction, such that a given reduction is
simulated only once.
Level keep track of all goto transitions to simulate on a given stack
frame (hence the reductions are grouped per stack "level").
Stack modules simulate the stack reduction.

Auxiliary information are provided by:

Parser_complete.state_to_reduction_table: the list of reductions to
simulate when in a given state, already structured per level

Parser_complete.state_goto_table: a naive but sufficient
representation of the LR goto table.
TODO: replace it by Menhir builtin goto table when possible (need to
patch Menhir).

After that, we have the list of all states that were reached, and for
each state, the set of lookahead tokens that led to it.
The list is ordered: the state that is the deepest in the stack comes
first. This is done to favor completions that will close the most
syntactic constructions over completions that might open new nested
constructions.
In practice, it means that in this example :
module M = struct
let v = if true then x
Completing after the x, the suggestions will be:

first end, to close the structure

then in, to transforme the module let into an expression let

finally else, to turn the if then into an if then else.
This order will be preserved by subsequent transformations.

Then we turn each reached state into an "item set":

Parser_complete.state_closure_table associates to each state the
states that can be reached by following "null" reductions.
(e.g. if we are in let . rec? we can each let rec? . by assuming
the rec flag is missing: rec? is a nullable reduction)
TODO: maybe we can remove this step by simulating the closure from
production definition, the runtime cost should be negligible.

Parser_complete.items_table associates a state to its itemset, in the
form of a list of pair of (production, dot position)

The item sets are transformed into sequence of symbols by looking
them up in Parser_complete.productions table, that contains the
definition of each production.
Extra step: we need to simulate the "closure" of the itemset.
For instance if we have an item that looks like . expression, we don't
want to stop there and just suggest "expression", rather we want to
expand the expression to its definition.
This is done using Parser_complete.nonterminal_to_productions that lists
all the productions that can produce a non-terminal.

Now we have a list of symbols that constitutes valid continuations of
the current parsing. We need to turn them into readable definitions that
can be presented to the user.
First, we keep only the ones that starts with tokens we consider
"interesting" (mostly keywords) using "is_interesting_terminal".
After that, for each starting terminal, we only keep the shortest
sentence that can complete it. For instance,
{ if ... then ... , if ... then ... else ... ,
let ... = ... , let ... = ... in ... }
is simplified too
{ if ... then ... , let ... = ... }
Then we turn terminals into text using Parser_printer and replace
non-terminals by "..."

[completion_for_parser] runs to whole pipeline.

TODO:

The pipeline is not really efficient, some intermediate structures could be prune early to avoid a lot of redundant computations
Add tests to the testsuite
Tested in VIM. What about other editors?

let-def · 2021-02-04T21:36:02Z

Here is an example showing syntactic completion in an expression:

let-def · 2021-02-04T21:37:48Z

TODO:
Syntactic completion engine does not handle well completing when the cursor is on an existing word.
Fix handling when the cursor is

in the middle of a word
at the end of a word

let-def added 8 commits January 31, 2021 13:41

WIP: completion generator

acbbb90

LR0 completion transitions generation

5a83047

WIP Syntactic completion

5cfb430

fix

1c6db5f

Filter interesting terminals/tokens and replace non-terminals by ...

e73d0b9

Cleanup dune rules, document syntactic_completion module

9f77d4d

Commit generated parser completion data

a20fc66

Add copyright headers

7905caf

let-def added 2 commits February 5, 2021 13:40

Mreader_lexer.for_completion: snapshot parser before inserted identifier

bd8b4f4

Forgot the mli :)

1865e89

trefis force-pushed the master branch from cf6700e to a8fa9db Compare July 13, 2021 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Syntactic completion #1257

Syntactic completion #1257

let-def commented Feb 4, 2021

let-def commented Feb 4, 2021

let-def commented Feb 4, 2021 •

edited

Syntactic completion #1257

Are you sure you want to change the base?

Syntactic completion #1257

Conversation

let-def commented Feb 4, 2021

let-def commented Feb 4, 2021

let-def commented Feb 4, 2021 • edited

let-def commented Feb 4, 2021 •

edited