Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add comments into the AST #14352

Open
wants to merge 7 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
116 changes: 110 additions & 6 deletions lib/elixir/lib/code.ex
Original file line number Diff line number Diff line change
Expand Up @@ -1036,12 +1036,13 @@ defmodule Code do
[
unescape: false,
literal_encoder: &{:ok, {:__block__, &2, [&1]}},
include_comments: true,
token_metadata: true,
emit_warnings: false
] ++ opts

{forms, comments} = string_to_quoted_with_comments!(string, to_quoted_opts)
to_algebra_opts = [comments: comments] ++ opts
forms = string_to_quoted!(string, to_quoted_opts)
to_algebra_opts = opts
doc = Code.Formatter.to_algebra(forms, to_algebra_opts)
Inspect.Algebra.format(doc, line_length)
end
Expand Down Expand Up @@ -1203,6 +1204,12 @@ defmodule Code do
* `:emit_warnings` (since v1.16.0) - when `false`, does not emit
tokenizing/parsing related warnings. Defaults to `true`.

* `:include_comments` (since v1.19.0) - when `true`, includes comments
in the quoted form. Defaults to `false`. If this option is set to
`true`, the `:literal_encoder` option must be set to a function
that ensures all literals are annotated, for example
`&{:ok, {:__block__, &2, [&1]}}`.

## `Macro.to_string/2`

The opposite of converting a string to its quoted form is
Expand Down Expand Up @@ -1242,18 +1249,95 @@ defmodule Code do
* atoms used to represent single-letter sigils like `:sigil_X`
(but multi-letter sigils like `:sigil_XYZ` are encoded).

## Comments

When `include_comments: true` is passed, comments are included in the
quoted form.

There are three types of comments:
- `:leading_comments`: Comments that are located before a node,
or in the same line.

Examples:

# This is a leading comment
foo # This one too

- `:trailing_comments`: Comments that are located after a node, and
before the end of the parent enclosing the node(or the root document).

Examples:

foo
# This is a trailing comment
# This one too

- `:inner_comments`: Comments that are located inside an empty node.

Examples:

foo do
# This is an inner comment
end

[
# This is an inner comment
]

%{
# This is an inner comment
}

A comment may be considered inner or trailing depending on wether the enclosing
node is empty or not. For example, in the following code:

foo do
# This is an inner comment
end

The comment is considered inner because the `do` block is empty. However, in the
following code:

foo do
bar
# This is a trailing comment
end

The comment is considered trailing to `bar` because the `do` block is not empty.

In the case no nodes are present in the AST but there are comments, they are
inserted into a placeholder `:__block__` node as `:inner_comments`.
"""
@spec string_to_quoted(List.Chars.t(), keyword) ::
{:ok, Macro.t()} | {:error, {location :: keyword, binary | {binary, binary}, binary}}
def string_to_quoted(string, opts \\ []) when is_list(opts) do
file = Keyword.get(opts, :file, "nofile")
line = Keyword.get(opts, :line, 1)
column = Keyword.get(opts, :column, 1)
include_comments? = Keyword.get(opts, :include_comments, false)

Process.put(:code_formatter_comments, [])
opts =
if include_comments? do
[
preserve_comments: &preserve_comments/5,
token_metadata: true,
] ++ opts
else
opts
end

case :elixir.string_to_tokens(to_charlist(string), line, column, file, opts) do
{:ok, tokens} ->
:elixir.tokens_to_quoted(tokens, file, opts)
with {:ok, tokens} <- :elixir.string_to_tokens(to_charlist(string), line, column, file, opts),
{:ok, quoted} <- :elixir.tokens_to_quoted(tokens, file, opts) do
if include_comments? do
quoted = Code.Normalizer.normalize(quoted)
quoted = Code.Comments.merge_comments(quoted, Process.get(:code_formatter_comments))

{:ok, quoted}
else
{:ok, quoted}
end
else
{:error, _error_msg} = error ->
error
end
Expand All @@ -1275,7 +1359,27 @@ defmodule Code do
file = Keyword.get(opts, :file, "nofile")
line = Keyword.get(opts, :line, 1)
column = Keyword.get(opts, :column, 1)
:elixir.string_to_quoted!(to_charlist(string), line, column, file, opts)
include_comments? = Keyword.get(opts, :include_comments, false)

Process.put(:code_formatter_comments, [])

opts =
if include_comments? do
[
preserve_comments: &preserve_comments/5,
token_metadata: true,
] ++ opts
else
opts
end

quoted = :elixir.string_to_quoted!(to_charlist(string), line, column, file, opts)

if include_comments? do
Code.Comments.merge_comments(quoted, Process.get(:code_formatter_comments))
else
quoted
end
end

@doc """
Expand Down
Loading