Move token_stream_parse implementation to Wasm #18

dtolnay · 2019-10-31T07:34:57Z

watt::sym::token_stream_parse is me being too lazy to implement a Rust syntax tokenizer in wasm (aka copy it from proc-macro2), but we could likely optimize that slightly by running that in wasm as well.

We'll need to import something like https://github.com/alexcrichton/proc-macro2/blob/1.0.6/src/strnom.rs and use that to implement token stream parsing rather than calling out to a host func.

It's possible we can simplify the implementation from proc-macro2 by omitting the parts that deal with parsing comments, as those won't appear in our use case.

The text was updated successfully, but these errors were encountered:

alexcrichton · 2019-10-31T13:15:57Z

One thing I should note on this is that I was surprised that #[derive(Serialize)] called this function, but then I remembered this is part of quote!, which also lends itself well to reasoning that this may be relatively hot if it's called in quote! a lot. In that sense I think we could perhaps speed up but a mild amount, but as the linked post shows it's only 1-2ms for a massive macro.

mystor · 2019-10-31T15:32:06Z

Does that include time spent running watt::sym::token_stream_serialize to get the parsed data into wasm? Running parsing in wasm should eliminate a bunch of calls to these other methods, which might increase the impact a bit.

In addition, given the higher minimum rustc version for watt, it may be reasonable to use rustc_lexer as the lexer within wasm, probably based on the work in dtolnay/proc-macro2#202. I'm guessing it's been optimized a bit more than strnom.

alexcrichton · 2019-10-31T16:24:24Z

Oh right that's an excellent point! Yes the timing information of watt::sym::token_stream_parse doesn't include the timing of watt::sym::token_stream_serialize nor the time to deserialize inside of wasm itself. Those would all largely be avoided if we move this to wasm. While still likely to be modest, I think it'd be a clear win speed-wise.

Fixes dtolnay#18

mystor · 2019-11-04T16:34:08Z

@dtolnay benchmarked an impl of this in #26 (review), and found it had equivalent (or worse) performance, even on large inputs. I'm inclined to think that we should leave the token_stream_parse implementation implemented by the host runtime.

dtolnay added the help wanted Extra attention is needed label Oct 31, 2019

mystor added a commit to mystor/watt that referenced this issue Nov 2, 2019

Handle lexing TokenStreams from within wasm code

c4e626a

Fixes dtolnay#18

mystor mentioned this issue Nov 2, 2019

Lex TokenStreams from within wasm #26

Closed

dtolnay closed this as completed Nov 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Move token_stream_parse implementation to Wasm #18

Move token_stream_parse implementation to Wasm #18

dtolnay commented Oct 31, 2019

alexcrichton commented Oct 31, 2019

Uh oh!

mystor commented Oct 31, 2019

Uh oh!

alexcrichton commented Oct 31, 2019

Uh oh!

mystor commented Nov 4, 2019

Uh oh!

Uh oh!

Move token_stream_parse implementation to Wasm #18

Move token_stream_parse implementation to Wasm #18

Comments

dtolnay commented Oct 31, 2019

alexcrichton commented Oct 31, 2019

Uh oh!

mystor commented Oct 31, 2019

Uh oh!

alexcrichton commented Oct 31, 2019

Uh oh!

mystor commented Nov 4, 2019

Uh oh!