Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use proper parser #3

Open
DavidLeoni opened this issue Jan 24, 2018 · 0 comments
Open

Use proper parser #3

DavidLeoni opened this issue Jan 24, 2018 · 0 comments

Comments

@DavidLeoni
Copy link
Member

Currently parsing seems to be done by means of regexes --> bad thing!

Instead, we should use a proper parser. Alternatives found so far:

parsoid

  • parser used for Visual editor of Wikimedia
  • works in NodeJS
  • TODO does it work in browser?
  • mainly parses to HTML

wtf_wikipedia

  • works both in NodeJS and browser
  • according to the author creates proper json structure, while Parsoid doesn't

RemexHTML

  • new official PHP parser, replaces old one called Tidy

Maybe we could use WTF to get structured json so to have an easy structure for mindmaps , while we could use Parsoid for creating HTML to display in tooltips

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant