-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathREADME
32 lines (25 loc) · 1.08 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
typoscan is a utility to create to-do lists from Wikipedia
dumps for AutoWikiBrowser.
It retrieves a list of regular expressions to scan for from
http://en.wikipedia.org/wiki/WP:AWB/T, reads a dump file on
STDIN, and outputs a list of all page titles that match on
any of the regular expressions to STDOUT. Diagnostic output
is directed to STDERR.
AWB ignores some article parts like <nowiki> tags, so there
may be false positives.
Licence
=======
typoscan is licensed under the GNU General Public License
version 3 or later. You can find a copy of the licence in
COPYING.
The test data files (tests/*.xml, tests/typos-patterns.wiki)
are licensed under the Creative Commons
Attribution-ShareAlike 3.0 Unported License. You can find a
copy of the licence at
http://creativecommons.org/licenses/by-sa/3.0/.
For the article files (tests/*.xml), you can find the list
of contributors at
http://en.wikipedia.org/w/index.php?oldid=$REVISIONID where
$REVISIONID is the value of /page/revision/id. For
tests/typos-pattern.wiki, you can find it at
http://en.wikipedia.org/w/index.php?oldid=559776106.