Source code and TeX for "User Friendly Automatic Construction of Background Knowledge: Mode Construction from ER Diagrams." K-CAP 2017
Explore further on hayesall.com.
- (2019-12-16): Harsha Kokel extended this in "Just-another Walk-ER" (GitHub, ReadTheDocs).
If you build on this code or the ideas of the paper, please consider citing.
@inproceedings{hayes2017userfriendly,
author = {Hayes, Alexander L. and Das, Mayukh and Odom, Phillip and Natarajan, Sriraam},
title = {User Friendly Automatic Construction of Background Knowledge: Mode Construction from ER Diagrams},
booktitle = {Proceedings of the Knowledge Capture Conference},
series = {K-CAP 2017},
year = {2017},
isbn = {978-1-4503-5553-7},
location = {Austin, TX, USA},
pages = {30:1--30:8},
articleno = {30},
numpages = {8},
url = {http://doi.acm.org/10.1145/3148011.3148027},
doi = {10.1145/3148011.3148027},
acmid = {3148027},
publisher = {ACM},
address = {New York, NY, USA},
}
Modes are used to restrict/guide the search space and are a powerful tool in getting relational algorithms such as BoostSRL to work. If your algorithm does not learn anything useful, then the first debug point would be the modes (in the background.txt file).
Walk-ER is a system for defining background knowledge for use in relational learning algorithms by exploring entity/attribute/relationships in Entity-Relational Diagrams. Refer to the BoostSRL Basic Modes Guide for more information about modes.
- Java 1.8
- Python (2.7, 3.5)
Download the latest version from the GitHub repository. This includes five datasets:
$ git clone https://github.com/hayesall/Walk-ER.git
- Files representing the ER-Diagrams are in the
diagrams/
directory. - Datasets used in the experiments are in the
datasets/
directory.
Walk-ER can either be invoked from a terminal.
-
Options overview (output of
python walker.py -h
):usage: WalkER_rewrite.py [-h] [-v] [--number NUMBER] [-w | -s | -e | -r | -rw] diagram_file positional arguments: diagram_file optional arguments: -h, --help show this help message and exit -v, --verbose Increase verbosity to help with debugging. --number NUMBER Select number of features to walk to (assumes that Important features are ordered from most important to least important). Defaults to number_attributes + number_relations if chosen number is greater than both. -w, --walk [Default] Walk graph from target to features. -s, --shortest Walk the graph from target to features. If there are multiple paths, take the shortest. If the shortest are equal lengths, walk both. -e, --exhaustive Walk graph from every feature to every feature. -r, --random Ignore features the user selected and walk (-w) from the target to random features. -rw, --randomwalk Walk a random path from the target until reaching a depth limit (specified with --number).
$ python walker.py -w diagrams/imdb.mayukh
//target is workedunder
mode: actor(+personid).
mode: female_gender(+personid).
mode: genre(+personid,-genreid).
mode: movie(-movieid,+personid).
mode: workedunder(+personid,+personid).
$ python walker.py -rw --number 10 diagrams/imdb.mayukh
//target is workedunder
mode: actor(+personid).
mode: female_gender(+personid).
mode: genre(+personid,-genreid).
mode: movie(+movieid,+personid).
mode: workedunder(+personid,+personid).
mode: workedunder(+personid,-personid).
mode: workedunder(-personid,+personid).
- Mayukh Das and Sriraam Natarajan gratefully acknowledge the support of the CwC Program Contract W911NF-15-1-0461 with the US Defense Advanced Research Projects Agency (DARPA) and the Army Research Office (ARO).
- Phillip Odom and Sriraam Natarajan acknowledge the support of the Army Research Office (ARO) grant number W911NF-13-1-0432 under the Young Investigator Program.
- Icon in the logo is "Trail" by Martina Krasnayovรก from the Noun Project, used under a Creative Commons (CC) Attribution 3.0 United States License.