Med2Vec in Pytorch

This is a re-implementation of Med2Vec [1] in Pytorch. It simply embeds clinical concepts into a distributed representation using skip-gram model with an aditional code loss.

To run the code first obtain the ADMISSION.CSV and DIAGNOSES_ICD.CSV from MIMIC-III database here.

Compile the data by running bash gen_data.sh make sure you set the correct paths to the files.

The directories are structured as follows:

./base: base trainer, data loader.
./configs: json files for experiments. This where you pass in arguments to the model/trainer and what have you.
./trainer: contains training logic, and anything that must be done to train the model.
./model: directory containing the med2vec model.

To train the model run the following:

python train_med2vec.py -c ./configs/config.json

note: make sure the directories are set appropiatly in ./configs/config.json.

[1] Choi, Edward, et al. "Multi-layer representation learning for medical concepts." Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
base		base
configs		configs
data_loader		data_loader
model		model
trainer		trainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gen_data.py		gen_data.py
gen_data.sh		gen_data.sh
test.py		test.py
train_med2vec.py		train_med2vec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Med2Vec in Pytorch

About

Releases

Packages

Languages

License

sajaddarabi/Med2Vec-Pytorch

Folders and files

Latest commit

History

Repository files navigation

Med2Vec in Pytorch

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages