We provide the official Keras implementation of training Semantic Genesis from scratch on unlabeled images as well as the usage of the pre-trained Semantic Genesis reported in the following paper:
Learning Semantics-enriched Representation via Self-discovery, Self-classification, and Self-restoration
Fatemeh Haghighi1, Mohammad Reza Hosseinzadeh Taher1, Zongwei Zhou1, Michael B. Gotway2, Jianming Liang1
Arizona State University1, Mayo Clinic 2
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2020
Paper | Code | Poster | Slides | Graphical abstract | Talk (YouTube, YouKu)
Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-supervised Learning
Fatemeh Haghighi1, Mohammad Reza Hosseinzadeh Taher1,Zongwei Zhou1,Michael B. Gotway2, Jianming Liang1
Arizona State University1, Mayo Clinic, 2
IEEE Transactions on Medical Imaging (TMI)
paper | code
- Linux
- Python 3.7.5
- Keras 2.2.4+
- TensorFlow 1.14.0+
$ git clone https://github.com/fhaghighi/SemanticGenesis.git
$ cd SemanticGenesis/
$ pip install -r requirements.txt
Download the pre-trained Semantic Genesis as following and save into ./keras/Checkpoints/semantic_genesis_chest_ct.h5
directory.
Backbone | Platform | model | |
---|---|---|---|
Semantic Genesis | U-Net 3D | Keras | download |
Semantic Genesis learns a generic semantics-enriched image representation that can be leveraged for a wide range of target tasks. Specifically, Semantic Genesis provides a pre-trained U-Net network, which the encoder can be utilized for the target classification tasks and encoder-decoder for the target segmentation tasks.
As for the target classification tasks, the 3D deep model can be initialized with the pre-trained encoder using the following example:
# prepare your own data
X, y = your_data_loader()
# prepare the 3D model
import keras
from models.ynet3d import *
input_channels, input_rows, input_cols, input_deps = 1, 64, 64, 32
num_class, activate = 2, 'softmax'
weight_dir = './keras/Checkpoints/semantic_genesis_chest_ct.h5'
semantic_genesis = ynet_model_3d((input_channels, input_rows, input_cols, input_deps), batch_normalization=True)
print("Load pre-trained Semantic Genesis weights from {}".format(weight_dir))
semantic_genesis.load_weights(weight_dir)
x = semantic_genesis.get_layer('depth_7_relu').output
x = keras.layers.GlobalAveragePooling3D()(x)
output = keras.layers.Dense(num_class, activation=activate)(x)
model = keras.models.Model(inputs=semantic_genesis.input, outputs=output)
model.compile(optimizer="adam", loss="categorical_crossentropy", metrics=["accuracy","categorical_crossentropy"])
# train the model
model.fit(X, y)
As for the target segmentation tasks, the 3D deep model can be initialized with the pre-trained encoder-decoder using the following example:
# prepare your own data
X, Y = your_data_loader()
# prepare the 3D model
from unet3d import *
from models.ynet3d import *
input_channels, input_rows, input_cols, input_deps = 1, 64, 64, 32
num_class = 2
weight_dir = './keras/Checkpoints/semantic_genesis_chest_ct.h5'
semantic_genesis = ynet_model_3d((input_channels, input_rows, input_cols, input_deps), batch_normalization=True)
print("Load pre-trained Semantic Genesis weights from {}".format(weight_dir))
semantic_genesis.load_weights(weight_dir)
model = unet_model_3d((1,config.input_rows,config.input_cols,config.input_deps), batch_normalization=True)
for layer in tuple(model.layers):
if "input" not in layer.name and "max_pooling3d" not in layer.name \
and "up_sampling3d" not in layer.name and "concatenate_" not in layer.name \
and "conv3d_1" not in layer.name and "activation" not in layer.name \
and not layer.name.startswith("conv3d"):
layer.set_weights(semantic_genesis.get_layer(layer.name).get_weights())
models.compile(optimizer="adam", loss=dice_coef_loss, metrics=[mean_iou,dice_coef])
# train the model
model.fit(X, Y)
$ git clone https://github.com/fhaghighi/SemanticGenesis.git
$ cd SemanticGenesis/
$ pip install -r requirements.txt
For your convenience, we have provided our own self-discoverd 3D anatomical patterns from LUNA16 dataset as well as their pseudo labels.
Download the data from this repository and put it in the self_discovery/Semantic_Genesis_data/
directory. We have provided the training and validation samples for C=50 classes of anatomical patterns. In our study, we trained Semantic Genesis using C=44 classes. For each sample of anatomical pattern, we have extracted 3 multi-resolution cubes from each patient, where each of the three resolutions are saved in files named as 'train_dataN_vwGen_ex_ref_fold1.0.npy', N=1,2,3. For each 'train_dataN_vwGen_ex_ref_fold1.0.npy' file, there is a corresponding 'train_labelN_vwGen_ex_ref_fold1.0.npy' file, which contains the pseudo labels of the discovered anatomical patterns.
- The processed anatomical patterns directory structure
Semantic_Genesis_data/
|-- train_data1_vwGen_ex_ref_fold1.0.npy : training data - resolution 1
|-- train_data2_vwGen_ex_ref_fold1.0.npy : training data - resolution 2
|-- train_data3_vwGen_ex_ref_fold1.0.npy : training data - resolution 3
|-- val_data1_vwGen_ex_ref_fold1.0.npy : validation data
|-- train_label1_vwGen_ex_ref_fold1.0.npy : training labels - resolution 1
|-- train_label2_vwGen_ex_ref_fold1.0.npy : training labels - resolution 2
|-- train_label3_vwGen_ex_ref_fold1.0.npy : training labels - resolution 3
|-- val_label1_vwGen_ex_ref_fold1.0.npy : validation labels
Step 1: Divide your training data into the train and validation folders, and put them in the dataset
directory.
Step 2: Train an auto-encoder using your data. The pre-trained model will be saved into self_discovery/Checkpoints/Autoencoder/
directory.
python -W ignore self_discovery/train_autoencoder.py
--data_dir dataset/
Step 3: Extract and save the deep features of each patient in the dataset using the pre-trained auto-encoder:
python -W ignore self_discovery/feature_extractor.py
--data_dir dataset/
--weights self_discovery/Checkpoints/Autoencoder/Unet_autoencoder.h5
Step 4: Extract 3D anatomical patterns from train and validation images. The data and their labels will be save into self_discovery/Semantic_Genesis_data
directory.
python -W ignore self_discovery/pattern_generator_3D.py
--data_dir dataset/
--multi_res
python -W ignore keras/train.py
--data_dir self_discovery/Semantic_Genesis_data
Your pre-trained Semantic Genesis will be saved at ./keras/Checkpoints/semantic_genesis_chest_ct.h5
.
If you use our source code and/or refer to the baseline results published in the paper, please cite our paper by using the following BibTex entry:
@InProceedings{haghighi2020learning,
author="Haghighi, Fatemeh and Hosseinzadeh Taher, Mohammad Reza and Zhou, Zongwei and Gotway, Michael B. and Liang, Jianming",
title="Learning Semantics-Enriched Representation via Self-discovery, Self-classification, and Self-restoration",
booktitle="Medical Image Computing and Computer Assisted Intervention -- MICCAI 2020",
year="2020",
publisher="Springer International Publishing",
address="Cham",
pages="137--147",
isbn="978-3-030-59710-8",
url="https://link.springer.com/chapter/10.1007%2F978-3-030-59710-8_14"
}
@misc{haghighi2021transferable,
title={Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-supervised Learning},
author={Fatemeh Haghighi and Mohammad Reza Hosseinzadeh Taher and Zongwei Zhou and Michael B. Gotway and Jianming Liang},
year={2021},
eprint={2102.10680},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
We thank Fatemeh Haghighi and Mohammad Reza Hosseinzadeh Taher for their implementation of Semantic Genesis in Keras. Credit to Models Genesis by Zongwei Zhou. We build 3D U-Net architecture by referring to the released code at ellisdg/3DUnetCNN. This is a patent-pending technology.