Four-Directional-TCN-Attention-Mechanism-for-Abnormal-Heart-Sound-Detection

This is the model that I originally proposed during my undergraduate graduation theis 'Abnormal heart sound detection based on deep learning'.

With this model, my thesis was graded excellent and I was awarded the Graduation (Thesis) Innovation Award (3%).

Overview:

The existing methods for detecting abnormal heart sound are easily interfered by noise and cannot extract the weak pathological features well. In view of the above problems, we creatively propose a feature map mask learning strategy based on four-directional temporal convolutional network, so as to fully capture the potential time-frequency correlation information of heart sound signal, and use it to design a convolutional neural network with attention mechanism.
The backbone we use is the residual attention network, in which they use the traditional down sampling and upsampling strategy to construct the masking branch. Instead we use the four-directional TCN to do that.

The intuition behind the 4-D TCN are the following:

The image that we feed in the neural network is the mel spectrogram with log scale for the original heart sound, which is time related, so as the feature map in the neural network. In order to fully capture the time related information in the feature map, we use four TCN that sweeps horizontally and vertically in both directions across the feature map to fully capture the time-frequency correlation information of heart sound signal. What's more, the horizontally sweeps can enable the natwork to learn the time related information, while the vertically sweeps can help the network learn the information of energy of different frequency bands. The 4D-TCN model will create four mask, corresponding to the up, down, left, right direction.
Actually, you can plug in this softmask branch in any CNN architecture you like!

The model architecture

The 4D-TCN atchitecture

The parameter of the network architecture

For more details, please refer to the code.

How do we scan the feature map in four direction? (A quick look)

The size of the feature map: Bs, H, W, C
We use the patches of (2, 2) and can reshape the tensor into Bs, #H, psh, #W, psw, C
Then, we transpose the tensor: Bs, #H, # W, psh, psw, C   (psh, psw == 2, 2)
So the information of each path (H, W) is in (psh, psw, C)
Therefore, we can further reshape the tensor into Bs * #H, #W, psh * psw *C
This is a 3D tensor and therefore can feed into a TCN. (left scan)
If we reverse the information in the #W direction, we can get the right scan.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
model		model
4dtcn.png		4dtcn.png
README.md		README.md
param1.png		param1.png
param2.png		param2.png
tcn_masking_model.png		tcn_masking_model.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Four-Directional-TCN-Attention-Mechanism-for-Abnormal-Heart-Sound-Detection

This is the model that I originally proposed during my undergraduate graduation theis 'Abnormal heart sound detection based on deep learning'.

With this model, my thesis was graded excellent and I was awarded the Graduation (Thesis) Innovation Award (3%).

Overview:

The intuition behind the 4-D TCN are the following:

The model architecture

The 4D-TCN atchitecture

The parameter of the network architecture

How do we scan the feature map in four direction? (A quick look)

The model is implemented by tensorflow 1.x.

About

Releases

Packages

Languages

GuoshenLi/Four-Directional-TCN-Attention-Mechanism-for-Abnormal-Heart-Sound-Detection

Folders and files

Latest commit

History

Repository files navigation

Four-Directional-TCN-Attention-Mechanism-for-Abnormal-Heart-Sound-Detection

This is the model that I originally proposed during my undergraduate graduation theis 'Abnormal heart sound detection based on deep learning'.

With this model, my thesis was graded excellent and I was awarded the Graduation (Thesis) Innovation Award (3%).

Overview:

The intuition behind the 4-D TCN are the following:

The model architecture

The 4D-TCN atchitecture

The parameter of the network architecture

How do we scan the feature map in four direction? (A quick look)

The model is implemented by tensorflow 1.x.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages