-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
0f20cc1
commit e7c3a78
Showing
3 changed files
with
12 additions
and
1 deletion.
There are no files selected for viewing
11 changes: 11 additions & 0 deletions
11
2024/2024_11_04_Unlocking_the_Power_of_Spatial_and_Temporal_Information/README.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
# Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training | ||
|
||
## Abstract | ||
|
||
We will discuss pre-training multimodal vision-language models for applications in computer-aided radiology. The multimodal models we will examine are trained jointly on raw medical images and corresponding free-text radiology reports. Radiology reports, generated abundantly within typical clinical workflows, serve as a valuable source of medical image annotations but have yet to be fully leveraged in modeling efforts. | ||
|
||
I will present a [recent ICML 2024 conference paper](https://icml.cc/virtual/2024/poster/34857) that addresses this issue. I will begin with examples to illustrate the rationale for developing multimodal models in radiology and provide an overview of recent work and public dataset that form the basis of this research. Then, I will detail the paper’s main contributions: (1) extending the multimodal framework to account for multiple representations of anatomy in chest radiographs, and (2) advancing temporal modeling of longitudinal data. | ||
|
||
## Source paper | ||
|
||
[Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training](https://arxiv.org/abs/2405.19654) |
Binary file added
BIN
+2.47 MB
...tial_and_Temporal_Information/Unlocking_the_Power_of_Spatial_and_Temporal_Information.pdf
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters