type of text encoder #7

dt-yuhui · 2024-12-26T17:24:37Z

Hi,
I found the text encoder in your paper and hf repo is "ncbi/MedCPT-Query-Encoder".
However, in your github repo it is "FremyCompany/BioLORD-2023"
So which one did you finally choose?

dt-yuhui · 2024-12-26T18:11:14Z

Also, i'm interested in how to pretrain text encoder with knowledge enhancement.
Will this code be made public?

qiaoyu-zheng · 2024-12-27T03:21:16Z

Thanks for your interest. In fact, they are similar. The text encoder in our first version is medcpt, which is finetuned using our data. However, later we find that biolord is also a good choice which needs no more finetuning. You can directly use biolord aligned with our code. I think its a easier way for reproducing.

dt-yuhui · 2024-12-27T04:56:03Z

Thanks for your quick reply!

So there's no knowledge enhancement in the model if we choose "biolord" as text encoder?
But i'm still quite interested in how to inject domain-specific knowledge into pretrained text encoder...like how to mix synonyms, explanation and which loss is used in the ICD tree?

I couldn't find the relevant code in this repo. If possible, could you send this part of the code to my email: [email protected].
Thanks again :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

type of text encoder #7

type of text encoder #7

dt-yuhui commented Dec 26, 2024

dt-yuhui commented Dec 26, 2024

qiaoyu-zheng commented Dec 27, 2024

dt-yuhui commented Dec 27, 2024

type of text encoder #7

type of text encoder #7

Comments

dt-yuhui commented Dec 26, 2024

dt-yuhui commented Dec 26, 2024

qiaoyu-zheng commented Dec 27, 2024

dt-yuhui commented Dec 27, 2024