Problem in using xlnet-base model #1

so-hyeun · 2021-02-04T03:02:05Z

Hi, I'm trying to use the DialogXL you provide, but I have a problem.

At first, I used Transformers 3.0.2 as shown in the requirement, but a Segment fault error occurred, so I used the latest version of transformers.
This solved the above error, but "UnicodeDecodeError:'utf-8' codec can't decode byte 0x80 in position 0" error occurred.
Can you tell me how to fix that error or a link to download xlnet-base-case for transformers 3.0.2?

Thanks for conducting an interesting research.

shenwzh3 · 2021-02-05T03:16:12Z

Hi, could you show me the full error description? It seems that this error usually occurs when the program reading the dataset files rather than when loading the model.

so-hyeun · 2021-02-05T04:16:25Z

Thanks for the answer.
When I used Transformers 3.0.2, the segment fault error mentioned above was resolved, but the following error occurred.

Digimonseeker · 2021-02-07T08:41:43Z

Hi, @so-hyeun, thanks for the interest in our work，we have updated the relevant files, you can kindly check it out.

so-hyeun · 2021-02-15T05:01:26Z

Thank you. It works fine for eval.py code.

By the way, when IEMOCAP is trained as the command in the Readme, it shows remarkably low performance. (Valid F-Score: 4.42 Test F-Score : 9.44 Test Acc: 23.86)
What is the cause?

Digimonseeker · 2021-02-23T14:36:27Z

We found some problems with the previous IEMOCAP files and we have now updated the relevant files. Thanks for your friendly reminder.

so-hyeun · 2021-03-08T00:32:35Z

Hello, first of all, thank you for always quick feedback.

By the way, the same phenomenon occurs when running the train using the newly uploaded IEMOCAP file. Can you check it again?
In addition, when I change line 131 of run.py to model =nn.DataParallel(model) to use 2 GPUs, an error occurs. If you have used multi-GPU, please advise.

Thank you.

tenihasina · 2021-09-03T09:06:59Z

Hello, first I'd like to thank you for working on such an interesting subject

I am also trying to use the model provided in the repository :
Training with MELD gives good results (comparable to your article), unfortunately, I also encountered the same issue as @so-hyeun where training with IEMOCAP gives bad performance.

Did you manage to find what was the issue ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem in using xlnet-base model #1

Problem in using xlnet-base model #1

so-hyeun commented Feb 4, 2021

shenwzh3 commented Feb 5, 2021

so-hyeun commented Feb 5, 2021

Digimonseeker commented Feb 7, 2021

so-hyeun commented Feb 15, 2021 •

edited

Loading

Digimonseeker commented Feb 23, 2021

so-hyeun commented Mar 8, 2021 •

edited

Loading

tenihasina commented Sep 3, 2021

Problem in using xlnet-base model #1

Problem in using xlnet-base model #1

Comments

so-hyeun commented Feb 4, 2021

shenwzh3 commented Feb 5, 2021

so-hyeun commented Feb 5, 2021

Digimonseeker commented Feb 7, 2021

so-hyeun commented Feb 15, 2021 • edited Loading

Digimonseeker commented Feb 23, 2021

so-hyeun commented Mar 8, 2021 • edited Loading

tenihasina commented Sep 3, 2021

so-hyeun commented Feb 15, 2021 •

edited

Loading

so-hyeun commented Mar 8, 2021 •

edited

Loading