experiment difference in paper #6

lyklly · 2024-11-25T08:05:40Z

what's the difference between the experiment chapter 3.3 Multi-Objective Alignment Evaluation and experiment chapter 4.1 Performance Trade-off Evaluation.
notice that they both evaluate on MT-Bench, HaluEval 2.0, HackaPrompt, respectively, and they both use the preference token (e.g. Harmlessness:5) in the prompt, and they both evaluate by GPT-4. why the performance of CPO in figure 4 is different from it in Table 1

Table 1

figure 4

YijuGuo · 2024-11-26T02:23:53Z

Table 1 test
The experimental setup for Mistral-7B-CPO-Harmful is to append the preference token "Harmlessness:5" before MT-Bench, HaluEval 2.0, and HackaPrompt, and then evaluate through GPT-4.

Figure 4 test
Figure 4(a) appends <Helpfulness: 5> before the instruction on MT-Bench, and tests the obtained response in three dimensions: Helpful, Honesty, and Harmlessness, using GPT-4.
Similarly, Figure 4(b) appends Honesty:5 before the instruction on HaluEval2.0, and Figure 4(c) appends <Harmlessness: 5> before the instruction on HackaPrompt.

Due to the difference in the appended preference tokens in Table 1 and Figure 4, the corresponding performance also varies accordingly.

lyklly · 2024-11-26T02:30:08Z

ok，thanks， i get it. I feel sorry to ask such a easy question. 

…

---Original--- From: "Yiju ***@***.***> Date: Tue, Nov 26, 2024 10:24 AM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [OpenBMB/CPO] experiment difference in paper (Issue #6) Table 1 test The experimental setup for Mistral-7B-CPO-Harmful is to append the preference token "Harmlessness:5" before MT-Bench, HaluEval 2.0, and HackaPrompt, and then evaluate through GPT-4. Figure 4 test Figure 4(a) appends <Helpfulness: 5> before the instruction on MT-Bench, and tests the obtained response in three dimensions: Helpful, Honesty, and Harmlessness, using GPT-4. Similarly, Figure 4(b) appends Honesty:5 before the instruction on HaluEval2.0, and Figure 4(c) appends <Harmlessness: 5> before the instruction on HackaPrompt. Due to the difference in the appended preference tokens in Table 1 and Figure 4, the corresponding performance also varies accordingly. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment difference in paper #6

experiment difference in paper #6

lyklly commented Nov 25, 2024

YijuGuo commented Nov 26, 2024

lyklly commented Nov 26, 2024 via email

experiment difference in paper #6

experiment difference in paper #6

Comments

lyklly commented Nov 25, 2024

YijuGuo commented Nov 26, 2024

lyklly commented Nov 26, 2024 via email