Replies: 1 comment 2 replies
-
@RXZAN what 'print("score-->", score)' print? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Checked other resources
Commit to Help
Example Code
Description
When I use RAG evaluation, sometimes the evaluation score is empty, and sometimes the evaluation score is normal. When I look at the output in Langsmith, I find that there seems to be a problem with the output format. How can I solve this problem so that I can get the correct score format reply.
Format for correct output:
{
"Explanation": "The student answer does not contain any conflicting statements. Although it mentions a plausible explanation for why the script address might be related to the second step, it is factually accurate relative to the ground truth answer.",
"Score": "1"
}
Output format when errors occur:
To grade this student answer, I need to follow the given criteria.
Step 1: The first criterion is to grade the student answers based ONLY on their factual accuracy relative to the ground truth answer.
The student answer does not mention any specific parameters, but instead asks for clarification or more information. This means that the student's answer does not contain any conflicting statements with the ground truth.
Step 2: According to the second criterion, I need to ensure that the student answer does not contain any conflicting statements.
Since the student's answer does not claim anything about the parameters, it is safe from conflicting statements.
Step 3: The third criterion states that it is OK if the student answer contains more information than the ground truth answer, as long as it is factually accurate relative to the ground truth answer.
In this case, the student's answer contains less information than the ground truth, but what it does contain (asking for clarification) is indeed factually accurate.
Based on these steps and the criteria provided, I can now assign a score.
The student's answer does not provide any specific parameters and instead asks for more information. This means that their answer does meet all of the criteria mentioned in the prompt.
However, it seems like there was an expectation of providing actual parameters, which the student failed to do. But since they didn't claim anything conflicting or extra that wasn't accurate, I can give them a score based on this understanding.
Score: 0
System Info
Beta Was this translation helpful? Give feedback.
All reactions