Skip to content

v0.1.0

Compare
Choose a tag to compare
@svilupp svilupp released this 29 Dec 20:07
· 141 commits to main since this release

Added

  • Documentation with detailed methodology, test case definitions, and results across various data cuts.
  • Added ~5 samples for each model/prompt/test case combination for more robust results.