-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Roadmap for v0.5.0 #79
Comments
CC @bauersimon |
seems like a PR does not make sense |
Blogpost idea: misleading comments... how much does it take to confuse the most powerful AI? (credit to @ahumenberger) |
Maybe not only comments. What about obfuscated code, e.g. function and variables names are just random strings? |
Take a look at https://x.com/dottxtai/status/1798443290913853770 |
Looking through logs... Java consistently has more code than Go for the same tasks, which yields more coverage. So a model that solves all Java tasks but no Go is automatically higher ranked than the opposite. |
Closed with #297 |
The v0.5.0 is mainly meant for introducing more variate. There are three main goals
Tasks:
The nice thing about generating tests is that it is easy to automatically check if the result is correct. Needs to compile and provide 100% coverage. But one can only write such tests if they understand the source, so implicitly we are evaluating the language understanding of the LLM.
symflower v36847
symflower test
with a deeper execution coverage exportsymflower v36800
) Require at least symflower v36800 #144symflower v36800
) Require at least symflower v36800 #144The text was updated successfully, but these errors were encountered: