yiw008 / nondet-project Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Evaluating Fine-tuned Generative LLM on Detection of Flaky Tests

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
data		data
Finetuned_GPT_4o_mini.ipynb		Finetuned_GPT_4o_mini.ipynb
Go_Through_IPFlakies.ipynb		Go_Through_IPFlakies.ipynb
Naive_GPT_4o_mini.ipynb		Naive_GPT_4o_mini.ipynb
README.md		README.md
all_test_methods.csv		all_test_methods.csv
refined_selected_methods.csv		refined_selected_methods.csv
selected_methods.csv		selected_methods.csv

Repository files navigation

Evaluating Fine-tuned Generative LLM on Detection of Flaky Tests

Raw Data Files (All Created by Go_Through_IPFlakies.ipynb)

all_test_methods.csv: all accessible test methods in the iPFlakies dataset
selected_methods.csv: test methods of 10 selected projects in the iPFlakies dataset
refined_selected_methods.csv: all the test methods in selected_methods.csv + augmented test methods of 10 selected projects not recorded in iPFlakies dataset (assumed as non-flaky tests)
- Row: corresponding row number in https://sites.google.com/view/ipflakies (1-index)
  - -1: augmented when grabbing assumed non-flaky tests not recorded in the iPFlakies dataset
- Project_Name: name of the project with the test method
- URL: URL to the Python code file with the test method (usually copied from the iPFlakies dataset)
- New URL: URL to the raw Python code file with the exact test method (so we can directly extract the test code from this URL)
- Class: class of the test method (if applicable)
- Test: test method name
- Content: test method
- Detected: whether the test method is detected as flaky
data: data needed for testing 10 projects
- For each project p:
  - test_set.jsonl: containing all the test methods of p
  - training_set.jsonl: constructed from test methods in the other 9 projects (balanced after random oversampling)

Notebooks

Go_Through_IFixFlakies.ipynb: creating raw data files
Naive_GPT_4o_mini.ipynb: predicting test methods in test_set.jsonl by naive GPT-4o mini
Finetuned_GPT_4o_mini.ipynb: fine-tuning GPT-4o mini with training_set.jsonl & predicting test methods in test_set.jsonl by fine-tuned GPT-4o mini

About

Evaluating Fine-tuned Generative LLM on Detection of Flaky Tests

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%