Advent-of-Code 2022: ChatGPT Edition

OpenAI's ChatGPT model came out today (November 30), one day before the start of the Advent-of-Code 2022. I thought it would be interesting to let ChatGPT solve each day's puzzle and see how close it gets to the correct solution. Each day, I'll use ChatGPT as a starting point and modify the solution minimally until it works. I think this is a pretty interesting task for a few reasons:

Advent-of-Code puzzles are generally unique and worded with lots of natural language, so it's unlikely that the model has seen similar inputs before.
Advent-of-Code is a decent proxy for measuring basic programming skills.
The multiple parts to each puzzle make it a good test of the model's ability to generalize to updated instructions and retain context. I'll only do part 1 of each puzzle to make things easier, especially because a lot of the same errors appear in both parts which makes it less interesting.

Prompting ChatGPT

Each day, I'll copy the prompt from the puzzle as-is and see what happens. Based on day 1 and 2, the prompts generally work well but I've had to make a few modifications like telling it to read from a file or specifying what the input specifications are. I'm trying to give ChatGPT the best odds of solving the problem correctly to determine the model's capabilities at the algorithmic portion of the problem and not other tedious tasks.

Usage

You can run main.py and input the day to run ChatGPT's solutions and the correct solutions. You'll see something like this:

Each day's puzzles are located in solutions/day-XX and chatgpt.py is ChatGPT's solution while correct.py is the corrected version. You can find the corresponding input files in inputs/day-XX.

Recap

Day	Puzzle	Errors
1	Calorie Counting	Split newlines incorrectly in the input file.
2	Rock Paper Scissors	Read input file incorrectly. Off-by-one error. Used variable names that didn't associate with the prompt.
3	Rucksack Reorganization	Worked perfectly.
4	Camp Cleanup	Split input on hyphens instead of commas. Loop iteration should have been pairwise. Indexed second element in a pair wrong.
5	Supply Stacks	Too hard.

Details

Day 1: Calorie Counting

The first error I ran into was a ValueError: invalid literal for int() with base 10: ''. I realized the model was splitting the newlines incorrectly causing it to read an empty string as a number, which was easily fixed by having it split input on \n\n instead of \n.

Day 2: Rock Paper Scissors

Right off the bat, ChatGPT seems to get confused on how the strategy guide works so I had to prompt it a bit to explain that the dictionary keys should be A, B, and C instead of Rock, Paper, and Scissors. After that, I hit a KeyError: ' ' which is because of how it was reading in the input. Then, there were a few minor errors like an off-by-one for the number of characters to read in per line (5 instead of 4) which resulted in an IndexError. Also, ChatGPT forgot that A associates with X and so on, so I had to add an extra map for that. After that, it worked perfectly.

Day 3: Rucksack Reorganization

Not sure what to say here because it just worked perfectly on the first try. No notes!

Day 4: Camp Cleanup

I prompted it a bit to get it to parse the input but wasn't able to get it to work all the way, so I had to add a line that split by the comma instead of only the hyphen in order to separate both pairs. Another issue is that the code was double counting because it wasn't iterating by pairs, so I had to change the iteration to range(0, len(input), 2). Finally, it indexed the second element of the pair with pairs[1 - i] for some reason which I changed to pairs[i + 1], which was the last bug.

Day 5: Supply Stacks

This day actually threw me for a loop because parsing the input is so challenging. After spending a while on prompting I wasn't able to produce anything meaningful. I might actually stop after today if the puzzles keep getting harder because I don't want to spend all this time messing with input parsing. This highlights some of the limitations well, we still need human input to coax the model into understanding how to break down a hard problem.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
inputs		inputs
solutions		solutions
.gitignore		.gitignore
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advent-of-Code 2022: ChatGPT Edition

Prompting ChatGPT

Usage

Recap

Details

Day 1: Calorie Counting

Day 2: Rock Paper Scissors

Day 3: Rucksack Reorganization

Day 4: Camp Cleanup

Day 5: Supply Stacks

About

Languages

ishan0102/aoc-2022-chatgpt

Folders and files

Latest commit

History

Repository files navigation

Advent-of-Code 2022: ChatGPT Edition

Prompting ChatGPT

Usage

Recap

Details

Day 1: Calorie Counting

Day 2: Rock Paper Scissors

Day 3: Rucksack Reorganization

Day 4: Camp Cleanup

Day 5: Supply Stacks

About

Resources

Stars

Watchers

Forks

Languages