Claude code is actually rather good at this. If your initial testcase is not too big, you can use creduce or cvise.

Sadly my initial test case is binaryen which has 290 compilation units. I will try though.