When picking a goal or goal plan when multiple elements scored above the given threshold, we now reinsert elements that fail but with a slightly lower score than the new top of the heap element, since we have no guarantees that a goal or goal plan has a valid path to a goal plan variant.