What kind of bug would make machine learning suddenly 40% worse at NetHack?NetHack is used for machine learning experimentation, showing challenges in model performance consistency.