Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

@patatahooligan@lemmy.world

Not even close. The paper is questioning LLMs ability to reason. The article talks about fundamental flaws of LLMs and how we might need different approaches to achieve reasoning. The benchmark is only used to prove the point. It is definitely not the headline.

@rickdg@lemmy.world

Once there’s a benchmark, LLMs can optimise for it. This is just another piece of news where people call “game over” but the money poured into R&D isn’t stopping anytime soon. Wasn’t synthetic data supposed to be game over for LLMs? Its limitations have been identified and it’s still being leveraged.

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

Technology

Our Rules

Approved Bots

Apple study exposes deep cracks in LLMs’ “reasoning” capabilitiesplus-square

Apple study exposes deep cracks in LLMs’ “reasoning” capabilitiesplus-square

Technology

Our Rules

Approved Bots

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities