NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Epicycles All the Way Down (2025) (strangeloopcanon.com)
OutOfHere 51 minutes ago [-]
The force equation example is disturbing, but it's easy to prevent by disallowing the inclusion of random decimal numbers in the formula, with the latter also suggesting over-fitting to the data. It is immediately obvious that such numbers make the equation inelegant and therefore likely to be wrong. If you're going to use symbolic construction, be careful in what formulations you allow, also having an appropriate penalty for complexity.

As for chess, although an LLM knows the rules of chess, it is not expected to have been trained on many optimal chess games. As such, is it fair to gauge its skill in chess, especially without showing it generated images of its candidate moves? Even if representational and training limitations were addressed, we know that LLMs are architecturally crippled in that they have no neural memory beyond their context. Imagine a next-gen LLM that if presented with a chess puzzle would first update its internal weights for playing optimal chess via a simulation of a billion games, and then return to address the puzzle you gave it. Even with the current arch, it could equivalently create a fork of itself for the same purpose, a new trained model in effect, but the rushing human's desire for wanting the answer immediately comes in the way.

suddenlybananas 31 minutes ago [-]
>As for chess, although an LLM knows the rules of chess, it is not expected to have been trained on many optimal chess games

Well, it's read every book ever written on chess so you would expect it to be at least half-way decent.

ogogmad 2 hours ago [-]
The recent news of multiple solutions to Erdos problem 1196 produced by LLMs without any human help, makes any suggestion that LLMs have hit a wall in reasoning seem less credible. To give you an idea, problem 1196 had been worked on by different experts for years. Now suddenly, LLMs have come along and solved the problem in a multitude of ways. Perhaps LLMs will eventually stall, but this paradigm still has some juice left to squeeze.
DarkNova6 1 hours ago [-]
But are we talking pure LLMs, or existing AI solvers augmented with LLMs? Because while the latter is impressive, it doesn't state much outside of this specific domain.

If anything, I see greater verticality of specialized software that is using LLMs at their core, but with much aid and technology around it to really make the most out of it.

FrustratedMonky 51 minutes ago [-]
"are we talking pure LLMs, or existing AI solvers augmented with LLM"

Why do these distinctions matter?

is it an LLM, or symbolic, or a combo, or a dozen technologies stitched together. Who cares. It is all automation. It is all artificial.

DarkNova6 10 minutes ago [-]
True, it's an achievement either way. But if an "out of the box LLM" can solve difficult math problems it is an achievement by the LLM vendor. Otherwise it is an achievement by the people doing the vertical integration.

In the context of evolving LLM this is the crucial distinction.

suddenlybananas 29 minutes ago [-]
The distinctions matter since computational proofs have been around for decades.
throwaway210426 2 hours ago [-]
Needs a “[November 2025]” title. It is already outdated
suddenlybananas 2 hours ago [-]
Why?
throwaway210426 1 hours ago [-]
It was silly at the time but even sillier now (eg see other comment on Erdos 1196)
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 15:15:37 GMT+0000 (Coordinated Universal Time) with Vercel.