logoalt Hacker News

iammjmtoday at 7:57 AM4 repliesview on HN

Why doesn’t it just call tools such as Mathematica for such operations?


Replies

ACCount37today at 11:03 AM

For the same reason you don't run "4+6" on a calculator.

External tool call has an overhead. It requires a round trip into an external tool. It requires an LLM to run in agentic autoregression - it can't be used in prefill.

Which means that having native arithmetic capabilities is useful. Forward pass arithmetics are an LLM version of quick mental math.

An LLM can read "#define SILLY_TIME_CONST (3*20*60*60*1000)" and have "SILLY_TIME_CONST is 60 h expressed as 216000000 ms" already cached by the end of the line, before it even emits its first token.

show 2 replies
defrosttoday at 8:06 AM

This is more how an LLM thinks about math internally - an LLM version of drilled tables being used for mental arithmetic "as humans do".

When humans stall on these tasks, they reach for pen and paper, a slide rule, a calculator, etc.

Mathematica is overkill for arithmetic, in addition it's licenced and can cost a bit extra.

If an LLM were to reach for a light cheap arithmetic tool something like bc would be a good first stop - a CLI tool with a language that supports arbitrary precision numbers with interactive execution of statements.

https://en.wikipedia.org/wiki/Bc_(programming_language)

jampekkatoday at 9:37 AM

They do. I asked CharGPT for 327 x 48 and it used the "ChatGPT Instruments" calculator.

Previously it used to run Python scripts, and may still do for more complex calculations.

show 1 reply
breezybottomtoday at 12:34 PM

ChatGPT does, and has since 2023