Discussion about this post

User's avatar
Sebastian Hanke's avatar

I‘d love to see a dedicated deep dive on AI vs climate change, both sides of the medal

Expand full comment
Tony Curzon Price's avatar

The Claude 4 mess-up of arithmetic subtraction is a bit of a non-issue, isn't it? Obviously, if instead Clause had been asked: "write a python script to work out 9.11-9.9 & execute it", it would have had no trouble. It is presumably not hard for Claude to have a script running in the background with some logic going: "does this look like a maths question? if so, write a bit of code, execute it and slot in the answer". A few more tokens to identify the likelihood a problem is a maths problem, but not many. Hardly a step to full-blown reasoning.

Expand full comment
5 more comments...

No posts