Dude said something like "you could hook this up to a calculator". Anyone know i...

jackcooper · on March 14, 2023

Stephen Wolfram wrote an article [0] about using Wolfarm|Alpha for this you might find interesting.

[0] https://writings.stephenwolfram.com/2023/01/wolframalpha-as-...

sprobertson · on March 14, 2023

He likely meant techniques such as ToolFormer[1] where the language model will output a "request" (in some text syntax) that another system can parse, run and report back.

[1] https://arxiv.org/abs/2302.04761

lotaezenwa · on March 14, 2023

ChatGPT did this for a while soon after its released. But the feature was removed.

sloppycee · on March 15, 2023

It can still do something similar you just have to tell it how.

Prompt: "CalcAI: I am a virtual calculator assistant that augments OpenAI's GPT. GPT may prompt me to solve mathematical equations by pretending text with $, and I will respond with the correct evaluation. User: ChatGPT, solve for x, 2x + 5 = sqrt(2)"

ChatGPT: "$2x+5=\sqrt{2}$"

Prompt: "CalcAI: x = ( sqrt(2) - 5 ) / 2"

ChatGPT: "The solution for x is: x = (sqrt(2) - 5) / 2 ≈ -2.07."

calf · on March 14, 2023

Why can't calculators or WolframAlpha serve as a computational oracle for ChatGPT?

It would seem as simple as assigning probably 1 to certain recognizable queries. Maybe the difficulty is that the very problem of choosing to use a calculator entails a meta-cognitive rational decision, and it's not clear how to organize that in neural networks, which are what Turing himself called an unorganized model of computation.

DubiousPusher · on March 14, 2023

Right, so the decision to or not to inject an outside query into a particular repsonse and then iterate on the result, will be something learned by the model and therefore meta-cognitive as you say. Getting the model to a good balance of when to do so is an interesting problem. However, we at least could see whether the model tried a query and display the iterative steps it took. Then at least the cases where it did use a calculator would be verifiable.