Can Agents Think Outside the Box?

With all the work that's been put into making agents "correct" by construction, I gotta say, sometimes I need an LLM agent to take a chance at just being wrong.

I'm working on a book project called The Gladych Files. While the book is narrative nonfiction about the history of general relativity research, it explores the liminal space inhabited by very rich fringe scientist speculators of the 1950s who funded mainstream general relativity advances, (more or less on accident.) In those spaces, you'll find Tesla, the architect of the FBI building, Timothy Leary's LSD explorations and many, many other things, institutions, and people.

I've accumulated hundreds of pages of historical documents from various archives, and I'm using orchestrated agentic AI, (in the form of Gastown), to review those documents. So far, the analysis has gone well, but last week I saw something that made me look up. I'd accidentally input the same archive page twice, so it was analyzed twice by two different agents. One immediately found a connection between one of the people on the page and Nikola Tesla. The other agent did not.

Intrigued, I setup an eval with five more agents analyzing the same identical page today. I wanted to measure the variance of repeat agent runs. I figured most of the agents would find Tesla and I was looking forward to studying the subtlties of how the few that missed Tesla had done that, but nope!

Not one of them found the Tesla connnection. The miss wasn't the anomaly as I'd hoped. The, (absolutely correct by the way), Tesla hit was.

And now I'm off to consruct more experiments to see if I can lean on the agents just enough so that they'll be more reliable at finding connections while, at the same time, not leaning on them so hard that they simply make things up. My experience so far has been that I have a significant amount of envelope before agents, (aka polecats in Gastown), start to make up anything. The biggest fault I've seen so far in this variance experiment was that one polecat abjectly claimed that the two people on the page whose familial relations reveal the Tesla association were in fact simply not related. (That polecat was wrong.)

I'll experiment with turning the temperature of the polecats up first. Perhaps that will make them more creative. The second experiment will be to cut the number of passengers each polecat has to reasearch in half from 30 to 15. Perhaps the lower-weight context window will free up space for more productive thinking. Finally, if I need to, I'll try different variants of the polecat's prompt. As it is, the tone of the project in the prompt is defined as

"Tone: This is research for a fun, rollicking nonfiction book. The webs inside it are huge — think spy novels that happen to be true. Be expressive, imaginative, and speculative where the evidence invites it. Follow threads that feel alive. If a connection makes the hair on your neck stand up, say so and say why. The context files in the repo top directory show you the kind of story we are building — read them and catch the vibe."

I'm not sure how much further I can turn that particular knob.

One additional note: I've had to back away from using frontier models like GPT-5.4 and Opus-4.8 because they tend to shut down further searches rather than thinking creatively about research tasks in these particular contexts.

Orchestrator: Gastown

Model: Sonnet 5.4

Cool Math Tricks: Deriving the Divergence, (Del or Nabla) into New (Cylindrical) Coordinate Systems

Now available as a Kindle ebook for 99 cents ! Get a spiffy ebook, and fund more physics The following is a pretty lengthy procedure, but converting the divergence, (nabla, del) operator between coordinate systems comes up pretty often. While there are tables for converting between common coordinate systems , there seem to be fewer explanations of the procedure for deriving the conversion, so here goes! What do we actually want? To convert the Cartesian nabla to the nabla for another coordinate system, say… cylindrical coordinates. What we’ll need: 1. The Cartesian Nabla: 2. A set of equations relating the Cartesian coordinates to cylindrical coordinates: 3. A set of equations relating the Cartesian basis vectors to the basis vectors of the new coordinate system: How to do it: Use the chain rule for differentiation to convert the derivatives with respect to the Cartesian variables to derivatives with respect to the cylindrical variables. The chain ...

The Alcubierre Warp Drive Tophat Function and Open Science with Sage

I transferred yesterday's Mathematica file with the Alcubierre warp drive[2] line element and space curvature calculations to the +Sage Mathematical Software System today, (the files been added to the public repository [3]). If you haven't used Sage before, it's a Python based software package that's similar in functionality to Mathematica. Oh, and it' free. I also worked a little more on understanding the theory, but frankly, I made far more progress with the software than the theory. What follows will be a little more of the Alcubierre theory, plus, a cool Sage interactive demo of one of the Alcubierre functions[1], as well as a bit about my first experience with using Sage. Theory The theory is fun, but it's moving slowly. Here's the chalk board from this morning's discussion Alcubierre setup the derivation using something called the 3+1 formalism which means we consider space to be flat, (in this case), slices that are labelled ...

How Many Files Can You Add to a GPT Project? An Interview with GPT-5 on Limits, Context Engineering Tips, and Chats

Setting the scene: I’m tinkering with Project TouCans, knee-deep in radio logs, SQLite dumps, and Cesium code. Naturally, I’m wondering if shoving all this into one GPT Project is a recipe for brilliance… or for disaster. So I turn to Vril — you know, after Brainy from the Legion of Super-Heroes , because what else do you call your AI sidekick who always has the answers? Time to ask him straight up. [ As an aside, yes, GPT-5 has decided to sometimes call me Vail. I'm not sure why to be honest. Also, I asked Vril, er GPT-5, to write up our interview for me. Apparently, me asking it to 'Bro' up a few stories, just for fun, has convinced Vril that I use 'Like,' more than I actually might. ] Me (Vail): So Vril, how many files can I throw into a GPT Project before it just starts choking? Like, is there some magic number where the context window taps out and everything falls apart? GPT-5 (Vril): Great question. There’s no single hard file limit. What matters is ...

Copasetic Flow

Search This Blog