The wrinkle is that the AI doesn't have a truly global view, and so it slowly de...

mattgreenrocks · 2026-02-09T01:31:24 1770600684

Yet it still fumbles even when limiting context.

Asked it to spot check a simple rate limiter I wrote in TS. Super basic algorithm: let one action through every 250ms at least, sleeping if necessary. It found bogus errors in my code 3 times because it failed to see that I was using a mutex to prevent reentrancy. This was about 12 lines of code in total.

My rubber duck debugging session was insightful only because I had to reason through the lack of understanding on its part and argue with it.

seunosewa · 2026-02-09T17:33:53 1770658433

Once you've gone through that, you might want to ask it to codify what it learned from you so you don't have to repeat it next time.

Kiro · 2026-02-09T07:32:05 1770622325

I would love to see that code.

EnPissant · 2026-02-09T06:28:08 1770618488

Try again with gpt-5.3-codex xhigh.

xigoi · 2026-02-09T10:46:13 1770633973

The goalposts have been moved so many times that they’re not even on the playing field.

EnPissant · 2026-02-10T07:08:14 1770707294

Nahh, just trying to make it concrete. I could instead just ask which model they used instead.

Capricorn2481 · 2026-02-09T08:06:54 1770624414

Try again with Opus 4.5

Try again with Sonnet 4

Try again with GPT-4.1

Here I thought these things were supposed to be able to handle twelve lines of code, but they just get worse.

sandos · 2026-02-09T08:24:29 1770625469

I have to 1000% agree with this. In a large codebase they also miss stuff. Actually, even at 10kloc the problems beging, UNLESS youre code is perfectly designed.

But which codebase is perfect, really?

redox99 · 2026-02-09T00:46:19 1770597979

AGENTS.md is for that global view.

sandos · 2026-02-09T08:23:02 1770625382

You can't possibly cram everything into AGENTS, also LLMs still do not perfectly give the same weight to all of its context, ie. it still ignores instructions.

zozbot234 · 2026-02-09T01:12:50 1770599570

The 'global view' doc should be in DESIGN.md so that humans know to look for it there, and AGENTS.md should point to it. Similar for other concerns. Unless something really is solely of interest to robots, it shoudn't live directly in AGENTS.md AIUI.

hyperadvanced · 2026-02-09T01:01:51 1770598911

Am I stupid or do these agents regularly not read what’s in the agents.md file?

minimaxir · 2026-02-09T01:19:22 1770599962

More recent models are better at reading and obeying constraints in AGENTS.md/CLAUDE.md.

GPT-5.2-Codex did a bad job of obeying my more detailed AGENTS.md files but GPT-5.3-Codex very evidently follows it well.

hyperadvanced · 2026-02-09T01:38:49 1770601129

Perhaps I’m not using the latest and greatest in terms of models. I tend to avoid using tools that require excessive customization like this.

I find it infinitely frustrating to attempt to make these piece of shit “agents” do basic things like running the unit/integrations tests after making changes.

00deadbeef · 2026-02-09T06:06:53 1770617213

Opus 4.5 successfully ignored the first line of my CLAUDE.md file last week

hyperadvanced · 2026-02-10T02:08:35 1770689315

Thank god it’s not just me. It really makes me feel insane reading some of the commentary online.

redox99 · 2026-02-09T01:03:41 1770599021

Each agent uses a different file, like claude.md etc (maybe you already knew that).

And it requires a bit of prompt engineering like using caps for some stuff (ALWAYS), etc.

ozozozd · 2026-02-09T04:43:48 1770612228

You’re not stupid. But the agents.md file is just an md file at the end of the day.

We’ve been acting as if it’s assembly code that the agents execute without question or confusion, but it’s just some more text.

isodev · 2026-02-09T00:52:26 1770598346

That’s not what Claude and Codex put there when you ask them to init it. Also, the global view is most definitely bigger than their tiny, loremipsum-on-steroids, context so what do you do then?

redox99 · 2026-02-09T01:01:52 1770598912

You know you can put anything there, not just what they init, right? And you can reference other doc files.

I should probably stop commenting on AI posts because when I try to help others get the most out of agents I usually just get down voted like now. People want to hate on AI, not learn how to use it.

8note · 2026-02-09T01:15:35 1770599735

its still not truly global but that seems like a bit pie in the sky.

people still do useful work without a global view, and there's still a human in the loop witth the same ole amount of global view as they ever had.