More

nicklo · 2026-03-13T20:53:23 1773435203

the animation of the model name text when opening the detail view is so smooth and delightful

nicklo · 2026-03-09T21:52:49 1773093169

Congrats on launch! As the agent cli’s and sdk’s were built for local use, there’s a ton of this infra work to run these agents in production. Genuinely excited for this space to mature.

I have been building an OSS self-hostable agent infra suite at https://ash-cloud.ai

Happy to trade notes sometime!

nicklo · 2026-03-07T22:45:54 1772923554

Yeah with sandbox pre-warming and disk co-location its fast enough to avoid UX cold start penalty.

On write amplification — we persist at the message level, not per SSE chunk. The sandbox's workspace filesystem (claude code's native jsonl files) is the source of truth for resume, and the DB is for queryability, tracing, etc - so fire and forget works fine here.

nicklo · 2026-02-28T17:56:36 1772301396

I’m building a self-hostable, open source agent sandbox orchestrator here: https://github.com/ash-ai-org/ash-ai

nicklo · 2026-02-23T06:17:08 1771827428

directionally correct but important to note the water wasted by sustaining the insufferable human is much higher than producing the tokens

nicklo · 2026-01-21T22:20:01 1769034001

i've always wondered (for this, portkey, etc) - why not have a parallel option that fires an extra request instead of mitm the llm call?

ofabioroma · 2026-01-21T22:23:45 1769034225

You can fire them in parallel for simple cases. The issue is when you have multi-agent setups. If context isn't persisted before a sub-agent reads it, you get stale state. Single source of truth matters when agents are reading and writing to the same context.

For single-agent flows, parallel works fine.

nicklo · 2025-08-20T01:00:19 1755651619

I think it’s the reverse - people were too lazy to read the docs so nobody was motivated to write them.

With an agent I know if I write once to CLAUDE.md and it will be read by 1000’s of agents in a week.

blinkymach12 · 2025-08-20T01:23:32 1755653012

I like this insight. We kind of always knew that we wanted good docs, but they're demotivating to maintain if people aren't reading them. LLMs by their nature won't be onboarded to the codebase with meetings and conversations, so if we want them to have a proper onboarding then we're forced to be less lazy with our docs, and we get the validation of knowing they're being used.

nicklo · 2025-07-01T13:46:50 1751377610

Have you considered making an MCP for this? Would be great for use in vibe-coding

nicklo · 2025-06-01T20:38:51 1748810331

The bitter lesson strikes again… now for graphics rendering. Nerfs had a ray tracing prior, and Gaussian splats had some raster prior. This just… throws it all away. No priors, no domain knowledge, just data and attention. This is the way.

nicklo · on April 8, 2024

OP: please don't poison your MIT license w/ surya's GPL license

vikp · on April 11, 2024

It should be possible to call a GPL library in a separate process (surya can batch process from the CLI) and avoid GPL - ocrmypdf does this with ghostscript.