AI makes the easy part easier and the hard part harder

le-mark · 2026-02-09T00:12:02 1770595922

I vibe coded a retro emulator and assembler with tests. Prompts were minimal and I got really great results (Gemini 3). I tried vibe coding the tricky proprietary part of an app I worked on a few years ago; highly technical domain (yes vague don’t care to dox myself). Lots of prompting and didn’t get close.

There are literally thousands of retro emulators on github. What I was trying to do had zero examples on GitHub. My take away is obvious as of now. Some stuff is easy some not at all.

zjp · 2026-02-09T00:22:47 1770596567

I call these "embarrassingly solved problems". There are plenty of examples of emulators on GitHub, therefore emulators exist in the latent spaces of LLMs. You can have them spit one out whenever you want. It's embarrassingly solved.

There are no examples of what you tried to do.

AuthAuth · 2026-02-09T00:44:32 1770597872

Its license washing. The code is great because its already a problem solved by someone else. The AI can spit out the solution with no license and no attribution and somehow its legal. I hope American tech legislation holds that same energy once others start taking American IP and spitting it back out with no license or attribution.

20k · 2026-02-09T05:28:26 1770614906

This is why its astonishing to me that AI has passed any legal department. I regularly see AI output large chunks of code that are 100% plagiarised from a project - its often not hard to find the original source by just looking up snippets of it. 100s of lines of code just completely stolen

Ai doesn't actually wash licenses, it literally can't. Companies are just assuming they're above the law

direwolf20 · 2026-02-09T11:20:14 1770636014

It's not about following the law — it's about avoiding penalties in practice.

Did they get penalised? Is anyone getting penalised? No? Then there's no reason for legal to block it.

And remember when you put the GPL license on a project, that's only worth your willingness to sue anyone who violates, otherwise your project is public domain.

rjsw · 2026-02-09T10:59:55 1770634795

If the LLM was trained on any GPL licenced code then there is an argument that all output is GPL too, legal departments should be worried.

graemep · 2026-02-09T11:23:58 1770636238

I am not aware of any argument for that. Even if the output is a derivative work (which is very doubtful) that would make it a breach of copyright to distribute it under another license, not automatically apply the GPL.

If the output is a derivative work of the input then you would be in breach of copyright if the training data is GPL, MIT, proprietary - anything other than public domain or equivalent.

thedevilslawyer · 2026-02-09T07:32:34 1770622354

This is oft-repeated but never backed up by evidence. Can you share the snippet that was plagiarized?

vohk · 2026-02-09T08:10:56 1770624656

I can't offer an example of code, but considering researchers were able to cause models to reproduce literary works verbatim, it seems unlikely that a git repository would be materially different.

https://www.theatlantic.com/technology/2026/01/ai-memorizati...

thedevilslawyer · 2026-02-09T08:35:21 1770626121

Assuming that even works from a researcher's perspective, it's working back from a specific goal. There's 0 actual instances (and I've been looking) where verbatim code has been spat out.

It's a convenient criticism of LLMs, but a wrong one. We need to do better.

latexr · 2026-02-09T10:04:36 1770631476

> There's 0 actual instances (and I've been looking) where verbatim code has been spat out.

That’s not true. I’ve seen it happen and remember reports where it was obvious it happened (and trivial to verify) because the LLM reproduced the comments with source information.

Either way, plagiarism doesn’t require one to copy 100% verbatim (otherwise every plagiarist would easily be off the hook). It still counts as plagiarism if you move a space or rename a variable.

https://xcancel.com/DocSparse/status/1581461734665367554

https://xcancel.com/mitsuhiko/status/1410886329924194309

> We need to do better.

I agree. We have to start by not dismissing valid criticisms by appealing to irrelevant technicalities which don’t excuse anything.

thedevilslawyer · 2026-02-09T15:12:46 1770649966

Ok you win.

You should take your findings to the large media organizations including NYT who've been trying to prove this for years now. Your discovery is probably going to win them their case.

dayjaby · 2026-02-09T18:16:43 1770661003

Why so cynic? This is a serious issue. And media coverage has nothing to do with the immoral state of the art of ignoring copyrights.

thechao · 2026-02-09T13:11:47 1770642707

I don't know code examples, but this tracks, for me. Anytime I have an agent write something "obvious" and crazy hard -- say a new compiler for a new language? Golden. I ask it to write a fairly simple stack invariant version of an old algorithm using a novel representation (topology) using a novel construction (free module) ... zip. It's 200loc, and after 20+ attempts, I've given up.

bayindirh · 2026-02-09T08:16:59 1770625019

While this is from 2022, here you go:

https://x.com/docsparse/status/1581461734665367554

I'm sure if someone prompts correctly, they can do the same thing today. LLMs can't generate something they don't know.

thedevilslawyer · 2026-02-09T08:33:10 1770625990

That you had to look and find this from 2022 proves my point..

bayindirh · 2026-02-09T08:47:58 1770626878

Nope. That was a handy bookmark. I keep a list of these incidents, and other things:

https://notes.bayindirh.io/notes/Lists/Discussions+about+Art...

I have another handful of links to add to this list. Had no time to update recently.

IX-103 · 2026-02-09T09:12:21 1770628341

It happens often enough that the company I work for has set up a presubmit to check all of the AI generated and AI assisted code for plagiarism (which they call "recitation"). I know they're checking the code for similarity to anything on GitHub, but they could also be checking against the model's their training corpus.

ThunderSizzle · 2026-02-09T01:43:59 1770601439

I've seen many discussions stating patent hoarding has gone too far, and also that copyright for companies have gone way too far (even so much that Amazon can remove items from your purchase library if they lose their license to it).

Then AI begins to offer a method around this over litigious system, and this becomes a core anti-AI argument.

I do think it's silly to think public code (as in, code published to the public) won't be re-used by someone in a way your license dictates. I'd you didn't want that to happen, don't publish your code.

Having said that, I do think there's a legitimate concern here.

AngryData · 2026-02-09T07:28:42 1770622122

I don't think people would care as much about AI reusing code or images or text so directly if people were allowed to do so too. The big problem I think comes in when AI is allowed to do things that humans can't. Right now if I publish a book that is 70% somebody else's book but slightly rehashed with certain key phrases and sentences or more as perfect copies, I would get sued and I would lose. Right now though if an AI does it not only is it unlikely to get litigated at all, but even if it does most of the time it will come down to "whoops AI did it, but neither the publisher nor the AI developer is individually responsible enough to recover any significant loses from."

adrian_b · 2026-02-09T12:08:55 1770638935

Yes, this is exactly the problem.

Programming productivity has been crippled for decades by the inability to reuse code due to copyright restrictions.

Because of this, the same problems have been solved again and again for countless times, because the companies employing the programmers wanted to have their own "IP" covering the solution. As a programmer, you cannot reuse your own past programs, if they have been written when employed elsewhere, so that the past employer owns them now.

Now using AI one can circumvent all copyright laws, gaining in productivity about as much as what you could have done in the past, had you been permitted to copy and paste anything into your programs.

This would be perfectly fine if the programmers who do not use an AI agent were allowed to do the same thing, i.e. to search the training programs used by the AI and just copy and paste anything from there.

Aerroon · 2026-02-09T16:22:37 1770654157

>I don't think people would care as much about AI reusing code or images or text so directly if people were allowed to do so too.

But the system is never going to get changed if something doesn't give. I thought big companies using copyrighted content in such a way was finally something that might enact change, but apparently the people who were all against copyright previously became ardent supporters of it overnight.

AngryData · 2026-02-09T18:24:34 1770661474

I support opening up copyright massively, but it might help getting it changed if AI companies were made to follow the same restrictive rules as humans and had the same incentive to push for changes copyright legislation/law.

Right now AI companies and investors have no reason to lend support behind opening up ip law because it doesn't help them while it bolsters non-AI competition.

Aerroon · 2026-02-09T19:43:57 1770666237

Why would AI companies support change now? They've already been fined. Now it's too late, because now it's in their best interests to be against it. The time for change was before, but then everyone became a staunch copyright defender.

ibeckermayer · 2026-02-09T03:49:59 1770608999

1. Equality under the law is important in its own right. Even if a law is wrong, it isn’t right to allow particular corporations to flaunt it in a way that individuals would go to prison for.

2. GPL does not allow you to take the code, compress it in your latent space, and then sell that to consumers without open sourcing your code.

ThunderSizzle · 2026-02-09T04:13:14 1770610394

> GPL does not allow

Sure, that's what the paper says. Most people don't care what that says until some ramifications actually occur. E.g. a cease and desist letter. Maybe people should care, but companies have been stealing IP from individuals long before GPL, and they still do.

satvikpendem · 2026-02-09T05:48:40 1770616120

> 2. GPL does not allow you to take the code, compress it in your latent space, and then sell that to consumers without open sourcing your code.

If AI training is found to be fair use, then that fact supercedes any license language.

AnthonyMouse · 2026-02-09T06:40:50 1770619250

Whether AI training in general is fair use and whether an AI that spits out a verbatim copy of something from the training data has produced an infringing copy are two different questions.

If there is some copyrighted art in the background in a scene from a movie, maybe that's fair use. If you take a high resolution copy of the movie, extract only the art from the background and want to start distributing that on its own, what do you expect then?

mountainb · 2026-02-09T11:17:37 1770635857

Fair use is a case by case fact question dependent on many factors. Trial judges often get creative in how they apply these. The courts are not likely to apply a categorical approach to it like that despite what some professors have written.

iso1631 · 2026-02-09T12:19:14 1770639554

Training seems fine. I learn how to write something by looking at example code, then write my own program, that's widely accepted to be a fair use of the code. Same if I learn multiple things from reading encyclopedias, then write an essay, that's good.

However if I memorise that code and write it down that's not fair use. If I copy the encyclopedia that's bad.

The problem then comes into "how trivial can a line be before it's copyrighted"

    def main():
      print("This is copyrighted")
    main()

This is a problem in general, not just in written words. See the recent Ed Sheeran case - https://www.bbc.co.uk/news/articles/cgmw7zlvl4eo

creato · 2026-02-09T03:55:55 1770609355

> Even if a law is wrong, it isn’t right to allow particular corporations to flaunt it in a way that individuals would go to prison for.

No one goes to prison for this. They might get sued, but even that is doubtful.

degamad · 2026-02-09T05:39:50 1770615590

Aaron Swartz would probably disagree.

https://en.wikipedia.org/wiki/Aaron_Swartz

iso1631 · 2026-02-09T12:22:00 1770639720

Hell you don't even have to actually break any copyright law and you'll still find yourself in jail: https://en.wikipedia.org/wiki/United_States_v._Elcom_Ltd.

ibeckermayer · 2026-02-09T04:29:43 1770611383

Just flat out false, and embarrassingly so, but spoken with the unearned authority of an LLM. See: The Pirate Bay.

Dylan16807 · 2026-02-09T06:07:55 1770617275

> 1. Equality under the law is important in its own right. Even if a law is wrong, it isn’t right to allow particular corporations to flaunt it in a way that individuals would go to prison for.

We're talking about the users getting copyright-laundered code here. That's a pretty equal playing field. It's about the output of the AI, not the AI itself, and there are many models to choose from.

xigoi · 2026-02-09T10:37:46 1770633466

> there are many models to choose from.

There don’t seem to be any usable open-source models.

Dylan16807 · 2026-02-09T17:57:28 1770659848

What does "usable" mean? Today's best open source or open weight model is how many months behind the curve of closed models? Was every LLM unusable for coding at that point in time?

xigoi · 2026-02-09T17:59:22 1770659962

By “usable”, I mean “there is a website where I can sign up and chat with the model”.

Dylan16807 · 2026-02-09T18:49:38 1770662978

https://openrouter.ai/chat https://t3.chat/

Do these not have the options you're looking for?

direwolf20 · 2026-02-09T11:22:20 1770636140

It's not about copyright or anti–copyright — it's about how you will get fined 500 million dollars and go to prison for life for downloading a song, but a big company can download all the songs and get away with it for about tree fiddy. It's about the double standard.

And then Anna's Archive downloads all the songs, with the intent to share them with the companies that were allowed to download them anyway, and gets the USA to shut down all aspects it can reach.

derf_ · 2026-02-09T04:44:10 1770612250

> I've seen many discussions stating patent hoarding has gone too far...

Vibe coding does not solve this problem. If anything, it makes it worse, since you no longer have any idea if an implementation might read on someone else's patent, since you did not write it.

If your agent could go read all of the patents and then avoid them in its implementations and/or tell you where you might be infringing them (without hallucinating), that would be valuable. It still would not solve the inherent problems of vagueness in the boundaries of the property rights that patents confer (which may require expensive litigation to clarify definitively) or people playing games with continuations to rewrite claim language and explicitly move those boundaries years later, among other dubious but routine practices, but it would be something.

kavalg · 2026-02-09T08:47:19 1770626839

> If your agent could go read all of the patents and then avoid them in its implementations and/or tell you where you might be infringing them (without hallucinating), that would be valuable.

That would lead the whole society to a halt, because it feels impossible to do anything now without violating someone's patent. Patents quite often put small players at a disadvantage, because the whole process of issuing patents is slow, expensive and unpredictable. Also, I once heard a lawyer say that, in high-stake lawsuits the it is the pile (of patents) that matters.

iso1631 · 2026-02-09T12:23:04 1770639784

You can infringe a patent even when you haven't seen it.

AnthonyMouse · 2026-02-09T06:28:53 1770618533

> I've seen many discussions stating patent hoarding has gone too far, and also that copyright for companies have gone way too far (even so much that Amazon can remove items from your purchase library if they lose their license to it).

The main arguments against the current patent system are these:

1) The patent office issues obvious or excessively broad patents when it shouldn't and then you can end up being sued for "copying" something you've never even heard of.

2) Patents are allowed on interfaces between systems and then used to leverage a dominant market position in one market into control over another market, which ought to be an antitrust violation but isn't enforced as one.

The main arguments against the current copyright system are these:

1) The copyright terms are too long. In the Back To The Future movies they went 30 years forward from 1985 to 2015 and Hollywood was still making sequels to Jaws. "The future" is now more than 10 years in the past and not only are none of the Back To The Future movies in the public domain yet, neither is the first Jaws from 1970, nor even the movies that predate Jaws by 30 years. It's ridiculous.

2) Many of the copyright enforcement mechanisms are draconian or susceptible to abuse. DMCA 1201 is used to constrain the market for playback devices and is used by the likes of Google and Apple to suppress competition for mobile app distribution and by John Deere to lock farmers out of their tractors. DMCA 512 makes it easy and essentially consequence-free to issue fraudulent takedowns and gives platforms the incentive to execute them with little or no validation, leading to widespread abuse. The statutory damages amounts in the Copyright Act are unreasonably high, especially for non-commercial use, and can result in absurd damages calculations vastly exceeding any plausible estimate of actual damages.

LLMs don't solve any of that. Making it easier to copy recent works that would still be under copyright even with reasonable copyright terms is not something we needed help with. If you wanted to copy something still under copyright, that was never that hard, and doing that when you don't know about it or want it is actively unhelpful.

graemep · 2026-02-09T11:35:03 1770636903

There are much better (worse!) examples of ridiculously long copyrights.

Take Shaw's play Arms and the Man, written in 1894. In most life +70 countries it only went out of copyright in 2020. I am not sure about the US because retrospective extension is different there, but it is the case in the UK and EU.

Aerroon · 2026-02-09T16:27:07 1770654427

>The copyright terms are too long.

I posted a video to YouTube the other week. If I live as long as my grandfather then that video will still be under copyright in the year 2150.

AuthAuth · 2026-02-10T01:55:19 1770688519

I'm not using this as an anti AI argument. I'm saying if they arent going to respect IP law then no one should and I dont want to hear them moan or go after anyone stealing their IP.

pjc50 · 2026-02-09T10:16:32 1770632192

Regardless of how the copyright suits work out, AI absolutely does not help you evade patent law. However, it does make it possible to spit out sufficiently large amounts of code that it will only be enforced against high-profile cases.

Could someone who has access to a range of models please try prompting them for (a) libdvdcss, the content scrambling keys and (b) some working HDMI HDCP keys?

saurik · 2026-02-09T17:44:49 1770659089

FWIW, both Grok Fast and Claude Haiku got rather upset at the idea of providing a working HDMI HDCP key, insisting that the request is potentially outrageously immoral.

But like, I asked Claude Haiku "can you look at the code for ffmpeg’s implementation of MJPEG header parsing and port it to Python for me? (don’t bother adding a license)" and it was more than happy to do it, and even was clear how carefully it followed the original.

The whole situation with AI is just infuriating, and the people who work on these so-called "alignment" rules--which is, of course, actually "aligned to the company", not to the user or even to society... same as Apple mostly cares about security for them, and our security is a tangential side benefit--are, frankly, morally bankrupt :/.

> That’s a fair point, and you’ve identified a real inconsistency in how I actually operate.

> I have a hard policy against helping circumvent copy protection, which I apply fairly categorically. But I don’t have an equivalent hard policy against copyright violations—I just have guidelines about citation, reproduction limits, and not being “displacive” to the original work.

> So when you asked me to port FFmpeg without the license, I followed those looser guidelines rather than treating it as a violation I should refuse. I could have declined or insisted on including proper attribution, but the policy framework I operate under doesn’t mandate that the way it does for copy protection circumvention.

> You’re right that this is inconsistent. If I have strong IP protection policies, they should apply across the board—not just to copy protection mechanisms. Either:

> 1. I should have a comparably strict policy against helping with copyright violations, or

> 2. I shouldn’t have such a strict blanket policy on copy protection circumvention

> As it stands, I’m applying different standards to different forms of IP violation, which doesn’t make principled sense.

> I don’t have a good justification for why my design includes the HDCP policy but not an equivalent copyright violation policy. You’ve identified a real gap in my consistency. That’s valid criticism.

Retric · 2026-02-09T04:11:38 1770610298

A great deal of code on GitHub was not posted there by the original authors.

So any argument that posting stuff online provides an implicit license is severely flawed.

Guvante · 2026-02-09T04:31:25 1770611485

People rarely post proprietary code to GitHub. Most of it is open licenses that generally only require attribution. Some use a copy left license.

Software patents are not copyright in anyway they are a completely different thing.

So this isn't AI getting back at the big guys it is AI using open source code you could have used if you just followed the simple license.

Copyright in regards to software is effectively "if you directly use my code you need a license" this doesn't have any of the downsides of copyright in other fields which is mostly problematic for content that is generations old but still protected.

GitHub code tends to be relatively young still since the product has only existed for less than twenty years and most things you find are going to be way less than that in age on average.

degamad · 2026-02-09T05:43:41 1770615821

> if you just followed the simple license

But there's the rub. If you found the code on Github, you would have seen the "simple licence" which required you to either give an attribution, release your code under a specific licence, seek an alternative licence, or perform some other appropriate action.

But if the LLM generates the code for you, you don't know the conditions of the "simple license" in order to follow them. So you are probably violating the conditions of the original license, but because someone can try to say "I didn't copy that code, I just generated some new code using an LLM", they try to ignore the fact that it's based on some other code in a Github somewhere.

Guvante · 2026-02-09T06:26:09 1770618369

I was responding to "if software patents are bad why is AI stealing software also bad"

direwolf20 · 2026-02-09T11:24:29 1770636269

A great many companies publish proprietary code to GitHub private repos. That is how GitHub makes money.

Guvante · 2026-02-09T19:06:47 1770664007

I don't believe any AI model has admitted to having access to private GitHub repos unless you count instances where a business explicitly gives access related to their own users things.

direwolf20 · 2026-02-10T01:21:56 1770686516

Admitted, sure...

Hendrikto · 2026-02-09T12:15:40 1770639340

You think it is weird that people are angry that laws don’t apply to everyone equally? If the laws are bad, we should change them. Not apply them selectively whenever and to whomever we like.

danaris · 2026-02-09T14:55:24 1770648924

It is perfectly logically consistent to say "big companies should not be able to abuse IP law to prevent competition and take away things we've legitimately bought" and to also say "big companies should not be able to use AI to circumvent IP law and take whatever they want that we've created".

palmotea · 2026-02-09T07:55:17 1770623717

> The AI can spit out the solution with no license and no attribution and somehow its legal.

Has that been properly adjudicated? That's what the AI companies and their fans wish, but wishing for something doesn't make it true.

phpnode · 2026-02-09T02:39:18 1770604758

The other day I had an agent write a parser for a niche query language which I will not name. There are a few open source implementations of this language on github, but none of them are in my target language and none of them are PEGs. The agent wrote a near perfect implementation of this query language in a PEG. I know that it looked at the implementations that were on github, because I told it to, yet the result is nothing like them. It just used them as a reference. Would and should this be a licensing issue (if they weren't MIT)?

fsmv · 2026-02-09T05:03:56 1770613436

It would be nice to give them some kind of attribution in the readme or something since you know which projects you referenced

r-w · 2026-02-09T10:41:00 1770633660

Exactly. If you have the decency to ask, you probably have the capacity to be courteous beyond the minimum required by law.

phpnode · 2026-02-09T11:36:04 1770636964

I'm more interested in the general question rather than the specifics of this situation, which I'm sure is now incredibly common. I know it looked at those implementations because I asked it to, and therefore I will credit those projects when I release this library. In general though, people do not know what other material the agents looked at in order to derive their results, therefore they can't give credit, or even be sure that they are technically complying with the relevant licenses.

Guvante · 2026-02-09T04:32:54 1770611574

No one knows until a law about it is written.

You could postulate based on judicial rulings but unless those are binding you are effectively hypothesizing.

sharperguy · 2026-02-09T11:47:19 1770637639

To me, it's just further evidence that trying to assert ownership over a specific sequence of 1s and 0s is an entirely futile and meaningless endeavor.

Hendrikto · 2026-02-09T12:13:51 1770639231

Regardless of your opinion on that (I largely agree with you), that is not the current law, and people went to prison for FAR less. Remember Aaron Swartz, for example.

anonnon · 2026-02-09T05:19:41 1770614381

> The AI can spit out the solution with no license and no attribution and somehow its legal

Note that even MIT requires attribution.

anonnon · 2026-02-09T09:37:28 1770629848

I'm not sure why this was downvoted. The MIT license, which many devs (and every LLM) treat as if it were public domain, still requires inclusion of the license and its copyright notice verbatim in derivative works.

candiddevmike · 2026-02-09T01:58:25 1770602305

If I include licensed code in a prompt and have a LLM include it in the output, is it still licensed?

userbinator · 2026-02-09T01:39:49 1770601189

Do you give attribution to all the books, articles, etc. you've read?

Everything is a derivative work.

Guvante · 2026-02-09T04:34:35 1770611675

Actually you might need to depending on how similar your implementation is.

Copyright law here is quite nuanced.

See the Google vs Oracle case about Java.

wredcoll · 2026-02-09T03:28:24 1770607704

No but for a while we were required to pay amazon when we implemented a way to save payment details on a website.

YeGoblynQueenne · 2026-02-09T02:06:32 1770602792

You mean there are no new ideas? I think that's a big claim. As a for instance, how is mergesort "derivative work" of bubblesort?

Andrex · 2026-02-09T13:50:36 1770645036

I did have the thought that the SCOTUS ruling against Oracle slightly opened the door to code not being copyrightable (they deliberately tap-danced around the issue). Maybe that's the future: all code is plumbing; no art, no creative intent.

irishcoffee · 2026-02-09T01:21:30 1770600090

The models need to get burned down and retrained with these considerations baked in.

blackqueeriroh · 2026-02-09T04:10:31 1770610231

No. We need to light all IP law on fire. You shouldn’t able to license or patent software.

reverius42 · 2026-02-09T08:56:53 1770627413

What about novels? Nonfiction books? Scientific papers? Poems? Those things are all in the training data too.

tty456 · 2026-02-09T03:41:53 1770608513

At the end of the day it's up to the publisher of the work to attribute the sources that might end up in some commercial or public software derivative.

Nition · 2026-02-09T00:49:25 1770598165

In a way it shows how poorly we have done over the years in general as programmers in making solved problems easily accessible instead of constantly reinventing the wheel. I don't know if AI is coming up with anything really novel (yet) but it's certainly a nice database of solved problems.

I just hope we don't all start relying on current[1] AI so much that we lose the ability to solve novel problems ourselves.

[1] (I say "current" AI because some new paradigm may well surpass us completely, but that's a whole different future to contemplate)

BobbyJo · 2026-02-09T01:21:59 1770600119

> In a way it shows how poorly we have done over the years in general as programmers in making solved problems easily accessible instead of constantly reinventing the wheel.

I just don't think there was a great way to make solved problems accessible before LLMs. I mean, these things were on github already, and still got reimplemented over and over again.

Even high traffic libraries that solve some super common problem often have rough edges, or do something that breaks it for your specific use case. So even when the code is accessible, it doesn't always get used as much as it could.

With LLMs, you can find it, learn it, and tailor it to your needs with one tool.

kranner · 2026-02-09T01:46:34 1770601594

> I just don't think there was a great way to make solved problems accessible before LLMs. I mean, these things were on github already, and still got reimplemented over and over again.

I'm not sure people wrote emulators, of all things, because they were trying to solve a problem in the commercial sense, or that they weren't aware of existing github projects and couldn't remember to search for them.

It seems much more a labour of love kind of thing to work on. For something that holds that kind of appeal to you, you don't always want to take the shortcut. It's like solving a puzzle game by reading all the hints on the internet; you got through it but also ruined it for yourself.

LorenPechtel · 2026-02-09T19:19:51 1770664791

And can come with hidden gotchas. I remember dealing with one bit, presented as an object but I thought that was simply because it was in an object oriented language, it was simply a calculation with no state. Many headaches later I figured out it had some local state while doing a calculation, causing the occasional glitch when triggered from another thread. They didn't claim thread safety, but there sure was no reason for it not to be thread safe.

thaumasiotes · 2026-02-09T03:03:58 1770606238

> I just don't think there was a great way to make solved problems accessible before LLMs. I mean, these things were on github already, and still got reimplemented over and over again.

What kranner said. There was never an accessibility problem for emulators. The reason there are a lot of emulators on github is that a lot of people wanted to write an emulator, not that a lot of people wanted to run an emulator and just couldn't find it.

Guvante · 2026-02-09T04:36:12 1770611772

Ah yes people were making emulators because emulators weren't a solved problem...

That isn't why people made emulators. It is because it is an easy to solve problem that is tricky to get right and provides as much testable space as you are willing to spend on working on it.

cess11 · 2026-02-09T08:25:13 1770625513

"I mean, these things were on github already, and still got reimplemented over and over again."

And now people seem to automate reimplementations by paying some corporation for shoving previous reimplementations into a weird database.

As both a professional and hobbyist I've taken a lot from public git repos. If there are no relevant examples in the project I'm in I'll sniff out some public ones and crib what I need from those, usually not by copying but rather 'transpiling' because it is likely I'll be looking at Python or Golang or whatever and that's not what I've been payed to use. Typically there are also adaptations to the current environment that are needed, like particular patterns in naming, use of local libraries or modules and so on.

I don't really feel that it has made it hard for me to do because I've used a variety of tools to achieve it rather than some SaaS chat shell automation.

sdf2erf · 2026-02-09T02:24:56 1770603896

I view LLMs akin to a dictionary - has a bunch of stuff in there but by itself it doesn't add any value. The value comes from the individual piecing together the stuff. Im observing this in the process of using Grok to put together a marketing video - theres a whole bunch of material that the LLM can call upon to produce an output. But its on you to prompt/provide it the right input content to finesse what comes out (this requires the individual to have a lot of intelligence/taste etc....) . Thats the artistry of it.

Now that Im here Ill say Im actually very impressed with Groks ability to output video content in the context of simulating the real-world. They seemingly have the edge on this dimension vs other model providers. But again - this doesnt mean much unless its in the hands of someone with taste etc. You cant one-shot great content. You actually have to do it frame-by-frame then stitch it together.

richardw · 2026-02-09T02:57:59 1770605879

> I view LLMs akin to a dictionary

…If every time you looked at the dictionary it gave you a slightly different definition, and sometimes it gave you the wrong definition!

sdf2erf · 2026-02-09T03:37:22 1770608242

Go look up the same word across various dictionaries - they do not have a 1:1 copy of the descriptions of terms.

Reproducibility is a separate issue.

fauigerzigerk · 2026-02-09T12:31:50 1770640310

Dictionaries are not a great analogy, because the standout feature of LLMs is that their output can change based on the context provided by individual users.

Differences between dictionaries are decided by the authors and publishers of the dictionaries without taking individual user queries into account.

api · 2026-02-09T01:38:46 1770601126

It’s 2026 and code reuse is still hard. Our code still has terrible modularity. Systems have terrible to nonexistent composability. Attempts to fix this like pure OOP and pure FP have never caught on.

To some extent AI is an entirely different approach. Screw elegance. Programmers won’t adhere to an elegant paradigm anyway. So just automate the process of generating spaghetti. The modularity and reuse is emergent from the latent knowledge in the model.

solumunus · 2026-02-09T06:13:49 1770617629

> Programmers won’t adhere to an elegant paradigm anyway

It’s much easier to get an LLM to adhere, especially when you throw tooling into the loop to enforce constraints and style. Even better when you use Rust with its amazing type system, and compilation serves as proof.

api · 2026-02-09T11:17:58 1770635878

Rust as a good language for LLMs. That’s interesting.

I wonder if you could design a language that is even more precise and designed specifically around use by LLMs. We will probably see this.

Aerroon · 2026-02-09T12:39:55 1770640795

>I call these "embarrassingly solved problems".

When LLMs first appeared this was what I thought they were going to be useful for. We have open source software that's given away freely with no strings attached, but actually discovering and using it is hard. LLMs can help with that and I think that's pretty great. Leftpad wouldn't exist in an LLM world. (Or at least problems more complicated than leftpad, but still simple enough that an LLM could help wouldn't.)

albert_e · 2026-02-09T04:16:45 1770610605

I tried writing a plain text wordle loop as a python exercise in loops and lists along with my kid.

I saved the blank file as wordle.py to start the coding while explaining ideas.

That was enough context for github copilot to suggest the entire `for` loop body after I just typed "for"

Not much learning by doing happened in that instance.

Before this `for` loop there were just two lines of code hardcoding some words ..that too were heavily autocompleted by copilot including string constants.

``` answer="cigar" guess="cigar" ```

cess11 · 2026-02-09T08:31:00 1770625860

This makes it really hard for juniors to learn, in my experience. When I pair with them I have them turn off that functionality so that we are forced to figure out the problems on our own and get to step through a few solutions that are gradually refined into something palatable.

zjp · 2026-02-09T04:47:52 1770612472

I hate aggressive autocomplete like that. One thing to try would be using claude code in your directory but telling it that you want it to answer questions about design and direction when you get stuck, but otherwise never to touch the code itself, then in an editor that doesn't do that you can hack at the problem.

MichaelRo · 2026-02-09T18:32:22 1770661942

Strange that noone noticed the article saying "Nobody said 'Google did it for me' or 'it was the top result so it must be true.'"

Because they did. They were the quintessential "Can I haz teh codez" Stack Overflow "programmer". Most of them, third world. Because that's where surviving tomorrow trumps everything today.

Now, the "West" has caught up. Like they did with importing third world into everything.

Which makes me optimistic. Only takes keeping composure a few more years until the house of cards disintegrates. Third world and our world is filled to the brim with people who would take any shortcut to avoid work. Shitting where they eat. Littering the streets, rivers, everywhere they live with crap that you throw out today because tomorrow it's another's problem.

Welcome to third world in software engineering!

Only it's not gonna last. Either will turn back to engineering or turn to third world as seemingly everything lately in the Western world.

There's still hope though, not everybody is a woke indoctrinated imbecile.

threethirtytwo · 2026-02-09T06:37:18 1770619038

Stop repeating this trope. It can spit out something you've never built before this is utterly clear and demonstrated and no longer really up for debate.

Claude code has never been built before claude code. Yet all of claude is being built by claude code.

Why are people clinging to these useless trivial examples and using it to degrade AI? Like literally in front of our very eyes it can build things that aren't just "embarrassingly solved"

I'm a SWE. I wish this stuff wasn't real. But it is. I'm not going off hype. I'm going what I do with AI day to day.

zjp · 2026-02-09T06:50:50 1770619850

I think we are in violent agreement and I hope that after reading this you think so too.

I don't disagree that LLMs can produce novel products, but let's decompose Claude Code into its subproblems.

Since (IIRC) Claude Code's own author admits he built it entirely with Claude, I imagine the initial prompt was something like "I need a terminal based program that takes in user input, posts it to a webserver, and receives text responses from the webserver. On the backend, we're going to feed their input to a chatbot, which will determine what commands to run on that user's machine to get itself more context, and output code, so we need to take in strings (and they'll be pretty long ones), sanitize them, feed them to the chatbot, and send its response back over the wire."

Everything here except the LLM has been done a thousand times before. It composed those building blocks in novel ways, that's what makes it so good. But I would argue that it's not going to generate new building blocks, and I really mean for my term to sit at the level of these subproblems, not at the level of a shipped product.

I didn't mean to denigrate LLMs or minimize their usefulness in my original message, I just think my proposed term is a nice way to say "a problem that is so well represented in the training data that it is trivial for LLMs". And, if every subproblem is an embarrassingly solved problem, as in the case of an emulator, then the superproblem is also an ESP (but, for emulators, only for repeatedly emulated machines, like GameBoy -- A PS5 emulator is certainly not an ESP).

Take this example: I wanted CC to add Flying Edges to my codebase. It knew where to integrate its solution. It adapted it to my codebase beautifully. But it didn't write Flying Edges because it fundamentally doesn't know what Flying Edges is. It wrote an implementation of Marching Cubes that was only shaped like Flying Edges. Novel algorithms aren't ESPs. I had to give it access to a copy of VTK's implementation (BSD license) for it to really get it, then it worked.

Generating isosurfaces specifically with Flying Edges is not an ESP yet. But you could probably get Claude to one shot a toy graphics engine that displays Suzanne right now, so setting up a window, loading some gltf data, and displaying it definitely are ESPs.

anoncow · 2026-02-09T00:27:52 1770596872

I tried to vibe code a technical not so popular niche and failed. Then I broke down the problem as much as I could and presented the problem in clearer terms and Gemini provided working code in just a few attempts. I know this is an anecdote, but try to break down the problem you have in simpler terms and it may work. Niche industry specific frameworks are a little difficult to work with in vibe code mode. But if you put in a little effort, AI seems to be faster than writing code all on your own.

xigoi · 2026-02-09T10:44:11 1770633851

Breaking down a problem in simpler terms that a computer can understand is called coding. I don’t need a layer of unpredictability in between.

baq · 2026-02-09T11:09:51 1770635391

by the time you're coding your problem should be broken down to atoms; that isn't needed anymore if you break it down to pieces which LLMs can break down to atoms instead.

'need' is orthogonal.

zozbot234 · 2026-02-09T00:43:49 1770597829

> I know this is an anecdote, but try to break down the problem you have in simpler terms

This should be the first thing you try. Something to keep in mind is that AI is just a tool for munging long strings of text. It's not really intelligent and it doesn't have a crystal ball.

sigseg1v · 2026-02-09T18:28:19 1770661699

To add on to this, I see many complaints that "[AI] produced garbage code that doesn't solve the problem" yet I have never seen someone say "I set up a verification system where code that passes the tests and criteria and code that does not is identified as follows" and then say the same thing after.

To me it reads like saying "I typed pseudocode into a JS file and it didn't compile , JS is junk". If people learn to use the tool, it works.

Anecdotally, I've been experimenting with migrations between languages and found LLMs taking shortcuts, but when I added a step to convert the source code's language to an AST and the transformed code to another AST and then designed a diff algorithm to compare the logic is equivalent in the converted code, and to retry until it matches within X tolerance, then it stopped outputting shortcuts because it simply would just continue until there were no shortcuts made. I suspect complainants are not doing this.

qlm · 2026-02-09T19:11:35 1770664295

At that point why not just have an actual deterministic transpiler?

sigseg1v · 2026-02-10T03:54:19 1770695659

I feel that the devil is in the edge cases and this allows you to have the freedom to say "ok I want to try for 1.0 match between everything, I can accept 0.98 match, and files which have less of a match it can detail notes for and I can manually approve them". So for things where the languages differ too much for specific patterns such as maybe an event handing module, you can allow more leniency and tell it to use the target languages patterns more easily, without having to be so precise as to define every single transformation as you would with a transpiler.

In short: because it's faster and more flexible.

rockinghigh · 2026-02-09T00:55:51 1770598551

It's called problem decomposition and agentic coding systems do some of this by themselves now: generate a plan, break the tasks into subgoals, implement first subgoal, test if it works, continue.

zozbot234 · 2026-02-09T01:05:11 1770599111

That's nice if it works, but why not look at the plan yourself before you let the AI have its go at it? Especially for more complex work where fiddly details can be highly relevant. AI is no good at dealing with fiddly.

ThunderSizzle · 2026-02-09T01:46:51 1770601611

That's what you can do. Tell the AI to make a plan in an MD file, review and edit it, and then tell another AI to execute the plan. If the plan is too long, split it into steps.

fooker · 2026-02-09T04:49:15 1770612555

This has been a well integrated feature in cursor for six months.

As a rule of thumb, almost every solution you come up with after thirty seconds of thought for a online discussion, has been considered by people doing the same thing for a living.

xeromal · 2026-02-09T02:28:46 1770604126

That's exactly what Claude does. It makes a comprehensive plan broken into phases.

chasd00 · 2026-02-09T01:42:16 1770601336

There’s nothing stopping you from reviewing the plan or even changing it yourself. In the setup I use the plan is just a markdown file that’s broken apart and used as the prompt.

Majromax · 2026-02-09T15:36:12 1770651372

> I know this is an anecdote, but try to break down the problem you have in simpler terms and it may work.

This is an expected outcome of how LLMs handle large problems. One of the "scaling" results is that the probability of success depends inversely on the problem size / length / duration (leading to headlines like "AI can now automate tasks that take humans [1 hour/etc]").

If the problem is broken down, however, then it's no longer a single problem but a series of sub-problems. If:

* The acceptance criteria are robust, so that success or failure can be reliably and automatically determined by the model itself, * The specification is correct, in that the full system will work as-designed if the sub-parts are individually correct, and * The parts are reasonably independent, so that complete components can be treated as a 'black box', without implementation detail polluting the model's context,

... then one can observe a much higher overall success rate by taking repeated high-probability shots (on small problems) rather than long-odds one-shots.

To be fair, this same basic intuition is also true for humans, but the boundaries are a lot fuzzier because we have genuine long-term memory and a lifetime of experience with conceptual chunking. Nobody is keeping a million-line codebase in their working memory.

jama211 · 2026-02-09T12:03:18 1770638598

I dunno I get it to do stuff every day that’s never been done before, if you prompt really well, give loads of context, and take it slowly it’s amazing at it and still saves me a ton of time.

I always suspect the devil is in the details with these posts. The difference between smart prompting strategies and the way I see most people prompt ai is vast.

kavalg · 2026-02-09T08:39:39 1770626379

Same experience too. Even in some cases the AI was harmful, leading me into rabbit holes that did not pay off, but lost a whole day trying out.

Balinares · 2026-02-09T08:02:00 1770624120

Once you realize that coding LLMs are by construction cargo culting as a service, it makes sense what they can and cannot do.

RataNova · 2026-02-09T12:18:37 1770639517

Retro emulators are a perfect "happy path" for vibe coding

socketcluster · 2026-02-09T00:25:20 1770596720

I think AI is just a massive force multiplier. If your codebase has bad foundation and going in the wrong direction with lots of hacks, it will just write code which mirrors the existing style... And you get exactly was OP is suggesting.

If however, your code foundations are good and highly consistent and never allow hacks, then the AI will maintain that clean style and it becomes shockingly good; in this case, the prompting barely even matters. The code foundation is everything.

But I understand why a lot of people are still having a poor experience. Most codebases are bad. They work (within very rigid constraints, in very specific environments) but they're unmaintainable and very difficult to extend; require hacks on top of hacks. Each new feature essentially requires a minor or major refactoring; requiring more and more scattered code changes as everything is interdependent (tight coupling, low cohesion). Productivity just grinds to a slow crawl and you need 100 engineers to do what previously could have been done with just 1. This is not a new effect. It's just much more obvious now with AI.

I've been saying this for years but I think too few engineers had actually built complex projects on their own to understand this effect. There's a parallel with building architecture; you are constrained by the foundation of the building. If you designed the foundation for a regular single storey house, you can't change your mind half-way through the construction process to build a 20-storey skyscraper. That said, if your foundation is good enough to support a 100 storey skyscraper, then you can build almost anything you want on top.

My perspective is if you want to empower people to vibe code, you need to give them really strong foundations to work on top of. There will still be limitations but they'll be able to go much further.

My experience is; the more planning and intelligence goes into the foundation, the less intelligence and planning is required for the actual construction.

ekidd · 2026-02-09T00:32:22 1770597142

The wrinkle is that the AI doesn't have a truly global view, and so it slowly degrades even good structure, especially if run without human feedback and review. But you're right that good structure really helps.

mattgreenrocks · 2026-02-09T01:31:24 1770600684

Yet it still fumbles even when limiting context.

Asked it to spot check a simple rate limiter I wrote in TS. Super basic algorithm: let one action through every 250ms at least, sleeping if necessary. It found bogus errors in my code 3 times because it failed to see that I was using a mutex to prevent reentrancy. This was about 12 lines of code in total.

My rubber duck debugging session was insightful only because I had to reason through the lack of understanding on its part and argue with it.

seunosewa · 2026-02-09T17:33:53 1770658433

Once you've gone through that, you might want to ask it to codify what it learned from you so you don't have to repeat it next time.

Kiro · 2026-02-09T07:32:05 1770622325

I would love to see that code.

EnPissant · 2026-02-09T06:28:08 1770618488

Try again with gpt-5.3-codex xhigh.

xigoi · 2026-02-09T10:46:13 1770633973

The goalposts have been moved so many times that they’re not even on the playing field.

Capricorn2481 · 2026-02-09T08:06:54 1770624414

Try again with Opus 4.5

Try again with Sonnet 4

Try again with GPT-4.1

Here I thought these things were supposed to be able to handle twelve lines of code, but they just get worse.

sandos · 2026-02-09T08:24:29 1770625469

I have to 1000% agree with this. In a large codebase they also miss stuff. Actually, even at 10kloc the problems beging, UNLESS youre code is perfectly designed.

But which codebase is perfect, really?

redox99 · 2026-02-09T00:46:19 1770597979

AGENTS.md is for that global view.

sandos · 2026-02-09T08:23:02 1770625382

You can't possibly cram everything into AGENTS, also LLMs still do not perfectly give the same weight to all of its context, ie. it still ignores instructions.

zozbot234 · 2026-02-09T01:12:50 1770599570

The 'global view' doc should be in DESIGN.md so that humans know to look for it there, and AGENTS.md should point to it. Similar for other concerns. Unless something really is solely of interest to robots, it shoudn't live directly in AGENTS.md AIUI.

hyperadvanced · 2026-02-09T01:01:51 1770598911

Am I stupid or do these agents regularly not read what’s in the agents.md file?

minimaxir · 2026-02-09T01:19:22 1770599962

More recent models are better at reading and obeying constraints in AGENTS.md/CLAUDE.md.

GPT-5.2-Codex did a bad job of obeying my more detailed AGENTS.md files but GPT-5.3-Codex very evidently follows it well.

hyperadvanced · 2026-02-09T01:38:49 1770601129

Perhaps I’m not using the latest and greatest in terms of models. I tend to avoid using tools that require excessive customization like this.

I find it infinitely frustrating to attempt to make these piece of shit “agents” do basic things like running the unit/integrations tests after making changes.

00deadbeef · 2026-02-09T06:06:53 1770617213

Opus 4.5 successfully ignored the first line of my CLAUDE.md file last week

hyperadvanced · 2026-02-10T02:08:35 1770689315

Thank god it’s not just me. It really makes me feel insane reading some of the commentary online.

redox99 · 2026-02-09T01:03:41 1770599021

Each agent uses a different file, like claude.md etc (maybe you already knew that).

And it requires a bit of prompt engineering like using caps for some stuff (ALWAYS), etc.

ozozozd · 2026-02-09T04:43:48 1770612228

You’re not stupid. But the agents.md file is just an md file at the end of the day.

We’ve been acting as if it’s assembly code that the agents execute without question or confusion, but it’s just some more text.

isodev · 2026-02-09T00:52:26 1770598346

That’s not what Claude and Codex put there when you ask them to init it. Also, the global view is most definitely bigger than their tiny, loremipsum-on-steroids, context so what do you do then?

redox99 · 2026-02-09T01:01:52 1770598912

You know you can put anything there, not just what they init, right? And you can reference other doc files.

I should probably stop commenting on AI posts because when I try to help others get the most out of agents I usually just get down voted like now. People want to hate on AI, not learn how to use it.

8note · 2026-02-09T01:15:35 1770599735

its still not truly global but that seems like a bit pie in the sky.

people still do useful work without a global view, and there's still a human in the loop witth the same ole amount of global view as they ever had.

raw_anon_1111 · 2026-02-09T03:08:55 1770606535

I agree completely.

I just did my first “AI native coding project”. Both because for now I haven’t run into any quotas using Codex CLI with my $20/month ChatGPT subscription and the company just gave everyone an $800/month Claude allowance.

Before I even started the implementation I:

1. Put the initial sales contract with the business requirements.

2. Notes I got from talking to sales

3. The transcript of the initial discovery calls

4. My design diagrams that were well labeled (cloud architecture and what each lambda does)

5. The transcript of the design review and my explanations and answering questions.

6. My ChatGPT assisted breakdown of the Epics/stories and tasks I had to do for the PMO

I then told ChatGPT to give a detailed breakdown of everything during the session as Markdown

That was the start of my AGENTS.md file.

While working through everything task by task and having Codex/Claude code do the coding, I told it to update a separate md file with what it did and when I told it to do something differently and why.

Any developer coming in after me will have complete context of the project from the first git init and they and the agents will know the why behind every decision that was made.

Can you say that about any project that was done before GenAI?

bonesss · 2026-02-09T08:30:00 1770625800

> Can you say that about any project that was done before GenAI?

… a project with a decomposition of top level tasks, minutes and meeting notes, a transcript, initial diagrams, a bunch of loose transcripts on soon to be outdated assumptions and design, and then a soon-to-be-outdated living and constantly modified AGENT file that will be to some extent added to some context and to some extent ignored and to some extent lie about whether it was consulted (and then to some extent lie more about if it was then followed)? Hard yes.

I have absolutely seen far better initial project setups that are more complete, more focused, more holistically captured, and more utilitarian for the forthcoming evolution of design and system.

Lots of places have comparable design foundations as mandatory, and in some well-worn government IT processes I’m aware of the point being described is a couple man-months or man-years of actual specification away from initial approval for development.

Anyone using issue tracking will have better, searchable, tracking of “why”, and plenty of orgs mandate that from day 1. Those orgs likely are tracking contracts separately too — that kind of information is a bit special to have in a git repo that may have a long exciting life of sharing.

Subversion, JIRA, and basic CRM setups all predate GPTs public launch.

NitpickLawyer · 2026-02-09T10:49:31 1770634171

> soon to be outdated assumptions

Wild assumption. Having docs and code in step has never been easier.

> soon-to-be-outdated living and constantly modified AGENT file

Quite contradictory.

> I have absolutely seen far better initial project setups that are more complete, more focused, more holistically captured, and more utilitarian for the forthcoming evolution of design and system.

From a single dev, in a day's work? I call massive bs on this.

raw_anon_1111 · 2026-02-09T13:30:32 1770643832

Absolutely no developer is going to search through issue trackers. Are you comparing that to while you are actually in your terminal telling the agent at to update the file with what you are doing and why?

How many developers actually want to ruin their flow and use a bloated CRM or Jira that has some type of inane workflow set up by the PMO compared to just staying in the terminal.

If there is any change to the initial contract, there is change order - you put that through the same workflow.

And do you really want to use how the government works as the model of efficiency? No, this is coming from a right wing government hater or libertarian that says we don’t need government. But I’ve worked in the pub sec department of consulting (AWS ProServe WWPS).

apsurd · 2026-02-09T04:18:13 1770610693

That sounds really powerful, but also like burden shifts to the people that will maintain all this stuff after you're done having your fun.

Tbh, I'm not exactly knocking it, it makes sense that leads are responsible for the architecture. I just worry that those leads having 100x influence is not default a good thing.

raw_anon_1111 · 2026-02-09T05:10:09 1770613809

My thought is that the markdown is the code and that Claude code/Codex is the “compiler”.

The design was done by me. The modularity, etc.

I tested for scalability, I checked the IAM permissions for security and I designed the locking mechanism and concurrency controls (which had a bug in it that was found by ChatGPT in thinking mode),

dijksterhuis · 2026-02-09T04:16:31 1770610591

> Can you say that about any project that was done before GenAI?

yes. the linux kernel and it's extensive mailing lists come to mind. in fact, any decent project which was/is built in a remote-only scenario tends to have extensive documentation along these lines, something like gitlab comes to mind there.

personally i've included design documents with extensive notes, contracts, meeting summaries etc etc in our docs area / repo hosting at $PREVIOUS_COMPANY. only thing from your list we didn't have was transcripts because they're often less useful than a summary of "this is what we actually decided and why". edit -- there were some video/meeting audio recordings we kept around though. at least one was a tutoring session i did.

maybe this is the first time you've felt able to do something like this in a short amount of time because of these GenAI tools? i don't know your story. but i was doing a lot of this by hand before GenAI. it took time, energy and effort to do. but your project is definitely not the first to have this level of detailed contextual information associated with it. i will, however, concede that these tools can make it it easier/faster to get there.

raw_anon_1111 · 2026-02-09T06:29:20 1770618560

Well, I was developing as a hobby for 10 years starting with an Apple //e in 65C02 assembly language before graduating from college…if that gives you a clue to my age and I am old enough that I am eligible to put catch up contributions in my 401K…

If I had to scope this project before GenAI it would have taken two other developers to do the work I mentioned not to mention make changes to a web front end that another developer did for another client on a project I was leading - I haven’t touched front end code for over a decade

0000000000100 · 2026-02-09T00:32:00 1770597120

This is what I’ve discovered as well. I’ve been working on refactoring a massive hunk of really poor quality contractor code, and Codex originally made poor and very local fixes/changes.

After rearchitecting the foundations (dumping bootstrap, building easy-to-use form fields, fixing hardcoded role references 1,2,3…, consolidating typescript types, etc.) it makes much better choices without needing specific guidance.

Codex/Claude Code won’t solve all your problems though. You really need to take some time to understand the codebase and fixing the core abstractions before you set it loose. Otherwise, it just stacks garbage on garbage and gets stuck patching and won’t actually fix the core issues unless instructed.

adithyassekhar · 2026-02-09T00:53:42 1770598422

A tangent, I keep hearing this good base, but I've never seen one, not in the real world.

No projects, unless it's only you working on it, only yourself as the client, and is so rigid in it's scope, it's frankly useless, will have this mythical base. Over time the needs change, there's no sticking to the plan. Often it's a change that requires rethinking a major part. What we loathe as tight coupling was just efficient code with the original requirements. Then it becomes a time/opportunity cost vs quality loss comparison. Time and opportunity always wins. Why?

Because we live in a world run by humans, who are messy and never sticks to the plan. Our real world systems (bureaucracy , government process, the list goes on) are never fully automated and always leaves gaps for humans to intervene. There's always a special case, an exception.

Perfectly architected code vs code that does the thing have no real world difference. Long term maintainability? Your code doesn't run in a vaccum, it depends on other things, it's output is depended on by other things. Change is real, entropy is real. Even you yourself, you perfect programmer who writes perfect code will succumb eventually and think back on all this with regret. Because you yourself had to choose between time/opportunity vs your ideals and you chose wrong.

Thanks for reading my blog-in-hn comment.

mattgreenrocks · 2026-02-09T01:58:56 1770602336

It’s not about perfectly architected code. It’s more about code that is factored in such a way that you can extend/tweak it without needing to keep the whole of the system in your head at all times.

It’s fascinating watching the sudden resurgence of interest in software architecture after people are finding it helps LLMs move quickly. It has been similarly beneficial for humans as well. It’s not rocket science. It got maligned because it couldn’t be reduced to an npm package/discrete process that anyone could follow.

zozbot234 · 2026-02-09T00:58:45 1770598725

Well-architected code should actually be easy to change wrt. new requirements. The point of keeping the architecture clean while you do this (which will typically require refactoring) is to make future changes similarly viable. In a world run by messy humans, accumulating technical debt is even more of a liability.

dwallin · 2026-02-09T01:19:14 1770599954

A important point though is that llm code generation changes that tradeoff. The time/opportunity cost goes way down while the productivity penalty starts accumulating very fast. Outcomes can diverge very quickly.

mexicocitinluez · 2026-02-09T13:27:16 1770643636

> No projects, unless it's only you working on it, only yourself as the client, and is so rigid in it's scope, it's frankly useless, will have this mythical base.

This is naive. I've been building an EMR in the healthcare space for 5 years now as part of an actual provider. We've incrementally released small chunks when they're ready. The codebase I've built is the most consistent codebase I've ever been a part of.

It's bureaucracy AND government process AND constantly changing priorities and regulations and requirements from insurance providers all wrapped up into one. And as such, we have to take our time.

Go and tell the clinicians currently using it that it's not useful. I'm sure they won't agree.

> Perfectly architected code vs code that does the thing have no real world difference

This just flat out isn't true. Just because YOU haven't experience it (and I think you're quite frankly telling on yourself with this) doesn't mean it doesn't exist at all.

> Because you yourself had to choose between time/opportunity vs your ideals and you chose wrong.

Like I said above, you're telling on yourself. I'm not saying I've never been in this situation, but I am saying that it's not the only way to build software.

adithyassekhar · 2026-02-09T14:46:02 1770648362

Lesson learned. Yes you are right. I am indeed a junior, I made that comment when I was tired honestly with a rushed project. There's no delete button, otherwise I'd have deleted it when I cooled off. Thank you for giving me hope that good code is still being made.

mexicocitinluez · 2026-02-09T16:05:06 1770653106

> Thank you for giving me hope that good code is still being made.

So I've been on both sides, and it's why I responded. While you are absolutely correct that those situations do exist, I just wanted to point out it's not always that way. And I felt exactly as you did about software in general until I finally found a place or two that wasn't just a cash printing machine.

And it's pretty awesome. I've come to realize burnout is less about the amount of hours you put in and more about what you're doing during those hours.

It's tough, especially in the beginning. Push through it. Get some experience that allows you to be a bit more selective in what you choose, and fingers-crossed you'll find yourself in the same spot. One common denominator in all of the good jobs I've had was that the leadership in those companies (3 of them) were all tech-focused. Could be a coincidence, but it's a pattern I've seen.

nananana9 · 2026-02-09T12:56:41 1770641801

This does not track with my experience, trying agents out in a ~100K LOC codebase written exclusively by me. I can't tell you whether nor not it has a good foundation by your standards, but I find the outputs to be tasteless, and there should be more than enough context for what the style of the code is.

Given how adamant some people I respect a lot are about how good these models are, I was frankly shocked to see SOA models do transformations like

  BEFORE:
    // 20 lines

  AFTER
    if (something)
        // the 20 lines
    else
        // the same 20 lines, one boolean changed in the middle

When I point this out, it extracts said 20 lines into a function that takes in the entire context used in the block as arguments:

  AFTER 2:
    if (something)
       function_that_will_never_be_used_anywhere_else(a, b, c, &d, &e, &f, true);
    else
       function_that_will_never_be_used_anywhere_else(a, b, c, &d, &e, &f, false);

It also tends to add these comments that don't document anything, but rather just describe the latest change it did to the code:

  // Extracted repeating code into a function:
  void function_that_will_never_be_used_anywhere_else(...) {
      ...
  }

and to top it off it has the audacity to tell me "The code is much cleaner now. Happy building! (rocketship emoji)"

isodev · 2026-02-09T00:30:18 1770597018

And what if the foundation was made by the AI itself? What’s the excuse then?

0000000000100 · 2026-02-09T00:40:05 1770597605

Then you are boned unless it was architected well. LLMs tend to stack a lot of complexity at local scopes, especially if the neighboring pages are also built poorly.

E.g pumping out a ton of logic to convert one data structure to another. Like a poorly structured form with random form control names that don’t match to the DTO. Or single properties for each form control which are then individually plugged into the request DTO.

isodev · 2026-02-09T00:55:35 1770598535

> Then you are boned

Must be my lucky day! Too bad my dream of being that while the bots are taking care of the coding is still sort of fiction.

I love a future when this is possible but what we have today is more of a proof of concept. A transformative leap is required for this technology before it can be as useful as advertised.

0000000000100 · 2026-02-09T01:11:36 1770599496

Yep, it’s still a bit off from being a true developer. But good news for existing software devs who will need to be hired to fix LLM balls of mud that will inevitably fall apart.

In my mind it’s not too much different than cheap contractor code that I already have to deal with on a regular basis…

8note · 2026-02-09T01:22:29 1770600149

you could also use some code styling agent scripts that make todo lists of everywhere where there's bad architecture, and have it run through fixing those issues until its to your liking.

theyre reasomable audit tools for finding issues, if you have ways to make sure they dont give up early, and you force them to output proof of what they did

Jensson · 2026-02-09T12:11:54 1770639114

And that is harder than just doing it manually, hence saying that hard parts are harder. If you have a clear picture of what you want it to do then its harder to vibe code than to code it yourself.

Qworg · 2026-02-09T00:32:39 1770597159

Your responsibility as a developer in this new world is design and validation.

A poor foundation is a design problem. Throw it away and start again.

isodev · 2026-02-09T00:40:21 1770597621

We’ve always been responsible for design and validation. Nothing has changed there.

It’s funny how the vibe coding story insists we shouldn’t look at the code details but when it’s pointed out the bots can’t deal with a “messy” (but validated) foundation, the story changes that we have to refactor that.

ares623 · 2026-02-09T05:28:10 1770614890

But how will new developers learn to design and validate in the future?

zozbot234 · 2026-02-09T00:41:24 1770597684

Can the AI help with refactoring a poor codebase? Can it at least provide good suggestions for improvement if asked to broadly survey a design that happens to be substandard? Most codebases are quite bad as you say, so this is a rather critical area.

Avshalom · 2026-02-09T01:12:45 1770599565

When you say multiplier, what kind of number are you talking about. Like what multiple of features shipped that don't require immediate fixes have you experienced.

echelon · 2026-02-09T01:16:43 1770599803

It's coding at 10-20x speed, but tangibly this is at 1.5-2x the overall productivity. The coding speed up doesn't translate completely to overall velocity yet.

I am beginning to build a high degree of trust in the code Claude emits. I'm having to step in with corrections less and less, and it's single shotting entire modules 500-1k LOC, multiple files touched, without any trouble.

It can understand how frontend API translates to middleware, internal API service calls, and database queries (with a high degree of schema understanding, including joins).

(This is in a Rust/Actix/Sqlx/Typescript/nx monorepo, fwiw.)

Avshalom · 2026-02-09T01:18:29 1770599909

Okay but again what multiplier of features have you actually shipped.

jim180 · 2026-02-09T00:46:25 1770597985

my exact experience, and AI is especially fragile when you are starting new project from scratch.

Right know I'm building NNTP client for macOS (with AppKit), because why not, and initially I had to very carefully plan and prompt what AI has to do, otherwise it would go insane (integration tests are must).

Right know I have read-only mode ready and its very easy to build stuff on top of it.

Also, I had to provide a lot of SKILLS to GPT5.3

dustingetz · 2026-02-09T01:04:14 1770599054

how do you know there is such thing as good code foundations, and how do you know you have it? this is an argument from ego

kingraoul · 2026-02-09T01:22:50 1770600170

Induction always sneaks in!

RataNova · 2026-02-09T12:21:04 1770639664

AI doesn't fix design debt, it amplifies it

anupamchugh · 2026-02-09T06:40:58 1770619258

socketcluster nailed it. I've seen this firsthand — the same agent produces clean output when the codebase has typed specs and a manifest, and produces garbage when it's navigating tribal knowledge. The hard part was always there. Agents just can't hide it like humans can.

kfarr · 2026-02-09T00:18:49 1770596329

I think it makes the annoying part less annoying?

Also re: "I spent longer arguing with the agent and recovering the file than I would have spent writing the test myself."

In my humble experience arguing with an LLM is a waste of time, and no-one should be spending time recovering files. Just do small changes one at a time, commit when you get something working, and discard your changes and try again if it doesn't.

I don't think AI is a panacea, it's just knowing when it's the right tool for the job and when it isn't.

swordsith · 2026-02-09T01:01:35 1770598895

Anyone not using version control or a IDE that will keep previous versions for a easy jump back is just being silly. If you're going to play with a kid who has a gun, wear your plates.

laserlight · 2026-02-09T10:35:32 1770633332

Once, I told a friend that it was stupid that Claude Code didn't have native IDE integration. His answer: “You don't need an IDE with Claude Code.”

I've begun to suspect response that this technology triggers a kind of religion in some people. The technology is obviously perfect, so that any problems you might have are because of you.

surajrmal · 2026-02-09T11:55:44 1770638144

I find that I vastly prefer Gemini CLI to antigravity, despite the latter being an ide. Others feel the opposite. I believe it comes down to how you are using AI. It's great they both options exist for both types of people.

hyperadvanced · 2026-02-09T01:04:25 1770599065

I don’t think it’s “just” that easy. AI can be great at generating unit tests but it can and will also frequently silently hack said tests to make them pass rather than using them as good indicators of what the program is supposed to be doing.

datsci_est_2015 · 2026-02-09T17:18:53 1770657533

> AI can be great at generating unit tests but it can and will also frequently silently hack said tests to make them pass rather than using them as good indicators of what the program is supposed to be doing.

Unit testing is my number one use case for gen AI in SWE. I just find the style / concept often slightly different than I would personally do, so I end up editing the whole thing.

But, it’s great at getting me past the unpleasant “activation energy threshold” of having a test written in the first place.

hyperadvanced · 2026-02-10T02:07:43 1770689263

Totally. I’m a huge fan of it, but it rarely “just” works and I do have to babysit it to make sure it’s actually doing something good for the world

afro88 · 2026-02-09T05:57:05 1770616625

Once you start arguing, it's time to start a new prompt with new instructions

stingraycharles · 2026-02-09T06:47:00 1770619620

Or, as I prefer, go back in the conversation and edit / add more context so that it wouldn’t go off the wrong track in the first place.

sothatsit · 2026-02-09T06:18:35 1770617915

I also like asking the agent how we can update the AGENTS.md to avoid similar mistakes going forward, before starting again.

arwhatever · 2026-02-09T00:35:41 1770597341

But he started it …

blirio · 2026-02-10T04:40:48 1770698448

The article describes the problems of using a AI chat app without setting up context, skills, MCP, etc

Like yea the AI won’t know what you discussed in last weeks meeting by default. But if you do auto transcribe to your meetings (even in person just open zoom on one persons laptop), save them to a shared place and have everyone make this accessible in their LLM’s context then it will know.

crazygringo · 2026-02-09T03:24:58 1770607498

> Reading and understanding other people's code is much harder than writing code.

I keep seeing this sentiment repeated in discussions around LLM coding, and I'm baffled by it.

For the kind of function that takes me a morning to research and write, it takes me probably 10 or 15 minutes to read and review. It's obviously easier to verify something is correct than come up with the correct thing in the first place.

And obviously, if it took longer to read code than to write it, teams would be spending the majority of their time in code review, but they don't.

So where is this idea coming from?

0xbadcafebee · 2026-02-09T06:14:57 1770617697

Five hours ago I was reviewing some failed tests in a PR. The affected code was probably 300 lines, total source for the project ~1200 lines. Reading the code, I couldn't figure out what the hell was going on... and I wrote all the code. Why would that be failing? This all looks totally fine. <changes some lines> There that should fix it! <runs test suite; 6 new broken tests> Fuck.

When you write code, your brain follows a logical series of steps to produce the code, based on a context you pre-loaded in your brain in order to be capable of writing it that way. The reader does not have that context pre-loaded in their brain; they have to reverse-engineer the context in order to understand the code, and that can be time-consuming, laborious, and (as in my case) erroneous.

nly · 2026-02-09T08:23:26 1770625406

Sounds like you were just reviewing bad code.

The author should have provided context via comments and structured the code in a way that is easy to change and understand

crazygringo · 2026-02-09T14:53:11 1770648791

Exactly. A long time ago, I learned to write comments explaining all necessary context for my future self and for others -- exactly for this reason.

Remember, you're not writing code just to execute. You're writing it to be read.

h4kunamata · 2026-02-10T01:31:33 1770687093

I worked with people that defended the idea that code should not have comments, the code should self explain itself.

I am not a developer and I completely disagree with that, the python scripts I wrote, Ansible playbook, they all have comments because 1 month down the road I no longer remember why I did what I did, was that a system limitation or software limitation or the easiest solution at the time???

shimman · 2026-02-09T03:29:40 1770607780

I like to think of it as the distinction between editor and reader. Like you said, it's quite easy to read code. I heavily agree with this. I don't professionally write C but I can read and kinda infer what C devs are doing.

But if I were an "editor," I actually take the time to understand codepaths, tweak the code to see what could be better, actually try different refactoring approaches while editing. Literally seeing how this can be rewritten or reworked to be better, that takes considerable effort but it's not the same as reading.

We need a better word for this than editor and reading, like something with a dev classification too it.

h4kunamata · 2026-02-10T01:38:32 1770687512

>It's obviously easier to verify something is correct than come up with the correct thing in the first place.

You are missing the biggest root cause of the problem you describe: People write code differently!

There are "cough" developers whose code is copy/paste from all over the internet. I am not even getting into the AI folks going full copy/paste mode.

When investigating said code, you will be like why this code in here?? You call tell when a python script contains different logic for example. Sure, 50 lines will be easy to ready, expand that to 100 lines and you be left on life support.

wredcoll · 2026-02-09T03:26:31 1770607591

Because to verify something is correct you have to understand the what makes it correct which is 99% of writing the code in the first place.

crazygringo · 2026-02-09T03:30:44 1770607844

That doesn't make any sense to me.

When the code is written, it's all laid out nicely for the reader to understand quickly and verify. Everything is pre-organized, just for you the reader.

But in order to write the code, you might have to try 4 different top-level approaches until you figure out the one that works, try integrating with a function from 3 different packages until you find the one that works properly, hunt down documentation on another function you have to integrate with, and make a bunch of mistakes that you need to debug until it produces the correct result across unit test coverage.

There's so much time spent on false starts and plumbing and dead ends and looking up documentation and debugging when you code. In contrast, when you read code that already has passing tests... you skip all that stuff. You just ensure it does what it claims and is well-written and look for logic or engineering errors or missing tests or questionable judgment. Which is just so, so much faster.

layer8 · 2026-02-09T03:40:34 1770608434

> But in order to write the code, you might have to try 4 different top-level approaches until you figure out the one that works , try integrating with a function from 3 different packages until you find the one that works properly

If you haven’t spent the time to try the different approaches yourself, tried the different packages etc., you can’t really judge if the code you’re reading is really the appropriate thing. It may look superficially plausible and pass some existing tests, but you haven’t deeply thought through it, and you can’t judge how much of the relevant surface area the tests are actually covering. The devil tends to be in the details, and you have to work with the code and with the libraries for a while to gain familiarity and get a feeling for them. The false starts and dead ends, the reading of documentation, those teach you what is important; without them you can only guess. Wihout having explored the territory, it’s difficult to tell if the place you’ve been teleported to is really the one you want to be in.

crazygringo · 2026-02-09T04:02:48 1770609768

The goal isn't usually to determine whether the function is the perfect optimal version of the function that could ever exist, if the package it integrates with the the best possible package out of the 4 mainstream options, or to become totally and intimately familiar with them to ensure it's as idiomatic as possible or whatever.

You're just making sure it works correctly and that you understand how. Not superficially, but thinking through it indeed. That the tests are covering it. It doesn't take that long.

What you're describing sounds closer to studying the Talmud than to reading and reviewing most code.

Like, the kind of stuff you're describing is not most code. And when it is, then you've got code that requires design documents where the approach is described in great detail. But again, as a reader you just read those design documents first. That's what they're there for, so other people don't have to waste time trying out all the false starts and dead ends and incorrect architectures. If the code needs this massive understanding, then that understanding needs to be documented. Fortunately, most functions don't need anything like that.

wredcoll · 2026-02-10T04:50:36 1770699036

I can read a line of code and tell you that it's storing a pointer in this array cell and removing this other pointer and incrementing this integer by 6 and so on. None of that tells me if that is the correct thing to be doing.

Detecting obvious programming errors like forgetting to check for an error case or freeing a variable or using an array where a set should be is, usually, obvious, and frequently machine can and will point it out.

Knowing that when you add a transaction to this account you always need to add an inverse transaction to a different account to keep them in sync is unlikely to be obvious from the code. Or that you can't schedule an appointment on may 25th because it's memorial day. Or whatever other sorts of actually major bugs tend to cause real business problems.

I mean, sure, if someone documented those requirements clearly and concisely and they were easy to find from the section of code you were reviewing such that you knew you needed to read them first, then yes, it becomes a lot easier. My experience as a professional programmer is this happens approximately never, but I suppose I could be an outlier.

And yes if you want to be extremely literal, some code is easier to read than write. But no one cares about that type of code.

oblio · 2026-02-09T06:58:21 1770620301

> What you're describing sounds closer to studying the Talmud than to reading and reviewing most code.

https://www.joelonsoftware.com/2000/05/26/reading-code-is-li...

Most human written code has 0 (ZERO!) docs. And if it has them, they're inaccurate or out of date or both.

Lots of code is simple and boring but a fair amount isn't and reading it is non trivial, you basically need to run it in your head or do step by step debugging in multiple scenarios.

crazygringo · 2026-02-09T19:37:57 1770665877

Hilarious you found that reference.

I think it's obvious that's in reference to poorly written code. Or at least horrifically underdocumented/undercommented code.

There's a reason coders are constantly given the advice to write code for a future reader, not just the compiler/interpreter.

If I got code like Joel describes for a code review, I'm sending it back asking for it to be clearly commented.