More

cedws · 2026-02-12T15:34:07 1770910447

I watched a documentary a while ago and I think it said that Kim Jong Un's sister plays a big part in maintaining the regime. I wonder if she pulled some strings in this decision to select Kim Ju Ae as heir, perhaps because she's more pliable, or under Kim Yo Jong's thumb. A North Korean patriarchal society probably wouldn't be amenable to a woman telling a male heir what to do.

SirFatty · 2026-02-12T16:31:19 1770913879

Maybe she's the mother.

cedws · 2026-02-12T08:35:24 1770885324

Because being principled damages your social opportunities. Trust me. I resisted Instagram for years. When I finally gave in I instantly had access to more events, was able to connect with more people, felt less excluded. I realised all that I had missed out on.

I don't think asking people to abandon a platform works. We need to fight for open protocols.

cedws · 2026-02-12T08:33:00 1770885180

UK user here, it still shows my account as Unverified after running :(

extraduder_ire · 2026-02-12T09:44:31 1770889471

Check the issues/PRs on their github.

cedws · 2026-02-11T02:56:46 1770778606

This is a shame, I’ve been running Bazzite on my gaming PC since Christmas and have been pretty happy with it. To the honest I don’t really care who is at fault, I don’t want the maintainers of my OS to be a drama show, so I’ll probably switch to Cachy. Building my own image would be cool but it’s not high on my list of priorities.

cedws · 2026-02-10T18:31:19 1770748279

I’m in awe at the complete lack of critical thinking skills. Did people seriously believe LLMs were becoming self aware or something? Didn’t even consider the possibility it was all just a big show being puppeted by humans for hype and clicks? No wonder the AI hype has reached this level of hysteria.

cedws · 2026-02-10T11:46:54 1770724014

I’m finding that the code LLMs produce is just average. Not great, not terrible. Which makes sense, the model is basically a complex representation of the average of its training data right? If I want what I consider ‘good code’ I have to steer it.

So I wouldn’t use LLMs to produce significant chunks of code for something I care about. And publishing vibe coded projects under my own GitHub user feels like it devalues my own work, so for now I’m just not publishing vibe coded projects. Maybe I will eventually, under a ‘pen name.’

rybosworld · 2026-02-10T13:37:19 1770730639

We've gone from "it's glorified auto-complete" to "the quality of working, end-to-end features, is average", in just ~2 years.

I think it goes without saying that they will be writing "good code" in short time.

I also wonder how much of this "I don't trust them yet" viewpoint is coming from people who are using agents the least.

Is it rare that AI one-shots code that I would be willing to raise as a PR with my name on it? Yes, extremely so (almost never).

Can I write a more-specified prompt that improves the AI's output? Also yes. And the amount of time/effort I spend iterating on a prompt, to shape the feature I want, is decreasing as I learn to use the tools better.

I think the term prompt-engineering became loaded to mean "folks who can write very good one-shot prompts". But that's a silly way of thinking about it imo. Any feature with moderate complexity involves discovery. "Prompt iteration" is more descriptive/accurate imo.

cedws · 2026-02-10T17:50:06 1770745806

First you have to classify what “good code” is, something that programmers have still not settled on in the over half a century that the field has existed. I also think what the other reply said is true, going from average to “good code” is way harder because it implies a need for LLMs to self critique beyond what they do today. I don’t think just training on a set of hand picked samples is enough.

There’s also the knowledge cutoff aspect. I’ve found that LLMs often produce outdated Go code that doesn’t utilise the modern language features. Or for cases where it knows about a commonly used library, it uses deprecated methods. RAG/MCP can kind of paper over this problem but it’s still fundamental to LLMs until we have some kind of continuous training.

rybosworld · 2026-02-10T18:55:49 1770749749

AI's can self-critique via mechanisms like chain of thought or user specified guard rails like a hook that requires the test suite to pass before a task can be considered complete/ready for human review. These can and do result in higher quality code.

Agree that "good code" is vague - it probably always be. But we can still agree that code quality is going up over time without having a complete specification for what defines "good".

cedws · 2026-02-10T19:05:05 1770750305

Unfortunately I can only give anecdotes, but in my experience the LLM's 'thinking' does not lead to code quality improvements in the same way that a programmer thinking for a while would.

In my experience having LLMs write Go, it tends to factor code in not so great way from the start, probably due to lacking the mental model of pieces composing together. Furthermore, once a structure is in place, there doesn't seem to be a trigger point that causes the LLM to step back and think about reorganising the code, or how the code it wants to write could be better integrated into what's already there. It tends to be very biased by the structures that already exist and not really question them.

A programmer might write a function, notice it becoming too long or doing too much, and then decide break it down into smaller subroutines. I've never seen an LLM really do this, they seem biased towards being additive.

I believe good code comes from an intuition which is very hard to convey. Imprinting hard rules into the LLM like 'refactor long functions' will probably just lead to overcorrection and poor results. It needs to build its own taste for good code, and I'm not sure if that's possible with current technology.

simonw · 2026-02-10T21:28:00 1770758880

> Furthermore, once a structure is in place, there doesn't seem to be a trigger point that causes the LLM to step back and think about reorganising the code, or how the code it wants to write could be better integrated into what's already there.

Older models did do this, and it sucked. You'd ask for a change to your codebase and they would refactor a chunk of it and make a bunch of other unrelated "improvements" at the same time.

This was frustrating and made for code that was harder to review.

The latest generation of models appear to have been trained not to do that. You ask for a feature, they'll build that feature with the least changes possible to the code.

I much prefer this. If I want the code refactored I'll say to the model "look for opportunities to refactor this" and then it will start suggesting larger changes.

jmalicki · 2026-02-10T20:31:21 1770755481

> A programmer might write a function, notice it becoming too long or doing too much, and then decide break it down into smaller subroutines. I've never seen an LLM really do this, they seem biased towards being additive.

The nice thing is a programmer with an LLM just steps in here, and course-corrects, and still has that value add, without taking all the time to write the boilerplate in between.

And in general, the cleaner your codebase the cleaner LLM modifications will be, it does pick up on coding style.

cedws · 2026-02-10T20:54:49 1770756889

>The nice thing is a programmer with an LLM just steps in here, and course-corrects

This does not seem to be the direction things are going. People are talking about shipping code they haven't edited, most notably the author of Claude Code. Sometimes they haven't even read the code at all. With LLMs the path of least resistance is to take your hands off the wheel completely. Only programmers taking particular care are still playing an editorial role.

When the code is constructed by an LLM, the human in the driving seat doesn't get a chance to build the mental models that they usually would writing it manually. This stifles the ability to see opportunities to refactor. It is widely considered to be harder to read code than to write it.

>And in general, the cleaner your codebase the cleaner LLM modifications will be

Whilst true, this is a kind of "you're holding it wrong" argument. If LLMs had model of what differentiates good code from bad code, whatever they pull into their context should make no difference.

jmalicki · 2026-02-10T21:06:14 1770757574

> Whilst true, this is a kind of "you're holding it wrong" argument. If LLMs had model of what differentiates good code from bad code, whatever they pull into their context should make no difference.

Good code is in the eye of the beholder. What reviewers in one shop would consider good code is dramatically different than another.

Conforming to the existing code base style is good in and of itself, if the context it pulls in makes no difference that makes it useless.

jauntywundrkind · 2026-02-11T03:38:56 1770781136

> When the code is constructed by an LLM, the human in the driving seat doesn't get a chance to build the mental models that they usually would writing it manually

I'm asking the LLM for alternatives and options constantly, to test different models. It can give me a write-up description of options, or go spin up subagents to go try 4 different things at once.

> It is widely considered to be harder to read code than to write it

Even more than writing code, I think LLM's are exceptional at reading code. They can review huge amounts of code incredibly fast, to understand very complex systems. And then you can just ask it questions! Don't understand? Ask more questions!

I have mcp-neovim-server open, so I just ask it to open the relevant pieces of code at those lines, and it can then show me. CodeCompanion makes it easy to ask questions about a line. It's amazing how

Reading code was one of the extremely hard parts of programming, and the machine is far far better at it than us!

> When the code is constructed by an LLM, the human in the driving seat doesn't get a chance to build the mental models that they usually would writing it manually.

Here's one way to tell me you haven't tried the thing without saying you haven't tried the thing. The ability to do deep inquiry into topics & to test &btry different models is far far far better than it has ever been. We aren't stuck with what we right, we can keep iterating &b trying at vastly lower cost, to do the hard work to discover what is a good model. Programmers rarely have had the luxury of time and space to keep working on a problem again and again, to adjust and change and tweak until the architecture truly sings. Now you can try a weeks worth of architectures in an afternoon. There is no better time for those who want to understand to do so.

I feel like one thing missing from this thread is that most people adopting AI at a serious level are building really strong AGENTS.md files, that refine tastes and practices and forms. The AI is pretty tasteless, isnt deliberate. It is up to us to explore the possibility space when working on problems, and to create good context that steers towards good solutions. And our ability to get information out, to probe into systems, to asses, to test hypothesis, is vastly vastly higher, which we can keep using to become far better steersfolk.

thisisit · 2026-02-10T15:09:36 1770736176

Building expertise isn't a straight line. Going from a bad to average is much easier than going from average to good.

Herring · 2026-02-10T19:34:06 1770752046

Yeah Tesla and Waymo know this quite well. There's a reason we don't have moon bases yet.

ikr678 · 2026-02-10T13:45:53 1770731153

Is there a big enough dataset of 'good' code to train from though?

rybosworld · 2026-02-10T13:48:23 1770731303

I (and lots of people) used to think the models would run out of training data and it would halt progress.

They did run out of human-authored training data (depending on who you ask), in 2024/2025. And they still improve.

lelanthran · 2026-02-10T14:10:44 1770732644

> They did run out of human-authored training data (depending on who you ask), in 2024/2025. And they still improve.

It seemed to me that improvements due to training (i.e. the model) in 2025 were marginal. The biggest gains were in structuring how the conversation with the LLM goes.

eqvinox · 2026-02-10T14:05:29 1770732329

> And they still improve.

But what asymptote are they approaching? Average code? Good code? Great code?

rybosworld · 2026-02-10T19:07:08 1770750428

I'd argue that "good", or at least "good enough", is when they reach a point where it becomes preferable to spend your time prompting rather than reading and writing code. That the final output meets the feature specifications is more or less the goal.

A lot of developers are having a difficult time accepting that the code doesn't matter nearly as much anymore, myself included. The feedback cycles that made hot fixing, bug fixing, customer support, etc. so expensive, have shrunk by orders of magnitude. A codebase that can be maintained by humans is perhaps not a goal worth pursuing anymore.

To really see this and feel this, I think it's worthwhile to spend at least a weekend or two seeing what you can build without writing or reviewing any of the code. Use a frontier model. Opus 4.6 or Codex 5.3. Probably doesn't matter which one you choose.

If you give it an honest try, you'll see that a lot of the limitations are self-imposed. Said another way: the root problem is some flavor of the user under specifying a prompt, having inconsistent design docs, and not implementing guard rails to prevent the AI from reintroducing bugs you previously squashed.

It's a very new way of working and it feels foreign. But there are a lot of very smart, very successful people doing this. People who have written millions of lines of code over their lifetime, and who enjoyed doing it, are now fully delegating the task.

jmalicki · 2026-02-10T20:33:58 1770755638

They ran out of passively collected data. RLHF allows them to gather deeper more targeted data.

jmalicki · 2026-02-10T20:33:11 1770755591

There is a lot of RLHF effort around this.

co_king_3 · 2026-02-10T13:49:06 1770731346

AHEM

Let me repeat myself.

I think it goes without saying that they will be writing "good code" in short time.

sdf2erf · 2026-02-10T13:49:36 1770731376

I think your kind of missing the point.

Think about it from a resource (calorie) expenditure stand point.

Are you expending more resources on writing the prompts vs just doing without it? Thats the real question.

If you are expending more, which is what Simon is indicating at - are you really better off? Id argue not, given that this cant be sustained for hours on end. Yet the expectation from management might be that you should be able to sustain this for 8 hours.

So again, are you better off? Not in the slightest.

Many things in life are counter-intuitive and not so simple.

P.s. youre not getting paid more for increasing productivity if you are still expected to work 8 hrs a day... lmao. Thankfully im not a SWE.

notanastronaut · 2026-02-10T16:03:09 1770739389

I think something a lot of people miss out on is that we're not all the same. We all have different internal thought models, whether it is a biological difference (ADHD brain?), educational differences, and overall abilities. And it seems a lot of people have this idea everyone uses "AI" the same way. That's a lack of lateral thinking. Making assumptions we're all burning "calories" in the same way implies we all think, and work, alike.

We are not alike.

rybosworld · 2026-02-10T13:54:34 1770731674

I don't think I'm missing the point and respectfully, I think your reply is completely unrelated to anything that I said.

Whether you are "better off or not" is a separate topic, and I never suggested one way or the other.

Simon's point is that engineers can be so productive with these tools that it is tempting to work (much) longer.

sdf2erf · 2026-02-10T13:55:23 1770731723

Simon: "I'm frequently finding myself with work on two or three projects running parallel. I can get so much done, but after just an hour or two my mental energy for the day feels almost entirely depleted."

Youre a time waster, stop posting and creating noise.

rybosworld · 2026-02-10T14:02:28 1770732148

Time wasting would be not reading the comment I replied to, and then thinking I was replying to Simon/the article.

Does that sound familiar?

dcre · 2026-02-10T13:39:59 1770730799

People often describe the models as averaging their training data, but even for base models predicting the most likely next token this is imprecise and even misleading, because what is most likely is conditional on the input as well as what has been generated so far. So a strange input will produce a strange output — hardly an average or a reversion to the mean.

On top of that, the models people use have been heavily shaped by reinforcement learning, which rewards something quite different from the most likely next token. So I don’t think it’s clarifying to say “the model is basically a complex representation of the average of its training data.”

The average thing points to the real phenomenon of underspecified inputs leading to generic outputs, but modern agentic coding tools don’t have this problem the way the chat UIs did because they can take arbitrary input from the codebase.

falloutx · 2026-02-10T16:44:40 1770741880

I was literally creating a 2nd account on github today for this purpose.

cedws · 2026-02-09T23:37:02 1770680222

I understand the frustration towards Discord, especially because this is a global rollout of a policy they're only required to enforce in specific countries, but it's IMO misdirected. They're likely trying to get ahead of the legislation. The way the winds are blowing indicates the Western governments that haven't already passed legislation mandating ID verification soon will.

You can move to $ALT_PLATFORM but unless it's self hosted they'll eventually have to enforce the same policy.

Direct your anger at the geriatrics in government who don't understand the risks of these laws first. You only have to watch the TikTok CEO's hearing in Congress to see how American politicians don't understand technology.

heavyset_go · 2026-02-10T05:50:35 1770702635

Platforms want this, they're happily implementing it because they'll get a mountain of data to train on and sell, and they'll finally get to sell their userbase as real monetizable humans to their partners.

nondrool · 2026-02-11T11:36:53 1770809813

The more precise the profile the more money is fetched for you at market.

Lermatroid · 2026-02-10T00:07:07 1770682027

This. Still canning my nitro sub for now as I do think they should hold off until necessary, but people ignore that the root of this trend of ID verification is governments who are willfully ignorant to having staff who can accurately assess the technological landscape and enforce smart regulation.

rdm_blackhole · 2026-02-10T09:07:31 1770714451

> Direct your anger at the geriatrics in government who don't understand the risks of these laws first.

No offence but I think you are being extremely naive if you think that the people in power and the lobbyists who have spent the last 10 years relentlessly pushing for ID verification online and mass content scanning in the US and in the EU do not know what they are doing.

Here is the thing, most people are increasingly unhappy about the way things are going whether they are on the right or the left of the political spectrum. Governments can see that and don't want to see what happened in Nepal recently repeat itself. So they are getting ahead of the curve.

First require everyone to ID themselves online, then tie everything you say to your ID then use that against you one day if you decide that enough is enough.

The western countries are looking at what China is doing and simply iterating on it. They wrap it in a nit little bow to either "fight terrorism" TM or "protect the children" TM.

This is a pure power play meant to save their asses and the people who have been warning that this was always going to be going in that direction have been ridiculed and called conspiracy nuts but here we are.

Look at OFCOM in the UK. First it was to protect children form porn. Now they are looking to expand their powers to moderate speech online based on what THEY think is acceptable. If the EU gets it's way, you'll have client scanning in all messaging apps across the EU. And it won't stop.

This sort of thing is never about protecting kids, reducing harm or whatever they call it. It's about control about what you see, what you write, all done with the purpose to determine if you as an individual will become a problem for them in the future.

mock-possum · 2026-02-10T15:46:15 1770738375

> They're likely trying to get ahead of the legislation. The way the winds are blowing indicates the Western governments that haven't already passed legislation mandating ID verification soon will.

Isn’t that the first rule from On Tyranny? “Do not obey in advance"

budududuroiu · 2026-02-10T10:57:51 1770721071

Platforms want this because it means they can get rid of the mountains of money they were paying for moderators to keep "child unfriendly" content off their platform

"If your kid is on Discord, and sees something they shouldn't, it's their or your fault, not ours"

dmitrygr · 2026-02-10T04:10:01 1770696601

Who implements these idiotic policies? We do! Politicians could not code their way out of a paper bag! Giving up is not the solution. Refuse to do it. Make ID pass for a full-white jpeg.

cedws · 2026-02-09T16:34:56 1770654896

Git is like 10% of building software.

kgwxd · 2026-02-09T18:34:26 1770662066

If GitHub is doing 90% more than Git does, "GitHub" is a terrible name for it.

cedws · 2026-02-09T16:25:25 1770654325

Screw GitHub, seriously. This unreliability is not acceptable. If I’m in a position where I can influence what code forge we use in future I will do everything in my power to steer away from GitHub.

edoceo · 2026-02-09T16:36:38 1770654998

Forge feature parity is easy to find. But GH has that discover ability feature and the social queues from stars/forks.

One solution I see is (eg) internal forge (Gitlab/gitea/etc) and then mirrored to GH for those secondary features.

Which is funny. If GH was better we'd just buy their better plan. But as it stands we buy from elsewhere and just use GH free plans.

coffeebeqn · 2026-02-09T18:15:21 1770660921

Every company I’ve worked in the last 10 years used GH for the internal codebase hosting , PRs and sometimes CI. Discoverability doesn’t really come into picture for those users and you can still fork things from GitHub even if you don’t host your core code infra on it

anon7000 · 2026-02-10T07:58:29 1770710309

Yep. As someone in a similar position to influence this, I’ll also be pushing for at least significant discounts in our contract. The challenge with migrating off for enterprise is going to be integrations & compliance. There are dozens of options that replicate the core PR workflow that people use, and it’s probably fairly easy to migrate that. The hard part is the hundreds of things that hook into GitHub that don’t have a simple migration, even internal tooling. So it just turns into a pretty big project.

regularfry · 2026-02-09T16:44:55 1770655495

Stars are just noise. All they tell you is how online the demographics of that ecosystem are.

Mirroring is probably the way forward.

cedws · 2026-02-08T23:32:24 1770593544

Etiquette on GitHub has completely gone out the window, many issues I look at these days resemble reddit threads more than any serious technical discussion. My inbox is frequently polluted by "bump" comments. This is going to get worse as LLMs lower the bar.