So is Gemini tbh. It's the only agent I've used that gets itself stuck in ridicu...

0xcafefood · 2026-02-13T03:20:13 1770952813

Google's product management and discipline are absolute horsesh*t. But they have a moat and its extreme technical competence. They own their infra from the hardware (custom ASICs, their own data centers, global intranet, etc.) all the way up to the models and product platforms to deploy it in. To the extent that making LLMs work to solve real world problems is a technical problem, landing Gemini is absolutely in Google's wheelhouse.

prabubio · 2026-02-13T06:53:00 1770965580

without modern product moat, legacy products and infrastructure is useless. It's like Microsoft saying I have excel, Azure and CoPilot.

ahartmetz · 2026-02-13T07:55:57 1770969357

You are stating generalities when more specific information is easily available.

Google has AI infrastructure that it has created itself as well as competitive models, demonstrating technical competence in not-legacy-at-all areas, plus a track record of technical excellence in many areas both practical and research-heavy. So yes, technical competence is definitely an advantage for Google.

WheatMillington · 2026-02-13T02:15:00 1770948900

Interesting that you consider the most cutting edge technology in the category to be "the easiest layups".

marcus_holmes · 2026-02-13T03:57:18 1770955038

I think they've been gaming benchmarks.

I use Claude every day. I cannot get Gemini to do anything useful, at all. Every time I've tried to use it, it has just failed to do what was required.

asdff · 2026-02-13T07:22:23 1770967343

Three subthreads up you have someone saying gemini did what claude couldn't for them on some 14 year old legacy code issue. Seems you can't really use peoples prior success with their problem as an estimate of what your success will be like with your problem and a tool.

arw0n · 2026-02-13T18:12:27 1771006347

People and benchmarks are using pretty specific, narrow tests to judge the quality of LLMs. People have biases, benchmarks get gamed. In my own experience, Gemini seems to be lazy and scatter-brained compared to Claude, but shows higher general-purpose reasoning abilities. Anthropic is also obviously massively focusing on making their models good at coding.

So it is reasonable that Claude might show significantly better coding ability for most tasks, but the better general reasoning ability proves useful in coding tasks that are complicated and obscure.

QuantumGood · 2026-02-13T02:03:02 1770948182

Hard to bet against Hassabis + Google's resources. This is in their wheelhouse, and it's eating their search business and refactoring their cloud business. G+ seemed like a way to get more people to Google for login and tracking.

unmole · 2026-02-13T04:25:39 1770956739

> it's eating their search business

Fact not in evidence. Google's search and advertising revenue continues to grow.

QuantumGood · 2026-02-14T01:34:45 1771032885

Internally as well as externally.

2sdd · 2026-02-13T17:29:15 1771003755

Thats pretty telling that on the search's / ad placement on the web where it matters, OAI has had no impact or its muted and offset by continued market power / increased demand for Google's ad-space on the web.

BobbyTables2 · 2026-02-13T03:18:27 1770952707

Indeed. The stupid AI on Google’s search page is so bad, I really wonder why the released it publicly.

Makes CoPilot look like something from a Sci-Fi movie.

stephenhuey · 2026-02-13T05:48:08 1770961688

A couple months ago things were different. Try their stronger models. Gemini recently saved me from a needle in a haystack problem with buildpacks and Linux dependencies for a 14-year-old B2B SaaS app that I was solving a major problem for, and Gemini figured out the solution quickly after I worked on it for hours with Claude Code. I know it's just one story where Gemini won, and I have really enjoyed using Claude Code, but Google is having some success with the serious effort they're putting into this fight.

randerson · 2026-02-13T05:45:58 1770961558

I think they had no choice but to release that AI before it was ready for prime time. Their search traffic started dropping after ChatGPT came out, and they risked not looking like a serious player in AI.

what · 2026-02-13T06:14:15 1770963255

They recently replaced “define: word” (or “word meaning”) results with an “ai summary” and it’s decidedly worse. It used to just give you the definition(s) and synonyms for each one. Now it gives some rambling paragraphs.

bmacho · 2026-02-13T13:05:39 1770987939

My google gives me the data from oup for word meaning, and doesn't show any AI. It opens up the translator for word meaning language. It is really fast and convenient.

milleramp · 2026-02-13T05:11:23 1770959483

Maybe it's incentive is to 'close the ticket' as fast as possible.