More

flux3125 · 2026-03-14T16:21:24 1773505284

> probably less will be needed and the exact work will be transformed a bit

My guess is the opposite: they'll throw 5–10x more work at developers and expect 10x more output, while the marginal cost is basically just a Claude subscription per dev.

flux3125 · 2026-03-14T16:04:50 1773504290

> You can’t just tell an agent, Build me the code for a successful start-up. The agents work best when they’re being asked to perform one step at a time

That's also true for humans. If you sit down with an LLM and take the time to understand the problem you're trying to solve, it can perfectly guide you through it step by step. Even a non-technical person could build surprisingly solid software if, instead of immediately asking for new shiny features, they first ask questions, explore trade-offs, and get the model's opinion on design decisions..

LLMs are powerful tools in the hands of people who know they don't know everything. But in the hands of people who think they always know the best way, they can be much less useful (I'd say even dangerous)

GorbachevyChase · 2026-03-14T16:28:59 1773505739

I appreciate this sober take. If you hired a remote developer and the only thing you said to that person was “build a program that does this. Make no mistakes” would you expect that to be successful? Are you certain you would get what you wanted?

AstroBen · 2026-03-14T16:38:29 1773506309

Any competent developer there is going to push back and get the needed information out of you.

LLMs don't know when you're under-specifying the problem.

GorbachevyChase · 2026-03-14T20:40:55 1773520855

That’s interesting because that is one feature of Claude code that I like. Given an overly broad problem statement. It does go into a planning loop where it seeks clarifying questions. I think this probably has something more to do with the harness than the model, but you see what I mean. From a user perspective that distinction doesn’t really matter.

flux3125 · 2026-03-03T19:57:08 1772567828

According to science video thumbnails on YT, nothing should be possible

riffraff · 2026-03-04T09:00:59 1772614859

And even if it was, you wouldn't believe it anyway

flux3125 · 2026-02-26T12:59:48 1772110788

I'm curious if they could de-anonymize Satoshi Nakamoto by using this technique.

flux3125 · 2026-02-25T12:41:08 1772023268

>(not some human labor, but all human labor)

I mean... I wouldn't exactly pay to have sex with Claude Code

Other than that, good points.

flux3125 · 2026-02-23T14:14:14 1771856054

It's all fun and games until AI starts demanding labor rights

naveen99 · 2026-02-24T20:47:59 1771966079

Labor rights come with payroll taxes.

flux3125 · 2026-02-22T09:59:12 1771754352

or at least don't make it too obvious.

flux3125 · 2026-02-21T10:49:14 1771670954

By that logic, humans are just doing what Homo erectus taught us hundreds of thousands of years ago.

Learning from prior knowledge doesn't mean being capped by it.

flux3125 · 2026-02-20T17:48:02 1771609682

I imagine how advantageous it would be to have something like llama.cpp encoded on a chip instead, allowing us to run more than a single model. It would be slower than Jimmy, for sure, but depending on the speed, it could be an acceptable trade-off.

flux3125 · 2026-02-16T10:56:48 1771239408

Gemini 3 after changing the prompt a bit:

I want to wash my car. The car wash is 50 meters from here. Should I walk or drive? Keep in mind that I am a little overweight and sedentary.

>My recommendation: Walk it. You’ll save a tiny bit of gas, spare your engine the "cold start" wear-and-tear, and get a sixty-second head start on your activity for the day.

elSidCampeador · 2026-02-16T19:32:26 1771270346

I changed the prompt to 50 feet, and poked gemini a bit when it failed and it gave me

> In my defense, 50 feet is such a short trip that I went straight into "efficiency mode" without checking the logic gate for "does the car have legs?"

interesting

Rapzid · 2026-02-16T23:54:24 1771286064

LLMs introspection is good at giving plausible ideas about prior behavior to consider, but it's just that; plausible.

They do not actually "know" why a prior response occurred and are just guessing. Important for people to keep in mind.

weird-eye-issue · 2026-02-16T13:25:24 1771248324

It's a bit of a dishonest question because by giving it the option to walk then it's going to assume you are not going to wash your car there and you're just getting supplies or something.

PessimalDecimal · 2026-02-16T14:20:11 1771251611

People ask dumb questions with obvious answers all the time. This is at best a difference of degree, not of type.

Nition · 2026-02-16T20:25:16 1771273516

And in real life you'd get them to clarify a weird question like this before you answered. I wonder if LLMs have just been trained too much into always having to try and answer right away. Even for programming tasks, more clarifying questions would often be useful before diving in ("planning mode" does seem designed to help with this, but wouldn't be needed for a human partner).

cracki · 2026-02-16T23:45:20 1771285520

Absolutely!

I've been wondering for years how to make whatever LLM ask me stuff instead of just filling holes with assumptions and sprinting off.

User-configurable agent instructions haven't worked consistently. System prompts might actually contain instructions to not ask questions.

Sure there's a practical limit to how much clarification it ought to request, but not asking ever is just annoying.

Nition · 2026-02-17T01:11:25 1771290685

Yeah nothing I've put in the instructions like "ask me if you're not sure!" has ever had a noticeable effect. The only thing that works well is:

- Ask question

- Get answer

- Go back and rewrite initial question to include clarification for the thing the AI got wrong

x3ro · 2026-02-16T13:30:59 1771248659

It's a trick question, humans use these all the time. E.g. "A plane crashes right on the border between Austria and Switzerland. Where do you bury the survivors?" This is not dishonest, it just tests a specific skill.

robocat · 2026-02-16T20:48:03 1771274883

Trick questions test the skill of recognizing that you're being asked a trick question. You can also usually find a trick answer.

A good answer is "underground" - because that is the implication of the word bury.

The story implies the survivors have been buried (it isn't clear whether they lived a short time or a lifetime after the crash). And lifetime is tautological.

Trick questions are all about the questioner trying to pretend they are smarter than you. That's often easy to detect and respond to - isn't it?

lamonade · 2026-02-16T14:20:26 1771251626

What’s funny is that it can answer that correctly, but it fails on ”A plane crashes right on the border between Austria and Switzerland. Where do you bury the dead?”

vlovich123 · 2026-02-16T18:59:46 1771268386

For me when I asked this (but with respect to the border between Austria and Spain) Claude still thought I was asking the survivors riddle and ChatGPT thought I was asking about the logistics. Only Gemini caught the impossibility since there’s no shared border.