More often than not, when I inquire deeper, I often find their prompting isn't v...

cool_dude85 · 2025-03-27T22:31:14 1743114674

It seems hard to be bullish on LLMs as a generally useful tool if the solution to problems people have is "use trial and error to improve how you write your prompts, no, it's not obvious how to do so, yes, it depends heavily on the exact model you use."

simonw · 2025-03-27T22:39:00 1743115140

You could say that about any power tool.

A Mitre Saw is an amazing thing to have in a woodshop, but if you don't learn how to use it you're probably going to cut off a finger.

The problem is that LLMs are power tools that are sold as being so easy to use that you don't need to invest any effort in learning them at all. That's extremely misleading.

simoncion · 2025-03-28T07:33:38 1743147218

Unlike the manufacturers of LLMs, the manufacturers of real-world power tools

* Know how they work

* Are legally liable for defects in design or manufacture that cause injury, death, or property damage

* Provide manuals that instruct the operator how to effectively and safely use the power tool

TeMPOraL · 2025-03-28T12:20:51 1743164451

> Are legally liable for defects in design or manufacture that cause injury, death, or property damage

Except when you use them for purposes other than declared by them - then it's on you. Similarly, you get plenty of warnings about limitation and suitability of LLMs from the major vendors, including even warnings directly in the UI. The limitations of LLMs are common knowledge. Like almost everyone, you ignore them, but then consequences are on you too.

> Provide manuals that instruct the operator how to effectively and safely use the power tool

LLMs come with manuals much, much more extensive than any power tool ever (or at least since 1960s or such, as back then hardware was user-serviceable and manuals weren't just generic boilerplate).

As for:

> Know how they work

That is a real difference between power tool manufacturers and LLM vendors, but then if you switch to comparing against pharmaceutical industry, then they don't know how most of their products work either. So it's not a requirement for useful products that we benefit from having available.

zeitgeistcowboy · 2025-03-27T22:54:46 1743116086

So is using an LLM to write SQL for you like using a mitre saw instead of a table saw? I guess the crux is that you still need to do work either way.

simonw · 2025-03-27T22:59:26 1743116366

Using LLMs to write SQL is a fascinating case because there are so many traps you could fall into that aren't really the fault of the LLM.

My favorite example: you ask the LLM for "most recent restaurant opened in California", give it a schema and it tries "select * from restaurants where state = 'California' order by open_date desc" - but that returns 0 results, because it turns out the state column uses two-letter state abbreviations like CA instead.

There are tricks that can help here - I've tried sending the LLM an example row from each table, or you can set up a proper loop where the LLM gets to see the results and iterate on them - but it reflects the fact that interacting with databases can easily go wrong no matter how "smart" the model you are using is.

saaaaaam · 2025-03-28T08:35:25 1743150925

> that returns 0 results, because it turns out the state column uses two-letter state abbreviations like CA instead.

As you’ve identified, rather than just giving it the schema you give it the schema and a some data when you tell it what you want.

A human might make exactly the same error - based on misassumption - and would then look at the data to see why it was failing.

If we assume that a LLM would magically realise that when you ask it to find something based on an identifier which you tell it is ‘California’ it would magically assume that the query should be based on ‘CA’ rather than what you told it, then that’s not really the fault of the LLM.

zellyn · 2025-03-28T19:58:56 1743191936

I expect one of the database MCPs would do the trick too, if you didn't mind burning _several_ rounds of back-and-forth under the covers.