this post was submitted on 02 Oct 2024
117 points (100.0% liked)
Technology
37720 readers
41 users here now
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
That's the story people tell at least. The weasel phrase at the end is fun, I guess. Leaves a massive backdoor excuse when it doesn't actually work.
But in practice, LLMs are falling down even at this job. They seem to have some yse in academic qualitaruve coding, but for summarizing novel or extended bodies of text, they struggle to actually tell people what they want to know.
Most people do not give a shit if text contains a reference to X. And if they do, they can generally just CTRL+F "X".
Weasel phrase? You mean the fact that I don't treat them like their actual Ai, but just a tool that needs to be used properly, monitored, and verified?
There's a reason why I never call them AI, because they're not. They're just advanced machine learning tools, and just like I keep a steady hand when using a table saw, I only use LLMs for tasks that they can help me do something faster, but are easy to verify they did it right.
And as someone who has been using them very regularly, I feel confident in saying that. It's not a weasel phrase, I'm not trying to sell anyone snake oil about what they can actually do, and I acknowledge that they're an oversold and overhyped means of cooking the planet faster, so it's not like I would be mad if they were banned tomorrow, but until then, I will keep using them in ways that are actually fruitful.
But sure, if all you need to do is find one word in a single body of text, that's not really a good use of an LLM, but that wasn't what I was talking about.
If I need examples of various legal or ethical concerns documented in one, or multiple, pieces of writing, or other conceptual topics, I can give it a list, and then ask it to highlight all examples of those issues, and include the verbatim text where their present. I can then give that same task to a multiple different LLMs, with the same prompts, and a task that would have taken me hours to complete, takes me 30 to 45 minutes, including the time it takes me to give it quick read through see if anything was missed. But yeah, that requires a well crafted prompt, and it's not infallible.