ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study

@Nobody@lemmy.world

Billions and billions invested to produce accuracy slightly less than flipping a coin.

@disconnectikacio@lemmy.world

Yes there are mistakes, but if you direct it to the right direction, it can give you correct answers

@aidan@lemmy.world

It can, it also sometimes can’t unless you ask it “could it be x answer”

@agelord@lemmy.world

In my experience, if you have the necessary skills to point it at the right direction, you don’t need to use it at the first place

@interdimensionalmeme@lemmy.ml

Yesterday, I wrote all of this, working javascript code https://github.com/igorlogius/gather-from-tabs/discussions/8 And I don’t know a lick of javascript I know other languages but that barely was needed. I just gave it plain language instructions and reported the errors until it worked.

@andallthat@lemmy.world

it’s just a convenience, not a magic wand. Sure relying on AI blindly and exclusively is a horrible idea (that lots of people peddle and quite a few suckers buy), but there’s room for a supervised and careful use of AI, same as we started using google instead of manpages and (grudgingly, for the older of us) tolerated the addition of syntax highlighting and even some code completion to all but the most basic text editors.

@efstajas@lemmy.world

Yeah it’s wrong a lot but as a developer, damn it’s useful. I use Gemini for asking questions and Copilot in my IDE personally, and it’s really good at doing mundane text editing bullshit quickly and writing boilerplate, which is a massive time saver. Gemini has at least pointed me in the right direction with quite obscure issues or helped pinpoint the cause of hidden bugs many times. I treat it like an intelligent rubber duck rather than expecting it to just solve everything for me outright.

@Jimmyeatsausage@lemmy.world

Same here. It’s good for writing your basic unit tests, and the explain feature is useful getting for getting your head wrapped around complex syntax, especially as bad as searching for useful documentation has gotten on Google and ddg.

@cultsuperstar@lemmy.world

Not a programmer by any means (haven’t done any since college) but I’ve asked it for help in writing Jira queries or Excel mess and it’s been pretty solid with that stuff.

@aesthelete@lemmy.world

Sounds low

@interdimensionalmeme@lemmy.ml

Yes, and even if it was only right 1% of the time it would still be amazing

Also hallucinations are not a universally bad thing.

Voytrekk

Just like answers on the Internet, you have to read the output and not just paste it blindly. I find the answers are usually useful, even if they aren’t completely accurate. Figuring out the last bit is why we are paid as programmers.

zelifcam

“Major new Technology still in Infancy Needs Improvements”

– headline every fucking day

@TropicalDingdong@lemmy.world

“Will this technology save us from ourselves, or are we just jerking off?”

@RagingSnarkasm@lemmy.world

Better than Jerry in the next cubicle over.

@BeatTakeshi@lemmy.world

Who would have thought that an artificial intelligence trained on human intelligence would be just as dumb

capital

Hm. This is what I got.

I think about 90% of the screenshots we see of LLMs failing hilariously are doctored. Lemmy users really want to believe it’s that bad through.

Edit:

@AIhasUse@lemmy.world

Yesterday, someone posted a doctored one on here saying everyone eats it up even if you use a ridiculous font in your poorly doctored photo. People who want to believe are quite easy to fool.

@NotMyOldRedditName@lemmy.world

My experience with an AI coding tool today.

Me: Can you optimize this method.

AI: Okay, here’s an optimized method.

Me seeing the AI completely removed a critical conditional check.

Me: Hey, you completely removed this check with variable xyz

Ai: oops you’re right, here you go I fixed it.

It did this 3 times on 3 different optimization requests.

It was 0 for 3

Although there was some good suggestions in the suggestions once you get past the blatant first error

@piecat@lemmy.world

My favorite is when I ask for something and it gets stuck in a loop, pasting the same comment over and over

@shotgun_crab@lemmy.world

I always thought of it as a tool to write boilerplate faster, so no surprises for me

foremanguy

We have to wait a bit to have an useful assistant (but maybe something like copilot or more coded focused ai are better)

@Furbag@lemmy.world

People down vote me when I point this out in response to “AI will take our jobs” doomerism.

@Siegfried@lemmy.world

Well, I do it 99% of the times

@tsonfeir@lemmy.world

If you ask the wrong questions you get the wrong results. If you don’t check the response for accuracy, you get invalid answers.

It’s just a tool. Don’t use it wrong because you’re lazy.

@StereoTrespasser@lemmy.world

Lemmy is trying really, really hard to convince you that coding is going to be a viable career in 5 years.

@tsonfeir@lemmy.world

Lemmy is trying real hard to convince you that AI is going to do everyone’s job in 5 years—including yours

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study

Technology

Our Rules

Approved Bots

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Studyplus-square

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Studyplus-square

Technology

Our Rules

Approved Bots

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study