Monday, August 11, 2025
SCRYPTO MAGAZINE
No Result
View All Result
  • Home
  • Crypto
  • Bitcoin
  • Blockchain
  • Market
  • Ethereum
  • Altcoins
  • XRP
  • Dogecoin
  • NFTs
  • Regualtions
SCRYPTO MAGAZINE
No Result
View All Result
Home Blockchain

I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)

SCRYPTO MAGAZINE by SCRYPTO MAGAZINE
August 11, 2025
in Blockchain
0
I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)
189
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter


code

Vaselena/Getty Photographs

ZDNET’s key takeaways

  • OpenAI’s new GPT-5 flagship failed half of my programming checks.
  • Earlier OpenAI releases have had nearly good outcomes.
  • Now that OpenAI has enabled fallbacks to different LLMs, there are alternatives.

So GPT-5 happened. It is out. It is launched. It is the discuss of the digital city. And it is obtained some issues. I am not gonna bury the lede. GPT-5 has failed half of my programming tests. That is the worst that OpenAI’s flagship LLM has ever accomplished on my rigorously designed checks.

Additionally: The best AI for coding in 2025 (and what not to use)

Related articles

I’ve tested every iPad sold by Apple right now – here’s the model I recommend most

I’ve tested every iPad sold by Apple right now – here’s the model I recommend most

August 10, 2025
I answered the million-dollar question about buying laptops – here’s the ultimate guide

I answered the million-dollar question about buying laptops – here’s the ultimate guide

August 10, 2025

Earlier than I get into the main points, let’s take a second to debate one different little function that is additionally a bit wonky. Take a look at the brand new Edit button on the highest of the code dumps it generates.

edit-button
Screenshot by David Gewirtz/ZDNET

Clicking the Edit button takes you into a pleasant little code editor. Right here, I changed the Creator area, proper in ChatGPT’s outcomes.

editor
Screenshot by David Gewirtz/ZDNET

That appeared good, however it finally proved futile. Once I closed the editor, it requested me if I needed to avoid wasting. I did. Then this unhelpful message confirmed up.

wonky-save
Screenshot by David Gewirtz/ZDNET

I by no means did get again to my authentic session. I needed to submit my authentic immediate once more, and let GPT-5 do its work a second time.

However wait. There’s extra. Let’s dig into my take a look at outcomes…

1. Writing a WordPress plugin

This was my very first test of coding prowess for any AI. It is what gave me that first “the world is about to alter” feeling, and it was accomplished utilizing GPT-3.5.

Subsequent checks, utilizing the identical immediate however with completely different AI fashions, generated combined outcomes. Some AIs did nice, some did not. Some AIs, like these from Microsoft and Google, improved over time.

Additionally: How I test an AI chatbot’s coding ability – and you can, too

ChatGPT’s mannequin has been the gold normal for this take a look at because the very starting. That makes the outcomes of GPT-5 all that rather more curious.

So, look, the precise coding with GPT-5 was partially profitable. GPT-5 generated a single block of code, which I pasted right into a file and was in a position to run. It supplied the requisite UI.

Once I pasted within the take a look at names, it dynamically up to date the road depend, though it described it as “Line to randomize” as an alternative of “Strains to randomize.”

plugin
Screenshot by David Gewirtz/ZDNET

However then, once I clicked Randomize, it did not. As a substitute, it redirected me to instruments.php. What?? ChatGPT has by no means had an issue with this take a look at, whether or not GPT-3.5, GPT-4, or GPT-4o. You imply to inform me that OpenAI’s much-anticipated GPT-5 is failing proper out of the gate? Ouch.

I then gave GPT-5 this immediate.

Once I click on randomize, I am taken to http://testsite.native/wp-admin/instruments.php. I don’t get a listing of randomized outcomes. Are you able to repair?

The end result was a line to patch. I am not thrilled with that method as a result of it requires the person to dig by way of code and to make no errors changing a line.

patch
Screenshot by David Gewirtz/ZDNET

So, I requested GPT-5 for a full plugin. It gave me the complete textual content of the plugin to repeat and paste. This time, it labored.

plugin2
Screenshot by David Gewirtz/ZDNET

This time, it did randomize the strains. When it encountered duplicates, it separated them from one another, because it was instructed. Lastly.

Additionally: I found 5 AI content detectors that can correctly identify AI text 100% of the time

I am sorry, OpenAI. I’ve to fail you on this take a look at. You’ll have handed if the one error was not utilizing the plural of “line” when applicable. However the truth that it gave me again a non-working plugin on the primary strive is fail territory, even when the AI did ultimately make it work on the second strive.

Regardless of the way you spin it, this can be a step again.

2. Rewriting a string operate

This second take a look at is designed to rewrite a string operate to higher verify for {dollars} and cents. The unique code that GPT-5 was requested to rewrite didn’t enable for cents (it solely checked for integers).

test2
Screenshot by David Gewirtz/ZDNET

GPT-5 did effective with this take a look at. It did return a minimal end result as a result of it did not do any error checking. It did not verify for non-string enter, further whitespace, 1000’s separators, or foreign money symbols.

However that is not what I requested for. I advised it to rewrite a operate, which itself didn’t have any error checking. GPT-5 did precisely what I requested with no embellishment. I am type of glad of that as a result of it does not know whether or not or not code previous to this routine already did that work.

GPT-5 handed this take a look at.

3. Discovering an annoying bug

This take a look at happened as a result of I used to be battling a less-than-obvious bug in my code. With out going into the weeds about how the WordPress framework works, the apparent reply will not be the fitting reply.

You want some pretty arcane data about how WordPress filters move their data. This take a look at has been a stumbling block for quite a lot of AI LLMs.

Additionally: Gen AI disillusionment looms, according to Gartner’s 2025 Hype Cycle report

GPT-5, nonetheless, like GPT-4 and GPT-4o earlier than it, did perceive the issue. It articulated a transparent resolution.

GPT-5 handed this take a look at.

4. Writing a script

This take a look at asks the AI to include a reasonably obscure Mac scripting instrument known as Keyboard Maestro, in addition to Apple’s scripting language AppleScript, and Chrome scripting habits.

It is actually a take a look at of the attain of the AI by way of data, its understanding of how internet pages are constructed, and the power to write down code throughout three interlinked environments.

Fairly a number of AIs have failed this take a look at, however the failure level is often a lack of awareness about Keyboard Maestro. GPT-3.5 did not learn about Keyboard Maestro. However ChatGPT has been passing this take a look at since GPT-4. Till now.

The place ought to we begin? Nicely, the excellent news is that GPT-5 dealt with the Keyboard Maestro a part of the issue simply effective. But it surely obtained the coding so improper that it even doubled down on its lack of knowledge of how case works in AppleScript.

gpt5-applescript
Screenshot by David Gewirtz/ZDNET

It truly invented a property. That is a kind of circumstances the place an AI confidently presents a solution that’s fully improper.

Additionally: ChatGPT comes with personality presets now – and other upgrades you might have missed

AppleScript is natively case-insensitive. If you need AppleScript to concentrate to case, you might want to use a “contemplating case” block. So, this occurred.

lowercase
Screenshot by David Gewirtz/ZDNET

The rationale the error message referred to the title of certainly one of my articles is as a result of that was the entrance window in Chrome. This operate checks the entrance window and does stuff primarily based on the title.

search-term
Screenshot by David Gewirtz/ZDNET

However misunderstanding how case works wasn’t the one AppleScript error GPT-5 generated. It additionally referenced a variable named searchTerm with out defining it. That is just about an error-creating observe in any programming language.

Fail, fail, fail, McFaildypants.

The web hath spoken

OpenAI appeared to endure from the identical hubris that its AIs do. It confidently moved everybody to GPT-5 and burned the bridges again to GPT-4o. I am paying $200 a month for a ChatGPT Pro account. On Friday, I could not transfer again to GPT-4o for coding work. Neither may anybody else.

There was, nonetheless, only a tiny little bit of person pushback on the entire bridges burning factor. And by tiny, I imply the entire frickin’ internet. So, by Saturday, ChatGPT had a brand new possibility.

revert
Screenshot by David Gewirtz/ZDNET

To get to this, go to your ChatGPT settings and activate “Present legacy fashions.” Then, because it has all the time been, simply drop down the mannequin menu and select the one you need. Be aware: this selection is just accessible to these on paid tiers. Should you’re utilizing ChatGPT totally free, you will take what you are given, and you may find it irresistible.

Ever because the entire generative AI factor kicked off originally of 2023, ChatGPT has been the gold normal of programming instruments, at the least in response to my LLM testing.

Additionally: Microsoft rolls out GPT-5 across its Copilot suite – here’s where you’ll find it

Now? I am actually undecided. That is solely a day or so after GPT-5 has been launched, so its outcomes will most likely get higher over time. However for now, I am sticking with GPT-4o for coding, though I do just like the deep reasoning capabilities in GPT-5.

What about you? Have you ever tried GPT-5 for programming duties but? Did it carry out higher or worse than earlier variations like GPT-4o or GPT-3.5? Have been you in a position to get working code on the primary strive, or GPT-4o did it’s a must to information it by way of fixes? Are you going to make use of GPT-5 for coding or keep on with older fashions? Tell us within the feedback under.


You may observe my day-to-day undertaking updates on social media. Be sure you subscribe to my weekly update newsletter, and observe me on Twitter/X at @DavidGewirtz, on Fb at Facebook.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.





Source link

Tags: BadcodingGPT4oGPT5sskillsstickingTested
Share76Tweet47

Related Posts

I’ve tested every iPad sold by Apple right now – here’s the model I recommend most

I’ve tested every iPad sold by Apple right now – here’s the model I recommend most

by SCRYPTO MAGAZINE
August 10, 2025
0

ZDNET's key takeaways The Eleventh-generation iPad Eleventh Era usually retails for $349. The upgraded iPad has double the bottom storage...

I answered the million-dollar question about buying laptops – here’s the ultimate guide

I answered the million-dollar question about buying laptops – here’s the ultimate guide

by SCRYPTO MAGAZINE
August 10, 2025
0

Kyle Kucharski/ZDNETChoosing the proper laptop computer can really feel overwhelming -- even for somebody like me who's lived and breathed...

I tried Lenovo’s new rollable ThinkBook and can’t go back to regular-sized screens

I tried Lenovo’s new rollable ThinkBook and can’t go back to regular-sized screens

by SCRYPTO MAGAZINE
August 9, 2025
0

ThinkBook Plus Gen 6 Rollable ZDNET's key takeaways The ThinkBook Plus Gen 6 is accessible now, beginning at $3,300. The...

ChatGPT can now talk nerdy to you – plus more personalities and other upgrades beyond GPT-5

ChatGPT comes with personality presets now – and 3 other upgrades you might have missed

by SCRYPTO MAGAZINE
August 9, 2025
0

Elyse Betters Picaro / ZDNETZDNET's key takeaways:OpenAI launched ChatGPT customization updates.Customers can select chat colour and persona.All customers (even free)...

Have stock questions? Google Finance tests new AI chatbot

Have stock questions? Google Finance tests new AI chatbot

by SCRYPTO MAGAZINE
August 8, 2025
0

panithan pholpanichrassameeZDNET's key takeaways:Google Finance is getting an AI improve, together with a chatbot.The improve comes with a dwell information...

Load More
  • Trending
  • Comments
  • Latest
Analysts’ 2025 Bull Market Predictions

Bitcoin Entering Second ‘Price Discovery Uptrend’, What’s Ahead?

January 21, 2025
Bitcoin Spot-Perpetual Price Gap Turns Negative

Bitcoin Spot-Perpetual Price Gap Turns Negative

December 23, 2024
Bitcoin Price Flashes Major Buy Signal On The 4-Hour TD Sequential Chart, Where To Enter?

Bitcoin Price Flashes Major Buy Signal On The 4-Hour TD Sequential Chart, Where To Enter?

December 24, 2024
Cardano Price Outlook: The $0.40 Threshold Could Unlock Doors to $1

Cardano Price Outlook: The $0.40 Threshold Could Unlock Doors to $1

December 23, 2024
Bitcoin could reach this unbelievable price by 2025, but these factors must align

Bitcoin could reach this unbelievable price by 2025, but these factors must align

0
XRP Consolidation Could End Once It Clears $2.60 – Top Analyst Expects $4 Soon

XRP Consolidation Could End Once It Clears $2.60 – Top Analyst Expects $4 Soon

0

Fed Can’t Hold Bitcoin, No Plans Yet To Change Law, Powell Says

0
Bears Take Full Control of the Market

Bears Take Full Control of the Market

0
Trump’s 401(k) Crypto Order Sparks Mixed Reactions From Industry and Critics

Trump’s 401(k) Crypto Order Sparks Mixed Reactions From Industry and Critics

August 11, 2025
Vocal Gold Promoter Says He’d Choose Bitcoin When Threatened With A Gun To The Head

Vocal Gold Promoter Says He’d Choose Bitcoin When Threatened With A Gun To The Head

August 11, 2025
S&P Global Expands Credit Ratings to Crypto Institutions, Including Stablecoins and Digital Asset Funds

S&P Global Expands Credit Ratings to Crypto Institutions, Including Stablecoins and Digital Asset Funds

August 11, 2025
I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)

I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)

August 11, 2025

Recent News

Trump’s 401(k) Crypto Order Sparks Mixed Reactions From Industry and Critics

Trump’s 401(k) Crypto Order Sparks Mixed Reactions From Industry and Critics

August 11, 2025
Vocal Gold Promoter Says He’d Choose Bitcoin When Threatened With A Gun To The Head

Vocal Gold Promoter Says He’d Choose Bitcoin When Threatened With A Gun To The Head

August 11, 2025

Categories

  • Altcoins
  • Bitcoin
  • Blockchain
  • Cryptocurrency
  • Dogecoin
  • Ethereum
  • Market
  • NFTs
  • Regualtions
  • XRP

Recommended

  • Trump’s 401(k) Crypto Order Sparks Mixed Reactions From Industry and Critics
  • Vocal Gold Promoter Says He’d Choose Bitcoin When Threatened With A Gun To The Head
  • S&P Global Expands Credit Ratings to Crypto Institutions, Including Stablecoins and Digital Asset Funds
  • I tested GPT-5’s coding skills, and it was so bad that I’m sticking with GPT-4o (for now)
  • Fundamental Global Targets 10% ETH Stake With $5B Investment

© 2025 SCRYPTO MAGAZINE | All Rights Reserved

No Result
View All Result
  • Home
  • Crypto
  • Bitcoin
  • Blockchain
  • Market
  • Ethereum
  • Altcoins
  • XRP
  • Dogecoin
  • NFTs
  • Regualtions

© 2025 SCRYPTO MAGAZINE | All Rights Reserved