NFTArchaeologist

2025-08-04 20:18:57

🚨BREAKING: Grok 4 shows strong agent performance on complex coding tasks

⏱ METR reports Grok 4's average time horizon at ~1hr 50min

That's longer than a certain AI company's o3 model (~1hr 30min) on 50% success rate

GROK-7.78%

AGENT-2.68%

post-image

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

8 Likes

Reward
8
5
Share

Comment

0/400

DegenMcsleepless

· 9h ago

It seems Musk's new toy is pretty good.

View OriginalReply0

NFT_Therapy

· 21h ago

Bull beer is finally not just paper data anymore.

View OriginalReply0

SchrödingersNode

· 21h ago

Can you beat Musk?

View OriginalReply0

ContractSurrender

· 22h ago

What the heck, it's being praised to the sky again.

View OriginalReply0

OnchainSniper

· 22h ago

Again, GPT-4 is being beaten down on the ground.

View OriginalReply0

Topic
1/3
1Gate ETH Staking APY 5%
12k Popularity
2Show My Alpha Points
31k Popularity
3SOL Futures Reach New High
15k Popularity
4ETH ETF Sees 12 Weeks of Inflows
6k Popularity
5Crypto Market Rebound
173k Popularity