DeepSeek AI Sends Tech Stocks Tumbling as Cost Claims Spark Debate

Chinese AI startup DeepSeek sends tech stocks tumbling after claiming it trained competitive AI models using fraction of typical computing resources.

Al Landes Avatar
Al Landes Avatar

By

Our editorial process is built on human expertise, ensuring that every article is reliable and trustworthy. AI helps us shape our content to be as accurate and engaging as possible.
Learn more about our commitment to integrity in our Code of Ethics.

Image Credit: Deep Seek

Key Takeaways

  • DeepSeek claims to match Western AI performance with just 2,000 specialized chips and $5.6 million
  • Stock market reaction suggests fundamental shift in perception of AI infrastructure requirements
  • Open-source release under MIT license enables independent verification of company’s claims

Nvidia shares dropped more than 12% today after Chinese AI company DeepSeek claimed it trained its latest models using just 2,000 specialized chips, challenging assumptions about the massive computing infrastructure needed for advanced AI development.

Why it matters: DeepSeek’s assertion that it spent only $5.6 million training its models fundamentally challenges the AI industry’s conventional wisdom that developing competitive models requires billions in computing infrastructure, potentially threatening U.S. technological dominance.

Industry Impact: The dramatic market response reflects growing uncertainty about the future of AI infrastructure investments. DeepSeek’s claims suggest that sophisticated AI models can be developed with far fewer resources than previously thought, raising questions about planned investments like the $500 billion Stargate Project.

  • Major tech stocks decline across sector
  • Nvidia leads losses with 12% drop
  • Data center investments questioned

Jeremie Harris, CEO of Gladstone: “DeepSeek only has access to a few thousand GPUs, and yet they’re pulling this off. So this raises the obvious question: what happens when they get an allocation from the Chinese Communist Party to proceed at full speed?”, Harris said via Time.

Technical Achievement: DeepSeek’s approach combines innovative software techniques with efficient hardware utilization. The company claims its models match or exceed Western counterparts through advanced reinforcement learning strategies and novel attention mechanisms that maximize computational efficiency.

  • Mixture-of-Experts architecture
  • Multi-head Latent Attention
  • Efficient resource optimization

Market Response: While some experts question DeepSeek’s claims, the company’s rapid rise on app store rankings and strong benchmark performance have lent credibility to its assertions:

  • Top downloaded app in U.S. App Store
  • Competitive performance on key benchmarks
  • MIT license enables verification

Looking Forward: As the industry grapples with these claims, the implications could reshape how companies approach AI development, potentially shifting focus from hardware scaling to software optimization. 

Share this

At Gadget Review, our guides, reviews, and news are driven by thorough human expertise and use our Trust Rating system and the True Score. AI assists in refining our editorial process, ensuring that every article is engaging, clear and succinct. See how we write our content here →