Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

WWDC 2025: Prepare for iOS 26, iPadOS 26, and the Dazzling Era of Liquid Glass

10 June 2025

Apple’s macOS Tahoe to Be Final Operating System to Work on Intel Macs

10 June 2025

Apple Releases iOS 26 Developer Beta 1 Update After WWDC 2025: How to Install, Compatible Models

10 June 2025
Facebook X (Twitter) Instagram
Just In
  • WWDC 2025: Prepare for iOS 26, iPadOS 26, and the Dazzling Era of Liquid Glass
  • Apple’s macOS Tahoe to Be Final Operating System to Work on Intel Macs
  • Apple Releases iOS 26 Developer Beta 1 Update After WWDC 2025: How to Install, Compatible Models
  • Why Silicon Valley Needs Immigration
  • ‘Beautiful’ and ‘Hard to Read’: Designers React to Apple’s Liquid Glass Update
  • The Dangerous Truth About the ‘Nonlethal’ Weapons Used Against LA Protesters
  • WWDC 2025: macOS Tahoe 26 Unveiled With New Design, Continuity Features and Big Update to Spotlight
  • Ninja Gaiden 4 Gets October Release Date In Bloody New Trailer
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI
News

How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI

News RoomBy News Room25 January 20254 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

Today, DeepSeek is one of the only leading AI firms in China that doesn’t rely on funding from tech giants like Baidu, Alibaba, or ByteDance.

A Young Group of Geniuses Eager to Prove Themselves

According to Liang, when he put together DeepSeek’s research team, he was not looking for experienced engineers to build a consumer-facing product. Instead, he focused on PhD students from China’s top universities, including Peking University and Tsinghua University, who were eager to prove themselves. Many had been published in top journals and won awards at international academic conferences, but lacked industry experience, according to the Chinese tech publication QBitAI.

“Our core technical positions are mostly filled by people who graduated this year or in the past one or two years,” Liang told 36Kr in 2023. The hiring strategy helped create a collaborative company culture where people were free to use ample computing resources to pursue unorthodox research projects. It’s a starkly different way of operating from established internet companies in China, where teams are often competing for resources. (A recent example: ByteDance accused a former intern—a prestigious academic award winner, no less—of sabotaging his colleagues’ work in order to hoard more computing resources for his team.)

Liang said that students can be a better fit for high-investment, low-profit research. “Most people, when they are young, can devote themselves completely to a mission without utilitarian considerations,” he explained. His pitch to prospective hires is that DeepSeek was created to “solve the hardest questions in the world.”

The fact that these young researchers are almost entirely educated in China adds to their drive, experts say. “This younger generation also embodies a sense of patriotism, particularly as they navigate US restrictions and choke points in critical hardware and software technologies,” explains Zhang. “Their determination to overcome these barriers reflects not only personal ambition but also a broader commitment to advancing China’s position as a global innovation leader.”

Innovation Born out of a Crisis

In October 2022, the US government started putting together export controls that severely restricted Chinese AI companies from accessing cutting-edge chips like Nvidia’s H100. The move presented a problem for DeepSeek. The firm had started out with a stockpile of 10,000 H100’s, but it needed more to compete with firms like OpenAI and Meta. “The problem we are facing has never been funding, but the export control on advanced chips,” Liang told 36Kr in a second interview in 2024.

DeepSeek had to come up with more efficient methods to train its models. “They optimized their model architecture using a battery of engineering tricks—custom communication schemes between chips, reducing the size of fields to save memory, and innovative use of the mix-of-models approach,” says Wendy Chang, a software engineer turned policy analyst at the Mercator Institute for China Studies. “Many of these approaches aren’t new ideas, but combining them successfully to produce a cutting-edge model is a remarkable feat.”

DeepSeek has also made significant progress on Multi-head Latent Attention (MLA) and Mixture-of-Experts, two technical designs that make DeepSeek models more cost-effective by requiring fewer computing resources to train. In fact, DeepSeek’s latest model is so efficient that it required one-tenth the computing power of Meta’s comparable Llama 3.1 model to train, according to the research institution Epoch AI.

DeepSeek’s willingness to share these innovations with the public has earned it considerable goodwill within the global AI research community. For many Chinese AI companies, developing open source models is the only way to play catch-up with their Western counterparts, because it attracts more users and contributors, which in turn help the models grow. “They’ve now demonstrated that cutting-edge models can be built using less, though still a lot of, money and that the current norms of model-building leave plenty of room for optimization,” Chang says. “We are sure to see a lot more attempts in this direction going forward.”

The news could spell trouble for the current US export controls that focus on creating computing resource bottlenecks. “Existing estimates of how much AI computing power China has, and what they can achieve with it, could be upended,” Chang says.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleiPhone 17 Series to Arrive With ‘Largely Unchanged’ Dynamic Island: Ming-Chi Kuo
Next Article EV sales surge could continue as Trump delays ending federal rebates, report says

Related Articles

News

Why Silicon Valley Needs Immigration

10 June 2025
News

‘Beautiful’ and ‘Hard to Read’: Designers React to Apple’s Liquid Glass Update

10 June 2025
News

The Dangerous Truth About the ‘Nonlethal’ Weapons Used Against LA Protesters

10 June 2025
News

Apple Is Pushing AI Into More of Its Products—but Still Lacks a State-of-the-Art Model

10 June 2025
News

We’ve Finally Reached the End of the Road for Intel Macs

10 June 2025
News

The iPad Is a Full-On Computer Now

10 June 2025
Demo
Top Articles

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202493 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202466 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

The Dangerous Truth About the ‘Nonlethal’ Weapons Used Against LA Protesters

News Room10 June 2025
Laptops

WWDC 2025: macOS Tahoe 26 Unveiled With New Design, Continuity Features and Big Update to Spotlight

News Room10 June 2025
Gaming

Ninja Gaiden 4 Gets October Release Date In Bloody New Trailer

News Room10 June 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025123 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202493 Views
Our Picks

Why Silicon Valley Needs Immigration

10 June 2025

‘Beautiful’ and ‘Hard to Read’: Designers React to Apple’s Liquid Glass Update

10 June 2025

The Dangerous Truth About the ‘Nonlethal’ Weapons Used Against LA Protesters

10 June 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.