DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that's a nightmare for OpenAI

Hey there! Want to stay updated with the latest AI news and exclusive content? Subscribe to our daily and weekly newsletters. Find out more!

Have you heard about the latest development in the AI industry? Chinese AI startup DeepSeek has quietly introduced a groundbreaking new large language model called DeepSeek-V3-0324. This 641-gigabyte model is causing a stir not only for its capabilities but also for its unique deployment approach.

What sets this launch apart is the model’s MIT license, making it freely available for commercial use. Early reports suggest that it can operate on consumer-grade hardware like Apple’s Mac Studio with the M3 Ultra chip.

The new Deep Seek V3 0324 in 4-bit runs at > 20 toks/sec on a 512GB M3 Ultra with mlx-lm! pic.twitter.com/wFVrFCxGS6

— Awni Hannun (@awnihannun) March 24, 2025

Exciting news, right? The ability to run such a massive model locally on consumer hardware is a significant departure from the usual data center requirements for cutting-edge AI models.

DeepSeek’s Stealth Launch Strategy Disrupts AI Market Expectations

DeepSeek’s latest model, DeepSeek-V3-0324, arrived on the scene without any fanfare. No whitepaper, blog post, or marketing push accompanied its release — just the model weights themselves. This unconventional approach challenges the traditional hype-filled product launches seen in the Western AI industry.

Early testers are already reporting significant improvements over the previous version. AI researcher Xeophon declared, “Tested the new DeepSeek V3 on my internal bench and it has a huge jump in all metrics on all tests. It is now the best non-reasoning model, dethroning Sonnet 3.5.”

Tested the new DeepSeek V3 on my internal bench and it has a huge jump in all metrics on all tests.
It is now the best non-reasoning model, dethroning Sonnet 3.5.

Congrats @deepseek_ai! pic.twitter.com/efEu2FQSBe

— Xeophon (@TheXeophon) March 24, 2025

These bold claims position DeepSeek’s new model above industry giants like Claude Sonnet 3.5 from Anthropic, setting a new standard for AI performance.

How DeepSeek V3-0324’s Breakthrough Architecture Achieves Unmatched Efficiency

DeepSeek-V3-0324 utilizes a revolutionary mixture-of-experts (MoE) architecture that transforms the way large language models function. Unlike traditional models that activate all parameters for every task, DeepSeek’s approach selectively activates only about 37 billion of its 685 billion parameters for specific tasks.

This selective activation marks a significant shift in model efficiency. By activating only the most relevant parameters for each task, DeepSeek achieves performance comparable to larger models while reducing computational demands.

The model also incorporates two groundbreaking technologies: Multi-Head Latent Attention (MLA) and Multi-Token Prediction (MTP). MLA enhances the model’s contextual understanding, while MTP generates multiple tokens per step, increasing output speed by nearly 80%.

Moreover, a 4-bit quantized version reduces the storage footprint to 352GB, enabling high-end consumer hardware like the Mac Studio with the M3 Ultra chip to run the model efficiently.

This shift towards more efficient AI deployment challenges the conventional infrastructure requirements associated with top-tier model performance, signaling a potential transformation in the industry.

China’s Open-Source AI Revolution Challenges Silicon Valley’s Closed Garden Model

DeepSeek’s release strategy underscores a fundamental difference in AI business philosophy between Chinese and Western companies. While U.S. leaders tend to keep their models behind paywalls, Chinese AI firms are embracing open-source licensing, catalyzing a revolution in China’s AI ecosystem.

The open availability of cutting-edge models in China accelerates innovation, enabling startups, researchers, and developers to leverage advanced AI technology without significant financial barriers. This rapid progress has caught Western observers off guard.

Even established Chinese tech giants like Baidu, Alibaba, and Tencent are joining the open-source movement, releasing sophisticated AI models with specialized capabilities. This shift contrasts with the API-centric approach favored by Western leaders.

The open-source strategy also addresses unique challenges faced by Chinese AI companies, such as limited access to advanced Nvidia chips. This focus on efficiency and optimization has become a potential competitive advantage, driving innovation in resource-constrained environments.

DeepSeek V3-0324: The Foundation for an AI Reasoning Revolution

The characteristics and timing of DeepSeek-V3-0324 hint at the upcoming DeepSeek-R2 model, an enhanced reasoning-focused model expected in the next few months. This sequential release pattern suggests that DeepSeek’s base models pave the way for specialized reasoning models.

If DeepSeek-R2 follows the trajectory of its predecessor, it could pose a significant challenge to industry giants like GPT-5 from OpenAI. The contrast between DeepSeek’s open, resource-efficient strategy and OpenAI’s closed, heavily-funded approach reflects two competing visions for the future of AI.

The democratization of AI technology through open-source models has the potential to reshape the global AI landscape, enabling broader access to advanced AI systems and fostering innovation on a scale previously unseen.

How to Experience DeepSeek V3-0324: A Complete Guide for Developers and Users

Excited to explore DeepSeek-V3-0324? Whether you’re a developer or a curious user, there are various ways to engage with this cutting-edge AI model.

For those with technical resources, downloading the complete model weights from Hugging Face is an option. However, cloud-based solutions offer a more accessible entry point for most users.

Platforms like OpenRouter provide API access to the model, allowing users to interact with DeepSeek-V3-0324 through a user-friendly chat interface. DeepSeek’s own chat interface at chat.deepseek.com may also offer access to the new version.

Developers seeking to integrate the model into applications can utilize various inference providers like Hyperbolic Labs, which offers API access compatible with the OpenAI SDK.

DeepSeek’s New Model Prioritizes Technical Precision Over Conversational Warmth

Users have noticed a shift in DeepSeek’s communication style with the introduction of V3-0324. While previous models had a conversational tone, the latest version adopts a more formal, technically-oriented persona.

This deliberate design choice aligns the model for professional and technical applications, catering to specialized use cases that benefit from a precise communication style.

How DeepSeek’s Open Source Strategy Is Redrawing the Global AI Landscape

DeepSeek’s open-source approach not only represents a technical milestone but also embodies a vision for widespread technology dissemination. By making advanced AI freely available, DeepSeek accelerates innovation and challenges the dominance of closed AI models.

This philosophy is narrowing the AI gap between China and the U.S., signaling a shift in how AI technology is accessed and utilized globally.

As DeepSeek-V3-0324 spreads across research labs and developer communities worldwide, the competition in AI is no longer just about building powerful models. It’s about empowering more people to leverage AI technology. DeepSeek’s open-source strategy hints at a future where technology is shared freely, influencing how AI shapes our world.

DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

DeepSeek’s Stealth Launch Strategy Disrupts AI Market Expectations

How DeepSeek V3-0324’s Breakthrough Architecture Achieves Unmatched Efficiency

China’s Open-Source AI Revolution Challenges Silicon Valley’s Closed Garden Model

DeepSeek V3-0324: The Foundation for an AI Reasoning Revolution

How to Experience DeepSeek V3-0324: A Complete Guide for Developers and Users

DeepSeek’s New Model Prioritizes Technical Precision Over Conversational Warmth

How DeepSeek’s Open Source Strategy Is Redrawing the Global AI Landscape

Glove Stealer Emerges A New Malware Threat For Browsers

Microsoft July 2024 Patch Tuesday Fixed 140+ Vulnerabilities

7 tasks that waste your IT team’s time

Leave a Reply Cancel reply

DeepSeek’s Stealth Launch Strategy Disrupts AI Market Expectations

How DeepSeek V3-0324’s Breakthrough Architecture Achieves Unmatched Efficiency

China’s Open-Source AI Revolution Challenges Silicon Valley’s Closed Garden Model

DeepSeek V3-0324: The Foundation for an AI Reasoning Revolution

How to Experience DeepSeek V3-0324: A Complete Guide for Developers and Users

DeepSeek’s New Model Prioritizes Technical Precision Over Conversational Warmth

How DeepSeek’s Open Source Strategy Is Redrawing the Global AI Landscape

Related Posts

Leave a Reply Cancel reply