Skip to Content

Mac Studio With M3 Ultra Runs Massive DeepSeek R1 AI Model Locally

YouTuber Dave Lee of Dave2D fame has demonstrated how Apple's new Mac Studio equipped with an M3 Ultra chip can efficiently run a huge version of the DeepSeek R1 AI model locally, provided that users spec the machine with the maximum 512GB of memory.

Mac Studio 2025
According to Lee's testing, the 671 billion parameter AI model can be executed directly on Apple's high-end workstation, but it requires substantial memory resources, consuming 404GB of storage and requiring the manual allocation of 448GB of video RAM through Terminal commands.

The M3 Ultra's unified memory architecture is key to this performance, allowing the system to handle a 4-bit quantized version of DeepSeek R1 efficiently. The quantization slightly reduces accuracy, but it maintains all parameters and delivers approximately 17-18 tokens per second, which is sufficient for many practical applications.

Perhaps most impressively, the Mac Studio accomplishes this while consuming under 200 watts of power. Comparable performance on traditional PC hardware would require multiple GPUs drawing approximately ten times more electricity.

The capability to run such advanced AI models locally offers privacy advantages for sensitive applications like healthcare data analysis, where sending information to cloud services raises security concerns.


However, this performance doesn't come cheap – a Mac Studio configured with M3 Ultra and 512GB of RAM starts at around $10,000. Fully maxed out, an M3 Ultra Mac Studio with 16TB of SSD storage and an Apple M3 Ultra chip with 32-core CPU, 80-core GPU, and 32-core Neural Engine costs a cool $14,099. Of course, for organizations requiring local AI processing of sensitive data, the Mac Studio offers a relatively power-efficient solution compared to alternative hardware configurations.

Apple says the M3 Ultra is the fastest Mac chip it has ever released, thanks to its strategy of fusing two M3 Max chips together using the company's "UltraFusion" technology. This makes the chip's specs double that of the M3 Max.

Related Roundup: Mac Studio
Buyer's Guide: Mac Studio (Neutral)
Related Forum: Mac Studio

Popular Stories

tim cook data privacy day

Tim Cook Warned by CIA That China Could Move on Taiwan by 2027

Tuesday February 24, 2026 4:03 am PST by
Apple CEO Tim Cook was among a handful of top tech executives who attended a classified CIA briefing warning that China could attack Taiwan by 2027, according to a sweeping investigative report by The New York Times ($). The previously unreported briefing was apparently held in a secure room in Silicon Valley in July 2023. The meeting is said to have been arranged at the request of the...
iphone fold text

iPhone Fold Crease Measurements Revealed as Device Hits Production

Wednesday February 25, 2026 5:37 am PST by
Apple has submitted production line orders for its upcoming foldable iPhone, effectively confirming that the device will launch this year, claims a Chinese leaker. According to the Weibo account "Fixed Focus Digital," assembly lines recently received the orders from Apple, which has apparently allowed the leaker to learn the crease measurements for the device's 7.8-inch inner display....
Apple Announces Special Event in New York Feature 1

Apple Reportedly Plans to Unveil at Least Five New Products Next Week

Sunday February 22, 2026 9:48 am PST by
In his Power On newsletter today, Bloomberg's Mark Gurman said Apple will have a three-day stretch of product announcements from Monday, March 2 through Wednesday, March 4. In total, he expects Apple to introduce "at least five products." Subscribe to the MacRumors YouTube channel for more videos. A week ago, Apple invited selected journalists and content creators to an "Apple Experience" in...

Top Rated Comments

FSMBP Avatar
13 months ago
Cool - but how fast can it load MacRumors.com in Safari??? I want real-world cases for myself before I plunk down $15K.
Score: 27 Votes (Like | Disagree)
13 months ago
"Perhaps most impressively, the Mac Studio accomplishes this while consuming under 200 watts of power. Comparable performance on traditional PC hardware would require multiple GPUs drawing approximately ten times more electricity."

Assuming 1 hour of LLM use per day at $0.18 per kWh, that's about $11 per month to run a "comparable" PC versus about $1 for the Mac Studio. The Mac Studio is cheap to run.

That was just an estimation in the video about how much more electricity it would use.

The reality is worse for the non-Mac solution, not even counting the fact that you'd need to buy many GPUs to balance out the available RAM on the Mac Studio. You'd need 16 RTX 5090s to get 512 GB of RAM. Each of those idles at maybe 50 W. Let's say 400 W under load. Add in the draw of the CPU and more.

Using that estimate, 1 hour of LLM use per day would be about $40 per month in electricity costs versus $1 for the Mac Studio (using an $0.18 per kWh cost). Add to that the cost of the GPUs (let's be generous and say only $32,000 for 16 5090s), etc. and you're looking at something that costs about 4x the Mac Studio and is about 40x more electricity to run for similar performance.

Scale that up. Let's say this is a business running an LLM for 10 hours per day. That's $10 per month for the Mac and $400 per month for the non-Mac solution. It doesn't take long for the Mac to pay for itself just in electricity savings alone. Although, if you live somewhere cold and want to use your 16 5090s as your heating, maybe you can offset some heating costs that way. Just be careful you don't end up with too much heat ('https://www.theverge.com/news/609207/nvidia-rtx-5090-power-connector-melting-burning-issues')!

The 512 GB Studio is not for everyone, but it's a terrific value for some people and uses.

Although, I’ll add that with an nvidia card in a Linux box, you can offload into RAM too with various software managing LLMs. So you will not strictly need 16 5090s to reach the total RAM capacity of the Mac Studio solution. Performance of LLMs offloaded into RAM will not be great, however. This means there’s not much quite like the new Studio for local LLMs.
Score: 18 Votes (Like | Disagree)
surfzen21 Avatar
13 months ago
If LLMs are a significant part of the future of computing and privacy is going to be a huge part of that, then Apple has a huge advantage from a hardware perspective. This is the real unspoken hero of what Apple is doing.

While everyone is focused on a delayed "AI" end user roll out, and some absolutely losing their stuff over it, Apple has created the hardware that is blowing away all the competition. Once the software side catches up, Apple will be lightyears ahead.

Keep in mind, big players like META and Google are absolutely pirating any data they can get their hands on. Unfortunately, they are too big to fail like thepiratebay is.

Try to buy a Nvidia 5090 with a measly 32GB of Vram that needs a disgusting amount of power to run. The fake MSRP is $2,000 and are being sold on eBay for $7,000.

With all the crying going on I think Apple is doing this exactly right.
Score: 16 Votes (Like | Disagree)
13 months ago
Would it be safe to say that in 5-10 years a smartphone will be able to run a model like this internally and without the internet?
Score: 14 Votes (Like | Disagree)
AusMness Avatar
13 months ago
This new Mac Studio is the king of local LLMs
Score: 12 Votes (Like | Disagree)
13 months ago

If LLMs are a significant part of the future of computing and privacy is going to be a huge part of that, then Apple has a huge advantage from a hardware perspective. This is the real unspoken hero of what Apple is doing.

While everyone is focused on a delayed "AI" end user roll out, and some absolutely losing their stuff over it, Apple is created the hardware that is blowing away all the competition. Once the software side catches up, Apple will be lightyears ahead.

Keep in mind big players like META and Google are absolutely pirating any data they can get their hands on. Unfortunately, they are too big to fail like thepiratebay is.

Try to buy a Nvidia 5090 with a measly 32GB of Vram that needs a disgusting amount of power to run. The fake MSRP is $2,000 and are being sold on eBay for $7,000.

Will all the crying going on I think Apple is doing this exactly right.
But isn't it easier to just complain rather than trying to understand stuff?
Score: 12 Votes (Like | Disagree)