Mac Studio With M3 Ultra Runs Massive DeepSeek R1 AI Model Locally

YouTuber Dave Lee of Dave2D fame has demonstrated how Apple's new Mac Studio equipped with an M3 Ultra chip can efficiently run a huge version of the DeepSeek R1 AI model locally, provided that users spec the machine with the maximum 512GB of memory.

Mac Studio 2025
According to Lee's testing, the 671 billion parameter AI model can be executed directly on Apple's high-end workstation, but it requires substantial memory resources, consuming 404GB of storage and requiring the manual allocation of 448GB of video RAM through Terminal commands.

The M3 Ultra's unified memory architecture is key to this performance, allowing the system to handle a 4-bit quantized version of DeepSeek R1 efficiently. The quantization slightly reduces accuracy, but it maintains all parameters and delivers approximately 17-18 tokens per second, which is sufficient for many practical applications.

Perhaps most impressively, the Mac Studio accomplishes this while consuming under 200 watts of power. Comparable performance on traditional PC hardware would require multiple GPUs drawing approximately ten times more electricity.

The capability to run such advanced AI models locally offers privacy advantages for sensitive applications like healthcare data analysis, where sending information to cloud services raises security concerns.


However, this performance doesn't come cheap – a Mac Studio configured with M3 Ultra and 512GB of RAM starts at around $10,000. Fully maxed out, an M3 Ultra Mac Studio with 16TB of SSD storage and an Apple M3 Ultra chip with 32-core CPU, 80-core GPU, and 32-core Neural Engine costs a cool $14,099. Of course, for organizations requiring local AI processing of sensitive data, the Mac Studio offers a relatively power-efficient solution compared to alternative hardware configurations.

Apple says the M3 Ultra is the fastest Mac chip it has ever released, thanks to its strategy of fusing two M3 Max chips together using the company's "UltraFusion" technology. This makes the chip's specs double that of the M3 Max.

Related Roundup: Mac Studio
Buyer's Guide: Mac Studio (Buy Now)
Related Forum: Mac Studio

Popular Stories

apple watch ultra yellow

What's Next for the Apple Watch Ultra 3 and Apple Watch SE 3

Friday April 25, 2025 2:44 pm PDT by
This week marks the 10th anniversary of the Apple Watch, which launched on April 24, 2015. Yesterday, we recapped features rumored for the Apple Watch Series 11, but since 2015, the Apple Watch has also branched out into the Apple Watch Ultra and the Apple Watch SE, so we thought we'd take a look at what's next for those product lines, too. 2025 Apple Watch Ultra 3 Apple didn't update the...
iphone 16 display

iPhone 17's Scratch Resistant Anti-Reflective Display Coating Canceled

Monday April 28, 2025 12:48 pm PDT by
Apple may have canceled the super scratch resistant anti-reflective display coating that it planned to use for the iPhone 17 Pro models, according to a source with reliable information that spoke to MacRumors. Last spring, Weibo leaker Instant Digital suggested Apple was working on a new anti-reflective display layer that was more scratch resistant than the Ceramic Shield. We haven't heard...
iPhone 17 Air Pastel Feature

iPhone 17 Reaches Key Milestone Ahead of Mass Production

Monday April 28, 2025 8:44 am PDT by
Apple has completed Engineering Validation Testing (EVT) for at least one iPhone 17 model, according to a paywalled preview of an upcoming DigiTimes report. iPhone 17 Air mockup based on rumored design The EVT stage involves Apple testing iPhone 17 prototypes to ensure the hardware works as expected. There are still DVT (Design Validation Test) and PVT (Production Validation Test) stages to...
Beyond iPhone 13 Better Blue

20th Anniversary iPhone Likely to Be Made in China Due to 'Extraordinarily Complex' Design

Monday April 28, 2025 4:29 am PDT by
Apple will likely manufacture its 20th anniversary iPhone models in China, despite broader efforts to shift production to India, according to Bloomberg's Mark Gurman. In 2027, Apple is planning a "major shake-up" for the iPhone lineup to mark two decades since the original model launched. Gurman's previous reporting indicates the company will introduce a foldable iPhone alongside a "bold"...
iPhone 17 Air Pastel Feature

iPhone 17 Air Launching Later This Year With These 16 New Features

Thursday April 24, 2025 8:24 am PDT by
While the so-called "iPhone 17 Air" is not expected to launch until September, there are already plenty of rumors about the ultra-thin device. Overall, the iPhone 17 Air sounds like a mixed bag. While the device is expected to have an impressively thin and light design, rumors indicate it will have some compromises compared to iPhone 17 Pro models, including only a single rear camera, a...
iPhone 17 Pro Blue Feature Tighter Crop

iPhone 17 Pro Launching Later This Year With These 13 New Features

Wednesday April 23, 2025 8:31 am PDT by
While the iPhone 17 Pro and iPhone 17 Pro Max are not expected to launch until September, there are already plenty of rumors about the devices. Below, we recap key changes rumored for the iPhone 17 Pro models as of April 2025: Aluminum frame: iPhone 17 Pro models are rumored to have an aluminum frame, whereas the iPhone 15 Pro and iPhone 16 Pro models have a titanium frame, and the iPhone ...

Top Rated Comments

FSMBP Avatar
6 weeks ago
Cool - but how fast can it load MacRumors.com in Safari??? I want real-world cases for myself before I plunk down $15K.
Score: 27 Votes (Like | Disagree)
neuropsychguy Avatar
6 weeks ago
"Perhaps most impressively, the Mac Studio accomplishes this while consuming under 200 watts of power. Comparable performance on traditional PC hardware would require multiple GPUs drawing approximately ten times more electricity."

Assuming 1 hour of LLM use per day at $0.18 per kWh, that's about $11 per month to run a "comparable" PC versus about $1 for the Mac Studio. The Mac Studio is cheap to run.

That was just an estimation in the video about how much more electricity it would use.

The reality is worse for the non-Mac solution, not even counting the fact that you'd need to buy many GPUs to balance out the available RAM on the Mac Studio. You'd need 16 RTX 5090s to get 512 GB of RAM. Each of those idles at maybe 50 W. Let's say 400 W under load. Add in the draw of the CPU and more.

Using that estimate, 1 hour of LLM use per day would be about $40 per month in electricity costs versus $1 for the Mac Studio (using an $0.18 per kWh cost). Add to that the cost of the GPUs (let's be generous and say only $32,000 for 16 5090s), etc. and you're looking at something that costs about 4x the Mac Studio and is about 40x more electricity to run for similar performance.

Scale that up. Let's say this is a business running an LLM for 10 hours per day. That's $10 per month for the Mac and $400 per month for the non-Mac solution. It doesn't take long for the Mac to pay for itself just in electricity savings alone. Although, if you live somewhere cold and want to use your 16 5090s as your heating, maybe you can offset some heating costs that way. Just be careful you don't end up with too much heat ('https://www.theverge.com/news/609207/nvidia-rtx-5090-power-connector-melting-burning-issues')!

The 512 GB Studio is not for everyone, but it's a terrific value for some people and uses.

Although, I’ll add that with an nvidia card in a Linux box, you can offload into RAM too with various software managing LLMs. So you will not strictly need 16 5090s to reach the total RAM capacity of the Mac Studio solution. Performance of LLMs offloaded into RAM will not be great, however. This means there’s not much quite like the new Studio for local LLMs.
Score: 18 Votes (Like | Disagree)
surfzen21 Avatar
6 weeks ago
If LLMs are a significant part of the future of computing and privacy is going to be a huge part of that, then Apple has a huge advantage from a hardware perspective. This is the real unspoken hero of what Apple is doing.

While everyone is focused on a delayed "AI" end user roll out, and some absolutely losing their stuff over it, Apple has created the hardware that is blowing away all the competition. Once the software side catches up, Apple will be lightyears ahead.

Keep in mind, big players like META and Google are absolutely pirating any data they can get their hands on. Unfortunately, they are too big to fail like thepiratebay is.

Try to buy a Nvidia 5090 with a measly 32GB of Vram that needs a disgusting amount of power to run. The fake MSRP is $2,000 and are being sold on eBay for $7,000.

With all the crying going on I think Apple is doing this exactly right.
Score: 16 Votes (Like | Disagree)
bunce66 Avatar
6 weeks ago
Would it be safe to say that in 5-10 years a smartphone will be able to run a model like this internally and without the internet?
Score: 14 Votes (Like | Disagree)
AusMness Avatar
6 weeks ago
This new Mac Studio is the king of local LLMs
Score: 12 Votes (Like | Disagree)
Howard2k Avatar
6 weeks ago

If LLMs are a significant part of the future of computing and privacy is going to be a huge part of that, then Apple has a huge advantage from a hardware perspective. This is the real unspoken hero of what Apple is doing.

While everyone is focused on a delayed "AI" end user roll out, and some absolutely losing their stuff over it, Apple is created the hardware that is blowing away all the competition. Once the software side catches up, Apple will be lightyears ahead.

Keep in mind big players like META and Google are absolutely pirating any data they can get their hands on. Unfortunately, they are too big to fail like thepiratebay is.

Try to buy a Nvidia 5090 with a measly 32GB of Vram that needs a disgusting amount of power to run. The fake MSRP is $2,000 and are being sold on eBay for $7,000.

Will all the crying going on I think Apple is doing this exactly right.
But isn't it easier to just complain rather than trying to understand stuff?
Score: 12 Votes (Like | Disagree)