Intel finally launched the next generation Core Ultra 200V processors, codenamed Lunar Lake at Berlin, ahead of IFA. Intel’s new Lunar Lake architecture aims to provide competitive performance at ultra-low power for thin and compact laptops. After all, Qualcomm’s ARM-based Snapdragon X Elite has entered the Windows PC ecosystem, and it’s already grabbing headlines for its efficiency. Thus, let’s learn more about Intel’s Lunar Lake architecture and how it has been redesigned for efficiency.
Intel Lunar Lake Architecture
With Meteor Lake last year, Intel moved from its age-old monolithic design to tile-based chip design. And Lunar Lake takes it even further. Unlike Meteor Lake where the Compute tile only housed the CPU and cache, the Compute tile on Lunar Lake processors houses the CPU, cache, GPU, and NPU as well.
It means that the Compute tile is the largest tile on the die, and the best part this year is that it’s fabricated on TSMC’s N3B process node. Sure, TSMC’s N3B has a lower yield than the latest and much-improved N3E node, but Intel is finally moving away from Intel foundry to TSMC’s advanced 3nm process node, which is great.
As for the platform controller tile that provides I/O and connectivity, it’s built on TSMC’s 6nm (N6) node, like last year’s Meteor Lake. This is the first time Intel is designing its processor, but TSMC is manufacturing it. Finally, Intel is packaging the whole chipset using its own Foveros 3D tech.
Not just that, Intel has also moved the memory to the processor. It means that unified memory, similar to Apple M-series chips, will be available on Lunar Lake chips. The on-package LPDDR5X-8533 RAM is available in 16GB or 32GB capacity, offering memory speed up to 8,533 MT/s.
Overall, the Lunar Lake architecture has undergone significant changes. The CPU, GPU, NPU, and cache are now part of the Compute tile and it’s manufactured on TSMC’s 3nm (N3B) process node, which should lead to much better efficiency. And finally, memory is available directly on the SoC to reduce power consumption and improve bandwidth.
During the Computex event, Michelle Holthaus, executive VP and GM at Intel, said, “We’re going to bust the myth that [x86] can’t be as efficient.” Intel claims that x86-based Lunar Lake processors drops power consumption by a whopping 50% when compared to Meteor Lake chips.
It appears that Intel has made all the right moves to improve efficiency with Lunar Lake processors. Now, let’s learn about the new Lunar Lake CPU cores.
Intel Lunar Lake CPU
All Lunar Lake processors have 8 CPU cores — 4 performance (P) cores named Lion Cove, and 4 efficiency (E) cores named Skymont. As I mentioned above, the CPU is part of the Compute tile. Intel claims the P-core Lion Cove on Lunar Lake brings a 14% IPC gain over Meteor’s Lake Redwood Cove P core.
Intel has done something very different this time. The chipmaker has completely removed SMT (Simultaneous Multi-threading) after more than two decades from its processor. SMT, popularly known as HyperThreading, allows a core to perform two tasks in parallel. Intel says removing SMT helps in improving performance-per-watt by 15%.
To compensate for the lack of HyperThreading, Intel argues that Lunar Lake processors can execute more instructions per cycle instead of relying on parallel execution. This allows the processor to perform better in single-threaded tasks.
Now coming to the E core, I think Skymont is the headline feature of Lunar Lake processors. Intel says Skymont offers a massive 68% IPC improvement over Meteor’s Lake Crestmont E core. The 4-core Skymont cluster remains separate in a ‘Low Power Island’ from the P-core cluster with access to its own L3 cache.
As a result, Skymont consumes one-third of the power to match Crestmont’s peak performance. So overall, Skymont offers 2x more performance than the Crestmont core in single-threaded tasks.
On top of that, Intel has brought granularity to clock speed boosts with Lunar Lake. Instead of ramping up the clock speed by 100MHz, which consumes more power, Lunar Lake architecture can increment the clock speed by 16.67MHz to manage the power budget of any task.
The reduced frequency interval will lead to less power consumption. Overall, Intel says the Lunar Lake CPU can match the single-threaded performance of Meteor Lake at just half the power, which is quite impressive.
As for the competition, Intel says Lunar Lake processors not only match Snapdragon X Elite’s perf/watt figure, but performs 20% better in the same power envelope which is a big claim. As a result, it offers around 14 hours of battery life on office productivity tasks whereas X Elite can muster up to 9.5 hours. AMD HX 370 lasts up to 10.1 hours. It appears Intel has completely re-engineered the Lunar Lake chipset to offer great battery life, and we are excited to test it out.
Lunar Lake Geekbench Score (Leaked)
While Lunar Lake is scheduled to go on sale on 24th September, some Geekbench scores have already leaked. While running the lowest-end SKU (Core Ultra 5 228V), the 8-core CPU scored 2,621 in the single-core test and 10,072 in the multi-core test. The SKU clocks up to 4.5GHz with a default TDP of 17W (37W Maximum Turbo Power).
And the highest-end SKU (Core Ultra 9 288V) of Lunar Lake manages to score 2,790 in the single-core test and 11,048 in the multi-core test. In some other runs, it even managed to cross the 2,900 mark in single-threaded tasks. This particular SKU goes up to 5.1GHz and has a default TDP of 30W (Max 37W).
Intel Lunar Lake: New Xe2 GPU
The integrated GPU on Lunar Lake is built on the Battlemage graphics architecture and it features 8 second-gen Intel Xe cores. It also features larger ray tracing units for improved gaming performance and real-time ray tracing. Not only that, for AI tasks, the new Lunar Lake GPU can alone perform 67 trillion operations per second (TOPS). That’s pretty impressive, right?
In comparison to the Meteor Lake GPU, the Lunar Lake GPU is 31% faster while consuming half the power, and offers up to 60% faster XeSS AI upscaling as well. The Lunar Lake GPU absolutely decimates the Adreno GPU on the Snapdragon X Elite. It’s 68% faster than SD X Elite’s GPU and 16% faster than AMD’s latest HX 370 GPU.
Further, its display engine can handle three 4K HDR screens at 60Hz and a single 8K HDR screen at 60Hz. Finally, Lunar Lake processors support AV1 encoding and decoding as well.
Intel Lunar Lake NPU
Much was said about Meteor Lake’s weak NPU that could only execute up to 10 TOPS, but with Lunar Lake, Intel will be powering a range of Copilot+ PCs for local AI workloads (update coming in November). The new Lunar Lake NPU 4 can perform up to 48 TOPS alone, higher than Microsoft’s 40 TOPS eligibility ceiling for Copilot+ PCs.
Considering all the compute units, the processor can perform up to a massive 120 TOPS. The GPU can perform up to 67 TOPS, CPU up to 5 TOPS, and the NPU up to 48 TOPS — totaling 120 TOPS. This is even higher than Qualcomm’s total 75 TOPS processing capability on the Snapdragon X Elite. Keep in mind, the TOPS figure is based on INT8 data type.
Intel Lunar Lake SKUs
Below, you can check out all the SKUs of the Core Ultra (Series 2) processors based on the Lunar Lake architecture. There are nine different SKUs featuring eight CPU cores on all of them. The distinctive factors are memory, CPU/GPU clock speed, and NPU’s capability.
Lunar Lake SKUs | Cores/Threads | Memory | Max CPU Frequency | Max GPU Frequency | NPU (TOPS) | TDP Range (Min/Base/Turbo) |
---|---|---|---|---|---|---|
Core Ultra 9 288V | 8C/8T | 32 GB | 5.1 GHz | 2.05 GHz | 48 | 17W – 30W – 37W |
Core Ultra 7 268V | 8C/8T | 32 GB | 5.0 GHz | 2.00 GHz | 48 | 8W – 17W – 37W |
Core Ultra 7 266V | 8C/8T | 16 GB | 5.0 GHz | 2.00 GHz | 48 | 8W – 17W – 37W |
Core Ultra 7 258V | 8C/8T | 32 GB | 4.8 GHz | 1.95 GHz | 47 | 8W – 17W – 37W |
Core Ultra 7 256V | 8C/8T | 16 GB | 4.8 GHz | 1.95 GHz | 47 | 8W – 17W – 37W |
Core Ultra 5 238V | 8C/8T | 32 GB | 4.7 GHz | 1.85 GHz | 40 | 8W – 17W – 37W |
Core Ultra 5 236V | 8C/8T | 16 GB | 4.7 GHz | 1.85 GHz | 40 | 8W – 17W – 37W |
Core Ultra 5 228V | 8C/8T | 32 GB | 4.5 GHz | 1.85 GHz | 40 | 8W – 17W – 37W |
Core Ultra 5 226V | 8C/8T | 16 GB | 4.5 GHz | 1.85 GHz | 40 | 8W – 17W – 37W |
Intel Lunar Lake: Additional Improvements
As mentioned above, RAM is now part of the SoC. This means the CPU, GPU or NPU can access the memory quickly. Since the memory is physically closer to the Compute tile, the bandwidth improves with reduced latency and leads to about a 40% reduction in power consumption.
Of course, with the on-package memory, users won’t be able to upgrade or replace memory which some may not like. Apart from that, Intel says Thread Director has been improved to allocate tasks to suitable cores. Intel is further using machine learning to indicate the system scheduler for better guidance of tasks.
Finally, the TDP range of Lunar Lake processors is between 17W and 37W. Overall, I am very excited about the Lunar Lake processors which are scheduled to go on sale from September 24. It is going to be a thrilling time for consumers as Intel takes on Qualcomm and AMD in the AI PC race. We may finally see improved battery life on x86-powered Windows laptops.
The Boys Season 4 Episode 8 Revealed Shapeshifter’s Other Powers
NYT Strands Today: Hints, Answers & Spangram For August 17
Is Penguin A Total Bad Guy In DC? These 5 Instances Tell Us Otherwise