Ryzen AI MAX+ 395: AI in Ultra-Thin Laptops | en

Introduction: A New Era of AI Performance in Ultra-Thin Laptops

The AMD Ryzen™ AI MAX+ 395 processor represents a significant leap forward in the capabilities of ultra-thin and light laptops. This isn’t just an incremental upgrade; it’s a fundamental shift in how these devices can handle demanding workloads, particularly in the rapidly evolving field of artificial intelligence. By combining cutting-edge CPU architecture, a dedicated AI engine, and powerful integrated graphics, the Ryzen AI MAX+ 395 redefines what’s possible in a premium, portable form factor. This processor is not just about faster speeds; it’s about enabling entirely new experiences and workflows that were previously unimaginable on such compact devices.

Architectural Innovations: ‘Zen 5’, XDNA 2, and RDNA 3.5

The exceptional performance of the Ryzen AI MAX+ 395 is built upon a foundation of three key architectural innovations:

‘Zen 5’ CPU Cores: The processor utilizes AMD’s latest ‘Zen 5’ CPU cores, delivering exceptional processing power and efficiency. These cores are designed to handle a wide range of tasks, from everyday productivity to demanding content creation, ensuring a smooth and responsive user experience. The improvements in instructions per clock (IPC) and power efficiency contribute significantly to the overall performance uplift.
XDNA 2 NPU (Neural Processing Unit): The heart of the AI capabilities lies in the XDNA 2 NPU, boasting over 50 peak AI TOPS (Tera Operations Per Second). This dedicated AI engine is specifically designed to accelerate AI workloads, such as those found in Large Language Models (LLMs) and vision models. The XDNA 2 architecture provides a significant boost in AI inference performance, enabling faster response times and smoother operation of AI-powered applications.
AMD RDNA 3.5 Compute Units (Integrated Graphics): The Ryzen AI MAX+ 395 features a remarkably robust integrated GPU powered by 40 AMD RDNA 3.5 Compute Units. This powerful graphics engine not only delivers excellent gaming performance but also plays a crucial role in accelerating AI workloads, particularly those involving visual data. The RDNA 3.5 architecture provides significant improvements in graphics performance and efficiency compared to previous generations.

This combination of a powerful CPU, a dedicated NPU, and a high-performance integrated GPU creates a synergistic effect, allowing the Ryzen AI MAX+ 395 to excel in a wide range of tasks, especially those involving AI.

Unified Memory and Variable Graphics Memory (VGM)

Another key feature of the Ryzen AI MAX+ 395 is its support for unified memory, with options ranging from 32GB up to a staggering 128GB. Unified memory allows the CPU, GPU, and NPU to access the same pool of memory, reducing latency and improving overall system performance. This is particularly beneficial for AI workloads, which often involve large datasets and complex calculations.

Furthermore, AMD’s innovative Variable Graphics Memory (VGM) technology allows users to dynamically allocate a portion of the system memory as VRAM (Video RAM) for the integrated GPU, up to 96GB. This flexibility is crucial for running larger and more demanding AI models, particularly vision models, which often require significant amounts of VRAM. VGM allows the system to adapt to the specific needs of the workload, maximizing performance and efficiency.

LM Studio: The Benchmark for Client-Side LLM Performance

LM Studio, a llama.cpp-powered application, has emerged as the leading platform for running Large Language Models (LLMs) locally on client devices. It provides a user-friendly interface and eliminates the need for specialized technical expertise, making it easy for users to deploy and interact with the latest language models. LM Studio’s popularity and widespread adoption make it an ideal benchmark for evaluating the performance of processors in real-world AI workloads.

The ‘Strix Halo’ platform, featuring the AMD Ryzen AI MAX+ series processors, significantly extends AMD’s performance leadership in the LM Studio environment. This platform is designed to take full advantage of the architectural innovations of the Ryzen AI MAX+ 395, delivering unparalleled performance in LLM tasks.

Benchmarking Results: Text Language Models

Extensive benchmarking within LM Studio demonstrates the significant performance advantage of the AMD Ryzen AI MAX+ 395. When compared to the Intel Arc 140V in a device like the ASUS ROG Flow Z13, the Ryzen AI MAX+ 395 achieves up to 2.2 times the token throughput. This means that the processor can generate text significantly faster, resulting in a more responsive and interactive experience when working with LLMs.

This performance advantage is not limited to a specific model size or type; it remains consistently high across various models. The Ryzen AI MAX+ 395 consistently outperforms the competition, regardless of the complexity of the LLM.

The advantage becomes even more pronounced when considering the time-to-first-token metric. This metric measures the time it takes for the model to generate the first token of output after receiving a prompt. A faster time-to-first-token translates to a more immediate and responsive interaction with the LLM.

In this crucial metric, the AMD Ryzen AI MAX+ 395 demonstrates even more impressive gains:

Up to 4 times faster with smaller models like Llama 3.2 3b Instruct.
Up to 9.1 times faster with 7 billion and 8 billion parameter models like DeepSeek R1 Distill Qwen 7b and DeepSeek R1 Distill Llama 8b.
Up to 12.2 times faster with 14 billion parameter models. This comparison is against a laptop equipped with the Intel Core Ultra 258V, showcasing a performance difference exceeding an order of magnitude.

These results clearly demonstrate that the larger the LLM, the greater the performance advantage of the AMD Ryzen AI MAX+ 395. This scaling advantage is crucial for users who want to work with the most advanced and capable LLMs available. Whether it’s engaging in interactive conversations, summarizing large documents, or generating creative content, the AMD-powered machine delivers significantly faster response times, enhancing productivity and user experience.

Benchmarking Results: Vision Language Models

The evolution of AI is rapidly moving beyond text-only LLMs. Multi-modal models, which incorporate vision adapters and visual reasoning capabilities, are becoming increasingly prevalent. These models can understand and process both text and images, opening up new possibilities for AI applications.

Examples of these advanced vision models include:

IBM Granite Vision: A family of models developed by IBM, offering advanced vision capabilities.
Google Gemma 3: A recently launched family of models from Google, also featuring advanced vision capabilities.

The AMD Ryzen AI MAX+ 395 processor is designed to excel in these multi-modal workloads. It delivers exceptional performance when running these vision models, showcasing its versatility and adaptability to the evolving landscape of AI.

In the context of vision models, the time-to-first-token metric represents the time required for the model to analyze a provided image. A faster time-to-first-token means that the model can process and understand images more quickly, leading to a more responsive and interactive experience.

The Ryzen AI MAX+ 395 demonstrates commanding leadership in this area as well:

Up to 7 times faster in IBM Granite Vision 3.2 3b.
Up to 4.6 times faster in Google Gemma 3 4b.
Up to 6 times faster in Google Gemma 3 12b.

Furthermore, the ASUS ROG Flow Z13, equipped with a 64GB memory option, can effortlessly handle the Google Gemma 3 27B Vision model, which is currently recognized as the state-of-the-art (SOTA) vision model. This demonstrates the ability of the Ryzen AI MAX+ 395 to handle even the most demanding vision models, pushing the boundaries of what’s possible on a thin and light laptop.

Another compelling demonstration involves running the DeepSeek R1 Distill Qwen 32b in 6-bit precision. This configuration enables users to code a classic game in a remarkably short timeframe, approximately 5 minutes, showcasing the practical application of these AI capabilities.

Optimizing Settings for Maximum LLM Performance

To fully unlock the potential of the AMD Ryzen AI MAX+ 395 processor for LLM workloads, it’s crucial to optimize the system settings. This involves ensuring that the latest drivers are installed and that the appropriate configurations are selected within the AMD Software: Adrenalin Edition.

Here are the key recommendations:

Install the Latest AMD Software: Adrenalin Edition Driver: Ensure that your system is running the latest driver to benefit from the latest performance optimizations and bug fixes.
Enable Variable Graphics Memory (VGM): AMD laptops powered by AMD Ryzen AI 300 series processors feature VGM. AMD strongly recommends enabling VGM for all LLM workloads to enhance token throughput and facilitate the execution of larger models.
Set VGM to ‘High’: A ‘High’ VGM setting is recommended for optimal performance. This allocates a larger portion of system memory to the integrated GPU, providing the necessary resources for demanding AI workloads. The VGM options are accessible through the Performance > Tuning tab within AMD Software: Adrenalin Edition.
Manually Select Parameters in LM Studio: When running LLMs in LM Studio, check the ‘manually select parameters’ option.
Set GPU Offload to ‘MAX’: Within the manually selected parameters, set the GPU Offload setting to ‘MAX’. This ensures that the maximum amount of processing is offloaded to the GPU, maximizing performance.
Choose Appropriate Quantization: AMD recommends using Q4 K M quantization for everyday use and Q6 or Q8 quantization for coding tasks. Quantization reduces the precision of the model’s weights and activations, reducing memory usage and improving performance.

By following these recommendations, users can ensure that their system is configured to deliver the best possible performance when running LLMs on the AMD Ryzen AI MAX+ 395 processor.

The Future of Mobile AI: Portability and Power Combined

The AMD Ryzen AI MAX+ 395 processor represents a paradigm shift in the capabilities of thin and light laptops. It empowers users to experience cutting-edge AI models locally, without sacrificing portability or versatility. This combination of power and portability makes these devices ideal for a wide range of applications, from gaming and content creation to productivity and research.

The Ryzen AI MAX+ 395 is more than just a processor; it’s a gateway to a new era of AI-powered experiences. It enables complex operations to be performed with ease, setting a new standard for what users should expect from their mobile devices. The ability to run sophisticated LLMs and vision models locally opens up a world of possibilities, from enhanced productivity and creativity to new forms of entertainment and interaction. The future of mobile AI is here, and it’s powered by AMD. The ability to run these models offline also provides enhanced privacy and security, as the data does not need to be sent to the cloud for processing. This is a significant advantage for users who are concerned about data privacy.

updated at 2025-03-20

# AIGC # AMD # Llama