MediaTek Announces On-Device Generative AI Powered by Meta’s Llama 2

MediaTek Announces On-Device Generative AI Powered by Meta’s Llama 2

The News: MediaTek has announced that it is working closely with Meta’s Llama 2, the company’s next-generation open-source large language model (LLM) to build a complete edge computing ecosystem designed to accelerate AI application development on smartphones, IoT, vehicles, smart home, and other edge devices. Meta’s LLM will work alongside MediaTek’s latest APUs and NeuroPilot AI Platform. You can read the full press release on the MediaTek website.

MediaTek Announces On-Device Generative AI Powered by Meta’s Llama 2

Analyst Take: As with most transformative technologies these days, mobile is where innovation first learns to walk and run. MediaTek’s partnership announcement with Meta comes as demand for on-device/edge generative AI capabilities is entering a bit of a fever pitch. By folding Llama 2, Meta’s next-generation open-source Large Language Model (LLM) into its latest APUs and NeuroPilot AI platform, MediaTek is looking to build an edge computing ecosystem that will enable it to accelerate AI application development in smartphones, IoT, vehicles, smart home, and other edge devices powered by its SoCs.

MediaTek, the fabless semiconductor company powering more than 2 billion connected edge devices every year, needs a strong on-device/edge AI story to take to the market at a time when device SoC rivals like Qualcomm and Apple have already begun to prioritize on-device generative AI capable features in their premium mobile handset tiers as high-value market differentiators.

To better understand why on-device generative AI capabilities matters so much to the industry, consider that for now, most generative AI processing is still performed in the cloud. This is not ideal for a plethora of reasons, ranging from cost (a lot of generative AI processing can be handled more cheaply on a handheld device than in a data center) and connectivity challenges (losing connection to cloud computing services means lag or no generative AI processing for your device at all, which is both a productivity killer and a UX nightmare), to privacy (to keep prompts and generative AI work products better protected from snooping and third-party interference). Enabling generative AI applications to run directly on devices will lower latency, reduce operational costs, deliver more seamless performance, provide users with the ability to work despite having limited to no connectivity, and enhance privacy and security.

Device OEMs therefore need leading-edge SoCs and platforms capable of delivering on these capabilities, and this is a challenge for MediaTek, whose success in mobile can be defined by a single, enviable characteristic: consistent ability to put out attractively priced, high-performance products that fit well within the boundaries of budget-minded handset tiers. That strategy has served the company well, but it has struggled to penetrate premium tier SoC markets currently led by rivals Qualcomm and Apple. While MediaTek still occupies a strong 31% global market share of smartphone application processors (against Qualcomm’s 28% and Apple’s 26% market shares for Q1-2023), the Taiwan-based company has yet to gain traction in premium tiers, particularly in the coveted North American market.

To remedy this problem, MediaTek needs more than just capable CPUs, GPUs, and connectivity solutions now. It also needs a strong, credible, competitive on-device/edge generative AI story to take to handset OEMs. MediaTek’s Llama 2 announcement is the kernel of that story, and adding a credible partner like Meta is a check in the plus column. According to JC Hsu, Corporate Senior Vice President and General Manager of Wireless Communications Business Unit at MediaTek, “the increasing popularity of generative AI is a significant trend in digital transformation, and our vision is to provide the exciting community of Llama 2 developers and users with the tools needed to fully innovate in the AI space.”

The timing of the announcement is also interesting, as MediaTek’s next-generation flagship chipset, which will be introduced in the fall, is expected to feature a software stack optimized to run Llama 2, as well as an upgraded APU with Transformer backbone acceleration, reduced footprint access, and use of DRAM bandwidth to enhance LLM and AIGC performance. This helps highlight the importance of this otherwise easy-to-miss announcement. MediaTek expects Llama 2-based AI applications to become available for smartphones powered by the next-generation flagship SoC, scheduled to hit the market by the end of the year.

In the Android space, will MediaTek finally give Qualcomm a run for its money in the premium and flagship tiers? That remains to be seen. Snapdragon enjoys a significant head start in those tiers, to say nothing of its custom silicon advantage, but it will be interesting to see how MediaTek leverages this new opportunity to disrupt the status quo. Mobile aside, what I will be especially interested in over the coming 18 to 24 months is how MediaTek will expand these on-device generative AI capabilities to its ecosystem of mobile-adjacent SoCs and use cases. Mobile is only part of this new wave of AI-capable digital transformation.digita

Disclosure: The Futurum Group is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of The Futurum Group as a whole.

Other insights from The Futurum Group:

The Future of AI is Hybrid: Look No Further than Your Devices to Scale Generative AI

Snapdragon Chip Outperforms MediaTek in Xiaomi Phone Cameras

THE 5G FACTOR 5G Factor: AI Rising! Qualcomm’s Hybrid AI Vision, Nokia Puts the AI in AirScale, and NVIDIA Softbank Pair Up for Gen AI & 5G Apps

Author Information

Olivier Blanchard

Olivier Blanchard is Research Director, Intelligent Devices. He covers edge semiconductors and intelligent AI-capable devices for Futurum. In addition to having co-authored several books about digital transformation and AI with Futurum Group CEO Daniel Newman, Blanchard brings considerable experience demystifying new and emerging technologies, advising clients on how best to future-proof their organizations, and helping maximize the positive impacts of technology disruption while mitigating their potentially negative effects. Follow his extended analysis on X and LinkedIn.

Related Insights
Databricks AI’s GPU Reliability Push Exposes Hidden Risks for Large-Scale Training
July 3, 2026

Databricks AI’s GPU Reliability Push Exposes Hidden Risks for Large-Scale Training

Databricks AI reveals critical GPU reliability challenges in distributed training environments. Silent slowdowns and numerical corruption pose greater risks than visible failures, threatening model quality and compute efficiency at enterprise...
AI Code Review Hits a Wall: Why Speed Without Trust Risks Engineering Chaos
July 3, 2026

AI Code Review Hits a Wall: Why Speed Without Trust Risks Engineering Chaos

A survey shows 94% of engineering leaders use agentic AI coding tools, but 55% struggle with reliability and hallucinations—revealing a critical gap between development speed and production quality....
Brave's Browser Containers Raise the Bar for Privacy and Workflow Flexibility
July 3, 2026

Brave’s Browser Containers Raise the Bar for Privacy and Workflow Flexibility

As AI platform adoption accelerates to $181.3B projected market size, Brave's v1.92 release introduces native browser containers addressing data privacy concerns for 52.6% of enterprise decision makers managing multi-cloud AI...
Is Self-Healing ITOps Ready to Replace Manual Incident Response?
July 3, 2026

Is Self-Healing ITOps Ready to Replace Manual Incident Response?

LogicMonitor's AI-driven ITOps framework combines root-cause analysis with governed automation to reduce alert fatigue and accelerate issue resolution, as agentic AI reshapes enterprise infrastructure management....
Can DataRobot's Unified AI Governance Break the Silo Trap for Enterprise AI?
July 3, 2026

Can DataRobot’s Unified AI Governance Break the Silo Trap for Enterprise AI?

DataRobot's unified AI governance platform extends beyond public cloud to on-premises, edge, and air-gapped environments, directly addressing the enterprise AI fragmentation problem where visibility ends at deployment boundaries....
Oracle Makes the Case for AI Inside Everyday Leadership Workflows
July 2, 2026

Oracle Makes the Case for AI Inside Everyday Leadership Workflows

Keith Kirkpatrick, Research Director at The Futurum Group, examines how Oracle Manager Edge embeds AI-powered coaching into Oracle Cloud HCM, bringing real-time guidance into managers' daily workflows and strengthening Oracle's...

Book a Demo

Welcome

The vision behind everything in Futurum’s Custom Research practice is this: research should show you what is happening, what comes next, and what to do about it. It should be personal to each audience, easy for people to grasp, and structured so LLMs can reason over it accurately. And it should be fast and turnkey; you want answers now, not another project to carry for quarters.

Whether you are defining business, channel, or go-to-market strategy; evaluating vendors or justifying ROI; or commissioning research to fill an emerging market need, we have your back, with a program that answers your questions with the objectivity and credibility to drive real decisions.

To do it, we bring unmatched data to bear: Futurum research, surveys, and market projections; validated market feeds; ETR’s 15 years of insight from 10,000 technology decision-makers; G2’s buyer and user data; and what our analysts hear every day. Add leading primary collection, from AI-moderated voice interviews to surveys and analyst-led interviews, all turnkey, and every project comes out credible, nuanced, and actionable.

And we don’t just drop the results in your lap. For internal work, we provide analyst-led sessions, interactive dashboards, and a range of formats. For market-facing work, Futurum delivers turnkey activation and amplification that actually gets seen, by people and by LLMs, through our media and share of voice. This is research that moves decisions and markets.

We will meet you wherever you are, from a fast-turn brief to a multi-year program, and shape the work to your goals, timeline, and budget. The right program for your moment.

If any of this is useful, I would love to talk.

Benjamin Brown, VP Custom Research, Futurum Research

Benjamin Brown

VP, Custom Research · The Futurum Group

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.