Top Trends in AI This Week: August 25, 2023

Top Trends in AI This Week: August 25, 2023

Introduction: Generative AI is widely considered the fastest-moving technology innovation in history. It has captured the imagination of consumers and enterprises across the globe, spawning incredible innovation and along with it a mutating market ecosystem. Generative AI has also caused a copious amount of news and hype. To avoid AI FOMO and find the right path, the wise will pay attention to trends and not be distracted by every announcement and news bite.

AI Compute Is Shifting to Meet Demand for Cheaper, Better Outcomes

The News: Recent announcements from semiconductor companies around AI compute include the following.

  • Groq’s language processing unit (LPU) breaks LLM performance record. On August 8, startup AI chipset provider Groq announced it now runs LLM Llama-2 at more than 100 tokens per second, per user on a Groq LPU. Tokenization is a process LLMs use to break text into smaller, manageable units. It allows LLMs to process text more efficiently by reducing memory requirements and compute complexity. The more tokens per second per user an LLM can process, the faster the LLM will load results for users and the less AI compute the application will require.
  • Qualcomm Cloud AI 100 chip leads in power efficiency testing. As AI and ML workloads grow in size and complexity, they demand more computing resources and energy consumption. This poses a challenge for both providers and end users who want to deliver and access high-quality services at a reasonable cost. Therefore, it is essential to perfect the performance and efficiency of the solutions that support these workloads. Qualcomm Technologies offers a wide range of power-efficient AI accelerators to meet the performance/TCO requirements.
  • AMD introduces chip designed for specifically for generative AI workloads. On June 13, AMD introduced the AMD Instinct MI300X chip to provide compute and memory efficiency needed for LLM training and inference. AMD says its graphics processing unit (GPU) is highly efficient and that AI workloads using them require less GPUs than competitors.
  • Kneron launches latest version of its neural processing unit (NPU). According to the company, the chip tackles one of the largest bottlenecks to widespread AI adoption: the high costs driven by prevailing energy-inefficient hardware. The KL730 yields a 3 to 4 times leap in energy efficiency compared to previous Kneron models and is 150% to 200% more energy efficient than major industry peers.

Analyst Take: The current state of AI compute is a prime example of the perfectly-timed workaround. GPUs were designed for computer graphics and image processing, primarily for high-end computer gaming graphics cards. They were then found to be useful in multitasking and running programs in parallel. Consequently, they can execute more mathematical calculations with greater efficiency than central processing units (CPUs), which make them a good solution for AI compute, particularly for AI training. However, AI workloads are massive and therefore, expensive. Generative AI workloads are even bigger than legacy AI workloads and to make generative AI applications viable, compute cost has to come down. Semiconductors have long product cycles, but fortunately, there has been significant innovation and progress in developing purpose-built AI chips. Chip wars will heat up and the winners will be enterprises that are willing to experiment with AI compute at competitive prices.

AI Models/LLMs Are Mutating and One Size Does Not Fit All

The News: AI models/LLMs are mutating. How? They are getting smaller and more specialized:

Analyst take: First-generation LLMs are trained on massive amounts of public data. Numerous challenges are inherent to that approach – the quality of the data causes bias, inaccuracy, hallucinations, and more. Next-generation LLMs are taking a more measured approach to address these issues – enabling companies to train AI models on their own data sets, building industry-specific models, and creating smaller models that can be impactful but less expensive. AI model development will continue to evolve. Savvy enterprises will look for multiple options and design their architectures for better ways to plug and play.

Disclosure: The Futurum Group is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of The Futurum Group as a whole.

Other insights from The Futurum Group:

Cohere Launches Coral, a New AI-Powered Knowledge Assistant

Generative AI Investment Accelerating: $1.3 Billion for LLM Inflection

Generative AI War? ChatGPT Rival Anthropic Gains Allies, Investors

Author Information

Based in Tampa, Florida, Mark is a veteran market research analyst with 25 years of experience interpreting technology business and holds a Bachelor of Science from the University of Florida.

Related Insights
Oracle Makes the Case for AI Inside Everyday Leadership Workflows
July 2, 2026

Oracle Makes the Case for AI Inside Everyday Leadership Workflows

Keith Kirkpatrick, Research Director at The Futurum Group, examines how Oracle Manager Edge embeds AI-powered coaching into Oracle Cloud HCM, bringing real-time guidance into managers' daily workflows and strengthening Oracle's...
Domino Data Lab From MLOps Platform to Governed AI Application Factory
July 2, 2026

Domino Data Lab: From MLOps Platform to Governed AI Application Factory

Nick Patience, VP and Practice Lead, AI Platforms at Futurum, examines Domino Data Lab's pivot to governed AI application delivery, its agentic AI governance framework, and what the strategy means...
Siemens and IFS Announce Alliance to Advance Industrial AI
July 2, 2026

Siemens and IFS Announce Alliance to Advance Industrial AI

Siemens and IFS have partnered to advance Industrial AI solutions, merging Siemens' industrial automation depth with IFS's AI-embedded ERP platform. The alliance targets asset-intensive industries as enterprise software demand accelerates....
Lakebase and LTAP Challenge Database Orthodoxy, Are Monoliths Finally Obsolete?
July 2, 2026

Lakebase and LTAP Challenge Database Orthodoxy, Are Monoliths Finally Obsolete?

Databricks revolutionizes analytical platforms through Lakebase and LTAP, unifying transactional and analytical workloads. Research shows 73.6% of organizations are increasing spend, signaling a major shift from legacy databases....
Shopify’s PyTorch Foundation Move Signals a Power Shift in Open Source AI for Commerce
July 2, 2026

Shopify’s PyTorch Foundation Move Signals a Power Shift in Open Source AI for Commerce

Shopify's Platinum membership in the PyTorch Foundation signals a shift toward community-governed AI frameworks, avoiding vendor lock-in as enterprises increasingly deploy generative AI in production....
How Anthropic and OpenAI Are Building Everywhere Ecosystems
July 1, 2026

How Anthropic and OpenAI Are Building “Everywhere Ecosystems”

Alex Smith, VP & Practice Lead, Ecosystems, Channels & Marketplaces at Futurum, shares insights on how Anthropic and OpenAI are building 'Everywhere Ecosystems' and the multidimensional go-to-market strategies designed to...

Book a Demo

Welcome

The vision behind everything in Futurum’s Custom Research practice is this: research should show you what is happening, what comes next, and what to do about it. It should be personal to each audience, easy for people to grasp, and structured so LLMs can reason over it accurately. And it should be fast and turnkey; you want answers now, not another project to carry for quarters.

Whether you are defining business, channel, or go-to-market strategy; evaluating vendors or justifying ROI; or commissioning research to fill an emerging market need, we have your back, with a program that answers your questions with the objectivity and credibility to drive real decisions.

To do it, we bring unmatched data to bear: Futurum research, surveys, and market projections; validated market feeds; ETR’s 15 years of insight from 10,000 technology decision-makers; G2’s buyer and user data; and what our analysts hear every day. Add leading primary collection, from AI-moderated voice interviews to surveys and analyst-led interviews, all turnkey, and every project comes out credible, nuanced, and actionable.

And we don’t just drop the results in your lap. For internal work, we provide analyst-led sessions, interactive dashboards, and a range of formats. For market-facing work, Futurum delivers turnkey activation and amplification that actually gets seen, by people and by LLMs, through our media and share of voice. This is research that moves decisions and markets.

We will meet you wherever you are, from a fast-turn brief to a multi-year program, and shape the work to your goals, timeline, and budget. The right program for your moment.

If any of this is useful, I would love to talk.

Benjamin Brown, VP Custom Research, Futurum Research

Benjamin Brown

VP, Custom Research · The Futurum Group

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.