10.1 — Near-Term Predictions: 2026–2028

Learning Objectives

Identify the near-term AI trends most supported by current evidence
Explain why model cost collapse matters for AI adoption across the economy
Distinguish between high-confidence near-term predictions and more speculative medium-term trajectories

Calibrating Confidence

Before making predictions about AI's future, it is worth calibrating what "near-term" means and how confident we should be.

High confidence: Trends already clearly underway, with strong momentum from multiple independent directions — technology, economics, regulatory, and behavioral.

Medium confidence: Trends with clear directional evidence but meaningful uncertainty about pace and scale.

Low confidence / speculative: Possibilities that could occur but depend on breakthroughs or conditions that may or may not materialize.

This section focuses on high-confidence near-term predictions — things we can see clearly from where we stand in early 2026. The further out you predict in AI, the lower your confidence should be.

📝Note

Why AI prediction is hard: The history of AI is littered with confident predictions that were wrong in both directions — "AGI is 5 years away" (repeatedly wrong) and "AI will never match humans at X" (repeatedly wrong in the other direction). This section aims to present well-grounded projections, not prophecy.

Agentic AI Becomes Standard Knowledge Worker Tooling

The clearest near-term trend is the mainstreaming of agentic AI — systems that don't just respond to questions but autonomously execute multi-step tasks.

In 2023–2024, AI assistance primarily meant: ask a question, get an answer. In 2025–2028, AI assistance increasingly means: describe a goal, come back when it's done.

What this looks like in practice:

An AI agent that reads your email, schedules meetings, drafts responses, and flags items requiring your attention — running continuously in the background
A coding agent that receives a GitHub issue, implements a fix, writes tests, and opens a pull request — with a human reviewing the result
A research agent that synthesizes information from dozens of sources into a briefing document you review before a meeting
An AI sales development representative that handles initial outreach, qualifies leads, and hands off to human account executives

Salesforce's Agentforce has already deployed AI SDRs at enterprise scale, reaching 12,000 customers and $500 million ARR by early 2026. OpenAI's Operator handles web-based tasks autonomously. Anthropic's Claude Code — with Voice Mode, Computer Use, Agent SDK, and 1 million token context — represents the coding agent paradigm becoming accessible to non-experts. GitHub Copilot's Coding Agent now autonomously implements issues and opens pull requests. Frameworks like LangGraph v1.0, CrewAI (processing 12 million+ agent executions daily), and AG2 (the community fork of AutoGen) provide the infrastructure layer for building custom agents.

Confidence level: High. This is not a prediction — it is already happening. The question is pace of adoption, not direction.

Multimodal Becomes the Universal Default

In 2026, users already expect AI systems to handle text, images, and increasingly audio and video. By 2028, the expectation will be that AI handles all modalities natively — as a baseline, not a premium feature.

What this means:

You photograph a document and ask AI to summarize it — standard capability
You upload a video of a machine problem and ask for diagnosis — routine
You speak naturally to an AI assistant that understands context, emotion, and interruptions — normal
AI natively analyzes charts, graphs, and visualizations without requiring text transcription first

Current state: GPT-5.5, Gemini 3.1 Pro, Claude Opus 4.7 (with 1 million token context), and Grok 4.1 already have strong multimodal capabilities. The 2026–2028 transition is about these capabilities becoming the expected baseline rather than a differentiating feature.

Confidence level: High. The technology exists; the question is UX refinement and deployment scale.

AI Embedded in Every Significant Software Product

By 2028, any software product that does not have AI capabilities will feel dated — similar to how a software product today that doesn't have a mobile app feels dated.

Areas of rapid embedding:

Productivity suites: Microsoft 365 Copilot, Google Workspace AI — AI writing, summarizing, analyzing in Word, Excel, Gmail, Docs
CRMs: Salesforce, HubSpot — AI drafting emails, summarizing deals, predicting churn
Developer tools: GitHub Copilot (Coding Agent, CLI, Mission Control), Cursor ($2 billion+ ARR, $29.3 billion valuation), AWS Kiro (spec-driven development), JetBrains AI — code completion, agentic development, and review as standard
Design tools: Figma AI, Adobe Creative Cloud AI, Canva AI — generating assets, editing, style transfer
Communications: Zoom AI, Slack AI, Teams Copilot — meeting summaries, action items, search across conversations

The scale of investment behind this embedding is staggering: the Stargate Project alone represents $400 billion+ in AI infrastructure across five US sites, with the Abilene, Texas facility already operational. Custom AI chips — OpenAI's Titan, Amazon's Trainium3 (3nm), and Google's TPU Ironwood v7 — are being designed specifically to power this next generation of embedded AI.

The strategic implication: AI fluency becomes a prerequisite for using the tools professionals use every day — not a separate AI skill, but embedded in the tools themselves.

Confidence level: High. This is already happening across enterprise software.

Vertical Integration: The Terafab Wave

The 2026–2028 window is also when the largest AI buyers start to own their own silicon supply — not just design chips, but own the foundry. The flagship example is Terafab, the $20-25 billion joint venture announced March 21, 2026 by Tesla, SpaceX, and xAI, with Intel joining as the process-technology partner on April 7, 2026. Terafab targets one terawatt of AI compute output per year — roughly 50× current global AI chip production — on Intel's 18A and 2 nm nodes.

Terafab will produce Tesla's AI5 and AI6 chips (taped out April 2026 for FSD and Optimus), restart Tesla's Dojo3 program now repositioned for space-based AI compute, xAI training accelerators for the post-Colossus 2 era, and SpaceX's radiation-tolerant orbital inference silicon. The pilot line targets 3,000 wafers per month by 2029, scaling toward 1 million wafer starts per month long-term.

Why this matters: No single customer-owned foundry of this scale has been attempted. If Terafab delivers, it creates the first credible non-NVIDIA frontier training platform at scale. If it slips — and AI5 is already roughly two years late, while Dojo has pivoted twice in eight months — NVIDIA's moat holds and the custom-silicon thesis is weakened industry-wide.

Confidence level: Medium. The announcement and Intel partnership are confirmed, but Tesla's lack of foundry experience makes timelines uncertain. What is near-certain is that vertical-integration attempts from several frontier AI buyers (Terafab, Stargate's chip program, Amazon Trainium, Google TPU) will reshape the semiconductor industry over the 2026-2028 window.

The EU AI Act and International Governance Come Into Force

The EU AI Act — passed in 2024 — is being implemented in phases that run through 2028, with its obligations landing on different categories of AI at different times:

Prohibited AI systems: Social scoring, subliminal manipulation, real-time biometric surveillance in public spaces — banned since February 2025
General-purpose AI models: Large models must document capabilities, risks, and copyright compliance, and assess systemic risk — applicable since August 2025
High-risk AI systems: AI in hiring, education, critical infrastructure, biometrics, and migration and asylum faces transparency, data governance, and human oversight requirements — from December 2, 2027 for the standalone systems listed in Annex III, and August 2, 2028 for AI embedded in regulated products under Annex I

Beyond the EU, multiple other jurisdictions are developing AI governance frameworks — the UK's AI Safety Institute (renamed the AI Security Institute in 2025), Japan's $135 billion AI investment with a soft regulatory approach, South Korea's $735 billion AI commitment, and ongoing discussions at the United Nations about international AI safety standards. What changes on August 2, 2026 is who holds the reins rather than what compliance requires: the Commission's AI Office and national authorities take over implementing, supervising, and enforcing the Act, and transparency obligations for certain AI systems become enforceable. The high-risk deadlines sit later than the Act's original schedule because the Digital Omnibus, Regulation (EU) 2026/1744, moved them back — without deferring the general-purpose AI obligations already in force.

Penalties are substantial: up to EUR 35 million or 7% of global turnover for prohibited practices. Each EU member state is also required to establish at least one AI regulatory sandbox.

The more reliable near-term prediction is not that the US passes a federal AI law, but that US states keep setting the operative rules. California's AI Transparency Act became binding on the same August 2, 2026 date, deliberately aligned to the EU's, and requires generative AI providers above one million monthly users in the state to embed machine-readable content provenance and publish a tool that verifies it. Expect the practical compliance baseline through 2028 to be assembled from state statutes and the EU Act together rather than handed down from Washington — which means the operative question for most companies is which state thresholds they cross, not whether Congress acts.

Confidence level: High. The regulatory direction is clear; specific implementation timelines may shift.

Model Cost Collapse: Frontier Intelligence Becomes 100-Times Cheaper

One of the most profound near-term changes is happening in AI economics.

The cost to run a frontier-class AI query has been falling approximately 10-times per year since 2020. DeepSeek's efficiency breakthrough in early 2025 accelerated this trend. By 2027–2028:

What costs $100 per million tokens today will cost $1 or less
Tasks currently too expensive to automate (reviewing every customer email, analyzing every transaction in real time) will become economically viable
AI adoption barriers will shift from cost to organizational change management

Historical context: GPT-3 API access in 2021 cost approximately $60 per million tokens. By early 2026, frontier-class intelligence costs $3–15 per million tokens — and pricing drops have been dramatic: Claude Opus 4.7 dropped from $15/$75 to $5/$25 per million tokens, while DeepSeek's efficiency breakthroughs pushed costs even lower for open-weight models. The trajectory continues downward.

Confidence level: High. Hardware efficiency, software optimization, and competition all point in the same direction.

Physical AI: Robots and Autonomous Vehicles at Scale

Autonomous vehicles are not a future technology — they are a present technology in specific contexts. By 2028:

Waymo is operating fully autonomous robotaxis across 10 US cities, having completed 170 million+ autonomous miles with no safety driver
Zoox (Amazon) launched its commercial robotaxi service in San Francisco and Las Vegas, with expansion to Austin and Miami announced in March 2026, and a partnership with Uber to make Zoox robotaxis available on the Uber network
Aurora Innovation has commercially deployed driverless trucking across 10 routes connecting Dallas, Houston, Phoenix, El Paso, and more — 250,000+ driverless miles with zero Aurora Driver-attributed collisions, targeting 200+ trucks by end of 2026
Tesla FSD continues expanding, with the dedicated Cybercab entering production in April 2026
Robotaxis are operating in 20+ cities globally, with multiple operators scaling rapidly

Humanoid robots are at an earlier but real commercial stage:

Figure AI (valued at $39 billion) has progressed to Figure 03, with humanoid robots deployed in factory settings with BMW and others
Tesla Optimus Gen 3 has over 1,000 units deployed across Tesla factories, with mass production started January 2026, 50 actuators per hand (25 per hand), and a target production cost of $20,000 per unit — external customer deliveries anticipated late 2026
Boston Dynamics Atlas (electric) is now commercially deployed in warehouse and logistics tasks
The first commercial-scale humanoid deployments are underway in controlled industrial environments, with multiple companies scaling production

What is NOT happening yet: Humanoid robots in homes, general-purpose manipulation in unstructured environments, or humanoid robots meaningfully displacing manufacturing employment at scale. These are medium-term possibilities, not near-term certainties.

Confidence level: Medium-high. Autonomous vehicles are certain to continue expanding; humanoid robot timelines are harder to predict.

Orbital AI Data Centers Emerge as a Real Category

What was science fiction in 2024 moved into FCC filings — and onto an IPO roadshow — by 2026. On January 31, 2026, SpaceX filed for authorization to deploy up to one million satellites functioning as orbital AI data centers, with a long-term target of 100 GW per year of space-based power generation. Each Starlink V3 satellite — launching 60 per Starship starting 2026, with 1 Tbps download capacity and terabit-class laser mesh networking — is designed to serve as a distributed orbital compute node.

The bet became central to SpaceX's identity over the course of 2026. In February, SpaceX absorbed Elon Musk's AI lab xAI (maker of the Grok models) in a roughly $250 billion all-stock deal, folding frontier-model software into the same company that builds the rockets and satellites. By June, as SpaceX headed toward one of the largest IPOs in history — valuing the combined company at roughly $1.5 trillion — Musk made orbital compute the centerpiece of his pitch. At a JPMorgan investor roadshow, he argued that solar-powered data centers in space could become "the primary means by which AI can be expanded," sidestepping the power-generation and permitting bottlenecks that increasingly constrain data centers on the ground. SpaceX told investors it plans to spend about $12.7 billion on AI in 2026 alone.

The Musk ecosystem's silicon roadmap reflects the bet: Tesla's restarted Dojo3 program was explicitly repositioned in January 2026 for "space-based AI compute," and Terafab will produce the radiation-tolerant inference silicon needed for orbital deployment.

The compute business is already real on the ground. Even before a single orbital data center comes online, SpaceX has become a major compute landlord. In June 2026 it agreed to rent roughly 110,000 Nvidia chips to Google for about $920 million a month — close to $30 billion over three years — as "bridge capacity" while Google's own data centers scale, on top of an even larger compute arrangement with Anthropic. The terrestrial compute-leasing revenue is what is meant to fund the orbital ambition.

What this actually enables (near-term): distributed edge inference for remote/mobile/underserved regions, reduced terrestrial data-center energy draw, and a physically diversified compute layer for defense and disaster-resilience scenarios. What it does NOT enable yet: frontier model training in orbit (the power, thermal, and data-throughput requirements are far beyond current satellite capabilities).

Confidence level: Medium. The FCC filing, satellite launches, silicon repositioning, and multi-billion-dollar compute deals are all confirmed, but the economics of orbital compute versus terrestrial data centers remain unproven. Analysts who watched the IPO roadshow noted that Musk's space-data-center pitch was long on vision and short on operational detail. Skeptics add that ground-based solar plus batteries plus Stargate-scale buildouts may undercut orbital compute on cost per FLOP for most workloads through the end of the decade.

Real-Time AI Glasses Go Mainstream

Smart glasses with AI assistants are transitioning from novelty to mainstream:

Ray-Ban Meta smart glasses have sold 7 million+ units (2025), with Llama 4 AI integration, live translation, and image recognition. The premium Ray-Ban Meta Display ($799) adds a built-in teleprompter and Meta Neural Band (EMG wristband) for gesture control — demand so high that international expansion was paused
Samsung Galaxy XR headset launched October 2025 ($1,800), the first device on Android XR (Google's new platform) with Gemini AI built in. Samsung and Google are developing lightweight AR smart glasses for 2026 with audio prompts, notifications, and live translation
Apple Vision Pro has struggled commercially (~390,000 units shipped in 2024, production paused in early 2025), and Apple has redirected effort toward a lighter smart glasses form factor. Bloomberg reported in July 2026 that the glasses, code-named N50, are now targeted at a June 2027 developer-conference reveal with a release later that year — roughly six months later than planned, partly because Apple is still deciding whether to include cameras at all, or to restrict them to feeding Siri rather than user-initiated photos and video
The ability to look at something and ask AI a question — without looking at a phone — is becoming a natural interaction, with at least five Android XR devices expected in 2026

Confidence level: Medium-high. Ray-Ban Meta's commercial success validates the form factor; the ecosystem is expanding rapidly even as Apple's premium headset approach struggles.

Key Takeaways

Agentic AI (goal-directed, autonomous) is already mainstream in coding and research; it will be table-stakes for knowledge work by 2028
Multimodal AI becomes the universal baseline — text, images, audio, video handled natively by all major models
Vertical-integration chip plays like Terafab ($20-25 billion Tesla/SpaceX/xAI/Intel foundry) reshape the semiconductor industry — if they execute
Orbital AI data centers move from sci-fi to FCC filings — Starlink V3 and Dojo3's space-based compute pivot mark the start of a new deployment category
Model cost collapse continues: frontier-class AI will be 10-to-100-times cheaper by 2028, making new applications economically viable
The EU AI Act creates binding regulatory requirements affecting high-risk AI globally — enforcement passes to the AI Office and national authorities in August 2026, and the strict high-risk obligations land in December 2027 and August 2028; international governance frameworks are emerging
Physical AI (autonomous vehicles, humanoid robots) is arriving in specific industrial contexts — not yet general-purpose

Near-Term Predictions: 2026–2028

Audio & video lessons are paid features