AWS re:Invent 2025: Day 1 - CEO Keynote & Opening Day

Part of the AWS re:Invent 2025 Series: Overview | Day 1 | Day 2 | Day 3

8:00AM CEO Keynote - Matt Garman, CEO (AWS)

Matt Garman, CEO of Amazon Web Services, delivered the opening keynote sharing how AWS is innovating across every aspect of the world’s leading cloud. The central theme: “Why can’t developers focus on building?”—and AWS’s answer is billions of AI agents.

Key Themes

  • Resilient Infrastructure - More Availability Zones coming globally
  • Developer Focus - Vision for billions of AI agents handling operational toil
  • Four Pillars for AI Agents:
    1. AI Infrastructure (GPUs, custom silicon)
    2. Inference Platform (Bedrock)
    3. Your Data (Nova Forge)
    4. Tools to Build Agents (AgentCore)
Four Pillars for AI Agents at AWS re:Invent 2025

Announcements

AI Infrastructure & Custom Silicon

AWS AI Factories

Dedicated customer-specific AI infrastructure that integrates with existing hardware, data center, and networking investments.

Trainium3 UltraServers - First 3nm AWS AI Chip

The 4th generation of AWS’s custom AI silicon delivers breakthrough performance:

SpecTrainium3vs Trainium2
FP8 Compute2.52 PFLOPs4.4x higher
Memory144 GB HBM3e3.9x bandwidth
Energy Efficiency5x Tokens/MegaWatt4x better

Trn3 UltraServer Aggregate Specs:

  • Up to 20.7 TB HBM3e memory
  • 706 TB/s memory bandwidth
  • 362 FP8 PFLOPs compute
  • Supports up to 144 Trainium3 chips
  • NeuronSwitch-v1 fabric (2TB/s per chip)
  • Native PyTorch, JAX, Hugging Face support
  • Advanced data types: FP32, BF16, MXFP8, MXFP4

Optimized for: agentic/reasoning workloads, video generation, reinforcement learning, Mixture-of-Experts models.

Trainium4 - Coming Soon

Next generation already announced, continuing AWS’s aggressive custom silicon roadmap.

Amazon Bedrock & Foundation Models

Amazon Bedrock - Now Powers 100,000+ Organizations
  • Guardrails: Blocks up to 88% harmful content with 99% accuracy
  • Model Distillation: Up to 500% faster, 75% less cost
  • Intelligent Prompt Routing: Auto-routes queries to optimal model
  • New Models: Mistral Large 3, Ministral 3
  • Compliance: ISO, SOC, GDPR, FedRAMP High, HIPAA
Amazon Nova 2 Family

Understanding Models (Text, Image, Video → Text):

  • Nova Micro - Entry-level
  • Nova Lite - Fast, cost-effective reasoning
  • Nova Pro - Higher capability
  • Nova 2 Lite - Powers Nova Act browser automation

Creative Content Models (Text, Image → Image/Video):

  • Nova Canvas - Image generation
  • Nova Reel - Video generation
  • Nova 2 Sonic - Speech-to-speech for conversational AI

Built on AI technologies from Amazon internal systems (Alexa+, Amazon Ads).

Amazon Nova Forge

Build custom frontier models that deeply embed domain expertise without the traditional barriers of cost, compute, and time.

Amazon Nova Act

Build agents that automate browser-based UI workflows. Powered by custom Nova 2 Lite model for reliable production UI automation.

Agent Platform

Amazon Bedrock AgentCore platform overview
Amazon Bedrock AgentCore

An agentic platform for building, deploying, and operating AI agents securely at scale—no infrastructure management needed.

Build:

  • Persistent memory that learns from interactions
  • Secure browser runtime
  • Code interpreter for complex tasks
  • Framework agnostic—any framework, model, or tool
  • Semantic tool discovery

Deploy:

  • Session isolation
  • Long-running workloads up to 8 hours
  • Native identity provider integration
  • Fine-grained access policies
  • Serverless deployment

Monitor:

  • Real-time CloudWatch metrics
  • OpenTelemetry integration
  • Agent quality evaluation (correctness, safety, goal success)

AWS Agents & Applications

Amazon Q - Agentic Business Intelligence
  • Deep Research - AI-powered research capabilities
  • Analyze and Visualize - Data analysis and visualization
  • Q Flows - Automated workflows for business processes
Kiro Autonomous Agent

An autonomous development agent that:

  • Plans and executes multi-step coding tasks across multiple repositories
  • Learns from your reviews and maintains context over time
  • Runs work in isolated sandboxes
  • Opens pull requests for review
  • Integrates with GitHub and Jira
AWS Security Agent (Preview)

A frontier agent that proactively secures applications throughout the development lifecycle:

  • Automated application security reviews tailored to organizational requirements
  • Context-aware penetration testing on demand
  • Continuously validates security from design to deployment
AWS DevOps Agent platform overview
AWS DevOps Agent (Preview)

An autonomous on-call engineer that:

  • Analyzes data across CloudWatch, GitHub, ServiceNow, and other tools
  • Identifies root causes
  • Coordinates incident response

Compute

New EC2 Instance Classes:

  • P6e - NVIDIA GB200 & GB300
  • X81 instances
  • X8aedz instances
  • C8a & C8ine instances
  • M8azn instances
  • EC2 M3 and M4 instances

Storage

S3 Vectors (GA)

First cloud object store with native vector storage and querying (product page):

  • Up to 2 billion vectors per index
  • 10,000 indexes per bucket (up to 20 trillion vectors)
  • 100ms warm query latency
  • 90% cost reduction in vector storage/querying
  • Native integration with Bedrock Knowledge Bases

Additional S3 Updates:

Database

  • Oracle and SQL storage capacity improvements in RDS
  • Optimize CPUs for RDS SQL
  • SQL Developer addition
  • Database savings plans

Serverless

Lambda Durable Functions (GA)

Long-running workflows with built-in state persistence (docs):

  • Execute for up to one year
  • Checkpoint/replay mechanism—skips completed operations on restart
  • Pay only for actual processing time (no charges during suspension)
  • SDK support: JavaScript, TypeScript, Python

Use Cases: Payment workflows, order fulfillment, AI coordination, multi-step processes.

Security

  • GuardDuty extended for ECS and EC2 (no additional cost)
  • Security Hub GA
  • Unified data store in CloudWatch - Unified log management across operational, security, and compliance use cases

Innovation Talks - Day 1

11:30 AM - INV211: Amazon’s AI Innovations

How Amazon leaders across Zoox, Prime Video, and Amazon Stores are leveraging AI to power their next-generation innovations with AWS.

Zoox - Autonomous Ride-Hailing

Zoox is an autonomous ride-hailing service focused on comfort, control, and safety:

  • Booking: Through the app, a vehicle arrives with lounge-like “carriage” seating
  • Customization: Set music, temperature, and lighting per seat
  • Comfort: Zoned climate control, ample legroom, quiet space to work
  • Support: Live support available in-app

Current Markets: Las Vegas, San Francisco Coming Soon: Austin, Miami

1:00 PM - INV201: Harnessing Analytics

Speaker: Mai-Lan Tomsen Bukovec, VP of AWS Technology, explores emerging trends from Open Table Formats (OTF) to agentic infrastructure, and how to future-proof your data foundation for analytics at scale.

Key Topics:

  • SageMaker Notebooks - Launch fully managed JupyterLab from Amazon SageMaker Studio in seconds
  • Open Table Formats evolution
  • Agentic infrastructure for analytics

2:30 PM - INV215: AWS Storage Innovations

This session unveils breakthrough innovations like S3 Tables for analytics optimization, S3 Vectors for AI/ML acceleration, and seamless SAN migration pathways that eliminate traditional infrastructure constraints.

4:00 PM - INV202: AI Agents in Action

Speaker: Shaown Nandi - Director Technology, AWS

New AWS capabilities empower builders to design secure, reasoning-driven agents that orchestrate data, code, and tools at scale, with an emphasis on governance, reliability, and cost efficiency.


Continue Reading: Day 2 - Agentic AI Keynote →

Comments

Kevin Duane

Kevin Duane

Cloud architect and developer sharing practical solutions.