Part of the AWS re:Invent 2025 Series: Overview | Day 1 | Day 2 | Day 3
8:00AM CEO Keynote - Matt Garman, CEO (AWS)
Matt Garman, CEO of Amazon Web Services, delivered the opening keynote sharing how AWS is innovating across every aspect of the world’s leading cloud. The central theme: “Why can’t developers focus on building?”—and AWS’s answer is billions of AI agents.
Key Themes
- Resilient Infrastructure - More Availability Zones coming globally
- Developer Focus - Vision for billions of AI agents handling operational toil
- Four Pillars for AI Agents:
- AI Infrastructure (GPUs, custom silicon)
- Inference Platform (Bedrock)
- Your Data (Nova Forge)
- Tools to Build Agents (AgentCore)

Announcements
AI Infrastructure & Custom Silicon
AWS AI Factories
Dedicated customer-specific AI infrastructure that integrates with existing hardware, data center, and networking investments.
Trainium3 UltraServers - First 3nm AWS AI Chip
The 4th generation of AWS’s custom AI silicon delivers breakthrough performance:
| Spec | Trainium3 | vs Trainium2 |
|---|---|---|
| FP8 Compute | 2.52 PFLOPs | 4.4x higher |
| Memory | 144 GB HBM3e | 3.9x bandwidth |
| Energy Efficiency | 5x Tokens/MegaWatt | 4x better |
Trn3 UltraServer Aggregate Specs:
- Up to 20.7 TB HBM3e memory
- 706 TB/s memory bandwidth
- 362 FP8 PFLOPs compute
- Supports up to 144 Trainium3 chips
- NeuronSwitch-v1 fabric (2TB/s per chip)
- Native PyTorch, JAX, Hugging Face support
- Advanced data types: FP32, BF16, MXFP8, MXFP4
Optimized for: agentic/reasoning workloads, video generation, reinforcement learning, Mixture-of-Experts models.
Trainium4 - Coming Soon
Next generation already announced, continuing AWS’s aggressive custom silicon roadmap.
Amazon Bedrock & Foundation Models
Amazon Bedrock - Now Powers 100,000+ Organizations
- Guardrails: Blocks up to 88% harmful content with 99% accuracy
- Model Distillation: Up to 500% faster, 75% less cost
- Intelligent Prompt Routing: Auto-routes queries to optimal model
- New Models: Mistral Large 3, Ministral 3
- Compliance: ISO, SOC, GDPR, FedRAMP High, HIPAA
Amazon Nova 2 Family
Understanding Models (Text, Image, Video → Text):
- Nova Micro - Entry-level
- Nova Lite - Fast, cost-effective reasoning
- Nova Pro - Higher capability
- Nova 2 Lite - Powers Nova Act browser automation
Creative Content Models (Text, Image → Image/Video):
- Nova Canvas - Image generation
- Nova Reel - Video generation
- Nova 2 Sonic - Speech-to-speech for conversational AI
Built on AI technologies from Amazon internal systems (Alexa+, Amazon Ads).
Amazon Nova Forge
Build custom frontier models that deeply embed domain expertise without the traditional barriers of cost, compute, and time.
Amazon Nova Act
Build agents that automate browser-based UI workflows. Powered by custom Nova 2 Lite model for reliable production UI automation.
Agent Platform

Amazon Bedrock AgentCore
An agentic platform for building, deploying, and operating AI agents securely at scale—no infrastructure management needed.
Build:
- Persistent memory that learns from interactions
- Secure browser runtime
- Code interpreter for complex tasks
- Framework agnostic—any framework, model, or tool
- Semantic tool discovery
Deploy:
- Session isolation
- Long-running workloads up to 8 hours
- Native identity provider integration
- Fine-grained access policies
- Serverless deployment
Monitor:
- Real-time CloudWatch metrics
- OpenTelemetry integration
- Agent quality evaluation (correctness, safety, goal success)
AWS Agents & Applications
Amazon Q - Agentic Business Intelligence
- Deep Research - AI-powered research capabilities
- Analyze and Visualize - Data analysis and visualization
- Q Flows - Automated workflows for business processes
Kiro Autonomous Agent
An autonomous development agent that:
- Plans and executes multi-step coding tasks across multiple repositories
- Learns from your reviews and maintains context over time
- Runs work in isolated sandboxes
- Opens pull requests for review
- Integrates with GitHub and Jira
AWS Security Agent (Preview)
A frontier agent that proactively secures applications throughout the development lifecycle:
- Automated application security reviews tailored to organizational requirements
- Context-aware penetration testing on demand
- Continuously validates security from design to deployment

AWS DevOps Agent (Preview)
An autonomous on-call engineer that:
- Analyzes data across CloudWatch, GitHub, ServiceNow, and other tools
- Identifies root causes
- Coordinates incident response
Compute
New EC2 Instance Classes:
- P6e - NVIDIA GB200 & GB300
- X81 instances
- X8aedz instances
- C8a & C8ine instances
- M8azn instances
- EC2 M3 and M4 instances
Storage
S3 Vectors (GA)
First cloud object store with native vector storage and querying (product page):
- Up to 2 billion vectors per index
- 10,000 indexes per bucket (up to 20 trillion vectors)
- 100ms warm query latency
- 90% cost reduction in vector storage/querying
- Native integration with Bedrock Knowledge Bases
Additional S3 Updates:
- Object size limit increased to 50TB
- Batch operations 10x faster
- S3 Tables Intelligent Tiering
- S3 Table replication across accounts
- S3 FSx for NetApp OnTap
- GPU index search in OpenSearch - Serverless GPU acceleration for vector index
Database
- Oracle and SQL storage capacity improvements in RDS
- Optimize CPUs for RDS SQL
- SQL Developer addition
- Database savings plans
Serverless
Lambda Durable Functions (GA)
Long-running workflows with built-in state persistence (docs):
- Execute for up to one year
- Checkpoint/replay mechanism—skips completed operations on restart
- Pay only for actual processing time (no charges during suspension)
- SDK support: JavaScript, TypeScript, Python
Use Cases: Payment workflows, order fulfillment, AI coordination, multi-step processes.
Security
- GuardDuty extended for ECS and EC2 (no additional cost)
- Security Hub GA
- Unified data store in CloudWatch - Unified log management across operational, security, and compliance use cases
Innovation Talks - Day 1
11:30 AM - INV211: Amazon’s AI Innovations
How Amazon leaders across Zoox, Prime Video, and Amazon Stores are leveraging AI to power their next-generation innovations with AWS.
Zoox - Autonomous Ride-Hailing
Zoox is an autonomous ride-hailing service focused on comfort, control, and safety:
- Booking: Through the app, a vehicle arrives with lounge-like “carriage” seating
- Customization: Set music, temperature, and lighting per seat
- Comfort: Zoned climate control, ample legroom, quiet space to work
- Support: Live support available in-app
Current Markets: Las Vegas, San Francisco Coming Soon: Austin, Miami
1:00 PM - INV201: Harnessing Analytics
Speaker: Mai-Lan Tomsen Bukovec, VP of AWS Technology, explores emerging trends from Open Table Formats (OTF) to agentic infrastructure, and how to future-proof your data foundation for analytics at scale.
Key Topics:
- SageMaker Notebooks - Launch fully managed JupyterLab from Amazon SageMaker Studio in seconds
- Open Table Formats evolution
- Agentic infrastructure for analytics
2:30 PM - INV215: AWS Storage Innovations
This session unveils breakthrough innovations like S3 Tables for analytics optimization, S3 Vectors for AI/ML acceleration, and seamless SAN migration pathways that eliminate traditional infrastructure constraints.
4:00 PM - INV202: AI Agents in Action
Speaker: Shaown Nandi - Director Technology, AWS
New AWS capabilities empower builders to design secure, reasoning-driven agents that orchestrate data, code, and tools at scale, with an emphasis on governance, reliability, and cost efficiency.
Continue Reading: Day 2 - Agentic AI Keynote →
Comments