<?xml version="1.0" encoding="UTF-8" standalone="no"?><rss xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" version="2.0">

<channel>
	<title>AWS News Blog</title>
	<atom:link href="https://aws.amazon.com/blogs/aws/feed/" rel="self" type="application/rss+xml"/>
	<link>https://aws.amazon.com/blogs/aws/</link>
	<description>Announcements, Updates, and Launches</description>
	<lastBuildDate>Mon, 29 Jun 2026 16:30:30 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	
	<item>
		<title>AWS Weekly Roundup, Agentic CX designer for Amazon Connect Customer, EC2 AMI Watermarks, Open Governance for MySQL, and more (June 29, 2026)</title>
		<link>https://aws.amazon.com/blogs/aws/aws-weekly-roundup-agentic-cx-designer-for-amazon-connect-customer-ec2-ami-watermarks-open-governance-for-mysql-and-more-june-29-2026/</link>
					
		
		<dc:creator><![CDATA[Micah Walter]]></dc:creator>
		<pubDate>Mon, 29 Jun 2026 16:30:30 +0000</pubDate>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Week in Review]]></category>
		<guid isPermaLink="false">5f1578e8308028213ee25d84f88e1be66f65bc77</guid>

					<description>It has been a busy stretch on the AWS Summit circuit. At the New York City Summit, I delivered a workshop called Building AI architectures with AWS Serverless, and it was a lot of fun watching builders wire up agents and serverless services to solve real problems in a single afternoon. This week I am […]</description>
										<content:encoded>&lt;p&gt;&lt;img loading="lazy" class="alignright size-medium wp-image-104911" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/28/IMG_5013-225x300.jpg" alt="" width="225" height="300"&gt;It has been a busy stretch on the AWS Summit circuit. At the &lt;a href="https://aws.amazon.com/events/summits/new-york/"&gt;New York City Summit,&lt;/a&gt; I delivered a workshop called Building AI architectures with AWS Serverless, and it was a lot of fun watching builders wire up agents and serverless services to solve real problems in a single afternoon. This week I am heading down to the &lt;a href="https://aws.amazon.com/events/summits/washington-dc/"&gt;Washington, DC Summit&lt;/a&gt;, which always puts a spotlight on innovation in the public sector. If you are going to be there, come say hello.&lt;/p&gt; 
&lt;p&gt;A question I hear a lot at these events is how teams can put AI to work without waiting on a long engineering backlog, and this week’s biggest launch speaks directly to that, with Amazon Connect Customer introducing a no-code way for business teams to design AI powered customer experiences themselves. Now, let’s get into this week’s AWS news.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;ins&gt;Headlines&lt;/ins&gt;&lt;/strong&gt;&lt;/p&gt; 
&lt;p&gt;Amazon Connect Customer launched the Agentic CX designer (NLX) in preview, a no-code canvas for designing and deploying AI powered self service experiences. Business teams can build and launch voice and digital experiences that bring agentic and deterministic AI together in one governed flow, going from design to testing and simulation to production ready experiences in weeks rather than months. The launch also includes Live Sync in preview, a patented technology that drives a customer’s web or mobile experience in real time as they speak or type. A caller can complete a form or pull up the right product page without ever leaving the conversation. To see how this reshapes who designs customer experience, read the blog post on how the &lt;a href="https://aws.amazon.com/blogs/contact-center/business-user-is-the-new-architect-of-customer-experience/"&gt;business user is the new architect of customer experience&lt;/a&gt; and visit the &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-connect-customer-agentic-cx-preview/"&gt;Amazon Connect Customer&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;ins&gt;Last week’s launches&lt;/ins&gt;&lt;/strong&gt;&lt;/p&gt; 
&lt;p&gt;Here are some launches and updates from this past week that caught my attention:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-lambda-microvms/"&gt;AWS Lambda MicroVMs&lt;/a&gt;&lt;/strong&gt; – A new serverless compute primitive that gives each user or job VM level isolation with near instant launch and resume speeds, plus the ability to suspend and resume execution for up to 8 hours. Built on Firecracker, it is made for running user or AI generated code in multi-tenant applications without managing virtualization infrastructure or trading off isolation, speed, and state.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/ec2-image-watermarks-allowed-images/"&gt;Amazon EC2 AMI Watermarks&lt;/a&gt;&lt;/strong&gt; – Lets you embed custom identifiers in your private AMIs that automatically carry forward to every derived AMI across copies, Regions, and account shares. You can combine watermarks with Allowed AMIs and Declarative Policies to restrict launches to approved images, available at no additional cost in all AWS Regions.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-outposts-self-service-lifecycle-management"&gt;AWS Outposts self-service lifecycle management&lt;/a&gt;&lt;/strong&gt; – Adds self service configuration, quoting, ordering, subscription management, renewal, and decommissioning directly from the console, CLI, and API. A new quoting tool generates real time cost estimates in seconds and surfaces account and regional constraints before you submit an order.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-msk-ai-agent-skills"&gt;Amazon MSK AI Agent Skills&lt;/a&gt;&lt;/strong&gt; – Gives AI coding assistants like Kiro, Claude Code, and Cursor expert, up-to-date guidance for operating Amazon MSK, covering troubleshooting, sizing, configuring, monitoring, and migrating external Kafka clusters to MSK Express. Tasks that once required specialized knowledge become a guided experience developers can complete on their own.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-opensearch-service-ai-migrations"&gt;Amazon OpenSearch Service AI-assisted migrations&lt;/a&gt;&lt;/strong&gt; – Migration Assistant now includes an agent guided experience that helps you move self managed Apache Solr, Elasticsearch, or OpenSearch deployments to OpenSearch Serverless or Managed Clusters using tools like Kiro and Claude Code, with new live traffic capture and replay support for Solr.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-guardduty/"&gt;Amazon GuardDuty AI-powered investigations (preview)&lt;/a&gt;&lt;/strong&gt; – Automatically analyzes findings and accounts to help you separate true threats from benign activity, examining context and related activity from the last 90 days with knowledge graphs and threat intelligence. Each investigation returns a disposition assessment with confidence scoring, MITRE ATT&amp;amp;CK classification, and actionable recommendations in minutes.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;For a full list of AWS announcements, be sure to keep an eye on the &lt;a href="https://aws.amazon.com/new/"&gt;What’s New with AWS&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;ins&gt;Other AWS news&lt;/ins&gt;&lt;/strong&gt;&lt;/p&gt; 
&lt;p&gt;Here are some additional posts and resources that you might find interesting:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/blogs/opensource/open-governance-for-mysql-a-step-forward-for-the-community/"&gt;Open Governance for MySQL&lt;/a&gt;&lt;/strong&gt; – Oracle announced a community governance model for MySQL that gives organizations outside Oracle a defined role in the project, including four non Oracle seats on a new Steering Committee and a public GitHub presence. AWS holds a seat and shares why it supports the move and how it already contributes fixes upstream for everyone running MySQL.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/blogs/training-and-certification/a-new-way-to-keep-your-aws-certification-current/"&gt;A new way to keep your AWS Certification current&lt;/a&gt;&lt;/strong&gt; -You can now maintain an eligible AWS Certification for an additional year by completing curated training and hands on labs on AWS Skill Builder instead of retaking a full exam. The option is available today in open beta for several Associate and Professional certifications, with more coming later this year.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;&lt;a href="https://builder.aws.com/content/3FGF2bDqKm6xKGmyJqJMrcwRhE8/the-aws-all-builders-welcome-grant-an-insiders-guide-for-2026-applicants"&gt;The All Builders Welcome Grant insider’s guide for 2026 applicants&lt;/a&gt;&lt;/strong&gt; – A community guide on AWS Builder Center that walks early career builders through applying for the grant, which covers a full conference pass, airfare, and hotel for AWS re:Invent 2026. Applications are open now and close on July 14.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;For a full list of AWS blog posts, be sure to keep an eye on the &lt;a href="https://aws.amazon.com/blogs/"&gt;AWS Blogs&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;Looking for ways to connect with builders in person? Check out the &lt;a href="https://aws.amazon.com/events/summits/"&gt;AWS Summits&lt;/a&gt; coming to a city near you, find a local &lt;a href="https://aws.amazon.com/developer/community/community-days/"&gt;AWS Community Day&lt;/a&gt; led by user groups around the world, and explore tutorials, community content, and ways to grow your skills over at the &lt;a href="https://builder.aws.com/"&gt;AWS Builder Center&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;That’s all for this week. Check back next Monday for another Weekly Roundup!&lt;/p&gt; 
&lt;p&gt;-Micah&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Run isolated sandboxes with full lifecycle control: AWS Lambda introduces MicroVMs</title>
		<link>https://aws.amazon.com/blogs/aws/run-isolated-sandboxes-with-full-lifecycle-control-aws-lambda-introduces-microvms/</link>
					
		
		<dc:creator><![CDATA[Micah Walter]]></dc:creator>
		<pubDate>Mon, 22 Jun 2026 22:40:07 +0000</pubDate>
				<category><![CDATA[AWS Lambda]]></category>
		<category><![CDATA[Compute]]></category>
		<category><![CDATA[Firecracker]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Serverless]]></category>
		<guid isPermaLink="false">28b0196fbc8cdde62bdcc235123c0f1926871df6</guid>

					<description>AWS launches a new serverless compute primitive, AWS Lambda MicroVMs. VM-level, isolated sandboxes with no shared kernel or resources between sessions. Rapid launch and resume, full lifecycle control, state preservation up to 8 hours, no infrastructure to manage.</description>
										<content:encoded>&lt;p&gt;Today, we are announcing AWS Lambda MicroVMs, a new serverless compute primitive within &lt;a href="https://aws.amazon.com/lambda/"&gt;AWS Lambda&lt;/a&gt; that lets you run code generated by users or AI in isolated, stateful execution environments. You get virtual machine level isolation, near-instant launch and resume, and direct control over environment lifecycle and state, all without managing infrastructure or building expertise in complex virtualization technologies. Lambda MicroVMs are powered by &lt;a href="https://firecracker-microvm.github.io/"&gt;Firecracker&lt;/a&gt;, the same lightweight virtualization technology that has powered over 15 trillions of monthly Lambda function invocations.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;span style="text-decoration: underline"&gt;Why customers need this&lt;/span&gt;&lt;br&gt; &lt;/strong&gt;Over the past few years a new class of multi-tenant applications has emerged that all share the need to hand each end user their own dedicated execution environment in which to safely run code that the application developer did not write. AI coding assistants, interactive code environments, data analytics platforms, vulnerability scanners, and game servers that run user-supplied scripts all fit this pattern. Building that capability today means making a difficult choice. Virtual machines deliver strong isolation but take minutes to start. Containers launch in seconds, yet their shared-kernel architecture requires significant custom hardening to safely contain untrusted code. Functions as a service are optimized for event-driven, request-response workloads, but are not designed for long-running interactive sessions that need to retain environment state across user interactions. That leaves developers either accepting tradeoffs between performance and isolation, or investing significant engineering resources to build and operate custom virtualization infrastructure to achieve isolated execution while delivering low-latency experiences to end-users. This presents an effort that demands deep expertise and pulls engineering time away from the product they are actually trying to build.&lt;/p&gt; 
&lt;p&gt;Lambda MicroVMs is purpose-built for exactly this gap. Each MicroVM gives a single end user or session its own isolated environment that launches rapidly, retains memory and disk state for the length of the session, and pauses to a low idle cost when the user steps away. Because the same Firecracker technology already underpins AWS Lambda Functions, you inherit the operational maturity of a service that has been running this stack at scale.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;span style="text-decoration: underline"&gt;Let’s try it out&lt;/span&gt;&lt;br&gt; &lt;/strong&gt;To get started, I navigated to the AWS Lambda console, where Lambda MicroVMs now appears in the left-hand navigation menu. I first need to create a MicroVM Image.&lt;/p&gt; 
&lt;p&gt;I packaged a Flask web app and its Dockerfile into a zip file, uploaded it to an &lt;a href="https://aws.amazon.com/s3/"&gt;Amazon Simple Storage Service (Amazon S3)&lt;/a&gt; bucket.&lt;/p&gt; 
&lt;p&gt;My Flask API – app.py&lt;/p&gt; 
&lt;pre class="unlimited-height-code"&gt;&lt;code class="lang-python"&gt;import logging

from flask import Flask, jsonify

app = Flask(__name__)
logging.basicConfig(level=logging.INFO)


@app.route("/")
def hello():
    app.logger.info("Received request to hello world endpoint")
    return jsonify(message="Hello, World!")


if __name__ == "__main__":
    app.run(host="0.0.0.0", port=5000)
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;My Dockerfile&lt;/p&gt; 
&lt;pre class="unlimited-height-code"&gt;&lt;code&gt;
FROM public.ecr.aws/lambda/microvms:al2023-minimal
RUN dnf install -y python3 python3-pip &amp;amp;&amp;amp; dnf clean all

WORKDIR /app

COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

COPY app.py .

EXPOSE 5000

CMD ["gunicorn", "--bind", "0.0.0.0:5000", "app:app"]

&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;I used the following command to create my MicroVM Image.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;aws lambda-microvms create-microvm-image \
--code-artifact uri=&amp;lt;path/to/s3/artifact.zip&amp;gt; --name &amp;lt;VM_image_name&amp;gt; \
--base-image-arn arn:aws:lambda:us-east-1:aws:microvm-image:al2023-1 \
--build-role-arn &amp;lt;IAM role ARN&amp;gt;&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104847 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/22/Screenshot-2026-06-22-at-10.49.45 AM-1024x577.png" alt="" width="1024" height="577"&gt;&lt;/p&gt; 
&lt;p&gt;You can also create the MicroVM Image in the AWS Console as in the image above. Once I ran the command, Lambda retrieved the zip, ran the Dockerfile, initialized the application, and took a Firecracker snapshot of the running disk and memory state. Build logs streamed in real time to &lt;a href="https://aws.amazon.com/cloudwatch/"&gt;Amazon CloudWatch&lt;/a&gt; under &lt;code&gt;/aws/lambda/microvms/&amp;lt;image-name&amp;gt;&lt;/code&gt;, and when the image was ready it appeared in the console with its &lt;a href="https://docs.aws.amazon.com/IAM/latest/UserGuide/reference-arns.html"&gt;Amazon Resource Name (ARN)&lt;/a&gt; and version number.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;aws lambda-microvms run-microvm \
--image-identifier arn:aws:lambda:&amp;lt;region&amp;gt;:&amp;lt;acct&amp;gt;:microvm-image:my-image \
--execution-role-arn arn:aws:iam::&amp;lt;acct&amp;gt;:role/MicroVMExecutionRole \
--idle-policy '{"maxIdleDurationSeconds":900,"suspendedDurationSeconds":300,"autoResumeEnabled":true}'
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Launching can also be done via the AWS Console or the CLI. I passed the image ARN and an idle policy configured to auto-suspend after 15 minutes of inactivity and auto-resume on the next incoming request. No networking setup was required. Lambda assigned the MicroVM a unique ID, returned a dedicated endpoint URL, and started a new MicroVM with my Flask app already running, since it was resumed from a snapshot. My Flask app was already running the moment the launch completed. One API call to get a fully initialized, bootstrapped compute environment.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone size-large wp-image-104756" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/19/image-04-1024x729.png" alt="" width="1024" height="729"&gt;&lt;/p&gt; 
&lt;p&gt;To send traffic, I generated a short-lived auth token with the CLI and attached it to a plain HTTPS request using the &lt;code&gt;X-aws-proxy-auth&lt;/code&gt; header. The request landed on my Flask app immediately. I then let the MicroVM sit idle past the suspend threshold, at which point the MicroVM was suspended, with its memory and disk state snapshotted and stored. I then sent another request, and it resumed with the application state fully intact. From the client side, the pause never happened.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone size-large wp-image-104757" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/19/image-05-1024x229.png" alt="" width="1024" height="229"&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;span style="text-decoration: underline"&gt;How it works&lt;/span&gt;&lt;br&gt; &lt;/strong&gt;Under the covers, Lambda MicroVMs delivers three capabilities that, until today, no single AWS compute service offered together. The first is virtual machine level isolation, which comes from Firecracker. Each session runs in its own dedicated MicroVM with no shared kernel and no shared resources between users, so untrusted code supplied by one user is contained to their execution environment, without access to other environments or the underlying system. The second is rapid launch and resume. The model is image-then-launch: you create a MicroVM Image by supplying a Dockerfile and code packaged as a zip artifact in Amazon S3, and Lambda runs your Dockerfile, initializes your application, and takes a Firecracker snapshot of the running environment’s memory and disk state. Every subsequent MicroVM launched from that image resumes from the pre-initialized snapshot rather than booting cold, which means launches and idle resumes both achieve near-instant startup latency. Even a multi-gigabyte interactive session comes back online quickly enough to feel responsive to the end user. The third is stateful execution. A running MicroVM retains memory, disk, and running processes across the user’s session. During idle periods, a MicroVM can be suspended – with memory and disk state intact – and resumed when traffic arrives. Installed packages, loaded models, and working ﬁlesets are readily available when the user resumes their session. MicroVMs support up to 8 hours of total runtime and can be suspended automatically after a configurable idle window, which makes it straightforward to build products as varied as software vulnerability scans that complete in minutes, data analytics applications that run for hours, and interactive coding sessions with extended idle periods. As Lambda MicroVMs are started from pre-initialized snapshots, applications generating unique content, establishing network connections, or loading ephemeral data during initialization may need to integrate with service-provided hooks for compatibility.&lt;/p&gt; 
&lt;p&gt;Lambda MicroVMs is a new resource within AWS Lambda, with a distinct API surface. Lambda Functions remain the right choice for event-driven, request-response workloads, and Lambda MicroVMs is purpose-built for multi-tenant applications that need to hand each end user or session their own isolated environment to execute user- or AI-generated code. The two complement each other. An application using Lambda Functions for its event-driven backbone can call into Lambda MicroVMs for the steps that need to run untrusted code in isolation. You bring the application, and the service delivers the execution environment.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;span style="text-decoration: underline"&gt;Now available&lt;/span&gt;&lt;br&gt; &lt;/strong&gt;AWS Lambda MicroVMs is available today in the US East (N. Virginia, Ohio), US West (Oregon), Europe (Ireland) and Asia Pacific (Tokyo) &lt;a href="https://aws.amazon.com/about-aws/global-infrastructure/regions_az/"&gt;Regions&lt;/a&gt;, on the ARM64 architecture, with up to 16 vCPUs, 32 GB of memory, and 32 GB of disk per MicroVM. Idle MicroVMs can be suspended explicitly through an API call or automatically through a lifecycle policy, which reduces the running cost while preserving full state for fast resume. Pricing details can be found on the &lt;a href="https://aws.amazon.com/lambda/pricing/"&gt;AWS Lambda pricing page&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;To get started, visit the &lt;a href="https://console.aws.amazon.com/lambda/"&gt;AWS Lambda console&lt;/a&gt;, or learn more on the &lt;a href="https://aws.amazon.com/lambda/lambda-microvms"&gt;Lambda MicroVMs product page&lt;/a&gt;. For documentation, see the &lt;a href="https://docs.aws.amazon.com/lambda/latest/dg/lambda-microvms-guide.html"&gt;Lambda MicroVMs Developer Guide&lt;/a&gt;.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>AWS Weekly Roundup: NY Summit recap, Local Zone in Hanoi, Grok 4.3 in Bedrock, price reductions, and more (June 22, 2026)</title>
		<link>https://aws.amazon.com/blogs/aws/aws-weekly-roundup-ny-summit-recap-local-zone-in-hanoi-grok-4-3-in-bedrock-price-reductions-and-more-june-22-2026/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Mon, 22 Jun 2026 14:46:17 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Elastic Container Service]]></category>
		<category><![CDATA[Amazon GameLift]]></category>
		<category><![CDATA[Amazon Simple Storage Service (S3)]]></category>
		<category><![CDATA[AWS Local Zones]]></category>
		<category><![CDATA[AWS Management Console]]></category>
		<category><![CDATA[AWS Marketplace]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Price Reduction]]></category>
		<category><![CDATA[Strands Agents]]></category>
		<category><![CDATA[Week in Review]]></category>
		<guid isPermaLink="false">48f31f6fe142a62eba1b6ad460759328491f8672</guid>

					<description>Last week AWS Summit New York City brought together thousands of customers, partners, and builders for a free, one-day event showcasing the latest in cloud and AI innovation. Dr. Swami Sivasubramanian, VP of Agentic AI at AWS unveiled a stack of AI launches in his keynote, all built around one thesis: agents that compound value […]</description>
										<content:encoded>&lt;p&gt;Last week &lt;a href="https://aws.amazon.com/events/summits/new-york/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Summit New York City&lt;/a&gt; brought together thousands of customers, partners, and builders for a free, one-day event showcasing the latest in cloud and AI innovation. Dr. Swami Sivasubramanian, VP of Agentic AI at AWS unveiled a stack of AI launches in &lt;a href="https://www.youtube.com/watch?v=T25Fn3FvF6I"&gt;his keynote&lt;/a&gt;, all built around one thesis: agents that compound value over time.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104713" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/17/2026-aws-ny-summit-keynote.jpg" alt="" width="1600" height="905"&gt;&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Agents for working&lt;/strong&gt; – You can launch autonomous agents and access a smarter activity feed with &lt;a href="https://aws.amazon.com/blogs/machine-learning/get-back-hours-every-day-with-autonomous-agents-in-amazon-quick/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;new Amazon Quick features&lt;/a&gt;, which now let you create and run multi-step agents directly in the desktop app and consolidates email, Slack, calendar, and tasks into a single prioritized view with personalized rules.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Agents for securing&lt;/strong&gt; – You can shift from reactive to proactive security with AWS Continuum, a new AI-native security service that reasons, validates, and acts at machine speed across the &lt;a href="https://aws.amazon.com/blogs/security/introducing-aws-continuum-security-at-machine-speed/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;full code vulnerability lifecycle&lt;/a&gt;. AWS Security Agent (now part of AWS Continuum) adds &lt;a href="https://aws.amazon.com/blogs/aws/aws-security-agent-adds-threat-modeling-kiro-power-and-claude-code-plugin-and-more?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;new features&lt;/a&gt;: threat modeling; pull request code scanning with remediation across major Git platforms; and IDE integrations via Kiro power, Claude Code plugin, and MCP.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Agents for building&lt;/strong&gt; – You can write, ship, and modernize code in one continuous loop with Kiro, AWS DevOps Agent, and AWS Transform. Kiro introduces a &lt;a href="https://kiro.dev/blog/introducing-kiro-for-ios/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;native iOS app&lt;/a&gt;; AWS DevOps Agent adds &lt;a href="https://aws.amazon.com/blogs/aws/aws-devops-agent-adds-release-management-capabilities-to-assess-code-changes-before-production-preview?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;release management capabilities&lt;/a&gt; to assess code changes before production; and &lt;a href="https://aws.amazon.com/blogs/aws/proactively-reduce-tech-debt-autonomously-with-aws-transform-continuous-modernization-preview?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Transform continuous modernization&lt;/a&gt; reduces tech debt autonomously.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Agents customers create&lt;/strong&gt; – You can go from agent idea to production in minutes with Amazon Bedrock AgentCore, which now includes a &lt;a href="https://aws.amazon.com/blogs/machine-learning/amazon-bedrock-agentcore-harness-is-now-generally-available-go-from-idea-to-production-grade-agent-in-minutes/"&gt;GA harness&lt;/a&gt; for infrastructure and orchestration, &lt;a href="https://aws.amazon.com/blogs/aws/announcing-web-search-on-amazon-bedrock-agentcore-ground-your-ai-agents-in-current-accurate-web-knowledge?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Web Search&lt;/a&gt;, &lt;a href="https://aws.amazon.com/blogs/aws/introducing-amazon-bedrock-managed-knowledge-base-for-faster-more-accurate-enterprise-ai-applications?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Managed Knowledge Base&lt;/a&gt;, &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-agentcore-policy-guardrails-generally-available?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;policy integrations with Guardrails&lt;/a&gt;, and the new &lt;a href="https://aws.amazon.com/blogs/machine-learning/context-intelligence-for-your-data-and-ai-agents-at-scale/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Context service&lt;/a&gt; for mapping organizational data relationships.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;To learn more, visit the Summit recap from our &lt;a href="https://aws.amazon.com/blogs/aws/top-announcements-of-the-aws-summit-in-new-york-2026/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;top announcements blog post&lt;/a&gt; and &lt;a href="https://www.aboutamazon.com/news/aws/aws-summit-nyc-2026-ai-agents?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon News post&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Last week’s launches&lt;/strong&gt;&lt;br&gt; Here are last week’s launches that caught my attention:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-local-zones-hanoi-vietnam/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Local Zone in Hanoi, Vietnam&lt;/a&gt; – This new Local Zone is one of the first AWS Local Zones in the Asia Pacific with support for Amazon S3 and Amazon EBS Local Snapshots, enabling customers to meet data residency requirements by storing and backing up data locally. To get started, enable the Hanoi Local Zone (&lt;code&gt;ap-southeast-1-han-1a&lt;/code&gt;) from the Regions and Zones tab in the AWS Global View or by using the ModifyAvailabilityZoneGroup API.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-blocks-preview/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Blocks, an open-source TypeScript framework for application developers (preview)&lt;/a&gt; – AWS Blocks runs a fully functional local environment with Postgres, authentication, and real-time messaging, no AWS account required. When you’re ready to deploy, the same application code runs on production AWS services with zero changes, and you can drop into AWS CDK at any point for direct resource configuration.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/grok-amazon-bedrock/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Grok 4.3 from xAI in Amazon Bedrock&lt;/a&gt; – You can use the Grok 4.3 model on Amazon Bedrock, giving you even more choice as you build generative AI applications across reasoning, agentic, and enterprise workflows. Grok 4.3 runs on a new inference engine in Bedrock designed for price performance, with support for tool calling, structured output, and response streaming.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/amazon-s3-annotations-attach-rich-queryable-context-directly-to-your-objects/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon S3 annotations: attach rich, queryable context directly to your objects&lt;/a&gt; – Amazon S3 now lets you attach up to 1 GB of rich, mutable, and queryable context directly to your objects using annotations, purpose-built for AI agents and autonomous workflows that need to discover, understand, and act on data at scale without maintaining separate metadata systems.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/amazon-ecs-introduces-new-high-resolution-metrics-for-faster-service-auto-scaling/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon ECS announces faster service auto scaling&lt;/a&gt; – Amazon ECS service auto scaling now detects and responds to load changes faster with support for high resolution (20-second) metrics and metric publishing optimizations. In AWS benchmarking tests, time to trigger scale-out improved from 363 seconds to 86 seconds (76% faster), and total time to scale and provision new tasks improved from 386 seconds to 109 seconds (72% faster).&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-g7-instances-accelerated-by-nvidia-rtx-pro-4500-blackwell-server-edition-gpus/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon EC2 G7 instances accelerated by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs&lt;/a&gt; – AWS is the first major cloud provider to support NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs. G7 instances are accelerated by these GPUs with custom sixth-generation Intel Xeon Scalable processors, delivering up to 4.6x AI inference performance and up to 2.1x graphics performance compared to G6 instances.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://strandsagents.com/blog/reduced-cost-better-isolation-more-resilience/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Strands Agents introduces new capabilities&lt;/a&gt; – Strands is an open source toolkit for building production agents. You can now use better context management in Harness SDK, a new isolated execution environment with Strands Shell, and chaos testing and red teaming in Strands Evals.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-management-console-private/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Management Console Private Access&lt;/a&gt; – You can access the AWS Console from VPCs without internet connectivity, allowing enterprises to manage their AWS infrastructure through the console while maintaining strict network security controls in air-gapped environments.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-marketplace-storefront/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Marketplace Storefront is now generally available&lt;/a&gt; – AWS Partners can create and deploy their own branded catalog of solutions and services on their website or application in hours. Channel Partners and Independent Software Vendors can now simplify how they manage their cloud marketplace business and make it easier for customers to discover and purchase their solutions from AWS Marketplace.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-route-53-resolver-dns/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt; Palo Alto Networks (PANW) Advanced DNS Security on Amazon Route 53 Resolver DNS Firewall (preview)&lt;/a&gt; – You can now enforce DNS threat protections from Palo Alto Networks directly on Route 53 DNS Firewall rules, without deploying separate firewalls or modifying VPC configurations — by subscribing to PANW from the DNS Firewall console through the embedded AWS Marketplace widget.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;For a full list of AWS announcements, be sure to keep an eye on the &lt;a href="https://aws.amazon.com/new/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;What’s New with AWS&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Price reductions&amp;nbsp;&lt;/strong&gt;&lt;br&gt; AWS continues to look for ways to increase performance and lower prices for our customers. I noticed a few such efforts last week, so I’d like to share them:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/s3-vectors-reduces-query-charges-80-percent-large-indexes/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon S3 Vectors reduces query charges by up to 80% for large vector indexes&lt;/a&gt; – This reduction lowers costs for customers running similarity search across large-scale AI, RAG, and semantic search workloads. The new pricing applies automatically with no application changes required.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-gamelift-servers-free-network-bandwidth/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt; Amazon GameLift Servers introduces free network bandwidth&lt;/a&gt; – Amazon GameLift Servers provides network bandwidth in and out of AWS at no additional charge for all instance types from generation 6 and later, including On-Demand and Spot, with no commitment required. You now pay only for your Amazon GameLift Servers instance hours; all network bandwidth is free.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/reduce-listing-fee-professional-services-aws-marketplace/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Marketplace reduces listing fee for professional services to 0.5% from 2.5%&lt;/a&gt; – This reduction makes it more cost-effective for consulting partners, systems integrators, managed services providers and independent software vendors to transact their services through AWS Marketplace, while retaining the procurement and billing benefits that come with it.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;Learn more about AWS, browse and join upcoming &lt;a href="https://aws.amazon.com/events/explore-aws-events/?refid=e61dee65-4ce8-4738-84db-75305c9cd4fe"&gt;AWS-led in-person and virtual events&lt;/a&gt;, &lt;a href="https://aws.amazon.com/startups/events?tab=upcoming?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;startup events&lt;/a&gt;, and &lt;a href="https://builder.aws.com/connect/events?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;amp;sc_channel=el"&gt;developer-focused events&lt;/a&gt; as well as &lt;a href="https://aws.amazon.com/events/summits/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Summits&lt;/a&gt; and &lt;a href="https://aws.amazon.com/events/community-day/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Community Days&lt;/a&gt;. Join the &lt;a href="https://builder.aws.com/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Builder Center&lt;/a&gt; to connect with builders, share solutions, and access content that supports your development.&lt;/p&gt; 
&lt;p&gt;That’s all for this week. Check back next Monday for another &lt;a href="https://aws.amazon.com/blogs/aws/tag/week-in-review/?trk=39d9c26c-b157-46ae-bde6-9cf598f5c9e0&amp;amp;sc_channel=el"&gt;Weekly Roundup&lt;/a&gt;!&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://linkedin.com/in/channy/"&gt;Channy&lt;/a&gt;&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Announcing Amazon EC2 G7 instances accelerated by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs</title>
		<link>https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-g7-instances-accelerated-by-nvidia-rtx-pro-4500-blackwell-server-edition-gpus/</link>
					
		
		<dc:creator><![CDATA[Daniel Abib]]></dc:creator>
		<pubDate>Thu, 18 Jun 2026 21:22:10 +0000</pubDate>
				<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[Compute]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">9c5d4385212a06a322e30ee7f64a694ffd756b6d</guid>

					<description>Announcing the general availability of Amazon Elastic Compute Cloud (Amazon EC2) G7 instances, delivering high performance GPU acceleration for AI inference, graphics, and data analytics workloads.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing the general availability of &lt;a href="https://aws.amazon.com/ec2/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Elastic Compute Cloud (Amazon EC2)&lt;/a&gt; G7 instances, delivering high performance GPU acceleration for AI inference, graphics, and data analytics workloads.&lt;/p&gt; 
&lt;p&gt;AWS is the first major cloud provider to support NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs. G7 instances are accelerated by these GPUs with custom sixth-generation Intel Xeon Scalable processors, delivering up to 4.6x AI inference performance and up to 2.1x graphics performance compared to &lt;a href="https://aws.amazon.com/ec2/instance-types/g6/"&gt;G6 instances&lt;/a&gt;. G7 instances also deliver faster performance for GPU-accelerated analytics on &lt;a href="https://aws.amazon.com/emr/"&gt;Amazon EMR&lt;/a&gt; on &lt;a href="https://aws.amazon.com/eks/"&gt;Amazon Elastic Kubernetes Service (Amazon EKS)&lt;/a&gt;. G7 instances are well suited for a broad range of GPU-enabled workloads including AI inference, graphics rendering, video transcoding and analytics, spatial computing, virtual desktop infrastructure (VDI), and data analytics.&lt;/p&gt; 
&lt;p&gt;Here are improvements of G7 instances compared to previous generation:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Faster GPU memory&lt;/strong&gt;: NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs offer 1.33 times the GPU memory capacity and 2.45 times the GPU memory bandwidth compared to G6 instances. With 32 GB of GPU memory per GPU, 5th Gen Tensor Cores, and 4th Gen RT Cores, G7 instances deliver enhanced AI inference and graphics performance.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;High performance networking and storage&lt;/strong&gt;: G7 instances come with 700 Gbps of EFA-enabled networking throughput (7x compared to G6) enabling the low-latency, high-bandwidth connectivity that AI inference, graphics-intensive applications, and GPU-accelerated data analytics workloads need to perform at their best. G7 instances support up to 7.6 TB local NVMe SSD storage, enabling you to keep large models and datasets close to compute, reduce data transfer overhead, and improve throughput.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Advanced video encoding and decoding engines&lt;/strong&gt;: Ninth-generation NVENC and sixth-generation NVDEC engines support 4:2:2 encoding and decoding for high-resolution video workflows, delivering 1.5x concurrent video streams compared to previous-generation G6 instances.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;EC2 G7 instance specifications&lt;/strong&gt;&lt;br&gt; G7 instances feature up to 8 NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs with up to 256 GB of total GPU memory (32 GB of memory per GPU) and custom Intel Xeon Scalable processors. They also are available in 7 sizes and support up to 192 vCPUs, up to 700 Gbps of network bandwidth, up to 768 GiB of system memory, and up to 7.6 TB of local NVMe SSD storage.&lt;/p&gt; 
&lt;p&gt;Here are the specs:&lt;/p&gt; 
&lt;table style="border: 2px solid black;border-collapse: collapse;margin-left: auto;margin-right: auto"&gt; 
 &lt;tbody&gt; 
  &lt;tr style="border-bottom: 1px solid black;background-color: #e0e0e0"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Instance name&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;GPUs&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;GPU memory (GB)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;vCPUs&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Memory (GiB)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Storage&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;EBS bandwidth (Gbps)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Network bandwidth (Gbps)&lt;/strong&gt;&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.2xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 600&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 60&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.4xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;16&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;64&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 600&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 100&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.8xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;128&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 950&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;16&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 100&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.12xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;2&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;64&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;48&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 1900&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;20&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;175&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.24xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;4&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;128&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;96&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;384&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 3800&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;40&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;350&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.48xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;256&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;768&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;2 x 3800&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;80&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;700&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;g7.metal*&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;256&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;768&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;2 x 3800&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;80&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;700&lt;/td&gt; 
  &lt;/tr&gt; 
 &lt;/tbody&gt; 
&lt;/table&gt; 
&lt;p&gt;* Coming soon&lt;/p&gt; 
&lt;p&gt;G7 instances support NVIDIA GPUDirect P2P for multi-GPU sizes, NVIDIA GPUDirect RDMA with EFA, and GPUDirect RDMA with EFA for &lt;a href="https://aws.amazon.com/fsx/lustre/"&gt;Amazon FSx for Lustre&lt;/a&gt;, enabling low-latency GPU-to-GPU communication for multi-GPU and multi-node workloads.&lt;/p&gt; 
&lt;p&gt;To get started with G7 instances, you can use the &lt;a href="https://aws.amazon.com/ai/machine-learning/amis/"&gt;AWS Deep Learning AMIs (DLAMI)&lt;/a&gt; or &lt;a href="https://aws.amazon.com/marketplace/pp/prodview-z4aq5h62z2nv6"&gt;NVIDIA Workstation AMIs&lt;/a&gt; with prepackaged GPU drivers for your AI inference and graphics workloads. To use G7 instances with Amazon EKS, build EKS AMIs with NVIDIA driver version R595 with &lt;a href="https://docs.aws.amazon.com/eks/latest/userguide/eks-ami-build-scripts.html"&gt;EKS-provided automation&lt;/a&gt;&lt;strong&gt;.&lt;/strong&gt; G7 instances support multiple operating systems including Amazon Linux, Ubuntu, RHEL, and Windows Server, with comprehensive NVIDIA driver integration providing compatibility with industry-standard graphics libraries including DirectX, Vulkan, and OpenGL.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Get started today&lt;/strong&gt;&lt;br&gt; You can start using Amazon EC2 G7 instances today in two AWS regions: US East (Ohio) and US West (Oregon). To check future Regional expansion plans, look up the instance type in the &lt;a href="https://aws.amazon.com/pt/cloudformation/"&gt;&lt;strong&gt;CloudFormation&lt;/strong&gt;&lt;/a&gt; resources tab on the &lt;a href="https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/"&gt;AWS Capabilities by Region&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;G7 instances are offered through multiple purchasing options, including &lt;a href="https://aws.amazon.com/ec2/pricing/on-demand/"&gt;On-Demand&lt;/a&gt;, &lt;a href="https://aws.amazon.com/savingsplans/compute-pricing/"&gt;Savings Plans&lt;/a&gt;, and &lt;a href="https://aws.amazon.com/ec2/spot/pricing/"&gt;Spot Instances&lt;/a&gt;. &lt;a href="https://aws.amazon.com/ec2/pricing/dedicated-instances/"&gt;Dedicated Instances&lt;/a&gt; are also supported for the &lt;code&gt;12xlarge&lt;/code&gt;, &lt;code&gt;24xlarge&lt;/code&gt;, and &lt;code&gt;48xlarge&lt;/code&gt; sizes. For detailed pricing, visit the &lt;a href="https://aws.amazon.com/ec2/pricing/"&gt;Amazon EC2 Pricing&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;Ready to get started? Launch G7 instances from the &lt;a href="https://console.aws.amazon.com/ec2/"&gt;Amazon EC2 console&lt;/a&gt;. For more details, head over to the &lt;a href="https://aws.amazon.com/ec2/instance-types/g7/"&gt;Amazon EC2 G7 instances&lt;/a&gt; page. We’d love to hear your feedback. Share it on &lt;a href="https://repost.aws/tags/TAO-wqN9fYRoyrpdULLa5y7g/amazon-ec-2?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for EC2&lt;/a&gt; or reach out through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;– Daniel Abib&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Amazon ECS introduces new high-resolution metrics for faster service auto scaling</title>
		<link>https://aws.amazon.com/blogs/aws/amazon-ecs-introduces-new-high-resolution-metrics-for-faster-service-auto-scaling/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Thu, 18 Jun 2026 21:06:38 +0000</pubDate>
				<category><![CDATA[Amazon Elastic Container Service]]></category>
		<category><![CDATA[Auto Scaling]]></category>
		<category><![CDATA[Compute]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">abebfba02b3b7c0c3c3ed08a23aeb25f8830da3f</guid>

					<description>Amazon Elastic Container Service (Amazon ECS) service auto scaling automatically adjusts task counts to meet workload demand with comprehensive scaling policies, including predictive scaling for recurring traffic patterns, scheduled scaling for planned events, and target tracking to scale dynamically on real-time metrics. You can choose proactive scaling by using predictive scaling (automatic) and scheduled scaling […]</description>
										<content:encoded>&lt;p&gt;&lt;a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/service-auto-scaling.html"&gt;Amazon Elastic Container Service (Amazon ECS) service auto scaling&lt;/a&gt; automatically adjusts task counts to meet workload demand with comprehensive scaling policies, including predictive scaling for recurring traffic patterns, scheduled scaling for planned events, and target tracking to scale dynamically on real-time metrics.&lt;/p&gt; 
&lt;p&gt;You can choose proactive scaling by using &lt;a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/predictive-auto-scaling.html"&gt;predictive scaling&lt;/a&gt; (automatic) and &lt;a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/service-autoscaling-schedulescaling.html"&gt;scheduled scaling&lt;/a&gt; (customer-defined), or reactive scaling by using &lt;a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/service-autoscaling-targettracking.html"&gt;target tracking&lt;/a&gt; with just a target to scale on. Amazon ECS service auto scaling adjusts the number of tasks in an ECS service based on &lt;a href="https://aws.amazon.com/cloudwatch/"&gt;Amazon CloudWatch&lt;/a&gt; metrics, such as average CPU/Memory usage, request count per target, a custom metric such as queue depth, or demand surges by using advanced machine learning (ML) algorithms.&lt;/p&gt; 
&lt;p&gt;With today’s launch, Amazon ECS service auto scaling now detects and responds to load changes faster with support for high resolution (20-second) metrics and metric publishing optimizations. In AWS benchmarking tests, time to trigger scale-out improved from 363 seconds to 86 seconds (76% faster, 4.2x), and total time to scale and provision new tasks improved from 386 seconds to 109 seconds (72% faster, 3.5x)&lt;/p&gt; 
&lt;p&gt;This launch delivers three key benefits for your applications:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Improved performance and reliability&lt;/strong&gt;: Faster scaling means, your application responds faster to demand surges, reducing latencies or failures for end users during demand surges.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Right-size without compromise&lt;/strong&gt;: Depending on the workload, you can reduce baseline task counts because scale-out now happens fast enough to handle traffic spikes without preemptive capacity padding. This directly reduces compute costs while maintaining application performance and availability.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Simpler scaling configuration&lt;/strong&gt;: Target tracking with high-resolution metrics delivers the aggressive scaling behavior that previously required custom scaling configurations, such as usage of step-scaling policies. One configuration change replaces custom engineering work.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;How it works&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; To use ECS faster service auto scaling, first enable high-resolution metrics for your ECS service, and then configure a target tracking scaling policy which uses high-resolution metrics. ECS faster service autoscaling works across all compute options on ECS: &lt;a href="https://aws.amazon.com/fargate/"&gt;AWS Fargate&lt;/a&gt;, &lt;a href="https://aws.amazon.com/ecs/managed-instances/"&gt;ECS Managed Instances&lt;/a&gt;, and &lt;a href="https://aws.amazon.com/ec2"&gt;Amazon Elastic Compute Cloud (Amazon EC2)&lt;/a&gt;. You can enable these metrics when you create or update your ECS service in the &lt;a href="https://console.aws.amazon.com/ecs"&gt;Amazon ECS console&lt;/a&gt;, or using &lt;a href="https://docs.aws.amazon.com/sdkref/latest/guide/version-support-matrix.html"&gt;AWS SDKs and tools&lt;/a&gt;, and &lt;a href="https://aws.amazon.com/cloudformation/"&gt;AWS CloudFormation&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;When you create a service in the console, add 20-seconds resolution metrics in the &lt;strong&gt;Monitoring configuration&lt;/strong&gt; section. These metrics incur additional CloudWatch costs while the standard resolution (60-seconds) is free.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104695 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/17/2026-ecs-fast-autoscaling-metrics-create-service-1.jpg" alt="" width="1661" height="2560"&gt;&lt;/p&gt; 
&lt;p&gt;In the &lt;strong&gt;Service auto scaling&lt;/strong&gt; section, check &lt;strong&gt;Use service auto scaling&lt;/strong&gt; and choose &lt;strong&gt;Target Tracking&lt;/strong&gt; for the scaling policy type to use real-time data to scale the number of tasks that your service runs based on demand.&lt;/p&gt; 
&lt;p&gt;Then, choose a&lt;strong&gt; Scaling policy type&lt;/strong&gt; for the target tracking. You can select &lt;code&gt;ECSServiceAverageCPUUtilizationHighResolution&lt;/code&gt; or &lt;code&gt;ECSServiceAverageMemoryUtilizationHighResolution&lt;/code&gt; as new metrics.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104696 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/17/2026-ecs-fast-autoscaling-metrics-create-service-2.jpg" alt="" width="1800" height="2380"&gt;&lt;/p&gt; 
&lt;p&gt;That’s it. Your ECS service will use high resolution metrics for auto scaling.&lt;/p&gt; 
&lt;p&gt;To update an existing ECS service to use faster auto scaling, you first need to configure high resolution metrics via &lt;strong&gt;Update Service&lt;/strong&gt;. Once deployment completes, your service will generate high-resolution metrics. You can then go to the &lt;strong&gt;Service and auto scaling&lt;/strong&gt; tab from your service details to update scaling policy to use higher resolution metrics.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104200 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/03/2026-ecs-fast-autoscaling-metrics-1.png" alt="" width="2007" height="1685"&gt;&lt;/p&gt; 
&lt;p&gt;That’s all you need. Your ECS service now evaluates scaling decisions at 20-second intervals.&lt;/p&gt; 
&lt;p&gt;You can also use the &lt;a href="https://aws.amazon.com/cli"&gt;AWS Command Line Interface (AWS CLI)&lt;/a&gt; to enable new metrics in your ECS service through Application Auto Scaling. To learn more, visit the &lt;a href="https://docs.aws.amazon.com/AmazonECS/latest/developerguide/target-tracking-faster-auto-scaling.html"&gt;faster auto scaling documentation&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Now available&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; Faster service autoscaling with high-resolution metrics for Amazon ECS is available today. The feature itself has no additional cost, but high-resolution CloudWatch metrics introduce a new pricing dimension. For details, see the &lt;a href="https://aws.amazon.com/cloudwatch/pricing/"&gt;CloudWatch pricing&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;Give it a try today&amp;nbsp;and send feedback to &lt;a href="https://repost.aws/tags/TAefn4YSprR-uCBYmbofOpHw/amazon-elastic-container-service?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for ECS&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://linkedin.com/in/channy"&gt;Channy&lt;/a&gt;&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Top announcements of the AWS Summit in New York, 2026</title>
		<link>https://aws.amazon.com/blogs/aws/top-announcements-of-the-aws-summit-in-new-york-2026/</link>
					
		
		<dc:creator><![CDATA[AWS News Blog Team]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 16:36:08 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Bedrock AgentCore]]></category>
		<category><![CDATA[Amazon Simple Storage Service (S3)]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[AWS Transform]]></category>
		<category><![CDATA[AWS WAF]]></category>
		<category><![CDATA[Kiro]]></category>
		<category><![CDATA[Strands Agents]]></category>
		<guid isPermaLink="false">5c0395519cc4bb7cb222916f6a9f1cc174957d0f</guid>

					<description>A recap of the top announcements from AWS's New York Summit 2026</description>
										<content:encoded>&lt;p&gt;Today at the &lt;a href="https://aws.amazon.com/events/summits/new-york/"&gt;AWS Summit in New York City&lt;/a&gt;, Swami Sivasubramanian, AWS VP of Agentic AI, provided the day’s keynote.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104713" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/17/2026-aws-ny-summit-keynote.jpg" alt="" width="1600" height="905"&gt;&lt;/p&gt; 
&lt;p style="text-align: left"&gt;Here’s our roundup of the biggest announcements from the event:&lt;/p&gt; 
&lt;p&gt;&lt;iframe loading="lazy" title="AWS Summit New York City - Keynote | Amazon Web Services" width="500" height="281" src="https://www.youtube-nocookie.com/embed/T25Fn3FvF6I?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen sandbox="allow-scripts allow-same-origin"&gt;&lt;/iframe&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;New in agents customers create&lt;br&gt; &lt;/strong&gt;We’re introducing new capabilities on Amazon Bedrock AgentCore: connecting AI agents to organizational, web, and paid knowledge, helping teams find and fix what’s going wrong in production, and enforcing controls that scale as agents grow more capable.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-133707 size-full" src="https://d2908q01vomqb2.cloudfront.net/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59/2026/06/16/knowledge-layers.png" alt="" width="3200" height="1800"&gt;&lt;/p&gt; 
&lt;p&gt;Together, these capabilities help you build more capable agents faster, govern those agents with controls that scale, and improve them continuously. To learn more, read our &lt;a href="https://aws.amazon.com/blogs/machine-learning/new-in-amazon-bedrock-agentcore-build-agents-with-broader-knowledge-and-continuous-learning/"&gt;blog post&lt;/a&gt; covering all the new features.&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/introducing-amazon-bedrock-managed-knowledge-base-for-faster-more-accurate-enterprise-ai-applications"&gt;Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications&lt;/a&gt; – You can build enterprise RAG pipelines with the managed Knowledge Base on Bedrock. It provides native data connectors, Smart Parsing for automatic multi-format data preparation, and an Agentic Retriever for complex multi-step queries, all integrated with AgentCore Gateway so developers can focus on business outcomes rather than infrastructure management.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/announcing-web-search-on-amazon-bedrock-agentcore-ground-your-ai-agents-in-current-accurate-web-knowledge"&gt;Announcing Web Search on Amazon Bedrock AgentCore: Ground your AI agents in current, accurate web knowledge&lt;/a&gt; – You can use a fully managed web search tool that enables agents to ground responses in current, cited web knowledge with zero data egress from customer’ secured AWS environment. You can focus on building agents instead of manually adding web search to agents on Bedrock AgentCore and managing its infrastructure.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/aws-waf-adds-ai-traffic-monetization-capability-to-help-content-owners-charge-ai-bots-for-content-access/" rel="bookmark"&gt;AWS WAF adds AI traffic monetization capability to help content owners charge AI bots for content access&lt;/a&gt; – You can use a new Bot Control capability that enables content providers and publishers price, meter, and collect payment from AI bots and agents accessing their content and APIs. AWS WAF now lets you set a price for that access, accept payment through third-party providers, and grant scoped access directly at the edge.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/machine-learning/amazon-bedrock-agentcore-harness-is-now-generally-available-go-from-idea-to-production-grade-agent-in-minutes/"&gt;Amazon Bedrock AgentCore harness in now generally available&lt;/a&gt; – You can do building and running production-grade AI agents in minutes, without coding orchestration loops, by defining your agent’s model, tools, skills, and instructions in configuration, with Bedrock AgentCore harness.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/machine-learning/context-intelligence-for-your-data-and-ai-agents-at-scale/"&gt;Coming soon: AWS Context&lt;/a&gt; – This is a new service that automatically maps the relationships across your existing data into a knowledge graph and provides agentic search so AI agents in the organization can access governed data relationships, business rules, and domain knowledge at runtime.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;New in agents for securing&lt;/strong&gt;&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/security/introducing-aws-continuum-security-at-machine-speed/"&gt;Introducing AWS Continuum: Security at machine speed&lt;/a&gt; – AWS Continuum for code vulnerabilities, available in a gated preview, takes findings from across your environment, prioritizes by business impact, proves which are exploitable, and drives a fix through your own process.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/aws-security-agent-adds-threat-modeling-kiro-power-and-claude-code-plugin-and-more"&gt;AWS Security Agent (now part of AWS Continuum) adds threat modeling, Kiro power and Claude Code plugin, and more&lt;/a&gt; – You can generate the new threat modeling (preview) to understand the full context of your application and identify threats with recommended mitigations using the STRIDE framework. You can also use pull request code scanning with remediation across major Git platforms, and IDE integrations via Kiro power, Claude Code plugin, and MCP, letting developers run security reviews and fix issues without context switching.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;New in agents for building&lt;/strong&gt;&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://kiro.dev/blog/introducing-kiro-for-ios/"&gt;Introducing Kiro for iOS&lt;/a&gt; – Kiro introduces a native iOS app, available in a gated preview, built for real engineering work that gives developers a new surface to kick off, monitor, steer, and interact with their Kiro sessions directly from their phone. That means you can now start sessions, check back when they’re done, review diffs, and approve changes all while staying connected to your work with no laptop running.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/aws-devops-agent-adds-release-management-capabilities-to-assess-code-changes-before-production-preview"&gt;AWS DevOps Agent adds release management capabilities to assess code changes before production&lt;/a&gt; – You can use a new release readiness review of code changes and autonomous release testing. These new features verify every change against the natural language standards you give to the DevOps Agent and run change-specific tests in production-like environments.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/proactively-reduce-tech-debt-autonomously-with-aws-transform-continuous-modernization-preview"&gt;Proactively reduce tech debt autonomously with AWS Transform – continuous modernization&lt;/a&gt; – You can use continuous analysis (preview) to automatically scan your code repositories against configurable baselines and generates findings in hours, not weeks. Once you’ve identified and prioritized findings, you can configure autonomous remediations that generate pull requests for affected repositories automatically.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;New in agents for works&lt;/strong&gt;&lt;/p&gt; 
&lt;p&gt;With the launch of Amazon Quick’s new autonomous agents, you can create agents that work in the background with specific expertise, tone, and access to tools. You can create a finance agent to process orders as they come in, or a sales agent monitoring interactions across your CRM, emails, and Slacks to proactively draft follow-ups, flag risks, or recommend next steps.&lt;/p&gt; 
&lt;p&gt;We are also releasing a new activity feed that is tailored to how you work. It consolidates email, messaging, calendar, and tasks into a single prioritized view, learns which messages you always answer fast, which threads you skip, and what topics drive your week.&lt;/p&gt; 
&lt;p&gt;To learn more, look the &lt;a href="https://aws.amazon.com/blogs/machine-learning/get-back-hours-every-day-with-autonomous-agents-in-amazon-quick/"&gt;demo of Amazon Quick – AI Assistant&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;In addition to the keynote announcements, we have other important launches this week:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/amazon-s3-annotations-attach-rich-queryable-context-directly-to-your-objects/"&gt;Amazon S3 annotations: attach rich, queryable context directly to your objects&lt;/a&gt; – Amazon S3 now lets you attach up to 1 GB of rich, mutable, and queryable context directly to your objects using annotations, purpose-built for AI agents and autonomous workflows that need to discover, understand, and act on data at scale without maintaining separate metadata systems.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/amazon-ecs-introduces-new-high-resolution-metrics-for-faster-service-auto-scaling/"&gt;Amazon ECS announces faster service auto scaling&lt;/a&gt; – Amazon ECS service auto scaling now detects and responds to load changes faster with support for high resolution (20-second) metrics and metric publishing optimizations. In AWS benchmarking tests, time to trigger scale-out improved from 363 seconds to 86 seconds (76% faster, 4.2x), and total time to scale and provision new tasks improved from 386 seconds to 109 seconds (72% faster, 3.5x).&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-g7-instances-accelerated-by-nvidia-rtx-pro-4500-blackwell-server-edition-gpus/"&gt;Amazon EC2 G7 instances accelerated by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs&lt;/a&gt; – AWS is the first major cloud provider to support NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs. G7 instances are accelerated by these GPUs with custom sixth-generation Intel Xeon Scalable processors, delivering up to 4.6x AI inference performance and up to 2.1x graphics performance compared to G6 instances.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://strandsagents.com/blog/reduced-cost-better-isolation-more-resilience/"&gt;Strands Agents introduces new capabilities&lt;/a&gt; – Strands is an open source toolkit for building production agents. You can use now better context management in the &lt;a href="https://github.com/strands-agents/harness-sdk"&gt;Harness SDK&lt;/a&gt;, a new isolated execution environment with &lt;a href="https://github.com/strands-agents/shell"&gt;Strands Shell&lt;/a&gt;, and chaos testing and red teaming in &lt;a href="https://github.com/strands-agents/evals"&gt;Strands Evals&lt;/a&gt;.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June 18, 2026&lt;/strong&gt; — Added new important launches on June 18.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications</title>
		<link>https://aws.amazon.com/blogs/aws/introducing-amazon-bedrock-managed-knowledge-base-for-faster-more-accurate-enterprise-ai-applications/</link>
					
		
		<dc:creator><![CDATA[Daniel Abib]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 15:09:20 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock AgentCore]]></category>
		<category><![CDATA[Amazon Bedrock Knowledge Bases]]></category>
		<category><![CDATA[Amazon Machine Learning]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Generative AI]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">4c24fcdc8f84279cf19adcd9af7c745709c8ab27</guid>

					<description>Amazon Bedrock's new Fully Managed Knowledge Bases simplifies building enterprise RAG pipelines by providing native data connectors Smart Parsing for automatic multi-format data preparation, and an Agentic Retriever for complex multi-step queries—all integrated with AgentCore Gateway so developers can focus on business outcomes rather than infrastructure management.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing &lt;a href="https://aws.amazon.com/bedrock/knowledge-bases/"&gt;Amazon Bedrock Managed Knowledge Base&lt;/a&gt;, a new set of capabilities that enables developers to build enterprise-grade generative AI applications with their proprietary data in minutes. Organizations building agentic AI applications need secure, reliable, and up-to-date access to enterprise-wide data to deliver accurate, fast, and trusted outcomes. Managed Knowledge Base abstracts away the complexity of building and managing retrieval-augmented generation (RAG) pipelines, allowing developers to focus on business outcomes rather than infrastructure management.&lt;/p&gt; 
&lt;p&gt;Developers building knowledge bases for their agents face three key challenges today:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Connecting to enterprise data&lt;/strong&gt;: Enterprise knowledge lives across disparate systems with different content types, access control lists, and document formats. Building and maintaining custom connectors for each source adds complexity that slows down development.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Optimizing RAG accuracy&lt;/strong&gt;: Best practices for retrieval-augmented generation keep evolving. Developers need to experiment with different parsing strategies, chunking approaches, embedding models, and agentic retrieval behaviors to get accurate answers from their data.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Managing infrastructure at scale&lt;/strong&gt;: Organizations need to serve large knowledge bases with millions of documents, or manage thousands of smaller knowledge bases across teams. Both patterns require reliable infrastructure, security enforcement, and cost control.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;These challenges require developers to repeatedly perform undifferentiated work instead of focusing on their applications.&lt;/p&gt; 
&lt;p&gt;Amazon Bedrock Managed Knowledge Base addresses these challenges by abstracting away the multiple infrastructure components developers traditionally have to assemble and maintain themselves (storage, retrieval, embeddings, re-ranking, and foundation model selection) into a single managed primitive. By default, the service automatically selects and manages a default embeddings model, re-ranker model, and foundational model on your behalf, so you can get up to speed quickly without needing to pick or maintain one yourself. On top of this managed foundation, three core innovations further improve ease of use and accuracy:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Native data connectors&lt;/strong&gt;: Six pre-built ingestion connectors that natively pull enterprise data and permissions from SaaS applications, eliminating the overhead developers face in managing application-specific requirements. At launch, we support Amazon S3, SharePoint, Confluence, Web Crawler, Google Drive, and OneDrive.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Smart Parsing&lt;/strong&gt;: Different content types and sources require different approaches to achieve accurate retrieval. Smart Parsing handles this complexity automatically, selecting the right parsing strategy for each data type and connector to provide the highest accuracy for your agents.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Agentic Retriever&lt;/strong&gt;: Optimized for complex queries that require multiturn, multihop retrieval within a single knowledge base or across multiple knowledge bases. Agentic Retriever automatically infers end-user intent and draws relevant context from institutional knowledge spread across data sources and modalities.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;With just a few lines of code, Amazon Bedrock Managed Knowledge Base automatically manages and scales the end-to-end RAG pipeline that powers your enterprise knowledge agents. For agent builders, it’s available as a pre-built target type in &lt;a href="https://aws.amazon.com/bedrock/agentcore/"&gt;Amazon Bedrock AgentCore Gateway&lt;/a&gt;, reducing integration to a few lines of code, auto-generating role-based permissions, and providing observability and evaluation metrics in the AgentCore Observability dashboard.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Getting started with Amazon Bedrock Managed Knowledge Base&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; Creating a Managed Knowledge Base is straightforward. Navigate to the &lt;a href="https://console.aws.amazon.com/bedrock-agentcore/"&gt;Amazon Bedrock AgentCore console&lt;/a&gt; or the &lt;a href="https://console.aws.amazon.com/bedrock/"&gt;Amazon Bedrock console&lt;/a&gt;, open the &lt;strong&gt;Knowledge Bases&lt;/strong&gt; page, and choose &lt;strong&gt;Create Managed KB&lt;/strong&gt;. The experience is the same in both consoles.&lt;/p&gt; 
&lt;div id="attachment_104776" style="width: 1810px" class="wp-caption aligncenter"&gt;
 &lt;img aria-describedby="caption-attachment-104776" loading="lazy" class="wp-image-104776 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/19/2026-bedrock-fmkb-1-1.jpg" alt="" width="1800" height="1141"&gt;
 &lt;p id="caption-attachment-104776" class="wp-caption-text"&gt;Picture 1 – Knowledge Bases list page in the Amazon Bedrock AgentCore console showing the Type column with different KB types and the Create Managed KB button&lt;/p&gt;
&lt;/div&gt; 
&lt;p&gt;When creating a new Knowledge Bases, you can connect to your enterprise data sources by choosing from the list of supported connectors directly from a dropdown. &lt;a href="https://aws.amazon.com/iam"&gt;AWS Identity and Access Management (IAM)&lt;/a&gt; roles are automatically created, and you can choose to edit these permissions if needed:&lt;/p&gt; 
&lt;div id="attachment_104777" style="width: 1810px" class="wp-caption aligncenter"&gt;
 &lt;img aria-describedby="caption-attachment-104777" loading="lazy" class="wp-image-104777 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/19/2026-bedrock-fmkb-2-1.jpg" alt="" width="1800" height="1297"&gt;
 &lt;p id="caption-attachment-104777" class="wp-caption-text"&gt;Picture 2 – Create Knowledge Base page showing the Data source dropdown expanded with all supported connectors: Amazon S3, Confluence, Custom, Google Drive, One Drive, SharePoint, and Web Crawler&lt;/p&gt;
&lt;/div&gt; 
&lt;p&gt;An optimized set of defaults will be presented, allowing you to create your knowledge base in just a few clicks. Once the data is synced, you can integrate the knowledge base with your agent or provide it as a tool for your foundation model and start querying.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Smart Parsing for accurate data ingestion&lt;/strong&gt;&lt;br&gt; One of the key challenges in building knowledge bases is preparing diverse data types for accurate retrieval. Once you point Managed Knowledge Base at your data sources, Smart Parsing automatically determines the optimal parsing strategy for each data type and connector, no extra configuration is required.&lt;/p&gt; 
&lt;p&gt;Smart Parsing combines multiple techniques:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Connector-specific data models&lt;/strong&gt;: Optimized handling for each data source. For example, the Web Crawler connector preserves HTML structure including embedded images and tables, ensuring rich content is not dropped during ingestion. SharePoint connectors maintain document hierarchy and relationships between files.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Multimodal processing&lt;/strong&gt;: Automatic detection and processing of different content types within documents. The system identifies bounding boxes in documents, then sends them to foundation models for data extraction, captioning, and scene description in video files.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Optimized chunking&lt;/strong&gt;: Smart Parsing leverages foundation models to understand document structure and extract meaningful content, ensuring that complex documents with mixed formats are properly indexed. Intelligent defaults balance retrieval accuracy with performance based on document type and content structure, while advanced users can customize chunking strategies when needed.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;This automated approach eliminates weeks of experimentation typically required to achieve production-quality retrieval accuracy, while still preserving the flexibility to customize when needed.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Using Agentic Retriever for complex queries&lt;/strong&gt;&lt;br&gt; After your data is ingested, you can start querying your knowledge base. Generative AI applications often struggle with complex user queries that require reasoning, recursive multi-step retrieval, and intermediate evaluations of results. Consider a user asking two related questions: “What is the cloud infrastructure budget for the ML platform team?” and “Does our expense policy allow prepaying annual commitments?” A single retrieval step might surface documents about the ML platform team but fail to connect the budget information with the expense policy needed to fully answer the question.&lt;/p&gt; 
&lt;div id="attachment_104253" style="width: 3117px" class="wp-caption aligncenter"&gt;
 &lt;img aria-describedby="caption-attachment-104253" loading="lazy" class="wp-image-104253 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/Infographics.png" alt="" width="3107" height="1081"&gt;
 &lt;p id="caption-attachment-104253" class="wp-caption-text"&gt;Picture 3 – Agentic Retriever decomposes complex user queries into a step-by-step plan, performing multi-hop retrieval across multiple knowledge bases and combining results to deliver accurate, grounded responses&lt;/p&gt;
&lt;/div&gt; 
&lt;p&gt;Agentic Retriever solves this by creating a step-by-step query plan: 1. Which team owns the ML platform, and what is their cloud infrastructure budget? 2. What does the expense policy say about prepaying annual commitments? 3. Does the policy allow the ML platform team to prepay against this budget?&lt;/p&gt; 
&lt;p&gt;The system performs multi-hop retrieval and reasoning at each step, and once it has gathered sufficient relevant passages, it stops the search process and returns the top results. By abstracting away the complexity of building a separate multi-hop reasoning pipeline, this approach dramatically improves accuracy for complex queries while letting developers focus on their agentic search applications instead of orchestration logic.&lt;/p&gt; 
&lt;p&gt;You can try Agentic Retriever directly from the test panel of your knowledge base in the Amazon Bedrock AgentCore console. Select &lt;strong&gt;Agentic retrieval only&lt;/strong&gt; as the retrieval type to let the system automatically plan and execute multi-step queries across your knowledge bases:&lt;/p&gt; 
&lt;div id="attachment_104603" style="width: 1810px" class="wp-caption aligncenter"&gt;
 &lt;img aria-describedby="caption-attachment-104603" loading="lazy" class="wp-image-104603 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/15/2026-bedrock-fmkb-3.jpg" alt="" width="1800" height="991"&gt;
 &lt;p id="caption-attachment-104603" class="wp-caption-text"&gt;Picture 4 – Test Knowledge Base panel showing Agentic retrieval with answer generation selected as the retrieval type, with model selection and maximum agentic iterations options&lt;/p&gt;
&lt;/div&gt; 
&lt;p&gt;&lt;strong&gt;Enabling MCP with Bedrock AgentCore&lt;/strong&gt;&lt;br&gt; Amazon Bedrock Managed Knowledge Base seamlessly integrates with AgentCore Gateway as a native target type. This integration eliminates the need for manual integration and provides built-in observability, policy enforcement, and automatic permission management.&lt;/p&gt; 
&lt;p&gt;You can navigate to the Amazon Bedrock AgentCore console or SDK and create an AgentCore Gateway or select an existing one. When adding targets to your gateway, you will find &lt;strong&gt;Knowledge Base&lt;/strong&gt; as a new pre-built target type alongside other options such as MCP server, Lambda ARN, REST API, and other integrations. Simply select your knowledge base ID to expose it through the gateway:&lt;/p&gt; 
&lt;div id="attachment_104604" style="width: 1810px" class="wp-caption aligncenter"&gt;
 &lt;img aria-describedby="caption-attachment-104604" loading="lazy" class="wp-image-104604 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/15/2026-bedrock-fmkb-4.jpg" alt="" width="1800" height="1365"&gt;
 &lt;p id="caption-attachment-104604" class="wp-caption-text"&gt;Picture 5 – Add targets page in AgentCore Gateway showing Knowledge Base as a new pre-built target type, with the knowledge base ID selector and runtime retrieval mode options&lt;/p&gt;
&lt;/div&gt; 
&lt;p&gt;Add targets page in AgentCore Gateway showing Knowledge Base as a new pre-built target type, with the knowledge base ID selector and runtime retrieval mode options&lt;/p&gt; 
&lt;p&gt;Gateway exposes the standard Model Context Protocol (MCP), so the knowledge base tools are automatically discovered by clients from any MCP-compatible framework, including &lt;a href="https://strandsagents.com/"&gt;Strands Agents&lt;/a&gt;, &lt;a href="https://github.com/langchain-ai/langchain"&gt;LangChain&lt;/a&gt;, &lt;a href="https://crewai.com/"&gt;CrewAI&lt;/a&gt;, &lt;a href="https://www.llamaindex.ai/"&gt;LlamaIndex&lt;/a&gt;, and &lt;a href="https://github.com/langchain-ai/langgraph"&gt;LangGraph&lt;/a&gt;. No custom integration code is required.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Model choice and flexibility&lt;/strong&gt;&lt;br&gt; Amazon Bedrock Managed Knowledge Base preserves the flexibility developers expect from Amazon Bedrock. Every foundation model available on Bedrock can power the generation step, and developers can select from different embedding and re-ranking models to optimize retrieval for their specific use case, enabling teams to fine-tune accuracy and cost-performance without changing infrastructure.&lt;/p&gt; 
&lt;p&gt;Unlike managed solutions that lock you into specific model providers, Amazon Bedrock Managed Knowledge Base separates the infrastructure management (connectors, parsing, storage, retrieval orchestration) from model selection. This means you can:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Take advantage of the latest models&lt;/strong&gt;: Adopt the latest embedding, re-ranking, and foundation models as they become available to improve accuracy, latency, and cost for your application without rebuilding your RAG pipeline.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Optimize for price-performance&lt;/strong&gt;: Choose smaller, faster models for simple queries and more capable models for complex reasoning tasks, all using the same knowledge base infrastructure.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Use Bedrock embedding models&lt;/strong&gt;: While Smart Parsing provides optimized defaults, you can configure Bedrock embedding models when your domain requires specialized semantic understanding.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Maintain consistency with existing applications&lt;/strong&gt;: If you’re already using Bedrock Knowledge Bases APIs (&lt;code&gt;Retrieve&lt;/code&gt;, &lt;code&gt;StartIngest&lt;/code&gt;, &lt;code&gt;StopIngest&lt;/code&gt;, &lt;code&gt;IngestKnowledgeBaseDocuments&lt;/code&gt;), Managed Knowledge Base uses the same APIs, so migration requires no code changes, just point to the new knowledge base ID.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;This approach ensures you can spend time on your generative AI application without losing the ability to change models based on evolving requirements or new model capabilities.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Get started today&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; Amazon Bedrock Managed Knowledge Base is available today in the US East (N. Virginia), US West (Oregon), Asia Pacific (Sydney, Tokyo), Europe (Dublin, Frankfurt, London), and AWS GovCloud (US-West) Regions. For Regional availability and future roadmap, visit &lt;a href="https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/"&gt;AWS Capabilities by Region&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;With Bedrock Managed Knowledge Base, you pay for what you use with no upfront commitments. Pricing is based on two dimensions: the size of indexed data stored and the number of retrievals performed (on-demand). For detailed pricing information, visit the &lt;a href="https://aws.amazon.com/bedrock/pricing/"&gt;Amazon Bedrock pricing page&lt;/a&gt;. Bedrock is also a part of the &lt;a href="https://aws.amazon.com/free/"&gt;AWS Free Tier&lt;/a&gt; that new AWS customers can use to get started at no cost and explore key AWS services.&lt;/p&gt; 
&lt;p&gt;These capabilities work with any open source framework such as CrewAI, LangGraph, LlamaIndex, and Strands Agents, and with any foundation model. Bedrock services can be used together or independently, and you can get started using your favorite AI-assisted development environment with the &lt;a href="https://awslabs.github.io/mcp/servers/amazon-bedrock-agentcore-mcp-server"&gt;AgentCore open source MCP server&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;To learn more and get started quickly, visit the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/knowledge-base.html"&gt;Bedrock Knowledge Bases Developer Guide&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;Daniel Abib&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&amp;nbsp;Updated on June 19, 2026 &lt;/strong&gt;— Fixed correct screenshots to create a new Managed KB.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Announcing Web Search on Amazon Bedrock AgentCore: Ground your AI agents in current, accurate web knowledge</title>
		<link>https://aws.amazon.com/blogs/aws/announcing-web-search-on-amazon-bedrock-agentcore-ground-your-ai-agents-in-current-accurate-web-knowledge/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 15:00:11 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Bedrock AgentCore]]></category>
		<category><![CDATA[Amazon Machine Learning]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">1e4fc761b5a706e81f4b43e635181511d9ca0099</guid>

					<description>AWS introduces Web Search on Amazon Bedrock AgentCore, a fully managed tool that enables agents to ground responses in current, cited web knowledge with zero data egress from customer's secured AWS environment. You can focus on building agents instead of manually adding web search to agents on Bedrock AgentCore and managing its infrastructure.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing the general availability of Web Search on &lt;a href="https://aws.amazon.com/bedrock/agentcore/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock AgentCore&lt;/a&gt;, a fully managed tool that enables agents to ground responses in current, cited web knowledge with zero data egress from customer’s secured AWS environment.&lt;/p&gt; 
&lt;p&gt;Web Search uses a built-in connector target on Bedrock AgentCore Gateway using the Model Context Protocol (MCP). Your agent sends a natural-language query, and Web Search returns most relevant snippets, source URLs, titles, and publication dates that the model can reason over to produce a grounded response.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104652 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/2026-agentcore-websearch-diagram-3.jpg" alt="" width="2532" height="604"&gt;&lt;/p&gt; 
&lt;p&gt;It is built on Amazon’s search infrastructure, informed by years of experience powering agentic search experiences across &lt;a href="https://www.amazon.com/alexaplus/dp/B0CXRRF584?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Alexa+&lt;/a&gt;, &lt;a href="https://aws.amazon.com/quick/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Quick&lt;/a&gt;, and &lt;a href="https://kiro.dev/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Kiro&lt;/a&gt;. It uses a multi-source grounding approach that combines Amazon’s web index with structured knowledge graph data. Beyond standard web results, this gives agents access to Amazon Knowledge Graph with verified facts, helping them retrieve more relevant and accurate responses than traditional web search alone.&lt;/p&gt; 
&lt;p&gt;With this launch, you can focus on building agents instead of manually adding web search to agents on Bedrock AgentCore and managing its infrastructure. Your AI agent looks at user question, retrieves the latest facts, and then takes any necessary action grounded in current developments beyond a model’s training data. You can also meet enterprise governance policies without sending user prompts and retrieval queries to external search API providers outside of AWS.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Web Search on Bedrock AgentCore in action&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; To get started, create the Bedrock AgentCore Gateway with Web Search tool target in the &lt;a href="https://console.aws.amazon.com/bedrock-agentcore/home?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Bedrock AgentCore console&lt;/a&gt;. When the Gateway URL is created, you can interact with API call, Command Line Interface (CLI), or MCP Inspector.&lt;/p&gt; 
&lt;p&gt;To add Web Search tool target when creating the Gateway, choose &lt;strong&gt;MCP target&lt;/strong&gt; as a target protocol and &lt;strong&gt;Connectors&lt;/strong&gt; as a target type. You can select the &lt;strong&gt;Web Search tool&lt;/strong&gt; as a preconfigured target to retrieve most relevant web search results including links, snippets, and metadata.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104633 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/15/2026-agentcore-websearch-add-gateway-target.jpg" alt="" width="1800" height="1970"&gt;&lt;/p&gt; 
&lt;p&gt;After creating your gateway, you can find the Web Search tool target on the detail page of your gateway. You can also add a new Web Search tool target to an existing gateway.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104274" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-agentcore-websearch-add-gateway-detail.png" alt="" width="2406" height="2618"&gt;&lt;/p&gt; 
&lt;p&gt;To interact with Web Search tool, use the sample invocation code in the &lt;strong&gt;View invocation code&lt;/strong&gt; section. You can use code snippets through Python codes with API requests, MCP Python SDK, Strands MCP Client, and MCP Inspector.&lt;/p&gt; 
&lt;p&gt;For example, you can interact with the &lt;a href="https://modelcontextprotocol.io/docs/tools/inspector"&gt;MCP Inspector&lt;/a&gt;, an interactive developer tool for testing and debugging MCP servers.&amp;nbsp;When you connect to the MCP server through the &lt;strong&gt;Gateway resource URL&lt;/strong&gt;, you will find a Web Search tool for each connector target on the Gateway. Enter input the web search query and choose &lt;strong&gt;Run Tool&lt;/strong&gt; to get the results.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104703 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/17/2026-agentcore-websearch-mcp-inspector-1.jpg" alt="" width="1800" height="1258"&gt;&lt;/p&gt; 
&lt;p&gt;To learn more about how to use Web Search on Bedrock AgentCore, visit the &lt;a href="https://docs.aws.amazon.com/bedrock-agentcore/latest/devguide/gateway.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Bedrock AgentCore Gateway documentation&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Customer voices&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; Some of our customers had early access to this new feature. This is what they shared with us:&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://aws.amazon.com/marketplace/seller-profile?id=seller-2xeeqw6omnqeq&amp;amp;trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;Benchling&lt;/a&gt; helps scientists accelerate R&amp;amp;D, making it easy to centralize scientific data, collaborate across teams, and access insights. Nicholas Larus-Stone, Head of AI Agents at Benchling shared “Scientists using Benchling AI can now ask about a target they’re actively working on and get answers grounded in both their institutional data in Benchling and published literature. The result is more complete science, and hypothesis generation done right. Because we’re using the Web Search tool on Amazon Bedrock AgentCore, customers have a secure, governed environment to bring that high quality published data into their workflows without compromising how they manage their data.”&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://aws.amazon.com/solutions/case-studies/gen-digital-video-case-study/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;Gen Digital&lt;/a&gt; leads consumer and small business cyber safety, offering antivirus, antimalware, identity and privacy protection, virtual private networks, and cloud backup. Iskander Sanchez-Rola, Senior Director of AI &amp;amp; Innovation, Gen Digital shared “With the Web Search tool on Amazon Bedrock AgentCore, Norton Revamp helps professionals build their online reputation with current, grounded content ideas shaped by what’s actually happening in the world today. What we value most is that AWS uses its own search index and keep queries within our trusted AWS environment.”&lt;/p&gt; 
&lt;p&gt;To read more customer stories, visit the &lt;a href="https://aws.amazon.com/bedrock/customers/" target="_blank" rel="noopener noreferrer"&gt;Amazon Bedrock Customers&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Now available&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; Web Search on Amazon Bedrock AgentCore is generally available today in the US East (N. Virginia) Region. For Regional availability and a future roadmap, visit the &lt;a class="c-link" href="https://builder.aws.com/build/capabilities/explore?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer" data-stringify-link="https://builder.aws.com/capabilities/" data-sk="tooltip_parent"&gt;AWS Capabilities by Region&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;You can get started with Web Search on Bedrock AgentCore with no upfront commitments. Pricing is simple and usage-based. You are charged based on the number of search queries your agents submit to the web search. Web Search is priced at $7 per 1,000 queries. New AWS customers also receive up to $200 in Free Tier credits. To learn more, visit the &lt;a href="https://aws.amazon.com/bedrock/agentcore/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock AgentCore pricing&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;Try it in the &lt;a href="https://console.aws.amazon.com/bedrock-agentcore/home?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock AgentCore console&lt;/a&gt;&amp;nbsp;and send feedback to &lt;a href="https://repost.aws/tags/TAaysfWwGaS3SNb1O0i1GkOg/amazon-bedrock-agentcore?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for Amazon Bedrock AgentCore&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://twitter.com/channyun"&gt;Channy&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June 18, 2026&lt;/strong&gt; —&amp;nbsp;Added a clear pricing statement for Web Search in Bedrock AgentCore.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Proactively reduce tech debt autonomously with AWS Transform – continuous modernization (preview)</title>
		<link>https://aws.amazon.com/blogs/aws/proactively-reduce-tech-debt-autonomously-with-aws-transform-continuous-modernization-preview/</link>
					
		
		<dc:creator><![CDATA[Micah Walter]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 14:58:00 +0000</pubDate>
				<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[AWS Transform]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Generative AI]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">de6487c2a98aa63becca18998e2eaf30133784c1</guid>

					<description>AWS Transform – continuous modernization (preview) automatically scans code repositories to detect, prioritize, and remediate technical debt at scale.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing &lt;a href="http://aws.amazon.com/transform/continuous-modernization"&gt;AWS Transform – continuous modernization (preview)&lt;/a&gt;, a new capability of &lt;a href="https://aws.amazon.com/transform/"&gt;AWS Transform&lt;/a&gt; for continuous, autonomous tech debt analysis and remediation at scale. AWS Transform already helps enterprises migrate out of data centers, modernize mainframe and Windows applications, and handle the undifferentiated work of software maintenance: upgrading Java versions, swapping deprecated frameworks, and updating &lt;a href="https://aws.amazon.com/lambda/"&gt;AWS Lambda&lt;/a&gt; runtimes before they reach end of life. This new experience builds on this. Customers get full visibility into the state of their codebase across thousands of repositories, prioritized findings, and the pull requests that make the fixes.&lt;/p&gt; 
&lt;p&gt;Engineering organizations typically consume up to 30% of IT budgets. Customers stitch together point tools: one to detect dependency issues, another to flag vulnerabilities, another for code quality. But no existing tool detects, prioritizes, and remediates tech debt continuously and at scale. The result is a manual, app-by-app cycle that drains engineering capacity. Leaders fall back on self-reported team status that lags reality and hides regressions. AI-assisted development makes this worse: as coding agents accelerate the pace of change, tech debt accumulates faster than developers can keep up. Customers need a capability that detects, prioritizes, and remediates tech debt continuously, autonomously, and at scale.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Continuous analysis&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;To address the visibility challenge, this new capability within AWS Transform automatically scans your code repositories against configurable baselines and generates findings in hours, not weeks. Out of the box, AWS Transform – continuous modernization includes policies for detecting end of life dependencies, deprecated frameworks, and other common sources of technical debt. You can also extend these with your own remediation patterns specific to your organization, including approved libraries, internal coding standards, or tech debt policies your platform team already enforces. For example, if your team has deprecated an internal library or prefers a particular logging pattern, you can codify that as a policy and run it across all your repositories continuously.&lt;/p&gt; 
&lt;p&gt;Unlike periodic manual efforts, continuous analysis provides ground truth directly from your code. When a repository falls behind your baseline, you know immediately, showing which components are behind and by how much, regardless of how the team chooses to address it. This eliminates the need for status check-ins and manual compliance tracking, giving platform teams an always current view of their technical debt landscape.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Autonomous remediation at scale&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Once you’ve identified and prioritized findings, you can configure autonomous remediations that generate pull requests for affected repositories automatically. This new AWS Transform capability provides out-of-the-box remediation transformations for common scenarios such as Java version upgrades, SDK migrations, and library updates. You can also create custom transformations for organization-specific patterns.&lt;/p&gt; 
&lt;p&gt;When you launch a remediation, the continuous modernization capability creates pull requests for each affected repository, notifying the owning team with a message like: “This repository is behind on your organization’s baseline for this dependency. Here’s a PR that resolves it.” Teams can review and merge the PR, or choose to remediate using their own approach. Either way, continuous analysis detects when the fix is in place, providing ground truth without requiring manual confirmation.&lt;/p&gt; 
&lt;p&gt;AWS Transform – continuous modernization integrates with &lt;a href="https://aws.amazon.com/security-agent"&gt;AWS Security Agent&lt;/a&gt; to detect and remediate security vulnerabilities at the source-code level, so security findings flow into the same prioritized list and pull-request workflow as other tech debt.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Let’s try it out&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;To get started with, I navigated to the AWS Transform web application. From the dashboard, I can see an overview of my organization’s repositories and their current status against my configured baselines.&lt;/p&gt; 
&lt;p&gt;First, I connected my source control system and initiated an analysis against my specified policies. Within hours, the analysis returned findings across my repositories, showing which ones were behind the baseline and by how much. I could see the severity, the number of affected files, and the specific tech debt patterns detected.&lt;/p&gt; 
&lt;p&gt;From here, I selected a group of high-priority findings and launched a remediation campaign. AWS Transform – continuous modernization generated pull requests for each affected repository. I could monitor the campaign’s progress in real time, seeing which PRs were created, which were merged, and which repositories returned to compliance.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104654 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/image-01-1024x557.png" alt="" width="1024" height="557"&gt;&lt;/p&gt; 
&lt;p&gt;Image 1: AWS Transform – continuous modernization dashboard showing a portfolio overview of your technical debt findings across all connected repositories.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104655 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/image-02-1024x572.png" alt="" width="1024" height="572"&gt;&lt;/p&gt; 
&lt;p&gt;Image 2: The detailed findings view listing individual tech debt items by severity, category, and repository with their available remediation options.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104656 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/image-03-1024x573.png" alt="" width="1024" height="573"&gt;&lt;/p&gt; 
&lt;p&gt;Image 3: The sources view showing connected repositories from GitHub and local environments that continuous modernization is tracking for analysis.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Faster ways to modernize&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;These capabilities support two distinct approaches to code modernization. In continuous mode, you can use continuous modernization to keep your codebases current as baselines evolve. Think of this as the day-to-day work of upgrading libraries, applying security patches, and enforcing coding standards across your organization.&lt;/p&gt; 
&lt;p&gt;For larger modernization projects, such as migrating from one framework to another or upgrading a major runtime version across hundreds of applications, you can use campaign mode for targeted, project-based modernization. AWS Transform custom continues to provide the flexible primitive for these larger efforts. AWS Transform – continuous modernization is purpose-built for the recurring, high-volume work that platform teams manage every day.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Now available&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;AWS Transform – continuous modernization (preview) is available today. You can get started through the AWS Transform web application, via the AWS Transform Kiro Power, or through MCP and skills for integration with your existing coding agents. To learn more, visit the &lt;a href="https://docs.aws.amazon.com/transform/"&gt;AWS Transform documentation&lt;/a&gt;.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>AWS DevOps Agent adds release management capabilities to assess code changes before production (preview)</title>
		<link>https://aws.amazon.com/blogs/aws/aws-devops-agent-adds-release-management-capabilities-to-assess-code-changes-before-production-preview/</link>
					
		
		<dc:creator><![CDATA[Esra Kayabali]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 14:57:34 +0000</pubDate>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[DevOps]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">6261b6e3f0038f8b71e95bf2f91f996b1505a1e9</guid>

					<description>AWS DevOps Agent now offers release management capability in preview, reviewing code changes for release readiness and running autonomous release testing to help you ship code to production safely and with confidence.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing a new release management capability in &lt;a href="https://aws.amazon.com/devops-agent/"&gt;AWS DevOps Agent&lt;/a&gt; that is now available in preview. AWS DevOps Agent is your always-available teammate that spans software changes and operations across AWS, multicloud, and on-premises environments. The practice of DevOps aims to make software change and operations smooth and increasingly autonomous, and AWS DevOps Agent delivers on both by leveraging its deep understanding of your environment, your services, their dependencies, and how they behave in production. Already generally available for post-deployment operations, it autonomously investigates incidents, provides root cause analysis and mitigation steps, and delivers targeted recommendations to prevent recurring issues. With today’s preview, AWS DevOps Agent adds release readiness review of code changes and autonomous release testing. These new features verify every change against the natural language standards you give to the DevOps Agent and run change-specific tests in production-like environments. AWS DevOps Agent now supports teams from code creation to production, helping reviewers and testers keep pace with the volume of AI-generated code.&lt;/p&gt; 
&lt;p&gt;As development teams adopt AI coding tools, the volume of pull requests moving through delivery pipelines has increased faster than review and testing processes can handle. When teams are under pressure to keep up, reviews are approved without thorough examination, and test environments drift from production. The value that coding agents generate sits waiting in review queues instead of reaching end users. At the same time, AI models are increasingly capable of catching functional and security issues that human reviewers might miss under time pressure, making speedy and safe delivery a requirement rather than a tradeoff.&lt;/p&gt; 
&lt;p&gt;The release readiness review feature evaluates every code change against production requirements, dependency safety, and the standards and best practices you provide to the DevOps Agent. The agent checks cross-repository dependency risks that could affect other services, access control changes against AWS Well-Architected Framework best practices, and compliance with any standards you have defined. When no standards are provided, the agent applies general best practices. As part of the review, the agent also runs your software in an AWS-managed isolated environment, executing lightweight user journey tests to verify the software builds, runs, and passes basic functional checks before the change enters the pipeline. Findings appear in the AWS DevOps Agent console and as comments on pull requests in GitHub or GitLab. You can also invoke reviews directly from your IDE through the Kiro power or Claude Code plugin, so developers can identify and fix dependency risks, standards violations, and access control issues before the change is committed to version control.&lt;/p&gt; 
&lt;p&gt;The autonomous release testing feature goes further, generating and running change-specific test plans for web and API-based applications in customer-provisioned, production-like environments before the change merges. Rather than running a static test suite, the agent reasons about what the change does and constructs tests tailored to it, covering functional correctness, behavioral regressions, and integration scenarios that a manually maintained test plan might not anticipate. Every test run produces structured artifacts including metrics, logs, traces, and an execution summary, giving reviewers a consistent record of what was tested and what the results were.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Getting started with AWS DevOps Agent release management&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;This walkthrough shows how to run an on-demand release readiness review using the AWS DevOps Agent web app. Before you begin, confirm that you have at least one &lt;a href="https://docs.aws.amazon.com/devopsagent/latest/userguide/connecting-to-cicd-pipelines-connecting-github.html"&gt;GitHub&lt;/a&gt; or &lt;a href="https://docs.aws.amazon.com/devopsagent/latest/userguide/connecting-to-cicd-pipelines-connecting-gitlab.html"&gt;GitLab&lt;/a&gt; repository connected to your Agent Space. Once your repositories are connected, AWS DevOps Agent will index your code and build a knowledge graph of cross-repository and cloud dependencies.&lt;/p&gt; 
&lt;p&gt;To open the web app, navigate to the &lt;a href="https://console.aws.amazon.com/aidevops/"&gt;AWS DevOps Agent console&lt;/a&gt;, select your &lt;strong&gt;Agent Space&lt;/strong&gt;, and choose the &lt;strong&gt;Web app&lt;/strong&gt; tab. Choose &lt;strong&gt;Operator access&lt;/strong&gt; to open the web app.&lt;/p&gt; 
&lt;p&gt;Without standards configured, the agent applies general best practices. To tailor reviews to your internal standards, navigate to&amp;nbsp;&lt;strong&gt;Knowledge&lt;/strong&gt;, then choose the&amp;nbsp;&lt;strong&gt;Instructions&lt;/strong&gt;&amp;nbsp;tab. You will see a list of instruction sets, each scoped to a specific agent or task. Choose&amp;nbsp;&lt;strong&gt;View&lt;/strong&gt;&amp;nbsp;next to&amp;nbsp;&lt;strong&gt;Release readiness review&lt;/strong&gt;&amp;nbsp;to edit the instructions for production-readiness change review. Write your internal standards in plain English. For example, you can define infrastructure and data standards on encryption or network access rules, best practices that warn without blocking such as logging and observability requirements, and sensitive data classification best practices that identify applications or resources requiring higher security measures. To apply instructions across all agents in your space, choose&amp;nbsp;&lt;strong&gt;View&lt;/strong&gt;&amp;nbsp;next to&amp;nbsp;&lt;strong&gt;All agents&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104568 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/14/1214659328598974-4.png" alt="" width="1924" height="967"&gt;&lt;/p&gt; 
&lt;p&gt;You can trigger a release readiness review in two ways: by submitting a pull request to a connected repository, or by entering an on-demand query in the chat interface. To run an on-demand review from chat, choose&lt;strong&gt; New chat&lt;/strong&gt; and enter a request such as:&lt;/p&gt; 
&lt;p&gt;&lt;code&gt;Perform a production risk analysis on my repository branch&lt;/code&gt;&lt;/p&gt; 
&lt;p&gt;The agent will ask for the repository and branch you want to analyze. You can provide a branch name, a pull request number, or a commit SHA. Once you confirm your selection, the agent queues the review and analyzes the change for production risks, including infrastructure impacts, configuration changes, and potential issues.&lt;/p&gt; 
&lt;p&gt;After the review completes, you can ask follow-up questions directly in the chat to explore the findings in more detail. For example, you can ask which downstream consumers a change affects, and the agent will return a structured breakdown of in-repository and cross-repository consumers that will break, the specific files and line numbers affected, and the recommended steps to resolve the issue before deployment.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104569 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/14/1214659328598974-5.png" alt="" width="1168" height="968"&gt;&lt;/p&gt; 
&lt;p&gt;After submitting a review request, navigate to&amp;nbsp;&lt;strong&gt;Changes&lt;/strong&gt;&amp;nbsp;in the left navigation pane. The&amp;nbsp;&lt;strong&gt;Proposed changes&lt;/strong&gt; table shows each review that has run, including the proposed change description, its source, category, status, and when it was created. You can filter by category or status to find specific reviews, or search by name using the search bar. Choose any entry to open the full execution detail.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104571 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/14/1214659328598974-6.png" alt="" width="1641" height="516"&gt;&lt;/p&gt; 
&lt;p&gt;The&amp;nbsp;&lt;strong&gt;Timeline&lt;/strong&gt;&amp;nbsp;tab shows the agent’s step-by-step reasoning process, including the tools it called, the dependencies it consulted, and the observations it made at each step. Each entry is timestamped, giving you a complete record of how the agent built its understanding of the change and reached its conclusion.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104572 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/14/1214659328598974-8.png" alt="" width="1283" height="901"&gt;&lt;/p&gt; 
&lt;p&gt;Choose the&amp;nbsp;&lt;strong&gt;Report&lt;/strong&gt;&amp;nbsp;tab to see the final recommendation. The report opens with a summary header showing the recommended action, the number of critical issues found, the commit revision, and the number of files changed. The recommended action is either&amp;nbsp;&lt;strong&gt;BLOCK&lt;/strong&gt;,&amp;nbsp;&lt;strong&gt;Proceed with Caution&lt;/strong&gt;, or&amp;nbsp;&lt;strong&gt;Safe to Release&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;Below the summary header, the&amp;nbsp;&lt;strong&gt;Analysis&lt;/strong&gt;&amp;nbsp;section explains why the recommendation was made, citing specific risks and the evidence the agent found to support its conclusion. The&amp;nbsp;&lt;strong&gt;Issues&lt;/strong&gt;&amp;nbsp;section lists each finding by severity, giving you a prioritized view of what needs to be addressed before the change can proceed. The&amp;nbsp;&lt;strong&gt;Recommendations&lt;/strong&gt;&amp;nbsp;section provides specific, actionable steps the developer can take to resolve each issue. Finally, the&amp;nbsp;&lt;strong&gt;Changes&lt;/strong&gt;&amp;nbsp;section lists each file that was modified, with the type of change, the category it falls under, and a description of what was changed, so reviewers have a complete picture of what the change does before it merges.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104573 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/14/1214659328598974-7.png" alt="" width="1289" height="1072"&gt;&lt;/p&gt; 
&lt;p&gt;You can also invoke the autonomous release testing feature directly from the chat interface. To run an autonomous release test on a web or API-based application, choose &lt;strong&gt;New chat&lt;/strong&gt; and enter a query such as:&lt;/p&gt; 
&lt;p&gt;&lt;code&gt;Run a release test on my application deployed at [application URL]&lt;/code&gt;&lt;/p&gt; 
&lt;p&gt;The agent generates a change-specific test plan and executes it in your provisioned environment. Results appear in &lt;b&gt;Changes&lt;/b&gt;, where you can review the execution steps and a structured summary of what was tested.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Get started today&lt;/strong&gt;&lt;/span&gt;&lt;br&gt; The release readiness review and autonomous release testing features for AWS DevOps Agent are available in preview. These features are available at no additional cost during preview in the US East (N. Virginia) Region. For pricing information on other AWS DevOps Agent features, visit the &lt;a href="https://aws.amazon.com/devops-agent/pricing/"&gt;AWS DevOps Agent pricing&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;For configuration details, visit the &lt;a href="https://docs.aws.amazon.com/devopsagent/latest/userguide/getting-started-with-aws-devops-agent.html"&gt;AWS DevOps Agent user guide&lt;/a&gt;.&lt;/p&gt; 
&lt;a href="https://www.linkedin.com/in/esrakayabali/"&gt;— Esra&lt;/a&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>AWS Security Agent adds threat modeling, Kiro power and Claude Code plugin, and more</title>
		<link>https://aws.amazon.com/blogs/aws/aws-security-agent-adds-threat-modeling-kiro-power-and-claude-code-plugin-and-more/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 14:54:55 +0000</pubDate>
				<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Security, Identity, & Compliance]]></category>
		<guid isPermaLink="false">c95df14c0d340b8b3462b012a73af27313033e7a</guid>

					<description>AWS Security Agent now adds STRIDE-based threat modeling, full repo and PR code scanning with remediation across major Git platforms, and IDE integrations via Kiro power, Claude Code plugin, and MCP — letting developers run security reviews and fix issues without context switching.</description>
										<content:encoded>&lt;p&gt;At re:Invent 2025, we &lt;a href="https://aws.amazon.com/blogs/aws/new-aws-security-agent-secures-applications-proactively-from-design-to-deployment-preview/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;previewed&lt;/a&gt; &lt;a href="https://aws.amazon.com/security-agent/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Security Agent&lt;/a&gt; (now part of &lt;a href="http://aws.amazon.com/continuum/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Continuum&lt;/a&gt;), a frontier agent that proactively secures your applications throughout the development lifecycle across all your environments. You can perform on-demand penetration testing customized to your application, discovering and reporting security risks verified through exploitability testing.&lt;/p&gt; 
&lt;p&gt;Since the preview, we announced general availability for &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/03/aws-security-agent-ondemand-penetration/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;on-demand penetration testing&lt;/a&gt; and the preview of &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/05/aws-security-agent-full-repository-code-review/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;full repository code review&lt;/a&gt; that performs deep, context-aware security analysis of your entire codebase.&lt;/p&gt; 
&lt;p&gt;Today, we’re introducing more features based on customer feedback:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Code review updates (Preview)&lt;/strong&gt;: You can now use pull request scanning with remediation, security requirements packs, and simulated validation. New integrations support GitHub, GitLab, Bitbucket, and Confluence.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Threat modeling (Preview)&lt;/strong&gt; : AWS Security Agent analyzes your design documents or application source code, understands the full context of your application architecture and identifies threats with recommended mitigations using the STRIDE framework.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Kiro power, Claude Code plugin, and MCP integration&lt;/strong&gt;: You can run code reviews, generate threat models, and remediate findings directly from your IDE, CLI, or any AI-powered IDE through an open MCP integration, with results surfacing inline without any context switching.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;Let’s take a closer look at each launch!&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104509" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/11/2026-security-agent-updates-home.jpg" alt="" width="1800" height="994"&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Code review updates&lt;/strong&gt;&lt;br&gt; You can now connect to GitLab and Bitbucket in addition to GitHub— supporting both SaaS and self-hosted versions, so you can trigger scans regardless of where code lives. You can also integrate Confluence to reference your existing documentation as context for reviews.&lt;/p&gt; 
&lt;p&gt;To get started, choose &lt;strong&gt;Enable code review&lt;/strong&gt; or update your code review setting in the &lt;a href="https://console.aws.amazon.com/securityagent/agents/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Security Agent console&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104510" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/11/2026-security-agent-updates-1-codereivew-repo.jpg" alt="" width="1800" height="1018"&gt;&lt;/p&gt; 
&lt;p&gt;AWS Security Agent introduces deep, reasoning-based analysis on every pull request as well as full repository to identify complex vulnerabilities that go beyond pattern-matching. It checks against your organizational security requirements and common security risks to catch what other tools can’t. To get started, access the Security Agent web application and run your code review.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104511" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/11/2026-security-agent-updates-1-codereivew-results.jpg" alt="" width="1800" height="1025"&gt;&lt;/p&gt; 
&lt;p&gt;You’ll receive fix commits and remediation guidance directly in your GitHub, GitLab, or Bitbucket workflow, while your security teams configure the repositories to be monitored and intervene on critical issues. AWS Security Agent validates findings in simulated environments to demonstrate proof of exploitability. This embeds security expertise across all repositories, reducing security-related delays in the development pipeline.&lt;/p&gt; 
&lt;p&gt;To learn more about new code review features, visit &lt;a href="https://docs.aws.amazon.com/securityagent/latest/userguide/perform-code-review-scan.html"&gt;Create a code review&lt;/a&gt; in the AWS Security Agent User Guide.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Design review updates&lt;br&gt; &lt;/strong&gt;You can continuously validate your security requirements across every design and code review with managed compliance packs: &lt;a href="https://aws.amazon.com/architecture/well-architected/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Well Architected Framework&lt;/a&gt;, &lt;a href="https://docs.aws.amazon.com/config/latest/developerguide/operational-best-practices-for-nist-csf.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;NIST CSF&lt;/a&gt;, &lt;a href="https://aws.amazon.com/compliance/pci-faqs/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;PCI DSS&lt;/a&gt;, and AWS best practices, or import your own organizational requirements directly from internal documents or Confluence. Every finding maps back to your compliance posture, so teams stay audit-ready as they build.&lt;/p&gt; 
&lt;p class="jss491" data-pm-slice="1 1 []"&gt;To learn more, visit the &lt;a href="https://docs.aws.amazon.com/securityagent/latest/userguide/perform-design-review.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;design review documentation&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Threat modeling&lt;/strong&gt;&lt;br&gt; AWS Security Agent generates threat models based on your design documentation or code repository, creates and build context about the application, including data flows, architecture, and trust boundaries. It maps out all components of your application, identifies potential threat actors and attack vectors, determines where weaknesses may exist, and prioritizes threats so you know what to address first.&lt;/p&gt; 
&lt;p&gt;To get started, choose &lt;strong&gt;Enable threat model&lt;/strong&gt; and Connect source code repository in the &lt;a href="https://console.aws.amazon.com/securityagent/agents/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Security Agent console&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104640 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/2026-security-agent-updates-2-thread-model.png" alt="" width="1275" height="827"&gt;&lt;/p&gt; 
&lt;p&gt;To learn more, visit the &lt;a href="https://docs.aws.amazon.com/securityagent/latest/userguide/perform-threat-model.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;threat modeling documentation&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Kiro power and Claude Code plugin for Security Agent&lt;/strong&gt;&lt;br&gt; AWS Security Agent introduces a new &lt;a href="https://github.com/AWS-Security-Agent/aws-security-agent-kiro-power"&gt;Kiro power&lt;/a&gt; and &lt;a href="https://claude.com/plugins/aws-agents-for-devsecops"&gt;Claude Code plugin&lt;/a&gt; and can be integrated with any AI IDE through an open MCP integration to secure your applications. You can trigger threat models and code reviews directly from your IDE, with results surfacing inline without any context switching.&lt;/p&gt; 
&lt;p&gt;To get started, install the &lt;a href="https://github.com/AWS-Security-Agent/aws-security-agent-kiro-power"&gt;Kiro power&lt;/a&gt;, and run your prompts. &lt;span class="TextRun SCXW112479336 BCX2" lang="EN-US" xml:lang="EN-US" data-contrast="auto"&gt;&lt;span class="NormalTextRun SCXW112479336 BCX2"&gt;The Kiro power uses the &lt;/span&gt;&lt;/span&gt;&lt;a class="Hyperlink SCXW112479336 BCX2" href="https://github.com/awslabs/mcp/tree/main/src/security-agent-mcp-server" target="_blank" rel="noreferrer noopener"&gt;&lt;span class="TextRun Underlined SCXW112479336 BCX2" lang="EN-US" xml:lang="EN-US" data-contrast="none"&gt;&lt;span class="NormalTextRun SCXW112479336 BCX2" data-ccp-charstyle="Hyperlink"&gt;AWS Security Agent &lt;/span&gt;&lt;span class="FindHit SCXW112479336 BCX2" data-ccp-charstyle="Hyperlink"&gt;MCP&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="TextRun SCXW112479336 BCX2" lang="EN-US" xml:lang="EN-US" data-contrast="auto"&gt;&lt;span class="NormalTextRun SCXW112479336 BCX2"&gt; server&lt;/span&gt;&lt;span class="NormalTextRun SCXW112479336 BCX2"&gt;. You can get started with the power by asking “&lt;/span&gt;&lt;/span&gt;&lt;code&gt;&lt;span data-contrast="auto"&gt;Set up AWS Security Agent&lt;/span&gt;&lt;/code&gt;&lt;span data-ccp-props="{&amp;quot;335559685&amp;quot;:360}"&gt;“. &lt;/span&gt;&lt;span data-contrast="auto"&gt;Kiro will check if you have an Agent Space and ask if you would like to use the existing one or create a new one.&lt;/span&gt;&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104642 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/2026-security-agent-updates-3-kiro-power-1.jpg" alt="" width="1700" height="1057"&gt;&lt;/p&gt; 
&lt;p&gt;&lt;span data-contrast="auto"&gt;With the Kiro power for Security Agent, y&lt;/span&gt;ou can catch vulnerabilities on every pull request as you build and scan an entire repository to surface accumulated risk by asking “&lt;code&gt;&lt;span data-contrast="auto"&gt;Run a full security scan on this repo&lt;/span&gt;&lt;/code&gt;“. The Security Agent power includes an Agent hook to evaluate if a code review diff scan should be started after the Kiro agent has completed its turn. Before deploying to production, you can run a penetration test from your CLI to find what most scanners miss. Security Agent closes the loop by validating every finding and generating ready-to-implement code fixes.&lt;/p&gt; 
&lt;p class="jss491" data-pm-slice="1 1 []"&gt;You can pull the findings back into your development environment by asking “&lt;code&gt;help me remediate my findings&lt;/code&gt;“. The Kiro power for AWS Security Agent will download findings to your local workspace, prioritize the most critical finding, and offer to start a bugfix spec session. You can iterate on fixing the findings using their familiar IDE with their existing tooling, steering, powers, and MCP servers.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104643 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/16/2026-security-agent-updates-3-kiro-power-code-review.jpg" alt="" width="1700" height="1056"&gt;&lt;/p&gt; 
&lt;p&gt;&lt;span data-contrast="auto"&gt;You can also run threat models through the Kiro power in the IDE by asking “&lt;/span&gt;&lt;code&gt;&lt;span data-contrast="auto"&gt;Build a threat model for this application&lt;/span&gt;&lt;/code&gt;&lt;span data-ccp-props="{&amp;quot;335559685&amp;quot;:360}"&gt;“. &lt;/span&gt;&lt;span data-contrast="auto"&gt;The generated threat model is saved to &lt;/span&gt;&lt;code&gt;&lt;span data-contrast="auto"&gt;.security-agent/threat_model.md&lt;/span&gt;&lt;/code&gt;&lt;span data-contrast="auto"&gt;.&amp;nbsp;&lt;/span&gt;&lt;/p&gt; 
&lt;p&gt;To learn more, visit the &lt;a href="https://github.com/AWS-Security-Agent/aws-security-agent-kiro-power"&gt;Kiro power for Security Agent&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Now available&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; AWS Security Agent understands the full security context across your software development lifecycle by covering design-time security (design reviews and threat modeling in preview), development-time security (code review in preview), and deployment-time security (penetration testing in GA), in a single, unified agentic offering. To learn more, visit the &lt;a href="https://aws.amazon.com/security-agent/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS Security Agent product page&lt;/a&gt; and the &lt;a href="https://docs.aws.amazon.com/securityagent/latest/userguide/what-is.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;technical documentation&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;These features are now available in AWS commercial Regions where AWS Security Agent is available. For Regional availability and the future roadmap, visit the &lt;a class="c-link" href="https://builder.aws.com/build/capabilities/explore?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer" data-stringify-link="https://builder.aws.com/capabilities/" data-sk="tooltip_parent"&gt;AWS Capabilities by Region&lt;/a&gt;. For detailed pricing information and to access our 2-month free trial offer, please visit the &lt;a id="link-self:r8l:" class="awsui_link_4c84z_1hknd_145 awsui_variant-primary_4c84z_1hknd_280 awsui_font-size-body-m_4c84z_1hknd_478" href="https://aws.amazon.com/security-agent/pricing/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer" data-analytics-type="eventDetail" data-analytics="link-ga-pricing-page" data-awsui-analytics="{&amp;quot;action&amp;quot;:&amp;quot;click&amp;quot;,&amp;quot;detail&amp;quot;:{&amp;quot;label&amp;quot;:{&amp;quot;root&amp;quot;:&amp;quot;self&amp;quot;},&amp;quot;external&amp;quot;:&amp;quot;true&amp;quot;,&amp;quot;href&amp;quot;:&amp;quot;https://aws.amazon.com/security-agent/pricing/&amp;quot;},&amp;quot;component&amp;quot;:{&amp;quot;name&amp;quot;:&amp;quot;awsui.Link&amp;quot;,&amp;quot;label&amp;quot;:{&amp;quot;root&amp;quot;:&amp;quot;self&amp;quot;},&amp;quot;properties&amp;quot;:{&amp;quot;variant&amp;quot;:&amp;quot;primary&amp;quot;}}}" aria-label="AWS Security Agent pricing page (Opens in a new tab)" data-analytics-funnel-value="link:r8k:"&gt;AWS Security Agent pricing page&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;Give it a try in the &lt;a href="https://console.aws.amazon.com/securityagent/agents/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Security Agent console&lt;/a&gt; and send feedback to &lt;a href="https://repost.aws/tags/TAujmCOJj0TSykKSuMsqRwZQ/aws-security-agent?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for Security Agent&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://linkedin.com/in/channy"&gt;Channy&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June 18, 2026&lt;/strong&gt; –&amp;nbsp;&lt;a href="https://claude.com/plugins/aws-agents-for-devsecops"&gt;AWS Agents for DevSecOps&lt;/a&gt;, the Claude Code plugin for AWS DevOps Agent and AWS Security Agent is launched.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Amazon S3 annotations: attach rich, queryable context directly to your objects</title>
		<link>https://aws.amazon.com/blogs/aws/amazon-s3-annotations-attach-rich-queryable-context-directly-to-your-objects/</link>
					
		
		<dc:creator><![CDATA[Daniel Abib]]></dc:creator>
		<pubDate>Tue, 16 Jun 2026 23:13:10 +0000</pubDate>
				<category><![CDATA[Amazon Simple Storage Service (S3)]]></category>
		<category><![CDATA[AWS Summit New York]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Storage]]></category>
		<guid isPermaLink="false">d825f9d469e21e76b22344398328d967de0264d2</guid>

					<description>Amazon S3 now lets you attach up to 1 GB of rich, mutable, and queryable context directly to your objects using annotations, purpose-built for AI agents and autonomous workflows that need to discover, understand, and act on data at scale without maintaining separate metadata systems.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing a new metadata capability for &lt;a href="https://aws.amazon.com/s3/"&gt;Amazon Simple Storage Service (Amazon S3)&lt;/a&gt; called annotations, enabling you to attach rich, large-scale business context directly to your objects. You can store up to 1,000 named annotations per object, each up to 1 MB in size, totaling up to 1 GB per object, in flexible formats like JSON, XML, YAML, or plain text. You can modify or delete an annotation at any time, without re-writing your objects, making it easy to keep your object context current.&lt;/p&gt; 
&lt;p&gt;Organizations are building AI agents and autonomous workflows that need to find, understand, and act on data without human intervention. To support these agentic workflows, you need metadata that can evolve alongside the data, scale to petabytes of objects, and remain queryable without expensive retrieval.&lt;/p&gt; 
&lt;p&gt;With S3 annotations, you can store context such as AI-generated transcripts, content ratings, or technical specifications directly alongside your objects. Your context moves automatically with the object during copy, replication, and cross-region transfers, and S3 removes it when you delete the object. When you enable &lt;a href="https://aws.amazon.com/s3/features/metadata/"&gt;S3 Metadata&lt;/a&gt;, annotations automatically flow into fully managed annotation tables that you can query with &lt;a href="https://aws.amazon.com/athena/"&gt;Amazon Athena&lt;/a&gt; and other analytics engines.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Common use cases&lt;/strong&gt;&lt;br&gt; Annotations solve complex metadata challenges across industries:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Media &amp;amp; Entertainment&lt;/strong&gt;: Track transcripts, content moderation results, subtitle files, and licensing metadata as separate annotations on video assets, eliminating the need to synchronize metadata across multiple media asset management systems.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Financial Services&lt;/strong&gt;: Attach AI-generated investment summaries and sentiment analysis to research documents, enabling autonomous research agents to discover relevant datasets through natural-language queries without maintaining separate metadata databases.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Life Sciences&lt;/strong&gt;: Annotate clinical trial data with regulatory status, patient cohort details, and approval chains, making compliance audits faster while keeping full context accessible for archived data in Amazon S3 Glacier storage classes without retrieval charges.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;How annotations address metadata challenges&lt;/strong&gt;&lt;br&gt; Amazon S3 already supports several ways to describe your objects. System-defined metadata captures properties like size and storage class. Object tags support operational tasks like access control and lifecycle management. User-defined metadata lets you add small amounts of custom information at upload time.&lt;/p&gt; 
&lt;p&gt;While these capabilities work well for their intended purposes, they have limitations when you need to attach much richer context without building and maintaining separate metadata systems. Annotations address these needs by providing metadata capabilities at a fundamentally different scale and flexibility, offering mutable, queryable context per object compared to 10 immutable tags or 2 KB of headers.&lt;/p&gt; 
&lt;table style="border: 2px solid black;border-collapse: collapse;margin-left: auto;margin-right: auto;width: 100%"&gt; 
 &lt;tbody&gt; 
  &lt;tr style="border-bottom: 1px solid black;background-color: #e0e0e0"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Capability&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Max size&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Mutable?&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="padding: 4px;text-align: center"&gt;&lt;strong&gt;Best for&lt;/strong&gt;&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;System-defined metadata&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Fixed&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;No&lt;/td&gt; 
   &lt;td style="padding: 4px;text-align: center"&gt;Object properties (size, storage class, creation time)&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;User-defined metadata&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;2 KB&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;No (set at upload)&lt;/td&gt; 
   &lt;td style="padding: 4px;text-align: center"&gt;Small custom key-value pairs&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Object tags&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;10 tags, 128/256 characters per key/value&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Yes&lt;/td&gt; 
   &lt;td style="padding: 4px;text-align: center"&gt;Access control, lifecycle rules, cost allocation&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Annotations&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;1 GB (1,000 × 1 MB)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;Yes&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="padding: 4px;text-align: center"&gt;&lt;strong&gt;Rich business context (JSON, XML, YAML, plain text)&lt;/strong&gt;&lt;/td&gt; 
  &lt;/tr&gt; 
 &lt;/tbody&gt; 
&lt;/table&gt; 
&lt;p&gt;Today, metadata describing S3 objects often lives in separate databases or sidecar files, requiring complex synchronization workflows that can exceed data storage costs. When you enable S3 Metadata annotation tables, this context becomes queryable at scale through Amazon Athena. AI agents can discover your data through natural language with the &lt;a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/storage-lens-s3-tables-ai-tools.html"&gt;S3 Tables MCP server&lt;/a&gt;, which provides a standardized interface for AI models to query your annotations. You can query annotations for objects in any storage class, without restoring the objects or paying retrieval charges.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Getting started with annotations&lt;/strong&gt;&lt;/span&gt;&lt;br&gt; To start using annotations, make sure your &lt;a href="https://aws.amazon.com/iam/"&gt;AWS Identity and Access Management (IAM)&lt;/a&gt; policy or bucket policy grants permissions for the &lt;code&gt;s3:PutObjectAnnotation&lt;/code&gt; and &lt;code&gt;s3:GetObjectAnnotation&lt;/code&gt; actions. You can then add annotations to any existing or new S3 object using the &lt;code&gt;PutObjectAnnotation&lt;/code&gt; API.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104564 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/13/2026-s3-annotations-add.png" alt="" width="1244" height="650"&gt;&lt;/p&gt; 
&lt;p&gt;For example, a media company can attach technical specifications and AI-produced summaries to a video asset using the &lt;a href="https://aws.amazon.com/cli/"&gt;AWS Command Line Interface (AWS CLI)&lt;/a&gt;:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;# Create a JSON file with technical metadata
cat &amp;gt; mediainfo.json &amp;lt;&amp;lt; 'EOF'
{"codec":"H.265","resolution":"3840x2160","audio_tracks":8,"frame_rate":29.97}
EOF

# Attach it as an annotation
aws s3api put-object-annotation \
  --bucket my-media-bucket \
  --key videos/documentary-2026.mp4 \
  --annotation-name mediainfo \
  --annotation-payload ./mediainfo.json
&lt;/code&gt;&lt;/pre&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;# Attach a plain-text AI-generated summary as a separate annotation
echo "A 90-minute nature documentary covering wildlife migration patterns across three continents, featuring aerial footage and underwater sequences. Languages: English, Spanish, Portuguese." &amp;gt; ai_summary.txt

aws s3api put-object-annotation \
  --bucket my-media-bucket \
  --key videos/documentary-2026.mp4 \
  --annotation-name ai_summary \
  --annotation-payload ./ai_summary.txt
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;These commands attach two separate annotations to the same video object. The &lt;code&gt;mediainfo&lt;/code&gt; annotation stores structured technical specifications as JSON, while the &lt;code&gt;ai_summary&lt;/code&gt; annotation stores a text description. Each annotation is identified by a unique name, and you can read and modify each one independently. With unique names for each annotation, you can use different annotations to support multiple concurrent enrichment workflows, for example, one team adding technical metadata while another team adds content classifications, without interfering with each other.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104565 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/13/2026-s3-annotations-list.png" alt="" width="1169" height="604"&gt;&lt;/p&gt; 
&lt;p&gt;Retrieve a specific annotation using the &lt;code&gt;GetObjectAnnotation&lt;/code&gt; API:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;aws s3api get-object-annotation \
  --bucket my-media-bucket \
  --key videos/documentary-2026.mp4 \
  --annotation-name mediainfo \
  ./mediainfo-output.json
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;To see all annotations attached to an object, use the &lt;code&gt;ListObjectAnnotations&lt;/code&gt; API:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;aws s3api list-object-annotations \
  --bucket my-media-bucket \
  --key videos/documentary-2026.mp4
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;When you no longer need a specific annotation, remove it using the &lt;code&gt;DeleteObjectAnnotation&lt;/code&gt; API:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;aws s3api delete-object-annotation \
  --bucket my-media-bucket \
  --key videos/documentary-2026.mp4 \
  --annotation-name mediainfo
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;You can update an existing annotation at any time by calling &lt;code&gt;PutObjectAnnotation&lt;/code&gt; again with the same annotation name. For large objects uploaded using multipart upload, attach annotations after completing the multipart upload using the &lt;code&gt;PutObjectAnnotation&lt;/code&gt; API.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Querying annotations at scale with S3 Metadata tables&lt;/strong&gt;&lt;br&gt; Attaching annotations to individual objects is useful, but the real power comes when you query across all your annotations at scale. When you enable S3 Metadata annotation tables on your bucket, S3 automatically indexes your annotations into a fully managed &lt;a href="https://iceberg.apache.org/"&gt;Apache Iceberg&lt;/a&gt; table, called an annotation table. You can query annotation tables with Amazon Athena or any Iceberg-compatible engine.&lt;/p&gt; 
&lt;p&gt;To enable annotation tables, use the S3 console or the &lt;code&gt;CreateBucketMetadataConfiguration&lt;/code&gt; API. The following example creates a new metadata configuration with annotation tables enabled while keeping journal tables for change tracking and disabling the live inventory table:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-json"&gt;{
  "JournalTableConfiguration": {
    "RecordExpiration": { "Expiration": "DISABLED" }
  },
  "InventoryTableConfiguration": { "ConfigurationState": "DISABLED" },
  "AnnotationTableConfiguration": {
    "ConfigurationState": "ENABLED",
    "Role": "arn:aws:iam::123456789012:role/S3MetadataAnnotationRole"
  }
}
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;This configuration tells S3 to automatically capture all your annotations in a queryable table. Once applied, any annotation you attach to objects in this bucket will appear in the table within approximately one hour.&lt;/p&gt; 
&lt;p&gt;If the bucket already has a metadata configuration, use the &lt;code&gt;UpdateBucketMetadataAnnotationTableConfiguration&lt;/code&gt; API:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;aws s3api update-bucket-metadata-annotation-table-configuration \
  --bucket my-media-bucket \
  --annotation-table-configuration '{"ConfigurationState":"ENABLED","Role":"arn:aws:iam::123456789012:role/S3MetadataAnnotationRole"}'
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Once enabled, your annotations automatically flow into the annotation table. Journal tables update in near real time, while annotation tables refresh within an hour. Unlike traditional metadata tables that require predefined schemas, annotation tables automatically adapt to any JSON, XML, or YAML structure you write. Each annotation becomes a row in the table with its content stored in a &lt;code&gt;text_value&lt;/code&gt; column, letting you query across all annotations without schema migrations.&lt;/p&gt; 
&lt;p&gt;If you enable annotation tables on a bucket that already has annotated objects, S3 automatically backfills existing annotations into the table. The backfill process runs in the background and can take several hours to days depending on the number of objects.&lt;/p&gt; 
&lt;p&gt;For example, to find all video assets with more than 8 audio tracks across your entire bucket using Amazon Athena:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-sql"&gt;SELECT DISTINCT bucket, object_key
FROM "s3tablescatalog/aws-s3"."b_my_media_bucket"."annotation"
WHERE name = 'mediainfo'
AND CAST(json_extract_scalar(text_value, '$.audio_tracks') AS INTEGER) &amp;gt; 8
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;This query scans the annotation table for all annotations named &lt;code&gt;mediainfo&lt;/code&gt;, extracts the &lt;code&gt;audio_tracks&lt;/code&gt; field from the JSON content, and returns objects where the count exceeds 8.&lt;/p&gt; 
&lt;p&gt;Or to find all objects that received new annotations in the last 24 hours through the journal table:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-sql"&gt;SELECT bucket, key, version_id, record_timestamp, annotation.name
FROM "s3tablescatalog/aws-s3"."b_my_media_bucket"."journal"
WHERE record_timestamp &amp;gt;= (current_date - interval '1' day)
AND annotation.name IS NOT NULL
AND record_type IN ('CREATE_ANNOTATION', 'DELETE_ANNOTATION')
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;This query uses the journal table to track annotation changes in near real time, which is ideal for building event-driven workflows that respond to new or deleted annotations.&lt;/p&gt; 
&lt;p&gt;You can also use natural language to search objects by their annotations using agents in &lt;a href="https://aws.amazon.com/sagemaker/unified-studio/"&gt;Amazon SageMaker Unified Studio&lt;/a&gt; or any IDE with the S3 Tables MCP server. For example, asking “find all PG-rated movies with Spanish subtitles from 2023” returns results in seconds instead of the hours it would take querying multiple disconnected systems.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Get started today&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; You can start using Amazon S3 annotations today in all AWS Regions, including the AWS China Regions. Annotation tables are available in all AWS Regions where S3 Metadata is available.&lt;/p&gt; 
&lt;p&gt;Whether you’re building AI agents that need to discover data autonomously, managing petabytes of media assets with complex metadata, or tracking compliance context for archived datasets, annotations give you the scale and flexibility to attach rich metadata directly to your objects without managing separate systems.&lt;/p&gt; 
&lt;p&gt;Annotation storage is always billed at S3 Standard rates, even if the parent object is in S3 Glacier or another storage class. For full pricing details, visit the &lt;a href="https://aws.amazon.com/s3/pricing/"&gt;Amazon S3 pricing page&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;To learn more and get started, visit the &lt;a href="https://aws.amazon.com/s3/features/metadata/"&gt;Amazon S3 Metadata overview page&lt;/a&gt;&amp;nbsp;and the &lt;a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/annotations.html"&gt;Amazon S3 documentation&lt;/a&gt;. Send feedback to &lt;a href="https://repost.aws/tags/TADSTjraA0Q4-a1dxk6eUYaw/amazon-simple-storage-service"&gt;AWS re:Post for S3&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;Daniel Abib&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>AWS WAF adds AI traffic monetization capability to help content owners charge AI bots for content access</title>
		<link>https://aws.amazon.com/blogs/aws/aws-waf-adds-ai-traffic-monetization-capability-to-help-content-owners-charge-ai-bots-for-content-access/</link>
					
		
		<dc:creator><![CDATA[Esra Kayabali]]></dc:creator>
		<pubDate>Mon, 15 Jun 2026 20:42:13 +0000</pubDate>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[AWS WAF]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Security, Identity, & Compliance]]></category>
		<guid isPermaLink="false">b888b8d60a4fd57cdcb13b3f7ef4d08b578e7e5d</guid>

					<description>AWS WAF launches AI traffic monetization, a new Bot Control capability that enables content providers and publishers price, meter, and collect payment from AI bots and agents accessing their content and APIs. AWS WAF now lets you set a price for that access, accept payment through third-party providers, and grant scoped access directly at the edge.</description>
										<content:encoded>&lt;p&gt;AWS WAF now includes AI traffic monetization capability that gives digital content owners and publishers a way to charge AI bots and agents for access to protected web content directly at the network edge. The capability helps content owners and publishers set per-request pricing by content path, bot category, or verification tier without modifying their origin infrastructure or writing application code. Content owners can define granular access policies per agent type, collect payments in stablecoins to their preferred wallet, and monitor revenue and bot activity from a single dashboard.&lt;/p&gt; 
&lt;p&gt;AI bot traffic now accounts for more than 50% of web traffic for many content providers, with AI-specific crawlers growing more than 300% year-over-year. Unlike traditional search engine crawlers, which index content and return measurable referral traffic back to publisher websites, AI bots consume the same content to generate summaries and responses in AI interfaces, with little to no traffic sent back to the original source. Publishers bear the infrastructure costs of serving that traffic without the page views, ad impressions, or subscription conversions that typically offset those costs. &lt;a href="https://docs.aws.amazon.com/waf/latest/developerguide/waf-bot-control.html"&gt;AWS WAF Bot Control&lt;/a&gt; already gives customers visibility into bot activity and the ability to block or rate-limit traffic, but setting pricing and collecting payment from AI agents has not been possible until now. AI traffic monetization is a new Bot Control capability that closes that gap, giving content owners and publishers a way to configure pricing rules directly through the AWS WAF console and collect payments from AI agents through third-party payment integrations, without building custom payment infrastructure or negotiating individual licensing agreements. Payment settlement and verification flows are provided by Coinbase’s x402 Facilitator. Integration with Stripe for direct account payments and Machine Payments Protocol (MPP) support is coming soon.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Getting Started with AI Traffic Monetization&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Before configuring monetization, confirm that AWS WAF Bot Control is enabled at Common or Targeted level on the web ACL associated with your CloudFront distribution. Bot Control provides the agent classification that monetization rules depend on. If you have not set this up yet, visit &lt;a href="https://docs.aws.amazon.com/waf/latest/developerguide/waf-bot-control-rg-using.html"&gt;Adding the AWS WAF Bot Control managed rule group to your web ACL&lt;/a&gt; documentation. In the AWS Management Console, go to&lt;strong&gt; WAF &amp;amp; Shield&lt;/strong&gt; and choose &lt;strong&gt;Protection packs (web ACLs)&lt;/strong&gt; in the left navigation pane to get started.&lt;/p&gt; 
&lt;p&gt;A protection pack is the core configuration unit for AI traffic monetization. It defines which content paths are monetized, what each agent verification tier is charged, which payment methods you accept, and what license terms apply. To create one, choose &lt;strong&gt;Create protection pack (web ACL)&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104464 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/10/1213987938308918-0e.png" alt="" width="1924" height="2626"&gt;&lt;/p&gt; 
&lt;p&gt;In&amp;nbsp;&lt;strong&gt;Tell us about your app&lt;/strong&gt;, select one or more app categories that describe your content (for example, Content &amp;amp; publishing systems, E-commerce &amp;amp; transaction platforms, or Enterprise &amp;amp; business applications), and choose an &lt;strong&gt;App focus&lt;/strong&gt;. AWS WAF uses these selections to recommend suitable security protections for your configuration.&lt;/p&gt; 
&lt;p&gt;In&amp;nbsp;&lt;strong&gt;Select resources to protect&lt;/strong&gt;, choose&amp;nbsp;&lt;strong&gt;Add resources&lt;/strong&gt;&amp;nbsp;to associate regional or global resources such as CloudFront distributions with this protection pack. You can skip this step and add resources later.&lt;/p&gt; 
&lt;p&gt;In&amp;nbsp;&lt;strong&gt;Choose initial protections&lt;/strong&gt;, select from AWS WAF managed rule packages based on your app category and resource selections. You can also choose individual rules instead of packages.&lt;/p&gt; 
&lt;p&gt;In&amp;nbsp;&lt;strong&gt;Name and describe&lt;/strong&gt;, provide a name and optional description for the protection pack.&lt;/p&gt; 
&lt;p&gt;Optionally, expand&amp;nbsp;&lt;strong&gt;Customize protection pack (web ACL)&amp;nbsp;&lt;/strong&gt;to configure additional settings including pricing tiers, payment methods, content scope, and license terms.&lt;/p&gt; 
&lt;p&gt;When finished, choose&amp;nbsp;&lt;strong&gt;Create protection pack (web ACL)&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;Once your protection pack is in place, review the AI traffic analysis dashboard to understand the impact of AI bot traffic on your content before setting your pricing strategy. In the WAF &amp;amp; Shield console, go to &lt;strong&gt;AI traffic analysis &lt;/strong&gt;in the left navigation pane. Select your protection pack (web ACL) from the dropdown to populate the dashboard.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104466 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/10/1213987938308918-1a.png" alt="" width="1928" height="1787"&gt;&lt;/p&gt; 
&lt;p&gt;The AI traffic analysis dashboard breaks down traffic into four categories visible in the bot traffic overview panel: &lt;strong&gt;All bot requests&lt;/strong&gt;, &lt;strong&gt;AI bot requests&lt;/strong&gt;, &lt;strong&gt;Verified AI bot traffic&lt;/strong&gt;, and &lt;strong&gt;Unverified AI bot traffic&lt;/strong&gt;. The dashboard surfaces infrastructure impact metrics including bandwidth consumed, estimated monthly cost, and peak request rates. A per-path heatmap shows which content paths receive the most AI bot activity by hour, giving you the data you need to make informed pricing decisions.&lt;/p&gt; 
&lt;p&gt;AWS WAF Bot Control classifies over 650 distinct AI bot and agent types including GPTBot, Claude-Web, and Perplexity-Bot, and assigns each a verification tier:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Verified&lt;/strong&gt;: Agent identity confirmed through Web Bot Auth (WBA) Ed25519 cryptographic signature, or sourced from a documented IP range with a known set of user-agents and domain names.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Unverified&lt;/strong&gt;: Agent recognized through user-agent matching, behavioral fingerprinting, and IP reputation, but identity not cryptographically confirmed.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;Once you have reviewed your traffic patterns, return to&amp;nbsp;&lt;strong&gt;Protection packs (web ACLs)&lt;/strong&gt;, select your protection pack from the list, and choose&amp;nbsp;&lt;strong&gt;Configure AI monetization&lt;/strong&gt;&amp;nbsp;from the right panel to set pricing and access policies. Each protection pack defines the pricing, agent policies, accepted payment methods, and license terms that apply to a defined set of content paths. You can create multiple protection packs and apply different pricing to different content zones within the same distribution. Once created, associate the protection pack with your web ACL by opening the web ACL and choosing&amp;nbsp;&lt;strong&gt;Add protection pack&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;For each agent verification tier within the pack, you can assign one of six actions: &lt;strong&gt;Monetize&lt;/strong&gt;&amp;nbsp;(return a 402 with pricing),&amp;nbsp;&lt;strong&gt;Allow&lt;/strong&gt;&amp;nbsp;(grant free access),&amp;nbsp;&lt;strong&gt;Block&lt;/strong&gt; (deny access entirely), &lt;strong&gt;Count&lt;/strong&gt; (log without charging), &lt;strong&gt;CAPTCHA &lt;/strong&gt;(present a puzzle to verify a human sender), or &lt;strong&gt;Challenge &lt;/strong&gt;(run a silent check to verify the client is a browser, not a bot).&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104528 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/12/1213987938308918-3a.png" alt="" width="1279" height="1531"&gt;&lt;/p&gt; 
&lt;p&gt;In the&amp;nbsp;&lt;strong&gt;Edit monetization configuration&lt;/strong&gt;&amp;nbsp;page, configure the following:&lt;/p&gt; 
&lt;p&gt;Under&lt;strong&gt;&amp;nbsp;Payment settlement&lt;/strong&gt;, select one or more blockchain networks for stablecoin payments. Any wallet address on the supported networks is accepted, whether self-managed or hosted by a wallet provider such as Coinbase. For each network, provide your wallet address and set a&amp;nbsp;&lt;strong&gt;Base price per page&lt;/strong&gt;&amp;nbsp;in USDC. You can add multiple networks using&amp;nbsp;&lt;strong&gt;Add network&lt;/strong&gt;. AWS does not process payments or take a fee on content revenue; disbursement is self-managed or managed by your wallet provider.&lt;/p&gt; 
&lt;p&gt;When a &lt;strong&gt;Monetize&lt;/strong&gt; rule matches an incoming request, AWS WAF returns an HTTP 402 Payment Required response. The response body contains a machine-readable price manifest in JSON format using the x402 open protocol for machine-to-machine payments. The manifest includes the content price in USDC, accepted blockchain networks such as Base and Solana, the destination wallet address, the maximum payment timeout, and the payment scheme.&lt;/p&gt; 
&lt;p&gt;Any x402-compatible agent runtime can complete this flow autonomously. The client submits a signed payment authorization on their payment network of choice. AWS WAF verifies it, fetches the content, integrates with third-party facilitator services for settling the payment on-chain, and serves the response.&lt;/p&gt; 
&lt;p&gt;Note that the &lt;strong&gt;Monetize&lt;/strong&gt; action is supported exclusively for web ACLs associated with Amazon CloudFront distributions. Adding a &lt;strong&gt;Monetize&lt;/strong&gt; rule to a regional web ACL is not supported.&lt;/p&gt; 
&lt;p&gt;Since the&amp;nbsp;&lt;strong&gt;Currency mode&lt;/strong&gt;&amp;nbsp;toggle is available directly in the monetization configuration page, you can switch between &lt;strong&gt;Real&lt;/strong&gt; and &lt;strong&gt;Test&lt;/strong&gt; mode at any time. Before going live, use test mode on non-production traffic to validate pricing, wallet configuration, and x402 payment flows. Note that test mode still enforces x402 payments, but those payments can be made on testnets such as Base Sepolia or Solana Devnet using test funds obtained from faucets such as faucet.circle.com. To activate test mode, toggle &lt;strong&gt;Currency mode&lt;/strong&gt;&amp;nbsp;to&amp;nbsp;&lt;strong&gt;Test&lt;/strong&gt;&amp;nbsp;in your protection pack configuration. AWS WAF returns real price manifests and runs the full payment flow identically to production on the configured test chain. All events are logged with&amp;nbsp;&lt;code&gt;CurrencyMode: TEST&lt;/code&gt;. When satisfied with the configuration, toggle&amp;nbsp;Currency mode&amp;nbsp;back to&amp;nbsp;Real&amp;nbsp;to begin processing real payments.&lt;/p&gt; 
&lt;p&gt;Once you have switched&amp;nbsp;&lt;strong&gt;Currency mode&lt;/strong&gt;&amp;nbsp;to&amp;nbsp;&lt;strong&gt;Real&lt;/strong&gt;, navigate to &lt;strong&gt;AI access monetization&lt;/strong&gt; in the left navigation pane to track monetization outcomes in real time. Note that the &lt;strong&gt;AI access monetization&lt;/strong&gt; dashboard only reflects activity from real currency mode and does not display test transactions.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104468 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/10/1213987938308918-2b.png" alt="" width="1924" height="1906"&gt;&lt;/p&gt; 
&lt;p&gt;The&amp;nbsp;Revenue&amp;nbsp;dashboard shows&amp;nbsp;&lt;strong&gt;Total revenue&lt;/strong&gt;, revenue broken down by&amp;nbsp;&lt;strong&gt;Verified bots&lt;/strong&gt;&amp;nbsp;and&amp;nbsp;&lt;strong&gt;Unverified bots&lt;/strong&gt;, and&amp;nbsp;&lt;strong&gt;Avg. per request. &lt;/strong&gt;The&lt;strong&gt;&amp;nbsp;Top revenue sources&lt;/strong&gt;&amp;nbsp;panel groups earnings by bot category, and the&amp;nbsp;AI access patterns&amp;nbsp;panel ranks content paths by revenue generated. Use the&amp;nbsp;&lt;strong&gt;Settlements&lt;/strong&gt;&amp;nbsp;tab to reconcile payments by provider and review payment method distribution and failed payment attempts.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Now Available&lt;/strong&gt;&lt;/span&gt;&lt;br&gt; AI traffic monetization is available now for Amazon CloudFront customers at no additional charge beyond standard AWS WAF pricing. The capability is available in all edge locations where AWS WAF web ACLs are associated with Amazon CloudFront distributions.&lt;/p&gt; 
&lt;p&gt;To learn more about AI traffic monetization, see the&amp;nbsp;&lt;a href="https://docs.aws.amazon.com/waf/latest/developerguide/waf-ai-traffic-monetization.html"&gt;AWS WAF Developer Guide&lt;/a&gt;.&lt;/p&gt; 
&lt;a href="https://www.linkedin.com/in/esrakayabali/"&gt;— Esra&lt;/a&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>AWS Weekly Roundup: AWS FinOps Agent in preview, Gemma 4 on Bedrock, Kiro Pro Max, and more (June 15, 2026)</title>
		<link>https://aws.amazon.com/blogs/aws/aws-weekly-roundup-aws-finops-agent-in-preview-gemma-4-on-bedrock-kiro-pro-max-and-more-june-15-2026/</link>
					
		
		<dc:creator><![CDATA[Esra Kayabali]]></dc:creator>
		<pubDate>Mon, 15 Jun 2026 11:41:48 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Graviton]]></category>
		<category><![CDATA[Kiro]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Week in Review]]></category>
		<guid isPermaLink="false">d4a483b5504b40c977cc5d689275cac967902536</guid>

					<description>This week, New York City is hosting AWS Summit, bringing together builders, customers, and AWS teams for a full day of announcements, demos, and technical sessions at the Javits Center. I wrote blog posts for some of the Summit launches, so I am excited to see them go live this week. I just won’t be […]</description>
										<content:encoded>&lt;p&gt;This week, New York City is hosting &lt;a href="https://aws.amazon.com/events/summits/new-york/"&gt;AWS Summit&lt;/a&gt;, bringing together builders, customers, and AWS teams for a full day of announcements, demos, and technical sessions at the Javits Center. I wrote blog posts for some of the Summit launches, so I am excited to see them go live this week. I just won’t be watching from the Javits Center. I’ll be at a four-day music festival, following the launches on my phone while trying to figure out how to put up a tent. If you weren’t able to attend in person like me, the keynote &lt;a href="https://pages.awscloud.com/aws-summit-nyc-livestream-2026-registration.html"&gt;livestream&lt;/a&gt; is available on June 17, with Dr. Swami Sivasubramanian, VP of Agentic AI, and Chet Kapoor, VP of Security Services and Observability, covering new capabilities across developer tools, AI infrastructure, and security.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="alignnone wp-image-104579 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/14/WIR-Why1.5d9838d88ff23b99b4fe14be6598b68ef4493215.png" alt="" width="800" height="533"&gt;&lt;/p&gt; 
&lt;p&gt;Here’s what happened this week.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Headlines&lt;/strong&gt;&lt;/span&gt;&lt;br&gt; &lt;a href="https://aws.amazon.com/blogs/machine-learning/how-frontier-teams-are-reinventing-ai-native-development/"&gt;How frontier teams are reinventing AI-native development&lt;/a&gt; – Swami published a detailed post this week drawing on data from experiments across hundreds of Amazon engineering teams. The findings are worth reading carefully if you are thinking about how to structure AI adoption on your own team.&lt;/p&gt; 
&lt;p&gt;A six-engineer team rebuilt the Amazon Bedrock inference engine in 76 days, a project originally scoped for 30 developers over 12 to 18 months. The median productivity gain across structured pilots with Amazon Stores teams was 4.5x in normalized deployment velocity, with some teams exceeding 10x. Perfect Order Experience went from a two-week feature cycle to shipping in an afternoon. WW Grocery cut design document creation from five days to a few hours.&lt;/p&gt; 
&lt;p&gt;The post distills these results into five practices for becoming a frontier team. First, invest in agent context: build steering files, coding standards, and structured repositories before writing production code. Second, expect an initial slowdown while workflows are restructured, and push through it. Third, maintain a steady backlog of well-scoped tasks so agents can run in parallel without constant supervision. Fourth, make intent explicit through structured specifications before code generation begins. Fifth, shift testing left so agents can self-correct before code reaches the pipeline.&lt;/p&gt; 
&lt;p&gt;The post closes with a note that commit velocity is only part of the picture, and that a follow-up will cover release management, operations, security operations, and EOL upgrades.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-finops-agent-preview/"&gt;AWS FinOps Agent is now available in preview&lt;/a&gt; – AWS FinOps Agent is a new agent for FinOps practitioners and engineering teams that answers cost questions, surfaces optimization opportunities, investigates cost anomalies, and runs recurring FinOps workflows on a defined schedule. You can use it to query your AWS costs, generate cost reports for finance and engineering teams, and surface rightsizing, idle resource, and Savings Plans recommendations from AWS Cost Optimization Hub and AWS Compute Optimizer. The agent can open Jira tickets on your behalf based on those recommendations. When a cost anomaly is detected, FinOps Agent can automatically investigate the root cause and post findings to a Slack channel.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Last week’s launches&lt;/strong&gt;&lt;/span&gt;&lt;br&gt; I’ll start with one I wrote this week, then cover the other launches that caught my attention:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/now-available-amazon-ec2-m9g-and-m9gd-instances-powered-by-new-aws-graviton5-processors/"&gt;Amazon EC2 M9g and M9gd instances are now generally available&lt;/a&gt; – Powered by AWS Graviton5 processors and built on the sixth-generation AWS Nitro System, M9g instances deliver up to 25% better compute performance compared to Graviton4-based instances, with up to 35% faster performance for web applications, up to 35% for machine learning inference, and up to 30% for databases. Graviton5 is the first processor in the AWS fleet to support PCIe Gen6 and DDR5-8800 memory, and includes a 5x larger L3 cache compared to the previous generation. M9g and M9gd instances offer up to 15% higher network bandwidth and 20% higher Amazon EBS bandwidth on average across sizes compared to M8g. This release also introduces the Nitro Isolation Engine, an enhancement to the Nitro System that uses formal verification to provide mathematically proven isolation between virtual machines — establishing Nitro as the first formally verified cloud hypervisor. M9gd instances add up to 11.4 TB of NVMe SSD local storage with 30% higher IOPS compared to M8gd. Both instance types support Instance Bandwidth Configuration (IBC) for adjusting bandwidth allocation between EBS and VPC networking by up to 25%.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/anthropic-claude-fable-5-on-aws-mythos-class-capabilities-with-built-in-safeguards-now-available/"&gt;Anthropic Claude Fable 5 on Amazon Bedrock&lt;/a&gt; – Claude Fable 5 launched on Amazon Bedrock on June 9, bringing extended asynchronous task execution, advanced vision capabilities across diagrams, charts, and PDFs, and proactive self-verification. Access requires opting into data sharing via the Data Retention API before invoking the model; Anthropic requires 30-day retention of inputs and outputs for Mythos-class models. &lt;strong&gt;Important note on availability:&lt;/strong&gt; On June 12, Anthropic asked AWS to revoke access to Claude Fable 5 and Claude Mythos 5 for all users to support compliance with a US Government export control directive. All other models, including Opus 4.8, are unaffected. Read the &lt;a href="https://www.anthropic.com/news/fable-mythos-access"&gt;Anthropic statement&lt;/a&gt; for details. AWS will share further updates as they become available.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/gemma-4-amazon-bedrock/"&gt;Gemma 4 models are now available on Amazon Bedrock&lt;/a&gt; – The Gemma 4 family from Google DeepMind is now available on Amazon Bedrock across three variants: Gemma 4 31B (dense, 256K-token context window, suited for reasoning and coding workloads), Gemma 4 26B-A4B (mixture-of-experts architecture, targeting cost- and latency-sensitive workloads), and Gemma 4 E2B (smallest variant, designed for low-latency interactive use cases). All three support native function calling, structured output, reasoning, response streaming, multimodal input across text, image, video, and audio, and more than 35 languages.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/opensearch-agentic-observability-mcp-app/"&gt;Amazon OpenSearch Service launches MCP Apps for agentic observability&lt;/a&gt; – Amazon OpenSearch Service now supports MCP Apps, enabling observability workflows inside compatible agentic IDEs including Claude Desktop and VS Code. An AI agent in your local environment can investigate incidents using logs, traces, metrics, and alerts stored in OpenSearch domains, collections, and Amazon Managed Service for Prometheus. Each MCP App tool call returns a dual response: a text summary for the agent to reason over and an interactive visualization rendered in the same conversation thread. Available MCP App tools cover log, metrics, and trace investigation; service performance; topology; dynamic visualizations; agent health; cluster health; and instrumentation scoring.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Other AWS news&lt;/strong&gt;&lt;/span&gt;&lt;br&gt; Here are some additional posts and updates you may find useful:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/developer/aws-cli-v1-maintenance-mode-announcing-changes-to-dependency-updates/"&gt;AWS CLI v1 enters maintenance mode&lt;/a&gt; – When CLI v1 enters maintenance mode, the botocore and s3transfer dependencies will be vendored directly into the CLI v1 codebase rather than installed as separate packages. This means upgrading CLI v1 will no longer update the standalone botocore or s3transfer packages, and installing those packages independently will have no effect on the versions used by CLI v1. Environments with both CLI v1 and boto3 installed will contain separate copies of these libraries. New CLI v1 releases will be limited to critical bug fixes and security issues. The recommended path is to migrate to AWS CLI v2.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-workload-credentials-provider/"&gt;AWS Workload Credentials Provider is now available&lt;/a&gt; – AWS Workload Credentials Provider is a lightweight client-side provider that automates deployment of exported certificates from AWS Certificate Manager (ACM) and local caching of secrets from AWS Secrets Manager across AWS and non-AWS workloads. Previously, customers had to build custom EventBridge automation to detect certificate renewals and deploy updates, which becomes harder to maintain as public certificate lifetimes decrease per the CA/Browser Forum mandate. The provider handles this automatically, runs on Windows and Linux, and supports Apache and NGINX. For secrets, it maintains full backwards compatibility with the AWS Secrets Manager Agent.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://kiro.dev/blog/kiro-pro-max/"&gt;Kiro Pro Max is now available&lt;/a&gt; – Kiro has introduced a new Pro Max tier, adding higher usage limits, access to the latest frontier models, and additional agentic capabilities for development teams. Kiro Pro Max is designed for professional developers who need sustained, high-volume use across coding, specification generation, and agent-driven tasks.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Upcoming AWS events&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Check your calendar and sign up for upcoming AWS events:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/events/summits/"&gt;AWS Summits&lt;/a&gt;: AWS Summits are free in-person events covering cloud and AI. Coming up: &lt;a href="https://aws.amazon.com/events/summits/new-york/"&gt;New York City&lt;/a&gt; (June 17), &lt;a href="https://aws.amazon.com/events/summits/hongkong/"&gt;Hong Kong&lt;/a&gt; (June 17), &lt;a href="https://aws.amazon.com/events/summits/shanghai/"&gt;Shanghai&lt;/a&gt; (June 23-24), &lt;a href="https://aws.amazon.com/jp/events/summits/japan/"&gt;Japan&lt;/a&gt; (June 25), &lt;a href="https://aws.amazon.com/events/summits/washington-dc/"&gt;Washington, D.C.&lt;/a&gt; (June 30 – July 1), &lt;a href="https://aws.amazon.com/tw/events/summits/taipei/"&gt;Taipei&lt;/a&gt; (July 15), and &lt;a href="https://aws.amazon.com/es/events/summits/bogota/"&gt;Bogotá&lt;/a&gt; (July 30).&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/events/community-day/"&gt;AWS Community Days&lt;/a&gt;: Community-led conferences planned and delivered by community leaders. Upcoming events include &lt;a href="https://awscommunitydayeast.ca/"&gt;Montreal, Canada&lt;/a&gt; (June 20), &lt;a href="https://www.midwestcommunityday.com/"&gt;Indianapolis, USA&lt;/a&gt; (June 24), &lt;a href="https://2026-summer.awscommunityday.cn/"&gt;Hangzhou, China&lt;/a&gt; (June 28), &lt;a href="https://acd.awsugblr.in/"&gt;Bengaluru, India&lt;/a&gt; (July 11), and &lt;a href="https://communityday.awscmr.com/en"&gt;Yaoundé, Cameroon&lt;/a&gt; (July 25).&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;Visit the &lt;a href="https://builder.aws.com/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;amp;sc_channel=el"&gt;AWS Builder Center&lt;/a&gt; to meet other builders, contribute solutions, and find resources that help you keep building. You can also browse upcoming &lt;a href="https://aws.amazon.com/events/explore-aws-events/?refid=e61dee65-4ce8-4738-84db-75305c9cd4fe"&gt;AWS-led in-person and virtual events&lt;/a&gt;, plus &lt;a href="https://builder.aws.com/connect/events?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;amp;sc_channel=el"&gt;developer-focused sessions&lt;/a&gt;.&lt;/p&gt; 
&lt;a href="https://www.linkedin.com/in/esrakayabali/"&gt;— Esra&lt;/a&gt; 
&lt;p&gt;&lt;em&gt;This post is part of our Weekly Roundup series. Check back each week for a quick roundup of interesting news and announcements from AWS!&lt;/em&gt;&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors</title>
		<link>https://aws.amazon.com/blogs/aws/now-available-amazon-ec2-m9g-and-m9gd-instances-powered-by-new-aws-graviton5-processors/</link>
					
		
		<dc:creator><![CDATA[Esra Kayabali]]></dc:creator>
		<pubDate>Wed, 10 Jun 2026 15:04:30 +0000</pubDate>
				<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Compute]]></category>
		<category><![CDATA[Graviton]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">33118acc6752a7a014784d7b663da10b02d2d1aa</guid>

					<description>AWS launches Amazon EC2 M9g and M9gd instances, powered by AWS Graviton5 processors. AWS Graviton5 is most powerful, and most energy eﬃcient processor AWS has ever built, and oﬀers up to 25% better compute performance compared to Graviton4-based instances.</description>
										<content:encoded>&lt;p&gt;AWS Graviton processors have improved steadily across generations, with each iteration delivering advances in compute performance, price-performance, and energy efficiency. At re:Invent 2025, we &lt;a href="https://aws.amazon.com/about-aws/whats-new/2025/12/ec2-m9g-instances-graviton5-processors-preview/"&gt;announced&lt;/a&gt; Amazon EC2 M9g, the first Graviton5-powered instances, in preview. Since then, customers have tested M9g across a wide range of workloads and shared their results. &lt;a href="https://clickhouse.com/"&gt;ClickHouse&lt;/a&gt; saw a 36% performance boost compared to M8g, with zero code changes. &lt;a href="https://www.honeycomb.io/blog/graviton5-honeycomb-per-service-results-m8g-m9g-migration"&gt;Honeycomb&lt;/a&gt; achieved 36% better throughput per core compared to Graviton4, across a 6-month A/B test of production observability workloads.&amp;nbsp;&lt;a href="https://www.hubspot.com/"&gt;HubSpot&lt;/a&gt; deployed M9g for MySQL databases and saw query duration drop by up to 60%.&lt;/p&gt; 
&lt;p&gt;Today, M9g instances are generally available, alongside the new M9gd instances for customers who need high-speed, low-latency local NVMe SSD storage. Both are powered by &lt;a href="https://aws.amazon.com/ec2/graviton/"&gt;Graviton5&lt;/a&gt;, the most powerful and most energy efficient processor AWS has ever built.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/10/1213311671976503-0-G5_Angled.jpg"&gt;&lt;img loading="lazy" class="alignnone size-full wp-image-104435" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/10/1213311671976503-0-G5_Angled.jpg" alt="" width="2560" height="1710"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;While many Arm-based instances have been introduced across the industry, no one comes close to the breadth and depth of the AWS Graviton footprint. After five generations of custom silicon and eight years of continuous investment, Graviton powers over 350 instance types serving more than 120,000 customers, from startups to large enterprises, a robust ISV partner ecosystem, and a broad set of managed services.&lt;/p&gt; 
&lt;p&gt;You can use Graviton for a broad variety of workloads, including web applications, microservices, analytics, databases, machine learning (ML) inference, electronic design automation (EDA), gaming, and video encoding. As workloads grow more compute-intensive and data-driven, many have asked for more processing power, along with greater network and storage bandwidth to move more data and complete workloads faster. We’ve also designed these instances to efficiently package compute, memory, and I/O to maximize energy utilization.&lt;/p&gt; 
&lt;p&gt;As AI shifts from answering questions to taking actions, running code, using tools, evaluating results, and orchestrating multi-step tasks, the demand for CPU compute is growing rapidly. Graviton5 is built for this shift. With 192 cores, a 5x larger L3 cache, up to 33% lower inter-core latency, and DDR5 memory delivering high bandwidth, Graviton5 helps agents spend less time waiting on CPU-bound steps, processing more instructions, handling large numbers of concurrent environments, and keeping accelerators moving.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://www.aboutamazon.com/news/aws/meta-aws-graviton-ai-partnership"&gt;Meta&lt;/a&gt; is deploying Graviton at scale starting with tens of millions of cores to support its agentic AI efforts, making Meta one of the largest Graviton customers in the world. Agentic AI workloads, including real-time reasoning, code generation, and the orchestration of multi-step tasks, are CPU-intensive and benefit from the higher compute performance, larger caches, higher memory bandwidth, and core density in Graviton5.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;span style="text-decoration: underline"&gt;What’s new in M9g&amp;nbsp;and M9gd&lt;/span&gt;&lt;br&gt; &lt;/strong&gt;Built on the sixth-generation &lt;a href="https://aws.amazon.com/ec2/nitro/"&gt;AWS Nitro System&lt;/a&gt;, M9g instances are powered by AWS Graviton5 processors that deliver higher compute performance, larger caches, and improved memory and I/O scalability compared to Graviton4 processors. Graviton5 offers up to 25% better compute performance compared to Graviton4-based instances, with up to 35% faster performance for web applications, up to 35% for machine learning inference, and up to 30% for databases. As the first CPU in the AWS fleet to support the latest generation of PCIe Gen6 and DDR5-8800 memory, AWS Graviton5 instances deliver the fastest memory of any processor instances in the cloud, and 5 times more L3 cache compared to the previous generation.&amp;nbsp;These improvements also come with better energy efficiency, helping you meet sustainability targets without compromising capability.&lt;/p&gt; 
&lt;p&gt;Networking and storage bandwidth have been expanded to keep pace with compute growth. M9g and M9gd instances offer up to 15% higher network bandwidth and 20% higher &lt;a href="https://aws.amazon.com/ebs/?nc2=type_a"&gt;Amazon Elastic Block Store (Amazon EBS)&lt;/a&gt; bandwidth on average across sizes, with up to twice the network bandwidth for the largest instance size. M9g and M9gd instances also support &lt;a href="https://docs.aws.amazon.com/ebs/latest/userguide/instance-bandwidth-configuration.html"&gt;Instance Bandwidth Configuration (IBC)&lt;/a&gt;, a feature that helps you adjust the allocation of bandwidth between Amazon EBS and &lt;a href="https://aws.amazon.com/vpc/"&gt;Amazon Virtual Private Cloud (Amazon VPC)&lt;/a&gt; networking for an Amazon EC2 instance by up to 25%. IBC can help optimize performance for workloads with specific bandwidth requirements, such as database read and write performance, query processing, and logging. These enhancements support faster data movement and improved throughput for workloads that rely on high I/O performance.&lt;/p&gt; 
&lt;p&gt;Security and isolation are foundational requirements for running workloads in the cloud. Within the Nitro System, the AWS Nitro Hypervisor is designed to isolate instances from each other as well as AWS operators. With M9g and M9gd instances we are raising the bar on security even further with the introduction of Nitro Isolation Engine. Nitro Isolation Engine is an enhancement to the Nitro System, which enforces isolation of instances and harnesses formal verification to provide assurances of isolation with mathematical precision. Nitro Isolation Engine is a purpose-built component that is responsible for enforcing isolation between virtual machines, including mediation of all access to virtual machine memory, CPU register state, and I/O devices through a minimal set of APIs. Nitro Isolation Engine leverages formal verification, a technique to mathematically demonstrate that the hardware or software behaves as intended, and not just in specific test cases. This intensive verification technique establishes Nitro as the first formally verified cloud hypervisor, pioneering a new standard for mathematically proven cloud security. To learn more about the Nitro Isolation Engine, &lt;a href="https://aws.amazon.com/blogs/compute/aws-nitro-isolation-engine-formally-verifying-the-hypervisor-in-the-aws-nitro-system/"&gt;visit the blog post here&lt;/a&gt;. For details on the formal verification results, including scope and assumptions, &lt;a href="https://d1.awsstatic.com/onedam/marketing-channels/website/aws/en_US/whitepapers/compliance/nitro-isolation-engine-whitepaper.pdf"&gt;see our technical white paper&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;M9g instances provide one vCPU for every four GiB of memory and are well suited for a broad range of general-purpose workloads, including application servers, microservices, midsize data stores, gaming servers, caching fleets, containerized applications, large-scale Java applications, code repositories, web applications, and agentic AI.&lt;/p&gt; 
&lt;p&gt;For workloads that need high-speed, low-latency local storage, M9gd instances provide up to 11.4 TB of NVMe SSD storage and 30% higher IOPS and storage performance compared to Graviton4-based M8gd instances. M9gd instances are well suited for general-purpose workloads that require a balance of compute and memory with high-speed, low-latency local storage, including application servers, microservices, gaming servers, midsize key-value data stores, caching fleets, data logging, media processing, batch and log processing, and applications that need temporary storage such as caches and scratch files.&lt;/p&gt; 
&lt;p&gt;Here are the key specifications across the family:&lt;/p&gt; 
&lt;table style="border: 2px solid black;border-collapse: collapse;margin-left: auto;margin-right: auto"&gt; 
 &lt;tbody&gt; 
  &lt;tr style="border-bottom: 1px solid black;background-color: #e0e0e0"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;M9g&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;vCPUs&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;Memory (GiB)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;Network bandwidth (Gbps)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td&gt;&lt;strong&gt;EBS bandwidth (Gbps)&lt;/strong&gt;&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;medium&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;4&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 15&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;large&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;2&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 15&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;4&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;16&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 15&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;2xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 17&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;4xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;16&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;64&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 17&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;8xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;128&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;17&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;12xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;48&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;25&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;18&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;16xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;64&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;256&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;34&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;24&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;24xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;96&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;384&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;50&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;36&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;48xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;768&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;100&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;72&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;metal-48xl&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;768&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;100&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;72&lt;/td&gt; 
  &lt;/tr&gt; 
 &lt;/tbody&gt; 
&lt;/table&gt; 
&lt;p&gt;M9gd instances include local NVMe SSD storage. The table below shows the instance storage for each size. Compute, memory, network, and EBS bandwidth specifications are the same as M9g.&lt;/p&gt; 
&lt;table style="border: 2px solid black;border-collapse: collapse;margin-left: auto;margin-right: auto"&gt; 
 &lt;tbody&gt; 
  &lt;tr style="border-bottom: 1px solid black;background-color: #e0e0e0"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;M9gd&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;vCPUs&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;Memory (GiB)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;Instance storage (GB)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px"&gt;&lt;strong&gt;Network bandwidth (Gbps)&lt;/strong&gt;&lt;/td&gt; 
   &lt;td&gt;&lt;strong&gt;EBS bandwidth (Gbps)&lt;/strong&gt;&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;medium&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;4&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 59 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 15&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;large&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;2&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 118 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 15&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;4&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;16&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 237 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 15&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;2xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;8&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 475 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 17&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;4xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;16&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;64&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 950 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 17&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;Up to 12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;8xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;32&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;128&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 1900 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;17&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;12&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;12xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;48&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;3 x 950 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;25&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;18&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;16xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;64&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;256&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;1 x 3800 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;34&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;24&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;24xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;96&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;384&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;3 x 1900 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;50&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;36&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;48xlarge&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;768&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;3 x 3800 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;100&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;72&lt;/td&gt; 
  &lt;/tr&gt; 
  &lt;tr style="border-bottom: 1px solid black"&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;&lt;strong&gt;metal-48xl&lt;/strong&gt;&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;192&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;768&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;3 x 3800 NVMe SSD&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;100&lt;/td&gt; 
   &lt;td style="border-right: 1px solid black;padding: 4px;text-align: center"&gt;72&lt;/td&gt; 
  &lt;/tr&gt; 
 &lt;/tbody&gt; 
&lt;/table&gt; 
&lt;p&gt;&lt;strong&gt;&lt;span style="text-decoration: underline"&gt;Now available&lt;/span&gt;&lt;br&gt; &lt;/strong&gt;M9g and M9gd instances are available in the US East (N. Virginia), US East (Ohio), US West (Oregon), and Europe (Frankfurt) Regions. M9g and M9gd instances are available for purchase through &lt;a href="https://aws.amazon.com/savingsplans/"&gt;Savings Plans&lt;/a&gt;, On-Demand, Spot Instances, Dedicated Instances, or Dedicated Hosts. For more information, visit &lt;a href="https://aws.amazon.com/ec2/pricing/"&gt;Amazon EC2 pricing&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;To get started with M9g and M9gd instances, several resources are available. The&amp;nbsp;&lt;a href="https://github.com/aws/aws-graviton-getting-started"&gt;AWS Graviton Getting Started Guide&lt;/a&gt;&amp;nbsp;is a technical guide covering how to build, run, and optimize workloads on Graviton-based instances. The&amp;nbsp;&lt;a href="https://docs.aws.amazon.com/guidance/latest/cloud-intelligence-dashboards/graviton-savings-dashboard.html"&gt;Graviton Savings Dashboard&lt;/a&gt; helps you track and measure the cost savings from running workloads on Graviton-based instances. &lt;a href="https://aws.amazon.com/blogs/compute/migrating-your-java-applications-to-aws-graviton-using-aws-transform-custom/"&gt;AWS Transform&lt;/a&gt;&amp;nbsp;is an AI-powered service that automates code transformations for migrating Java applications from x86 to Graviton-based Amazon EC2 instances, handling compatibility analysis, automated recompilation, dependency updates, and validation.&lt;/p&gt; 
&lt;p&gt;To learn more about Graviton-based instances, visit&amp;nbsp;&lt;a href="https://aws.amazon.com/ec2/graviton/"&gt;AWS Graviton Processors&lt;/a&gt;&amp;nbsp;or &lt;a href="https://aws.amazon.com/ec2/graviton/level-up-with-graviton/"&gt;Level up your compute with AWS Graviton&lt;/a&gt;.&lt;/p&gt; 
&lt;a href="https://www.linkedin.com/in/esrakayabali/"&gt;— Esra&lt;/a&gt; 
&lt;p&gt;Editor’s Note 6/12/26: Added information about Nitro Isolation Engine&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Anthropic Claude Fable 5 on AWS: Mythos-class capabilities with built-in safeguards now available</title>
		<link>https://aws.amazon.com/blogs/aws/anthropic-claude-fable-5-on-aws-mythos-class-capabilities-with-built-in-safeguards-now-available/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Tue, 09 Jun 2026 17:40:31 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Machine Learning]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">d81a50b183e178e2f0a8e9fa40d25b0b0fb5b92d</guid>

					<description>AWS announces the availability of Claude Fable 5 on Amazon Bedrock and Claude Platform on AWS. Claude Fable 5 delivers Mythos-level capabilities available to all customers, with strong safeguards designed to make it safe for broader use.</description>
										<content:encoded>&lt;p&gt;&lt;b&gt;Updated on June, 12, 2026 – Claude Fable 5 and Claude Mythos 5 on Amazon Bedrock access unavailable&lt;/b&gt;&lt;br&gt; To support compliance with the US Government export control directive, Anthropic has asked AWS to revoke access to Claude Fable 5 and Claude Mythos 5 for all users. All other models, including Opus4.8, are not affected and you can continue using them in full confidence. Please view the &lt;a href="https://www.anthropic.com/news/fable-mythos-access" target="_blank" rel="noopener noreferrer"&gt;Anthropic statement&lt;/a&gt; for further details.&lt;/p&gt; 
&lt;hr&gt; 
&lt;p&gt;Today, we’re announcing the availability of &lt;a href="https://www.anthropic.com/news/claude-fable-5-mythos-5"&gt;Claude Fable 5&lt;/a&gt; on &lt;a href="https://aws.amazon.com/bedrock/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock&lt;/a&gt; and &lt;a href="https://aws.amazon.com/claude-platform/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Claude Platform on AWS&lt;/a&gt;. Claude Fable 5 makes Mythos-level capabilities available to customers, with strong safeguards designed to make it safe for broader use. Fable 5 is state-of-the-art on nearly all tested benchmarks and delivers exceptional performance in software engineering, knowledge work tasks, and vision – built for ambitious, long running work.&lt;/p&gt; 
&lt;p&gt;With Claude Fable 5 on Bedrock, you can build within your existing AWS environment and scale inference workloads. You can also use Claude Fable 5 through the Claude Platform on AWS, giving you Anthropic’s native platform experience.&lt;/p&gt; 
&lt;p&gt;According to Anthropic, Claude Fable 5 represents a step-change in what you can accomplish with AI models. Here is what makes this model different:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Long-running, asynchronous execution&lt;/strong&gt;: Claude Fable 5 handles complex tasks that previous models could not sustain, executing coding and knowledge work tasks for extended periods without intervention.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Advanced vision capabilities&lt;/strong&gt;: Claude Fable 5 understands diagrams, charts, and tables nested in files and PDFs. This opens up research and document-heavy work in finance, legal, analytics, architecture, and gaming. In coding, the model implements designs with high fidelity and uses vision to critique its output against goals.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Proactive self-verification&lt;/strong&gt;: The model updates its own skills based on learnings and develops its own harnesses and evaluations.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;Claude Fable 5 includes safeguards that limit its performance in specific areas where misuse risk is elevated. Harmful prompts related to cybersecurity, biology, chemistry, and health fall back to receive a response from Opus 4.8 instead. Anthropic is able to expand access to nearly all of Claude Fable 5’s state-of-the-art capabilities by developing more powerful safeguards. The same model without these limits is &lt;a href="https://www.anthropic.com/news/claude-fable-5-mythos-5"&gt;Claude Mythos 5&lt;/a&gt; and it will only be available to a small group of vetted customers.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Claude Fable 5 model in action&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; You can use Claude Fable 5 in both Amazon Bedrock and Claude Platform on AWS. This post covers guidance on how to access and use on Amazon Bedrock. For guidance on the Claude Platform on AWS, visit the &lt;a href="https://docs.aws.amazon.com/claude-platform/latest/userguide/welcome.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;documentation&lt;/a&gt; to learn more.&lt;/p&gt; 
&lt;p&gt;To get started with Amazon Bedrock, you can access the model programmatically now using the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-anthropic-claude-messages.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;Anthropic Messages API&lt;/a&gt; to call the &lt;code&gt;bedrock-runtime&lt;/code&gt; or &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoints through Anthropic SDK. You can also keep using the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/inference-api.html" target="_blank" rel="noopener noreferrer"&gt;Invoke&lt;/a&gt; and &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;Converse API&lt;/a&gt; on &lt;code&gt;bedrock-runtime&lt;/code&gt; through the &lt;a href="https://aws.amazon.com/cli/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;AWS Command Line Interface (AWS CLI)&lt;/a&gt; and &lt;a href="https://aws.amazon.com/developer/tools/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;AWS SDK&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Configure data retention setting&lt;br&gt; &lt;/strong&gt;In order to access Claude Fable 5 model, you must opt into data sharing by using the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/data-retention.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Data Retention API&lt;/a&gt; and setting &lt;code&gt;provider_data_share&lt;/code&gt; before you can invoke the models. There is no console user interface for this setting at launch.&lt;/p&gt; 
&lt;p&gt;This mode allows Amazon Bedrock to retain and share your inference data with model providers per their requirements. Anthropic requires 30-day inputs and outputs retention, as well as human review. To learn more, visit the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/abuse-detection.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock abuse detection&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;Here is a sample script to set data retention for the &lt;code&gt;bedrock-mantle&lt;/code&gt; engine.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;curl -X PUT https://bedrock-mantle.us-east-1.api.aws/v1/data_retention \
  -H "x-api-key: &amp;lt;your-bedrock-api-key&amp;gt;" \ 
  -H "Content-Type: application/json" \
  -d '{ "mode": "provider_data_share" }'&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;If you want to use the &lt;code&gt;bedrock-runtime&lt;/code&gt; engine, run this sample script.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;curl -X PUT https://bedrock.us-east-1.amazonaws.com/data-retention \
  -H "Authorization: Bearer &amp;lt;your_bearer_token&amp;gt;" \
  -H "Content-Type: application/json" \
  -d '{ "mode": "provider_data_share" }'&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;&lt;strong&gt;Updated on Jun 10, 2026&lt;/strong&gt; — You can also use AWS SigV4 (Signature Version 4) to call the data retention API. Configure your AWS CLI or AWS SDK using environment variables.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;export AWS_ACCESS_KEY_ID=your_access_key_id
export AWS_SECRET_ACCESS_KEY=your_secret_access_key
export AWS_SESSION_TOKEN=your_session_token&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;First, retrieve your current Bedrock data retention settings.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;curl -s https://bedrock.us-east-1.amazonaws.com/data-retention \
  --aws-sigv4 "aws:amz:us-east-1:bedrock" \
  --user "$AWS_ACCESS_KEY_ID:$AWS_SECRET_ACCESS_KEY" \
  -H "x-amz-security-token: $AWS_SESSION_TOKEN"&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;This should return something like this: &lt;code&gt;{"mode":"inherit","updatedAt":null}&lt;/code&gt; and update the data retention settings.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;curl -s -X PUT https://bedrock.us-east-1.amazonaws.com/data-retention \
  --aws-sigv4 "aws:amz:us-east-1:bedrock" \
  --user "$AWS_ACCESS_KEY_ID:$AWS_SECRET_ACCESS_KEY" \
  -H "x-amz-security-token: $AWS_SESSION_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"mode":"provider_data_share"}'&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;If everything worked as expected, you should receive a response like: &lt;code&gt;{"mode":"provider_data_share","updatedAt":"2026-06-10T16:51:39.331Z"}&lt;/code&gt;.&lt;/p&gt; 
&lt;p&gt;The latest AWS CLI supports configuring the data retention setting. Set your bearer API key as an environment variable after you generate a API key in the &lt;a href="https://console.aws.amazon.com/bedrock/home?region=us-east-1#/api-keys" target="_blank" rel="noopener noreferrer"&gt;Bedrock console&lt;/a&gt;.&lt;/p&gt; 
&lt;div&gt; 
 &lt;pre&gt;&lt;code class="lang-bash"&gt;export AWS_BEARER_TOKEN_BEDROCK=bedrock-api-key-XXXXXXXXXX&lt;/code&gt;&lt;/pre&gt; 
 &lt;p&gt;Run the following CLI command to use the Claude Fable 5 model.&lt;/p&gt; 
 &lt;pre&gt;&lt;code class="lang-bash"&gt;aws bedrock put-account-data-retention \ 
  --mode provider_data_share&lt;/code&gt;&lt;/pre&gt; 
&lt;/div&gt; 
&lt;p&gt;To learn more, visit the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/data-retention.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Data Retention API&lt;/a&gt; on the Amazon Bedrock User Guide.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;How to use the Claude Fable 5 model&lt;br&gt; &lt;/strong&gt;Let’s start with Anthropic SDK for Python using the Messages API on &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoint. Install Anthropic SDK.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;pip install anthropic&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Here is a sample Python code to call Claude Fable 5 model:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-python"&gt;import anthropic

client = anthropic.Anthropic(
    base_url="https://bedrock-mantle.us-east-1.api.aws/anthropic",
    api_key= &amp;lt;your-bedrock-api-key&amp;gt;
)

message = client.messages.create( 
     model="anthropic.claude-fable-5", 
	 max_tokens=4096, 
	 messages=[ 
	     { "role": "user", 
		   "content": "Design a distributed architecture on AWS in Python that should support 100k requests per second across multiple geographic regions", 
		 }, 
	 ], 
)

print(message.content[0].text)&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;To learn more, check out &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/api-inference-examples-claude-messages-code-examples.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Anthropic Messages API code examples&lt;/a&gt; and &lt;a href="https://github.com/aws-samples/anthropic-on-aws/tree/main/notebooks"&gt;notebook examples&lt;/a&gt; for multiple use cases and a variety of programming languages.&lt;/p&gt; 
&lt;p&gt;You can use Claude Fable 5 in the &lt;a href="https://console.aws.amazon.com/bedrock/home?#/playground?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Bedrock console&lt;/a&gt;. Choose &lt;strong&gt;Claude Fable 5&lt;/strong&gt; in the &lt;strong&gt;Playground&lt;/strong&gt; and test it.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104426" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/10/2026-claude-fable-5-in-bedrock-console.jpg" alt="" width="1800" height="1098"&gt;&lt;/p&gt; 
&lt;p&gt;You can also use Claude Fable 5 with the Invoke API and Converse API on &lt;code&gt;bedrock-runtime&lt;/code&gt; endpoint. Here’s an example to call Converse API for a unified multi-model experience using the AWS SDK for Python (Boto3):&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-python"&gt;import boto3 
bedrock_runtime = boto3.client("bedrock-runtime", region_name="us-east-1") 
response = bedrock_runtime.converse( 
    modelId="global.anthropic.claude-fable-5", 
    messages=[ 
        { 
            "role": "user", 
            "content": [ 
                { 
                    "text": "Design a distributed architecture on AWS in Python that should support 100k requests per second across multiple geographic regions." 
                } 
            ] 
        } 
    ], 
    inferenceConfig={ 
        "maxTokens": 4096 
    } 
) 
print(response["output"]["message"]["content"][0]["text"]) &lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;To learn more, visit &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/service_code_examples_bedrock-runtime_anthropic_claude.html"&gt;code examples&lt;/a&gt; that show how to use Amazon Bedrock Runtime with AWS SDKs.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Things to know&lt;/strong&gt;&lt;br&gt; Let me share some important technical details that I think you’ll find useful.&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Model access&lt;/strong&gt;: Claude Fable 5 access is gradually expanding for all AWS accounts. If your account doesn’t have access yet, it will be enabled soon depending on your Bedrock usage. If you want to get access to this model quickly, contact your usual AWS Support.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Pricing&lt;/strong&gt;: When a harmful prompt is routed to Opus 4.8 instead of Fable 5, you pay only Opus prices. If a request is blocked mid-conversation, initial tokens are charged at Fable rates and subsequent tokens at Opus rates. To learn more, visit the &lt;a href="https://aws.amazon.com/bedrock/pricing/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;amp;sc_channel=el"&gt;Amazon Bedrock pricing&lt;/a&gt; page.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Data retention&lt;/strong&gt;: For Fable 5, Mythos 5, and future models on Bedrock with similar or higher capability levels, Anthropic will require 30-day retention for all traffic on Mythos-class models. Retaining data for a limited period allows Anthropic to detect patterns of misuse that are not visible from a single exchange. Once you opt into data retention, your data will leave AWS’s data and security boundary.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Claude Mythos 5 on Bedrock (Limited Preview)&lt;/strong&gt;: You can also use Anthropic’s most capable model for cybersecurity and life sciences, including vulnerability discovery, drug design, and biodefense screening. Access is currently limited due to the dual-use nature of these domains. To learn more, visit the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/model-card-anthropic-claude-mythos-5.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;model card documentation&lt;/a&gt;.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Now available&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; Anthropic’s Claude Fable 5 model is available today on Amazon Bedrock in the US East (N. Virginia) and Europe (Stockholm) Regions; check the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/model-card-anthropic-claude-fable-5.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;full list of Regions&lt;/a&gt; for future updates. Claude Fable 5 is also available on the Claude Platform on AWS in North America, South America, Europe, and Asia Pacific.&lt;/p&gt; 
&lt;p&gt;Give Claude Fable 5 a try with the Amazon Bedrock APIs, in the &lt;a href="https://console.aws.amazon.com/claude-platform/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Claude Platform on AWS&lt;/a&gt;, and send feedback to &lt;a href="https://repost.aws/tags/TAQeKlaPaNRQ2tWB6P7KrMag/amazon-bedrock?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for Amazon Bedrock&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://twitter.com/channyun"&gt;Channy&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June 9, 2026&lt;/strong&gt; – 1) Updated the console screenshot. You can use the console on &lt;code&gt;bedrock-runtime&lt;/code&gt; engine. The console support on &lt;code&gt;bedrock-mantle&lt;/code&gt; is coming soon. 2) Fixed the right model id in the sample code, 3) Fixed correct &lt;code&gt;provider_data_share&lt;/code&gt; parameter, 4) Add a data retention setting script for &lt;code&gt;bedrock-runtime&lt;/code&gt; engine.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June 10, 2026&lt;/strong&gt; – Added how to configure data retention setting through AWS SigV4 and AWS CLI.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>AWS Weekly Roundup: BYOM for Amazon RDS for SQL Server, AWS IoT Device SDK for Swift, and more (June 8, 2026)</title>
		<link>https://aws.amazon.com/blogs/aws/aws-weekly-roundup-byom-for-amazon-rds-for-sql-server-aws-iot-device-sdk-for-swift-and-more-june-8-2026/</link>
					
		
		<dc:creator><![CDATA[Sébastien Stormacq]]></dc:creator>
		<pubDate>Mon, 08 Jun 2026 21:27:12 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Bedrock AgentCore]]></category>
		<category><![CDATA[Amazon Cognito]]></category>
		<category><![CDATA[Amazon EKS Distro]]></category>
		<category><![CDATA[Amazon Elastic Container Service]]></category>
		<category><![CDATA[Amazon Elastic Kubernetes Service]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[AWS Cost and Usage Report]]></category>
		<category><![CDATA[AWS Step Functions]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[RDS for SQL Server]]></category>
		<category><![CDATA[Week in Review]]></category>
		<guid isPermaLink="false">8f2066eba72bf35221f55a1dfd87b0762e172505</guid>

					<description>This week, the&amp;nbsp;AWS IoT Device SDK for Swift&amp;nbsp;reached general availability. As a member of the&amp;nbsp;Swift Server Workgroup (SSWG), this one caught my attention. The SDK brings production-ready MQTT 5 connectivity, Device Shadow, Jobs, and fleet provisioning to Swift developers on macOS, iOS, tvOS, and Linux. I’m curious to see what you will build with it. […]</description>
										<content:encoded>&lt;p&gt;This week, the&amp;nbsp;&lt;a href="https://aws.amazon.com/blogs/developer/announcing-the-general-availability-of-the-aws-iot-device-sdk-for-swift/"&gt;AWS IoT Device SDK for Swift&lt;/a&gt;&amp;nbsp;reached general availability. As a member of the&amp;nbsp;&lt;a href="https://swift.org/sswg"&gt;Swift Server Workgroup (SSWG)&lt;/a&gt;, this one caught my attention. The SDK brings production-ready MQTT 5 connectivity, Device Shadow, Jobs, and fleet provisioning to Swift developers on macOS, iOS, tvOS, and Linux.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/Gemini_Generated_Image_sdi7hqsdi7hqsdi7.jpg"&gt;&lt;img loading="lazy" class="aligncenter size-large wp-image-104245" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/Gemini_Generated_Image_sdi7hqsdi7hqsdi7-1024x559.jpg" alt="Swift on IoT and Edge devices, an AI generated illustration" width="1024" height="559"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;I’m curious to see what you will build with it. Swift on the server has matured over the past few years, and now it reaches IoT devices too. This connects to a broader trend of running Swift at the edge. &lt;a href="https://wendy.dev/"&gt;WendyOS&lt;/a&gt;, for example, is an open-source operating system for physical AI that offers first-class Swift support for deploying apps to NVIDIA Jetson and Raspberry Pi hardware. Between server-side Swift, IoT, and edge computing, the language is showing up in places that would have surprised most people a few years ago.&lt;/p&gt; 
&lt;p&gt;Now, let’s get into this week’s AWS news.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Headlines&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;&lt;strong&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/rds-sqlserver-supports-bring-your-own-media/"&gt;Amazon RDS for SQL Server supports Bring Your Own Media&lt;/a&gt;&lt;/strong&gt; – Customers who migrate SQL Server applications from on-premises environments can now reuse their existing Microsoft SQL Server licenses, including Software Assurance, through Microsoft’s License Mobility program on Amazon RDS. BYOM is integrated with AWS License Manager for tracking license usage and compliance. &lt;a href="https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/sqlserver-byom.html"&gt;Read more&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://aws.amazon.com/blogs/aws/improve-your-application-resilience-with-amazon-cognito-multi-region-replication/"&gt;&lt;strong&gt;Amazon Cognito now supports multi-Region replication&lt;/strong&gt;&lt;/a&gt; – You can now synchronize user and machine identity data, including credentials, user pool configurations, and federation setups, to a secondary user pool in a standby Region in near real-time. In the event of a disruption in the primary Region, signed-in users continue accessing their applications without re-authenticating, and registered users can sign in with their existing credentials. Multi-Region replication is available as an add-on for user pools in Essentials or Plus feature tiers across 16 Regions. &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-cognito-multi-region/"&gt;Read more&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://aws.amazon.com/blogs/aws/get-started-with-openai-gpt-5-5-gpt-5-4-models-and-codex-on-amazon-bedrock/"&gt;&lt;strong&gt;GPT-5.5, GPT-5.4, and Codex from OpenAI are now generally available on Amazon Bedrock&lt;/strong&gt;&lt;/a&gt; – You can now use GPT-5.5 and GPT-5.4 in production workloads on Amazon Bedrock and build with Codex for AI-powered software development, with the same security, governance, and operational controls you already use across AWS. GPT-5.5 is the most capable model from OpenAI, excelling at agentic coding, data analysis, and multi-step autonomous tasks. Codex is available through the Codex App, the Codex CLI, and IDE integrations with Visual Studio Code, JetBrains, and Xcode. Pricing matches OpenAI first-party rates, and usage counts toward existing AWS commitments. &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-openai-models-codex-generally-available/"&gt;Read more&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Last week’s launches&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Here are some launches and updates from this past week that caught my attention:&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-bedrock-supports-cloudwatch-metrics-bedrock-mantle-endpoint/"&gt;Amazon Bedrock adds CloudWatch metrics for OpenAI- and Anthropic-compatible APIs&lt;/a&gt; – You can now monitor inference traffic to the bedrock-mantle endpoint with CloudWatch metrics, including inference counts, input and output token totals, and client error counts at account, project, model, and project-and-model granularity.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/blogs/aws/try-the-new-console-experience-in-amazon-bedrock-optimized-for-anthropic-and-openai-compatible-apis/"&gt;Amazon Bedrock launches a redesigned console optimized for OpenAI- and Anthropic-compatible APIs&lt;/a&gt; – A refreshed console workflow with a model catalog, side-by-side comparison, project-based organization, and project-aware documentation with pre-filled code snippets.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/agentcore-identity-secrets-manager/"&gt;Amazon Bedrock AgentCore Identity now supports bring-your-own secrets with AWS Secrets Manager&lt;/a&gt; – Customers can now reference existing AWS Secrets Manager secret ARNs in AgentCore Identity Credential Providers, enabling full ownership of secrets governance including custom CMKs, tagging strategies, and automatic rotation.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-step-functions-agentcore/"&gt;AWS Step Functions adds AgentCore-powered agentic reasoning step&lt;/a&gt; – You can now add AI agent reasoning steps to your Step Functions workflows through an integration with the managed harness in Amazon Bedrock AgentCore. Run multiple agents in parallel or sequence, add human approval, and trace every agent decision.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-eks-distro-kubernetes-version-1-36/"&gt;Amazon EKS and Amazon EKS Distro now support Kubernetes version 1.36&lt;/a&gt; – Kubernetes 1.36 promotes User Namespaces to GA, introduces Mutating Admission Policies, In-Place Pod-Level Resources Vertical Scaling, and Resource Health Status reporting. Available in all Regions where EKS is available.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-ecs-managed-instances-neuron/"&gt;Amazon ECS Managed Instances now supports AWS Trainium and AWS Inferentia&lt;/a&gt; – You can now create an ECS Managed Instances capacity provider with Inferentia2, Trainium1, and Trainium2 instance types and have Amazon ECS automatically allocate all accelerator resources to your workload.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-quick-vpc-mcp/"&gt;Amazon Quick now supports VPC connectivity for MCP connections&lt;/a&gt; – Enterprise customers can now connect privately hosted Model Context Protocol (MCP) servers to Amazon Quick through VPC, enabling secure access to proprietary applications and internal tools without exposing them to the internet.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/aws-cur2.0-athena-redshift/"&gt;AWS Cost and Usage Report 2.0 now supports Athena and Redshift integration&lt;/a&gt; – CUR 2.0 exports are automatically delivered in the optimal format for your chosen query engine, with supporting infrastructure templates, table definitions, and data loading instructions.&lt;/li&gt; 
 &lt;li&gt;&lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/amazon-location-service/amazon-location-new-public-transit-intermodal-routing/"&gt;Amazon Location Service announces public transit and intermodal routing&lt;/a&gt; – The Routes API now supports two new travel modes, Transit and Intermodal, to plan journeys combining public transportation with walking, driving, taxi, and rental segments across 13 Regions.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;For a full list of AWS announcements, be sure to keep an eye on the&amp;nbsp;&lt;a href="https://aws.amazon.com/new/"&gt;What’s New with AWS&lt;/a&gt;&amp;nbsp;page.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Upcoming AWS events&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Learn more about AWS, browse and join upcoming &lt;a href="https://aws.amazon.com/events/explore-aws-events/?refid=e61dee65-4ce8-4738-84db-75305c9cd4fe" target="_blank" rel="noopener noreferrer"&gt;AWS-led in-person and virtual events&lt;/a&gt;, &lt;a href="https://aws.amazon.com/startups/events?tab=upcoming?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;startup events&lt;/a&gt;, and &lt;a href="https://builder.aws.com/connect/events?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;developer-focused events&lt;/a&gt; as well as &lt;a href="https://aws.amazon.com/events/summits/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;AWS Summits&lt;/a&gt; and &lt;a href="https://aws.amazon.com/events/community-day/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;AWS Community Days&lt;/a&gt;. Join the &lt;a href="https://builder.aws.com/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el" target="_blank" rel="noopener noreferrer"&gt;AWS Builder Center&lt;/a&gt; to connect with builders, share solutions, and access content that supports your development.&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;&lt;/p&gt; 
&lt;p&gt;That’s all for this week. Check back next Monday for another Weekly Roundup!&lt;/p&gt; 
&lt;a href="https://linktr.ee/sebsto"&gt;— seb&lt;/a&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Try the new console experience in Amazon Bedrock, optimized for Anthropic- and OpenAI-compatible APIs</title>
		<link>https://aws.amazon.com/blogs/aws/try-the-new-console-experience-in-amazon-bedrock-optimized-for-anthropic-and-openai-compatible-apis/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Fri, 05 Jun 2026 19:15:11 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Machine Learning]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">a0baf26d5322e6b76f8bcaeab13717a0ca9ba3d6</guid>

					<description>You can use the new console experience on Amazon Bedrock to browse and compare the latest AI models side by side, organize work into projects with streamlined evaluation workflows, and access project-aware live documentation with auto-prefilled code snippets ready to copy and run.</description>
										<content:encoded>&lt;p&gt;Today, we’re announcing a new console experience in &lt;a href="https://aws.amazon.com/bedrock/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock&lt;/a&gt; for you to experiment, iterate, and scale with the latest AI models on Amazon Bedrock’s next-generation inference engine built for high performance, reliability, and security. This console has a refreshed workflow optimized for &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoint, which supports the latest GPT, Claude, and open-weight models with the OpenAI Responses API, OpenAI Chat Completions API, and the Anthropic Messages API.&lt;/p&gt; 
&lt;p&gt;The new console experience makes it simple to find the right model and move quickly from evaluation to production.&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;New model card&lt;/strong&gt;: You can browse the full model catalog, compare them side by side on capabilities, modality support, context window, and applicable service quotas in a single view, removing the need to stitch together documentation, and limit calculators.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Project-based work&lt;/strong&gt;: You can make a project to run evaluations and review usage insights in one streamlined workflow that mirrors the lifecycle of building a generative AI application.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Live documentation&lt;/strong&gt;: You can use project-aware live documentation: code samples, SDK snippets, and API references are automatically prefilled with your project variables. You can copy a snippet straight from the console into your application and run it without modification.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;How to get started&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; You can try a new experience by choosing &lt;strong&gt;Try the Bedrock Mantle Console&lt;/strong&gt; from within the &lt;a href="https://console.aws.amazon.com/bedrock/home?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock console&lt;/a&gt;, or by using the &lt;a href="https://console.aws.amazon.com/bedrock-mantle/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;new console link&lt;/a&gt; directly.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104227 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-0.jpg" alt="" width="1796" height="779"&gt;&lt;/p&gt; 
&lt;p&gt;You can find a project-based dashboard to show inference requests and error by range of recent dates, recently used models, and the project list. You can create a project, assign models, configure API keys, and start making inference requests in minutes.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104229 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-1-home.jpg" alt="" width="1800" height="1630"&gt;&lt;/p&gt; 
&lt;p&gt;A new model catalog shows the latest GPT, Claude, and open-weight models that are supported on the &lt;code&gt;bedrock-mantle&lt;/code&gt; engine. You can see the details of features, tokens, pricing, input/output, pricing information, and Regional availability. You can also compare up to 3 models in a single view.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104230" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-1-model-card.jpg" alt="" width="1800" height="972"&gt;&lt;/p&gt; 
&lt;p&gt;When you choose the project dashboard, you can see the models used in the project, the distribution of your token usage such as total token usage, token usage per minute, inference requests per minute, and tokens per inference request. This can inform your model selection, prompt optimization, and workload consistency decisions.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104231" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-2-project.jpg" alt="" width="1800" height="991"&gt;&lt;/p&gt; 
&lt;p&gt;You can select up to 3 models to start evaluating to compare responses side by side with the same prompt.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104232" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-2-evaluation.jpg" alt="" width="1800" height="977"&gt;&lt;/p&gt; 
&lt;p&gt;To build your application in the project, choose &lt;strong&gt;Getting started&lt;/strong&gt;. You can migrate existing code, build a new app with the Anthropic or OpenAI SDK, or connect an AI coding assistant to Bedrock.&lt;/p&gt; 
&lt;p&gt;Choose the &lt;strong&gt;API &amp;amp; SDK&lt;/strong&gt;, your SDK (either Anthropic or OpenAI), your preferred programming language, and your authentication method. It shows your environment code to run these in your terminal for a quick test, or save to a &lt;code&gt;.env&lt;/code&gt; file for your application. You can also send your first request with sample code snippets to verify your setup.&lt;/p&gt; 
&lt;p&gt;When you choose &lt;strong&gt;Clients&lt;/strong&gt;, you can select the AI coding agent source such as Claude Code, Cline, Codex, Cursor, or OpenCode that you want to connect to the &lt;code&gt;bedrock-mantle&lt;/code&gt; engine. It provides instructions on how to install the AI agent, use your AWS IAM credentials or use a Bedrock API key, set environment variables, and route requests from each AI agent through Bedrock.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter size-full wp-image-104234" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-3-getstarted.jpg" alt="" width="1800" height="1932"&gt;&lt;/p&gt; 
&lt;p&gt;To learn about Anthropic- and OpenAI-compatible APIs, choose &lt;strong&gt;Live API docs&lt;/strong&gt;. You can choose&amp;nbsp;&lt;strong&gt;Anthropic API Protocol&lt;/strong&gt; for access to Claude model features like the Messages API or &lt;strong&gt;OpenAI API Protocol&lt;/strong&gt; for access to features like Responses API.&lt;/p&gt; 
&lt;p&gt;For example, when you choose OpenAI Response API, it retrieves a model response with the given model ID. These API references are automatically prefilled with the project’s selected model ID, Region, &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoint URL, and API key reference, and they update in place as you change models or settings.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104235 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/05/2026-new-bedrock-console-3-livedoc.jpg" alt="" width="1800" height="1929"&gt;&lt;/p&gt; 
&lt;p&gt;You can also choose the existing Bedrock console to manage fully-managed features such as Agents, Knowledge Bases, Guardrails, fine-tuning, or the InvokeModel and Converse APIs to run on the &lt;code&gt;bedrock-runtime&lt;/code&gt; endpoint.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Now available&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; The new console experience is available in all AWS Regions where the &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoint is offered: US East (N. Virginia, Ohio), US West (Oregon), Asia Pacific (Jakarta, Mumbai, Sydney, Tokyo), Europe (Frankfurt, Ireland, London, Milan, Stockholm), and South America (São Paulo). Check the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/models-region-compatibility.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;full list of Regions&lt;/a&gt; for future updates.&lt;/p&gt; 
&lt;p&gt;Give the new console experience a try in the &lt;a href="https://console.aws.amazon.com/bedrock-mantle/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;new Amazon Bedrock console&lt;/a&gt; and send feedback to &lt;a href="https://repost.aws/tags/TAQeKlaPaNRQ2tWB6P7KrMag/amazon-bedrock?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for Amazon Bedrock&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://twitter.com/channyun"&gt;Channy&lt;/a&gt;&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Improve your application resilience with Amazon Cognito multi-Region replication</title>
		<link>https://aws.amazon.com/blogs/aws/improve-your-application-resilience-with-amazon-cognito-multi-region-replication/</link>
					
		
		<dc:creator><![CDATA[Sébastien Stormacq]]></dc:creator>
		<pubDate>Wed, 03 Jun 2026 22:37:29 +0000</pubDate>
				<category><![CDATA[Amazon Cognito]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Security, Identity, & Compliance]]></category>
		<guid isPermaLink="false">ffb6b5eaa7f07ef9c7f8b9dd92ccdd689e10d7ae</guid>

					<description>Amazon Cognito now offers multi-Region replication that automatically synchronizes user data, credentials, and pool configurations to a secondary AWS Region, enabling uninterrupted authentication during regional failovers without forced password resets—plus new support for customer managed KMS keys for encryption control.</description>
										<content:encoded>&lt;p&gt;As a developer advocate working with web and mobile application developers, I’ve often heard about the need to maintain consistent user authentication in the unlikely event of a regional service interruption. The increasing use of agentic AI, microservices, automation, and service accounts has sparked a similar need for machine-to-machine authentication. Today, I’m excited to share two important updates to Amazon Cognito: &lt;strong&gt;multi-Region replication&lt;/strong&gt; for improved resilience, and support for &lt;strong&gt;customer managed keys&lt;/strong&gt; for more control encryption control.&lt;/p&gt; 
&lt;p&gt;Many applications rely on Amazon Cognito to handle user and machine-to-machine authentication, and to manage user profiles. When building for high availability, having consistent data across different AWS Regions is a key approach, and until now, achieving that consistency came with significant challenges. Engineering teams spent significant time building and maintaining custom replication solutions to synchronize configurations across Regions. Manual export and import of user data between Regions created security risks from potential data exposure and introduced opportunities for data inconsistencies. During regional transitions, end users experienced disruptions like forced password resets and re-authentication. For machine-to-machine communications, teams had to create new app clients in the secondary region, which meant reconfiguring their applications and updating OAuth-protected resources to accept access tokens issued by the new regional issuer. These challenges made it difficult to maintain uninterrupted operations across Regions.&lt;/p&gt; 
&lt;p&gt;With multi-Region replication, Amazon Cognito automatically maintains a synchronized copy of your user data and machine secrets in a secondary AWS Region of your choice. The replication flows in one direction, from your primary Region to the secondary Region. This includes user profiles, credentials, and pool configurations. The secondary Region operates in read-only mode, focusing on maintaining authentication capabilities. Existing sessions continue uninterrupted.&lt;/p&gt; 
&lt;p&gt;When you need to direct traffic to the secondary Region, your existing users can continue signing in with their existing credentials without disruption, and currently signed-in users remain authenticated because both regions recognize access tokens issued by either region. Multi-Region replication supports all authentication methods, including federated sign-in through social providers (Amazon, Google, Apple, Facebook), Security Assertion Markup Language (SAML) and OpenID Connect (OIDC) integrations, and API authorization flows. This approach maintains availability for both customer-facing applications and machine-to-machine communications in your backend services. While authentication continues without interruption, operations like new user registration or profile updates are not available during failover.&lt;/p&gt; 
&lt;p&gt;Before configuring multi-Region replication, you must configure a multi-Region customer managed key stored in &lt;a href="https://aws.amazon.com/kms/"&gt;AWS Key Management Service (AWS KMS)&lt;/a&gt; to encrypt your user data at rest. These keys provide consistent encryption across Regions while giving you control over your encryption strategy.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;How this works in practice&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;I start this demo with an existing Cognito user pool in the &lt;code&gt;us-west-2&lt;/code&gt; (Oregon) Region. I want to configure replication to &lt;code&gt;us-east-1&lt;/code&gt; (Northern Virginia). I also have a customer managed key replicated in these two Regions.&lt;/p&gt; 
&lt;p&gt;Configuring multi-Region replication is just three steps. The &lt;a href="https://console.aws.amazon.com"&gt;AWS Management Console&lt;/a&gt; guides me through the steps: set up a custom key for encryption, configure multi-region OIDC endpoints, and configure the replication itself.&lt;/p&gt; 
&lt;p&gt;First, I set up a custom AWS KMS key to encrypt the data at rest.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/22/Screenshot-2026-04-21-at-1.27.34 PM.png"&gt;&lt;img loading="lazy" class="aligncenter size-large wp-image-103762" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/22/Screenshot-2026-04-21-at-1.27.34 PM-1024x538.png" alt="Cognito Multi-Region replication - initial state" width="1024" height="538" data-wp-editing="1"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;I select the custom key I created. I also update the key policy to allow Amazon Cognito to access and use the key. The console shows the correct IAM policy statements to add to my key policy.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_14-12-57.png"&gt;&lt;img loading="lazy" class="aligncenter size-large wp-image-100881" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_14-12-57-1024x754.png" alt="Cognito Multi-Region replication - select CMK" width="1024" height="754"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;The console confirms when the custom key is selected and correctly configured.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_14-13-25.png"&gt;&lt;img loading="lazy" class="aligncenter size-large wp-image-100882" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_14-13-25-1024x471.png" alt="Cognito Multi-Region replication - confirm CMK" width="1024" height="471"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;Second, I follow the console instructions to configure the OIDC issuer type. On &lt;strong&gt;Step 2 – optional&lt;/strong&gt;, I choose &lt;strong&gt;Configure&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/24/Screenshot-2026-04-24-at-8.48.56 AM.png"&gt;&lt;img loading="lazy" class="aligncenter wp-image-103804 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/24/Screenshot-2026-04-24-at-8.48.56 AM-1024x442.png" alt="Cognito Multi-Region replication - configure multi region OIDC 1" width="1024" height="442"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;I make sure to update my client applications with these new endpoints. This is a required change that will need a redeployment of server-side applications and an update submission for mobile apps on the App Store and Google Play. If I don’t update the endpoints, my users will experience disruptions because requests to the old endpoints will no longer be routed correctly.&lt;/p&gt; 
&lt;p&gt;On the next screen, I select &lt;strong&gt;Updated&lt;/strong&gt;. I take note of the new URLs. I confirm the changes and choose &lt;strong&gt;Change issuer type&lt;/strong&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/24/Screenshot-2026-04-24-at-8.49.32 AM.png"&gt;&lt;img loading="lazy" class="aligncenter wp-image-103805 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/24/Screenshot-2026-04-24-at-8.49.32 AM-1024x532.png" alt="Cognito Multi-Region replication - configure multi region OIDC 2" width="1024" height="532"&gt;&lt;/a&gt;Finally, I select the target Region for replication. Only Regions where the custom encryption key is replicated are available for selection. After having chosen the target Region, I choose &lt;strong&gt;Create&lt;/strong&gt;.&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/27/Screenshot-2026-04-24-at-12.30.41 PM.png"&gt;&lt;img loading="lazy" class="aligncenter wp-image-103808 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/27/Screenshot-2026-04-24-at-12.30.41 PM-1024x463.png" alt="Cognito Multi-Region replication - start the replication process" width="1024" height="463"&gt;&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;The service prepares the replication. The time needed depends on the amount of data in the user pool.&lt;/p&gt; 
&lt;p&gt;When the replicated user pool is ready, I manually &lt;strong&gt;Activate&lt;/strong&gt; it.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_16-38-40.png"&gt;&lt;img loading="lazy" class="aligncenter wp-image-100888 size-large" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_16-38-40-e1763048830128-1024x184.png" alt="Cognito Multi-Region replication - replication process is complete" width="1024" height="184"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;The replication status becomes &lt;span style="color: #008000"&gt;&lt;strong&gt;Active&lt;/strong&gt;&lt;/span&gt;. It is ready to direct traffic to the replica.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_16-44-19.png"&gt;&lt;img loading="lazy" class="aligncenter size-large wp-image-100889" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_16-44-19-1024x113.png" alt="Cognito Multi-Region replication - active" width="1024" height="113"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Additional configurations&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;The console helps me to keep track of additional configurations I have to plan. When I’m using &lt;a href="https://docs.aws.amazon.com/cognito/latest/developerguide/cognito-user-pools-working-with-lambda-triggers.html"&gt;Lambda functions&lt;/a&gt; for &lt;a href="https://docs.aws.amazon.com/cognito/latest/developerguide/amazon-cognito-user-pools-authentication-flow-methods.html"&gt;custom authentication flows&lt;/a&gt; or SMS or email notifications, I must also deploy and configure these resources in the new Region.&lt;/p&gt; 
&lt;p&gt;Similarly, log streaming or &lt;a href="https://aws.amazon.com/waf"&gt;AWS WAF&lt;/a&gt; configuration must be manually configured in the target Region before I start directing authentication traffic to it.&lt;/p&gt; 
&lt;p&gt;&lt;a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_16-43-11-copy.png"&gt;&lt;img loading="lazy" class="aligncenter size-large wp-image-100890" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/11/13/2025-11-13_16-43-11-copy-1024x443.png" alt="Cognito Multi-Region replication - task list" width="1024" height="443"&gt;&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Health checks and failover&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Both primary and secondary regional endpoints remain active and ready to serve your traffic at all times. To monitor system health and manage failovers, you design a strategy that aligns with your application’s specific requirements and security posture. You can implement health checks to monitor the status of authentication services in your primary Region and define criteria for when to initiate failover. These checks might look for error rates, latency patterns, or specific service alerts.&lt;/p&gt; 
&lt;p&gt;When your monitoring system detects issues meeting your failover criteria, you can redirect traffic to the secondary Region through DNS updates. This approach gives you control over the failover process while maintaining security. Consider testing your failover strategy during off-peak hours by redirecting a small portion of traffic to verify that authentication continues working as expected in the secondary Region.&lt;/p&gt; 
&lt;p&gt;When using managed login and federation with custom domains, you can also use the built-in traffic routing feature by providing an &lt;a href="https://docs.aws.amazon.com/Route53/latest/DeveloperGuide/welcome-health-checks.html"&gt;Amazon Route 53 health check ID&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;span style="text-decoration: underline"&gt;&lt;strong&gt;Pricing and availability&lt;br&gt; &lt;/strong&gt;&lt;/span&gt;Multi-Region replication is available today as an add-on feature for Amazon Cognito customers using Essentials and Plus tier. For user authentication, the add-on costs $0.0045 per monthly active user per replica Region for Essentials tier customers and $0.006 per monthly active user per replica region for Plus tier customers. For machine-to-machine (M2M) authentication, the add-on is a 30% charge on top of the standard volume-based pricing for successful tokens issued. For detailed pricing information, &lt;a href="https://aws.amazon.com/cognito/pricing/"&gt;see Amazon Cognito pricing&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;Multi-Region replication is available in the following Regions: US East (Ohio, N. Virginia), US West (N. California, Oregon), Asia Pacific (Mumbai, Seoul, Singapore, Sydney, Tokyo), Canada (Central), Europe (Frankfurt, Ireland, London, Paris, Stockholm), and South America (São Paulo).&lt;/p&gt; 
&lt;p&gt;Any of these Regions can be used as the source or the destination for the replication.&lt;/p&gt; 
&lt;p&gt;Support for customer managed keys is available for the Essentials and Plus tiers. It is available in the following Regions: US East (Ohio, N. Virginia), US West (N. California, Oregon), Africa (Cape Town), Asia Pacific (Hong Kong, Hyderabad, Jakarta, Malaysia, Melbourne, Mumbai, New Zealand, Osaka, Seoul, Singapore, Sydney, Thailand, Tokyo), Canada (Central), Canada West (Calgary), Europe (Frankfurt, Ireland, London, Milan, Paris, Spain, Stockholm, Zurich), Israel (Tel Aviv), Mexico (Central), South America (São Paulo), and AWS GovCloud (US-East, US-West)&lt;/p&gt; 
&lt;p&gt;From my conversations with customers, maintaining business continuity during regional incidents while meeting security requirements is a high priority. Multi-Region replication provides the capability to build more resilient applications without managing complex replication logic yourself. The automatic synchronization of user data and configurations reduces operational overhead while maintaining security.&lt;/p&gt; 
&lt;p&gt;For customers in regulated industries, the new support for customer managed keys provides additional control over data encryption. You can now use your own encryption keys to protect user data at rest, helping you meet regulatory requirements in industries like healthcare and financial services.&lt;/p&gt; 
&lt;p&gt;To get started with multi-Region replication and customer managed key encryption, visit t&lt;a href="https://console.aws.amazon.com/cognito"&gt;he Amazon Cognito console&lt;/a&gt; or see &lt;a href="https://docs.aws.amazon.com/cognito/latest/developerguide/user-pool-multi-region.html"&gt;the documentation&lt;/a&gt; for detailed setup instructions. I look forward to hearing how you use this feature to strengthen your application architecture.&lt;/p&gt; 
&lt;a href="https://linktr.ee/sebsto"&gt;— seb&lt;/a&gt;</content:encoded>
					
					
			
		
		
			</item>
		<item>
		<title>Get started with OpenAI GPT-5.5, GPT-5.4 models, and Codex on Amazon Bedrock</title>
		<link>https://aws.amazon.com/blogs/aws/get-started-with-openai-gpt-5-5-gpt-5-4-models-and-codex-on-amazon-bedrock/</link>
					
		
		<dc:creator><![CDATA[Channy Yun (윤석찬)]]></dc:creator>
		<pubDate>Mon, 01 Jun 2026 21:33:28 +0000</pubDate>
				<category><![CDATA[Amazon Bedrock]]></category>
		<category><![CDATA[Amazon Machine Learning]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[News]]></category>
		<guid isPermaLink="false">b899502355a6fd291fccabfaf1fa4db3c1b64d8f</guid>

					<description>OpenAI frontier models GPT-5.5 and GPT-5.4, and Codex, the OpenAI coding agent, are available on Amazon Bedrock. Deploy frontier models on Bedrock's high performance inference engine with built-in security, governance, and pay-per-token pricing.</description>
										<content:encoded>&lt;p&gt;As we &lt;a href="https://www.aboutamazon.com/news/aws/bedrock-openai-models?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;previewed in What’s Next with AWS 2026&lt;/a&gt;, we’re announcing the availability of OpenAI GPT-5.5, GPT-5.4 models, and Codex on &lt;a href="https://aws.amazon.com/bedrock/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;Amazon Bedrock&lt;/a&gt;, giving you access to frontier models and a coding agent for software development.&lt;/p&gt; 
&lt;p&gt;According to OpenAI, GPT-5.5 and GPT-5.4 models are excellent for coding, reasoning, agentic workflows, and complex professional work. You can use GPT-5.5 for the hardest customer workloads and GPT-5.4 for the best price-performance. You can call them through &lt;code&gt;Responses&lt;/code&gt; API on Amazon Bedrock’s next-generation inference engine built for high performance, reliability, and security.&lt;/p&gt; 
&lt;p&gt;Codex is the OpenAI coding agent for AI-powered software development. According to OpenAI, more than 4 million developers use Codex every week to write, refactor, debug, test, and validate code across large codebases. With GPT-5.5 powering inference, Codex introduces a new class of intelligence optimized for complex, long-horizon developer workflows. You can use the Codex App, the Codex CLI, and IDE integrations with Visual Studio Code, JetBrains, and Xcode, with all model inference routed through the &lt;code&gt;Responses&lt;/code&gt; API on Amazon Bedrock.&lt;/p&gt; 
&lt;p&gt;For customers with data residency requirements, all processing stays within the Bedrock Region you select. You pay per token with no seat licenses and no per-developer commitments.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;GPT-5.5 and GPT-5.4 models on Bedrock in action&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; You can access the model programmatically using the OpenAI &lt;code&gt;Responses&lt;/code&gt; API to call the &lt;code&gt;bedrock-mantle&lt;/code&gt; endpoints through the OpenAI SDK, command-line tools such as &lt;code&gt;curl&lt;/code&gt;.&lt;/p&gt; 
&lt;p&gt;Let’s start with OpenAI SDK for Python. Install OpenAI SDK.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;pip install -U openai&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Set the environment variables for authentication.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;export OPENAI_BASE_URL="https://bedrock-mantle.us-east-2.api.aws/openai/v1"
export OPENAI_API_KEY="&amp;lt;BEDROCK_API_KEY&amp;gt;"
export BEDROCK_OPENAI_MODEL_ID="openai.gpt-5.5"
&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Here is a sample Python code to call GPT-5.5 model on Bedrock:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-python"&gt;import os
from openai import OpenAI
 
client = OpenAI(
    base_url=os.environ["OPENAI_BASE_URL"],
    api_key=os.environ["OPENAI_API_KEY"],
)
 
response = client.responses.create(
    model=os.environ["BEDROCK_OPENAI_MODEL_ID"],
    input=[
        {
            "role": "developer",
            "content": "You are a software engineer with excellent AWS cloud knowledge. Be concise and practical.",
        },
        {
            "role": "user",
            "content": "Design a distributed architecture on AWS in Python that should support 100k requests per second across multiple geographic regions.",
        },
    ],
    reasoning={"effort": "medium"},
    text={"verbosity": "low"},
)
 
print(response.output_text)&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;You can call directly the model endpoint using &lt;code&gt;curl&lt;/code&gt;.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;curl "$OPENAI_BASE_URL/responses" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "openai.gpt-5.5",
    "input": [
      {
        "role": "developer",
        "content": "You are a software engineer with excellent AWS cloud knowledge."
      },
      {
        "role": "user",
        "content": "Design a distributed architecture on AWS in Python that should support 100k requests per second across multiple geographic regions."
      }
    ],
    "reasoning": {"effort": "medium"},
    "text": {"verbosity": "low"}
  }'&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;You can use the &lt;code&gt;Responses&lt;/code&gt; API when you want to use model-managed multi-turn state, need hosted tools, function tools, or richer tool orchestration, and run background or long-running work. To learn more, visit the &lt;a href="https://github.com/openai/openai-cookbook/tree/main/examples/responses_api"&gt;OpenAI Cookbook Responses examples&lt;/a&gt; and &lt;a href="https://developers.openai.com/cookbook/examples/partners/aws/openai_models_with_amazon_bedrock"&gt;getting started guide&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Using OpenAI Codex with GPT-5.5 on Amazon Bedrock&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; You can download Codex CLI, Codex App or Codex VS Code extension and get started with the Bedrock for model inference. Codex supports two Bedrock authentication pathways: &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/api-keys.html"&gt;Amazon Bedrock API key&lt;/a&gt; or AWS SDK credential chain. If you set &lt;code&gt;AWS_BEARER_TOKEN_BEDROCK&lt;/code&gt;, Codex uses it first; otherwise Codex falls back to AWS SDK credential chain.&lt;/p&gt; 
&lt;p&gt;Set &lt;code&gt;AWS_BEARER_TOKEN_BEDROCK&lt;/code&gt; in the environment that Codex will read:&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-bash"&gt;export AWS_BEARER_TOKEN_BEDROCK=&amp;lt;your-bedrock-api-key&amp;gt;&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Then, configure your preferred Region and set the model ID to &lt;code&gt;openai.gpt-5.5&lt;/code&gt; in &lt;code&gt;~/.codex/config.toml&lt;/code&gt;, which is required for Bedrock API-key authentication. You can also choose &lt;code&gt;openai.gpt-5.4&lt;/code&gt;, &lt;code&gt;openai.gpt-oss-120b&lt;/code&gt;, or &lt;code&gt;openai.gpt-oss-20b&lt;/code&gt;. For the desktop app or VS Code extension, put any environment variables the app needs in &lt;code&gt;~/.codex/.env&lt;/code&gt;.&lt;/p&gt; 
&lt;pre&gt;&lt;code class="lang-json"&gt;model = "openai.gpt-5.5"
model_provider = "amazon-bedrock"
[model_providers.amazon-bedrock.aws]
region = "us-east-2"&lt;/code&gt;&lt;/pre&gt; 
&lt;p&gt;Restart the desktop app or VS Code extension after changing &lt;code&gt;~/.codex/config.toml&lt;/code&gt; or &lt;code&gt;~/.codex/.env&lt;/code&gt;. In Codex CLI, you should see a &lt;code&gt;/status&lt;/code&gt; tab that looks like this:&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104157 size-full" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/06/01/2026-codex-on-bedrock.png" alt="" width="1656" height="656"&gt;&lt;/p&gt; 
&lt;p&gt;In Codex App, you can use GPT-5.5 model through Amazon Bedrock inference.&lt;/p&gt; 
&lt;p&gt;&lt;img loading="lazy" class="aligncenter wp-image-104068 size-full" style="border: solid 1px #ccc" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/05/26/2026-codex-on-bedrock-desktop-1.jpg" alt="" width="2228" height="1214"&gt;&lt;/p&gt; 
&lt;p&gt;To learn more about how to configure Codex to use OpenAI models on Amazon Bedrock, visit &lt;a href="https://developers.openai.com/codex/amazon-bedrock"&gt;Use Codex with Amazon Bedrock&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Things to know&lt;/strong&gt;&lt;br&gt; Let me share some important technical details that I think you’ll find useful.&lt;/p&gt; 
&lt;ul&gt; 
 &lt;li&gt;&lt;strong&gt;Model latency&lt;/strong&gt;: OpenAI model information positions GPT-5.5 as fast and GPT-5.4 as medium speed, but customer-perceived latency depends on reasoning effort, output length, tool calls, background mode, Region, quotas, throttling, prompt size, and cache hits. Start GPT-5.5 at &lt;code&gt;medium&lt;/code&gt; effort. Start GPT-5.4 with effort set explicitly rather than relying on its &lt;code&gt;none&lt;/code&gt; default.&lt;/li&gt; 
 &lt;li&gt;&lt;strong&gt;Scaling and capacity&lt;/strong&gt;: Bedrock’s new inference engine is designed to rapidly provision and serve capacity across many different models. When accepting requests, we prioritize keeping steady state workloads running, and ramp usage and capacity rapidly in response to changes in demand. During periods of high demand, requests are queued, rather than rejected.&lt;/li&gt; 
&lt;/ul&gt; 
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Now available&lt;/u&gt;&lt;/strong&gt;&lt;br&gt; OpenAI GPT models and Codex on Amazon Bedrock are available today: GPT-5.5 model in the US East (Ohio) Region, GPT-5.4 model in the US East (Ohio) and US West (Oregon) Regions. Check the &lt;a href="https://docs.aws.amazon.com/bedrock/latest/userguide/models-region-compatibility.html?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;full list of Regions&lt;/a&gt; for future updates. To learn more, visit the &lt;a href="https://aws.amazon.com/bedrock/openai/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;OpenAI on Amazon Bedrock&lt;/a&gt; page and the &lt;a href="https://aws.amazon.com/bedrock/pricing/?trk=769a1a2b-8c19-4976-9c45-b6b1226c7d20&amp;amp;sc_channel=el"&gt;Amazon Bedrock pricing&lt;/a&gt; page.&lt;/p&gt; 
&lt;p&gt;Give GPT-5.5, GPT-5.4 models, and Codex on Amazon Bedrock a try today and send feedback to &lt;a href="https://repost.aws/tags/TAQeKlaPaNRQ2tWB6P7KrMag/amazon-bedrock?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS re:Post for Amazon Bedrock&lt;/a&gt; or through your usual AWS Support contacts.&lt;/p&gt; 
&lt;p&gt;— &lt;a href="https://twitter.com/channyun"&gt;Channy&lt;/a&gt;&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June, 1, 2026&lt;/strong&gt; – The GPT 5.5 and 5.4 models now support the Responses API only on Amazon Bedrock, and console support is coming soon.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June, 3, 2026&lt;/strong&gt; – Amazon Bedrock now supports GPT‑5.4 from OpenAI in &lt;a href="https://aws.amazon.com/about-aws/whats-new/2026/06/GPT54-available-in-aws-govcloud-us-west/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;AWS GovCloud (US-West) Region&lt;/a&gt;.&lt;/p&gt; 
&lt;p&gt;&lt;strong&gt;Updated on June, 7, 2026&lt;/strong&gt; – You can use the GPT 5.5 and 5.4 models in the &lt;a href="https://aws.amazon.com/blogs/aws/try-the-new-console-experience-in-amazon-bedrock-optimized-for-anthropic-and-openai-compatible-apis/?trk=d8ec3b19-0f37-4f8c-8c12-189f913e205c&amp;amp;sc_channel=el"&gt;new Amazon Bedrock console&lt;/a&gt;.&lt;/p&gt;</content:encoded>
					
					
			
		
		
			</item>
	</channel>
</rss>