You are currently viewing AWS Launches AI-Powered DevOps Agent to Accelerate Outage Detection and Recovery

AWS Launches AI-Powered DevOps Agent to Accelerate Outage Detection and Recovery

Prime Highlight 

  • AWS introduced DevOps Agent, an AI-enabled tool that helps companies detect, understand, and resolve system outages faster. 
  • The service automatically analyzes alerts, suggests fixes, and provides initial incident reports to support site reliability engineers. 

Key Facts 

  • DevOps Agent integrates data from third-party monitoring tools like Datadog and Dynatrace and uses AI models to investigate issues. 
  • Early testing by the Commonwealth Bank of Australia showed the tool could identify root causes in under 15 minutes, significantly reducing troubleshooting time. 

Background 

Amazon Web Services (AWS) has introduced a new AI-enabled tool designed to help companies detect, understand, and recover from outages much faster. The service, called DevOps Agent, uses artificial intelligence to study system alerts, identify possible causes, and suggest fixes before engineers even begin manual checks.

AWS announced the launch, saying the tool is now available for customers in a preview version. The company will begin charging for the service once it moves out of the trial stage.

According to AWS, the DevOps Agent works by pulling information from third-party monitoring tools such as Datadog and Dynatrace. It then uses AI models from Amazon and other providers to investigate the issue. Swami Sivasubramanian, AWS vice president of agentic AI, said the tool is designed to support site reliability engineers, who are responsible for preventing downtime and responding quickly during live incidents.

Instead of waiting for on-call teams to log in and start troubleshooting, the DevOps Agent automatically runs several checks, tests different explanations, and prepares an initial incident report. “By the time the on-call team joins, they already have early findings and possible fixes,” Sivasubramanian said.

The Commonwealth Bank of Australia, an early tester, reported that the DevOps Agent was able to identify the root cause of a technical issue in less than 15 minutes, work that would normally take an experienced engineer several hours.

AWS’ move comes as cloud companies race to show how generative AI can boost developer productivity. The tool will compete with Microsoft Azure’s SRE Agent and startups offering AI assistants for operations teams. Amazon is expanding its AI tools for developers, including its new code-generation tool, Kiro.

AWS aims to strengthen its leadership in cloud automation and AI-driven operations with DevOps Agent. 

Read Also :  China’s Trade Surplus Tops $1 Trillion in Record Time, Driven by Global Export Surge