-
Notifications
You must be signed in to change notification settings - Fork 56
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Deployed dc9db58 with MkDocs version: 1.6.1
- Loading branch information
Showing
4 changed files
with
106 additions
and
62 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
| Agent Name | Avg | Detection | Localization | Diagnosis | Mitigation | Time | Org. | Model | Link | | ||
|-------------------|-------|-----------|--------------|-----------|------------|--------|-----------|--------|------| | ||
| 🥇FLASH | **59.27** | **100** | 46.15 | 36.36 | **54.55** | 102.57 | AIOpsLab | GPT 4 | 🔗 | | ||
| 🥈REACT | 53.15 | 76.92 | 53.85 | **45.45** | 36.36 | 44.25 | AIOpsLab | GPT 4 | 🔗 | | ||
| 🥉GPT 4 w Shell | 49.74 | 69.23 | **61.54** | 40.9 | 27.27 | 30.57 | AIOpsLab | GPT 4 | 🔗 | | ||
| GPT 3.5 w Shell | 15.73 | 23.07 | 30.77 | 9.09 | 0 | 12.79 | AIOpsLab | GPT 3.5| 🔗 | | ||
|
||
|
||
|
||
|
||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
{"config":{"lang":["en"],"separator":"[\\s\\-]+","pipeline":["stopWordFilter"]},"docs":[{"location":"","title":"Home","text":"AIOpsLab A Holistic Framework to Design, Develop, and Evaluate AI Agents for Enabling Autonomous Clouds M365 Research - AIOps Team \u00a0Leaderboard \u00a0Paper \u00a0Code News <p>\ud83c\udd95 [12/2024] Microsoft Research features AIOpsLab in their latest blog post! \ud83c\udf10 [Link] </p> <p>\ud83c\udd95 [12/2024] Our code is now live on GitHub! \ud83d\ude80 [Link] </p> <p>\ud83c\udd95 [11/2024] Checkout our arxiv paper \"AIOpsLab: A Holistic Framework for Evaluating AI Agents for Enabling Autonomous Cloud\" \ud83d\udc40 [Link] </p> <p>\ud83c\udd95 [10/2024] Our vision paper \"Building AI Agents for Autonomous Clouds: Challenges and Design Principles\" was accepted by SoCC'24 \ud83d\udc40 [Link] </p> About <p>AIOpsLab is a holistic framework to enable the design, development, and evaluation of autonomous AIOps agents that, additionally, serves the purpose of building reproducible, standardized, interoperable and scalable benchmarks. AIOpsLab can deploy microservice cloud environments, inject faults, generate workloads, and export telemetry data, while orchestrating these components and providing interfaces for interacting with and evaluating agents. Moreover, AIOpsLab provides a built-in benchmark suite with a set of problems to evaluate AIOps agents in an interactive environment. This suite can be easily extended to meet user-specific needs. </p> <p>The Orchestrator coordinates interactions between various system components and serves as the Agent-Cloud-Interface (ACI). Agents engage with the Orchestrator to solve tasks, receiving a problem description, instructions, and relevant APIs. The Orchestrator generates diverse problems using the Workload and Fault Generators, injecting these into applications it can deploy. The deployed service has observability, providing telemetry such as metrics, traces, and logs. Agents act via the Orchestrator, which executes them and updates the service's state. The Orchestrator evaluates the final solution using predefined metrics for the task.</p> BibTeX <pre><code>\n @inproceedings{shetty2024building,\n title = {Building AI Agents for Autonomous Clouds: Challenges and Design Principles},\n author = {Shetty, Manish and Chen, Yinfang and Somashekar, Gagan and Ma, Minghua and Simmhan, Yogesh and Zhang, Xuchao and Mace, Jonathan and Vandevoorde, Dax and Las-Casas, Pedro and Gupta, Shachee Mishra and Nath, Suman and Bansal, Chetan and Rajmohan, Saravan},\n year = {2024},\n booktitle = {Proceedings of 15th ACM Symposium on Cloud Computing (SoCC'24)},\n }\n @misc{chen2024aiopslab,\n title = {AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds},\n author = {Chen, Yinfang and Shetty, Manish and Somashekar, Gagan and Ma, Minghua and Simmhan, Yogesh and Mace, Jonathan and Bansal, Chetan and Wang, Rujia and Rajmohan, Saravan},\n year = {2024},\n booktitle = {Arxiv}\n }\n </code>\n </pre>"},{"location":"pages/leaderboard/","title":"Leaderboard","text":"AIOpsLab A Holistic Framework to Design, Develop, and Evaluate AI Agents for Enabling Autonomous Clouds M365 Research - AIOps Team \u00a0Home \u00a0Paper \u00a0Code Leaderboard <p>We showcase the key results on the leaderboard. If you'd like your results to appear, please email us at [email protected]. In the table, AVG represents the average accuracy across all tasks. Time indicates the average runtime for the agents. Agent Name Avg \u21c5 Detection \u21c5 Localization \u21c5 Diagnosis \u21c5 Mitigation \u21c5 Time \u21c5 Org. Model Link \ud83e\udd47FLASH 59.27 100 46.15 36.36 54.55 102.57 AIOpsLab GPT 4 \ud83d\udd17 \ud83e\udd48REACT 53.15 76.92 53.85 45.45 36.36 44.25 AIOpsLab GPT 4 \ud83d\udd17 \ud83e\udd49GPT 4 w Shell 49.74 69.23 61.54 40.9 27.27 30.57 AIOpsLab GPT 4 \ud83d\udd17 GPT 3.5 w Shell 15.73 23.07 30.77 9.09 0 12.79 AIOpsLab GPT 3.5 \ud83d\udd17"}]} | ||
{"config":{"lang":["en"],"separator":"[\\s\\-]+","pipeline":["stopWordFilter"]},"docs":[{"location":"","title":"Home","text":"AIOpsLab A Holistic Framework to Design, Develop, and Evaluate AI Agents for Enabling Autonomous Clouds M365 Research - AIOps Team \u00a0Leaderboard \u00a0Paper \u00a0Code News <p>\ud83c\udd95 [12/2024] Microsoft Research features AIOpsLab in their latest blog post! \ud83c\udf10 [Link] </p> <p>\ud83c\udd95 [12/2024] Our code is now live on GitHub! \ud83d\ude80 [Link] </p> <p>\ud83c\udd95 [11/2024] Checkout our arxiv paper \"AIOpsLab: A Holistic Framework for Evaluating AI Agents for Enabling Autonomous Cloud\" \ud83d\udc40 [Link] </p> <p>\ud83c\udd95 [10/2024] Our vision paper \"Building AI Agents for Autonomous Clouds: Challenges and Design Principles\" was accepted by SoCC'24 \ud83d\udc40 [Link] </p> About <p>AIOpsLab is a holistic framework to enable the design, development, and evaluation of autonomous AIOps agents that, additionally, serves the purpose of building reproducible, standardized, interoperable and scalable benchmarks. AIOpsLab can deploy microservice cloud environments, inject faults, generate workloads, and export telemetry data, while orchestrating these components and providing interfaces for interacting with and evaluating agents. Moreover, AIOpsLab provides a built-in benchmark suite with a set of problems to evaluate AIOps agents in an interactive environment. This suite can be easily extended to meet user-specific needs. </p> <p>The Orchestrator coordinates interactions between various system components and serves as the Agent-Cloud-Interface (ACI). Agents engage with the Orchestrator to solve tasks, receiving a problem description, instructions, and relevant APIs. The Orchestrator generates diverse problems using the Workload and Fault Generators, injecting these into applications it can deploy. The deployed service has observability, providing telemetry such as metrics, traces, and logs. Agents act via the Orchestrator, which executes them and updates the service's state. The Orchestrator evaluates the final solution using predefined metrics for the task.</p> BibTeX <pre><code>\n @inproceedings{shetty2024building,\n title = {Building AI Agents for Autonomous Clouds: Challenges and Design Principles},\n author = {Shetty, Manish and Chen, Yinfang and Somashekar, Gagan and Ma, Minghua and Simmhan, Yogesh and Zhang, Xuchao and Mace, Jonathan and Vandevoorde, Dax and Las-Casas, Pedro and Gupta, Shachee Mishra and Nath, Suman and Bansal, Chetan and Rajmohan, Saravan},\n year = {2024},\n booktitle = {Proceedings of 15th ACM Symposium on Cloud Computing (SoCC'24)},\n }\n @misc{chen2024aiopslab,\n title = {AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds},\n author = {Chen, Yinfang and Shetty, Manish and Somashekar, Gagan and Ma, Minghua and Simmhan, Yogesh and Mace, Jonathan and Bansal, Chetan and Wang, Rujia and Rajmohan, Saravan},\n year = {2024},\n booktitle = {Arxiv}\n }\n </code>\n </pre>"},{"location":"pages/leaderboard/","title":"Leaderboard","text":"AIOpsLab A Holistic Framework to Design, Develop, and Evaluate AI Agents for Enabling Autonomous Clouds M365 Research - AIOps Team \u00a0Home \u00a0Paper \u00a0Code Leaderboard <p>We showcase the key results on the leaderboard. If you'd like your results to appear, please email us at [email protected]. In the table, AVG represents the average accuracy across all tasks. Time indicates the average runtime for the agents. Agent Name Avg Detection Localization Diagnosis Mitigation Time Organization Link \ud83e\udd47FLASH (GPT-4) 59.27 100 46.15 36.36 54.55 102.57 AIOpsLab \ud83d\udd17 \ud83e\udd48REACT (GPT-4) 53.15 76.92 53.85 45.45 36.36 44.25 AIOpsLab \ud83d\udd17 \ud83e\udd49GPT-4 w Shell 49.74 69.23 61.54 40.9 27.27 30.57 AIOpsLab \ud83d\udd17 FLASH (Llama3-8b) 33.34 80 20 0 33.34 63.16 AIOpsLab \ud83d\udd17 GPT-3.5 w Shell 15.73 23.07 30.77 9.09 0 12.79 AIOpsLab \ud83d\udd17 REACT (Llama3-8b) 15 60 0 0 0 230.74 AIOpsLab \ud83d\udd17 LocaleXpert (Llama3-8b) - - 80 - - 102.08 AIOpsLab \ud83d\udd17"}]} |
Binary file not shown.