Enterprise Agents and Benchmarks Collection Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 19 items • Updated 2 days ago • 17
view reply https://github.com/itbench-hub/ITBenchhttps://huggingface.co/datasets/ibm-research/ITBench-Lite
view article Article ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM ibm-research • 3 days ago • 12
view article Article ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM ibm-research • 3 days ago • 12
MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments Paper • 2605.09131 • Published 22 days ago • 57
view article Article Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents ibm-research • Apr 15 • 28
Enterprise Agents and Benchmarks Collection Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 19 items • Updated 2 days ago • 17
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents Paper • 2603.22386 • Published Mar 23 • 57
Enterprise Agents and Benchmarks Collection Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 19 items • Updated 2 days ago • 17
Time Series Models Collection A collection of time series models trained by IBM • 4 items • Updated Feb 25 • 1
Granite Time Series Collection Time series models for forecasting, anomaly detection, classification, and more. • 10 items • Updated 9 days ago • 52