InsightBench: A Benchmark for Evaluating End-to-End Data Analytics Agents

InsightBench: A Benchmark for Evaluating End-to-End Data Analytics Agents

Welcome to the AI research bites. This series of short and informative talks showcases cutting-edge research work from ServiceNow AI Research team. The AI Research Bites are open to all, especially those interested in keeping up with the fast-paced AI research community.

This session features the recent work of Issam Laradji and team on text-to-analytics and how it can change the user experience of data analytics. In this presentation, Issam introduces InsightBench, a benchmark dataset with 31 different business use-cases designed to evaluate end-to-end data analytics capabilities, and AgentPoirot, a Text-to-Analytics agent which can effectively perform comprehensive data analysis, from asking questions to interpreting answers and providing actionable insights.
Interested in collaborating with Issam to work on this research? Feel free to reach out to his email address (see the last slide)!
Try AgentPoirot: https://colab.research.google.com/drive/1VjZS-vZPNM3WX4QJs0e557UdAb208wXA
Evaluate on InsightBench: https://github.com/ServiceNow/insight-bench
This work is a continuation of Capture the Flag: Uncovering Data Insights with Large Language Models.

ServiceNow AI Research team:
https://www.servicenow.com/research/