2025-11-22T08:49:16.236324

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Lee, Ji, Wen et al.

Text analytics has traditionally required specialized knowledge in Natural Language Processing (NLP) or text analysis, which presents a barrier for entry-level analysts. Recent advances in large language models (LLMs) have changed the landscape of NLP by enabling more accessible and automated text analysis (e.g., topic detection, summarization, information extraction, etc.). We introduce VIDEE, a system that supports entry-level data analysts to conduct advanced text analytics with intelligent agents. VIDEE instantiates a human-agent collaroration workflow consisting of three stages: (1) Decomposition, which incorporates a human-in-the-loop Monte-Carlo Tree Search algorithm to support generative reasoning with human feedback, (2) Execution, which generates an executable text analytics pipeline, and (3) Evaluation, which integrates LLM-based evaluation and visualizations to support user validation of execution results. We conduct two quantitative experiments to evaluate VIDEE's effectiveness and analyze common agent errors. A user study involving participants with varying levels of NLP and text analytics experience -- from none to expert -- demonstrates the system's usability and reveals distinct user behavior patterns. The findings identify design implications for human-agent collaboration, validate the practical utility of VIDEE for non-expert users, and inform future improvements to intelligent text analytics systems.

academic

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

基本信息

论文ID: 2506.21582
标题: VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents
作者: Sam Yu-Te Lee, Chenyang Ji, Shicheng Wen, Lifu Huang, Dongyu Liu, Kwan-Liu Ma
分类: cs.CL cs.AI cs.HC
发表时间: 2025年10月13日 (arXiv v4)
论文链接: https://arxiv.org/abs/2506.21582

摘要

文本分析传统上需要自然语言处理(NLP)或文本分析的专业知识，这为入门级分析师带来了技术壁垒。大语言模型(LLMs)的最新进展通过支持更易获取和自动化的文本分析(如主题检测、摘要、信息提取等)改变了NLP的格局。本文介绍了VIDEE系统，支持入门级数据分析师与智能代理协作进行高级文本分析。VIDEE实例化了一个三阶段的人机协作工作流：(1) 分解阶段，结合人在回路的蒙特卡洛树搜索算法，支持带有人类反馈的生成推理；(2) 执行阶段，生成可执行的文本分析管道；(3) 评估阶段，集成基于LLM的评估和可视化，支持用户对执行结果的验证。

研究背景与动机

问题定义

传统文本分析面临四个主要挑战：

大分解空间问题：提示的灵活性允许通过不同子任务组合实现目标的多种分解方式，分析师必须在子任务难度和管道整体鲁棒性之间权衡。
技术知识壁垒：分析师具有不同水平的技术知识，特别是关于LLMs的知识。LLM相关领域正在快速发展，分析师可能无法跟上最新技术。
实现和实验困难：构建和实现文本分析管道需要大量工程努力，包括处理输入输出格式、中间数据转换和分析参数。
评估挑战：评估基于LLM的文本分析管道需要独特的评估方法，这些方法尚未广泛普及。

研究动机

这些挑战促使需要一个代理系统来支持文本分析师。给定用户目标和数据集，具有充分技术知识的代理可以自动分解目标、搜索大分解空间并生成文本分析计划，然后实现并执行管道，最后评估结果。

核心贡献

提出三阶段人机协作工作流：设计了分解(Decomposition)、执行(Execution)和评估(Evaluation)的完整工作流程来实现复杂的文本分析目标。
开发VIDEE系统：实现了具有可视化界面的代理系统，支持数据分析师在无代码环境中执行文本分析。
技术创新：
- 基于蒙特卡洛树搜索(MCTS)的人在回路分解算法
- 基于分析单元的概念框架处理数据结构变化
- LLM评判器与可视化集成的评估机制
实证研究发现：通过系统评估和用户研究，提供了关于代理系统和人机协作的新见解。