FrameEOL: Semantic Frame Induction using Causal Language Models
Yano, Yamada, Tsukagoshi et al.
Semantic frame induction is the task of clustering frame-evoking words according to the semantic frames they evoke. In recent years, leveraging embeddings of frame-evoking words that are obtained using masked language models (MLMs) such as BERT has led to high-performance semantic frame induction. Although causal language models (CLMs) such as the GPT and Llama series succeed in a wide range of language comprehension tasks and can engage in dialogue as if they understood frames, they have not yet been applied to semantic frame induction. We propose a new method for semantic frame induction based on CLMs. Specifically, we introduce FrameEOL, a prompt-based method for obtaining Frame Embeddings that outputs One frame-name as a Label representing the given situation. To obtain embeddings more suitable for frame induction, we leverage in-context learning (ICL) and deep metric learning (DML). Frame induction is then performed by clustering the resulting embeddings. Experimental results on the English and Japanese FrameNet datasets demonstrate that the proposed methods outperform existing frame induction methods. In particular, for Japanese, which lacks extensive frame resources, the CLM-based method using only 5 ICL examples achieved comparable performance to the MLM-based method fine-tuned with DML.
academic
FrameEOL: Semantic Frame Induction using Causal Language Models
Semantic frame induction is the task of clustering frame-evoking words according to the semantic frames they evoke. Recent work has achieved high performance in semantic frame induction using frame-evoking word embeddings obtained from masked language models (MLMs) such as BERT. Although causal language models (CLMs) such as GPT and Llama series have achieved success in a wide range of language understanding tasks and can engage in dialogue-like understanding of frames, they have not yet been applied to semantic frame induction. This paper proposes FrameEOL, a novel approach to semantic frame induction based on CLMs, which is a prompt-based method for obtaining frame embeddings that output a frame name as a label. To obtain embeddings more suitable for frame induction, we leverage in-context learning (ICL) and deep metric learning (DML). Experimental results demonstrate that the method outperforms existing approaches on English and Japanese FrameNet datasets. Notably, for Japanese, where extensive frame resources are lacking, the CLM method using only 5 ICL examples achieves performance comparable to MLM methods with DML fine-tuning.
Semantic frame induction aims to automatically identify and cluster verb instances that evoke the same semantic frame. For example, the verb "lost" may evoke different semantic frames in different contexts:
"He lost the gold medal by just .02 points" → FINISH_COMPETITION frame
"He lost his gold medal at the restaurant" → LOSING frame
Although modern CLMs such as GPT-4o demonstrate the ability to understand semantic frames (as shown in the ChatGPT example in Figure 1), they have not been systematically applied to semantic frame induction tasks. This paper aims to fill this gap.
Input: A set of sentences containing frame-evoking verbs
Output: Clustering of verb instances according to the semantic frames they evoke
Constraint: No predefined set of frame labels required
To address low-resource language challenges, ICL is introduced:
Example Construction:
The FrameNet frame evoked by "wear" in "On his head he wore a white nightcap..." is Wearing.
The FrameNet frame evoked by "type" in "I typed it out for Diana Morrison." is Text_creation.
The FrameNet frame evoked by "kneel" in "He knelt up and leaned towards Lucien." is Change_posture.
The FrameNet frame evoked by "lost" in "He lost his gold medal at the restaurant." is
Advantages: Significant performance improvement with only a few examples (5-20), particularly suitable for scenarios with scarce training data.
This paper cites important works from multiple domains including semantic frames, deep metric learning, and prompt-based learning, providing solid theoretical foundations for method design. Particularly noteworthy are foundational works by Yamada et al. (2021, 2023) in MLM-based frame induction and the PromptEOL method proposed by Jiang et al. (2024).
Overall Assessment: This is a high-quality research paper that successfully introduces causal language models to semantic frame induction tasks, with significant contributions in method innovation, experimental validation, and practical value. The breakthrough performance in low-resource language scenarios is particularly noteworthy and provides important reference for related field development.