Recent progress in large language models (LLMs) has enabled them to express their confidence in natural language, enhancing transparency and reliability. However, their confidence often exhibits overconfidence, the cause of which remains poorly understood. In this work, we conduct a detailed analysis of the dynamics underlying verbalized confidence and identify answer-independence as a key factor, defined as the model's failure to condition confidence on its own answer. To address this, we propose ADVICE (Answer-Dependent Verbalized Confidence Estimation), a fine-tuning framework that facilitates answer-grounded confidence estimation. Extensive experiments show that ADVICE substantially improves confidence calibration while preserving task performance. Further analyses confirm that ADVICE strengthens answer-groundedness, leading to more balanced and well-calibrated confidence distributions. Our findings shed light on the origin of overconfidence and establish a framework for more trustworthy confidence verbalization.
āĻĒā§āĻĒāĻžāϰ āĻāĻāĻĄāĻŋ : 2510.10913āĻļāĻŋāϰā§āύāĻžāĻŽ : ADVICE: Answer-Dependent Verbalized Confidence EstimationāϞā§āĻāĻ : Ki Jung Seo, Sehun Lim, Taeuk Kim (āĻšāĻžāύāĻāϝāĻŧāĻžāĻ āĻŦāĻŋāĻļā§āĻŦāĻŦāĻŋāĻĻā§āϝāĻžāϞāϝāĻŧ)āĻļā§āϰā§āĻŖā§āĻŦāĻŋāĻāĻžāĻ : cs.CL (āĻāĻŽā§āĻĒāĻŋāĻāĻā§āĻļāύāĻžāϞ āĻāĻžāώāĻžāĻŦāĻŋāĻā§āĻāĻžāύ)āĻĒā§āϰāĻāĻžāĻļāύāĻžāϰ āϏāĻŽāϝāĻŧ : ⧍ā§Ļ⧍ā§Ģ āϏāĻžāϞā§āϰ ā§§ā§Š āĻ
āĻā§āĻā§āĻŦāϰ (arXiv āĻĒā§āϰāĻŋ-āĻĒā§āϰāĻŋāύā§āĻ)āĻĒā§āĻĒāĻžāϰ āϞāĻŋāĻā§āĻ : https://arxiv.org/abs/2510.10913 āĻŦā§āĻšā§ āĻāĻžāώāĻž āĻŽāĻĄā§āϞ (LLMs) āĻĒā§āϰāĻžāĻā§āϤāĻŋāĻ āĻāĻžāώāĻžāϝāĻŧ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāĻāĻžāĻļā§ āĻāϞā§āϞā§āĻāϝā§āĻā§āϝ āĻ
āĻā§āϰāĻāϤāĻŋ āĻ
āϰā§āĻāύ āĻāϰā§āĻā§, āϝāĻž āϏā§āĻŦāĻā§āĻāϤāĻž āĻāĻŦāĻ āύāĻŋāϰā§āĻāϰāϝā§āĻā§āϝāϤāĻž āĻŦā§āĻĻā§āϧāĻŋ āĻāϰā§āĻā§āĨ¤ āϤāĻŦā§, āĻāĻĻā§āϰ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāĻžāϝāĻŧāĻļāĻ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āϏāĻŽāϏā§āϝāĻž āĻĒā§āϰāĻĻāϰā§āĻļāύ āĻāϰā§, āϝāĻžāϰ āĻŽā§āϞ āĻāĻžāϰāĻŖ āĻāĻāύāĻ āĻĒāϰā§āϝāĻžāĻĒā§āϤāĻāĻžāĻŦā§ āĻŦā§āĻāĻž āϝāĻžāϝāĻŧāύāĻŋāĨ¤ āĻāĻ āĻāĻŦā§āώāĻŖāĻž āϏāĻāĻŦāĻžāĻĻāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āĻ
āĻā§āϝāύā§āϤāϰā§āĻŖ āĻāϤāĻŋāĻļā§āϞāϤāĻžāϰ āĻŦāĻŋāϏā§āϤāĻžāϰāĻŋāϤ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻĒāϰāĻŋāĻāĻžāϞāύāĻž āĻāϰā§, "āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž" āĻā§ āĻāĻāĻāĻŋ āĻŽā§āϞ āĻāĻžāϰāĻŖ āĻšāĻŋāϏāĻžāĻŦā§ āĻāĻŋāĻšā§āύāĻŋāϤ āĻāϰā§âāĻ
āϰā§āĻĨāĻžā§ āĻŽāĻĄā§āϞ āϤāĻžāϰ āύāĻŋāĻāϏā§āĻŦ āĻāϤā§āϤāϰā§āϰ āĻāĻĒāϰ āĻāĻŋāϤā§āϤāĻŋ āĻāϰ⧠āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āϏāĻžāĻŽāĻā§āĻāϏā§āϝ āĻāϰāϤ⧠āĻŦā§āϝāϰā§āĻĨ āĻšāϝāĻŧāĨ¤ āĻāĻ āϏāĻŽāϏā§āϝāĻž āϏāĻŽāĻžāϧāĻžāύā§āϰ āĻāύā§āϝ, āϞā§āĻāĻāϰāĻž ADVICE (Answer-Dependent Verbalized Confidence Estimation) āĻĒā§āϰāϏā§āϤāĻžāĻŦ āĻāϰā§āĻā§āύ, āϝāĻž āĻāϤā§āϤāϰ-āĻāĻŋāϤā§āϤāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āĻĒā§āϰāĻāĻžāϰ āĻāϰ⧠āĻāĻŽāύ āĻāĻāĻāĻŋ āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĢā§āϰā§āĻŽāĻāϝāĻŧāĻžāϰā§āĻāĨ¤ āĻŦā§āϝāĻžāĻĒāĻ āĻĒāϰā§āĻā§āώāĻž-āύāĻŋāϰā§āĻā§āώāĻž āĻĻā§āĻāĻžāϝāĻŧ āϝ⧠ADVICE āĻāĻžāĻā§āϰ āĻāϰā§āĻŽāĻā§āώāĻŽāϤāĻž āĻŦāĻāĻžāϝāĻŧ āϰā§āĻā§ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āĻāϞā§āϞā§āĻāϝā§āĻā§āϝāĻāĻžāĻŦā§ āĻāύā§āύāϤ āĻāϰā§āĨ¤ āĻāϰāĻ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āύāĻŋāĻļā§āĻāĻŋāϤ āĻāϰ⧠āϝ⧠ADVICE āĻāϤā§āϤāϰ-āύāĻŋāϰā§āĻāϰāϤāĻž āĻŦā§āĻĻā§āϧāĻŋ āĻāϰā§, āĻāϰāĻ āĻāĻžāϰāϏāĻžāĻŽā§āϝāĻĒā§āϰā§āĻŖ āĻāĻŦāĻ āϏā§āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻā§āĻĄ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻŦāĻŋāϤāϰāĻŖ āϤā§āϰāĻŋ āĻāϰā§āĨ¤
āĻŽā§āϞ āϏāĻŽāϏā§āϝāĻž : āĻŦā§āĻšā§ āĻāĻžāώāĻž āĻŽāĻĄā§āϞāĻā§āϞāĻŋ āϏāĻāĻŦāĻžāĻĻāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āϤā§āϰāĻŋ āĻāϰāĻžāϰ āϏāĻŽāϝāĻŧ āĻā§āϰā§āϤāϰ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āϏāĻŽāϏā§āϝāĻž āĻĒā§āϰāĻĻāϰā§āĻļāύ āĻāϰā§, āĻ
āϰā§āĻĨāĻžā§ āĻāϤā§āϤāϰ āϏāĻ āĻŋāĻ āĻŦāĻž āĻā§āϞ āĻšā§āĻ āύāĻž āĻā§āύ āĻāĻā§āĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāĻāĻžāĻļ āĻāϰāĻžāϰ āĻĒā§āϰāĻŦāĻŖāϤāĻž āϰāϝāĻŧā§āĻā§āĻā§āϰā§āϤā§āĻŦ : āĻāĻāύ, āĻāĻŋāĻāĻŋā§āϏāĻž āĻāĻŦāĻ āĻ
āύā§āϝāĻžāύā§āϝ āĻāĻā§āĻ-āĻā§āĻāĻāĻŋāĻĒā§āϰā§āĻŖ āĻā§āώā§āϤā§āϰ⧠LLM āϏā§āĻĨāĻžāĻĒāύā§āϰ āϏāĻŽāϝāĻŧ, āύāĻŋāϰā§āĻāϰāϝā§āĻā§āϝ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āĻŽāĻĄā§āϞā§āϰ āĻ
āύā§āϤāϰā§āύāĻŋāĻšāĻŋāϤ āĻ
āϏāĻŽā§āĻĒā§āϰā§āĻŖāϤāĻž āĻĒāϰāĻŋāĻāĻžāϞāύāĻžāϰ āĻāύā§āϝ āĻā§āϰā§āϤā§āĻŦāĻĒā§āϰā§āĻŖāĻŦāĻŋāĻĻā§āϝāĻŽāĻžāύ āĻĒāĻĻā§āϧāϤāĻŋāϰ āϏā§āĻŽāĻžāĻŦāĻĻā§āϧāϤāĻž :
āĻŦāĻŋāĻĻā§āϝāĻŽāĻžāύ āĻāĻŦā§āώāĻŖāĻž āĻĒā§āϰāϧāĻžāύāϤ "āĻā§āĻāĻžāĻŦā§" āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻšā§āϰāĻžāϏ āĻāϰāϤ⧠āĻšāϝāĻŧ āϤāĻžāϰ āĻāĻĒāϰ āĻĻā§āώā§āĻāĻŋ āύāĻŋāĻŦāĻĻā§āϧ āĻāϰā§, "āĻā§āύ" āĻāϰ āĻāĻĒāϰ āύāϝāĻŧ āϏāĻāĻŦāĻžāĻĻāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āĻ
āĻā§āϝāύā§āϤāϰā§āĻŖ āĻĒā§āϰāĻā§āϰāĻŋāϝāĻŧāĻž āϏāĻŽā§āĻĒāϰā§āĻā§ āĻāĻā§āϰ āĻŦā§āĻāĻžāĻĒāĻĄāĻŧāĻžāϰ āĻ
āĻāĻžāĻŦ āĻĒā§āϰāĻŽā§āĻĒāĻāĻŋāĻ āĻĒāĻĻā§āϧāϤāĻŋ, āύāĻŽā§āύāĻž āĻĒāĻĻā§āϧāϤāĻŋ āĻāĻŦāĻ āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĒāĻĻā§āϧāϤāĻŋ āĻāύā§āύāϤāĻŋ āϏāϤā§āϤā§āĻŦā§āĻ, āĻŽā§āϞ āĻāĻžāϰāĻŖ āϏā§āĻĒāώā§āĻ āύāϝāĻŧ āϞā§āĻāĻāϰāĻž āϏā§āύāĻžāϝāĻŧā§āĻŦāĻŋāĻā§āĻāĻžāύ⧠āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āϤāϤā§āϤā§āĻŦ āĻĨā§āĻā§ āĻ
āύā§āĻĒā§āϰā§āϰāĻŖāĻž āĻĒāĻžāύ, āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύāĻā§ āϏāĻŋāĻĻā§āϧāĻžāύā§āϤ-āĻĒāϰāĻŦāϰā§āϤ⧠āĻĒā§āϰāĻŽāĻžāĻŖ āϏāĻāĻā§āϰāĻš āĻĒā§āϰāĻā§āϰāĻŋāϝāĻŧāĻž āĻšāĻŋāϏāĻžāĻŦā§ āĻāĻžāĻ āĻžāĻŽā§āĻŦāĻĻā§āϧ āĻāϰā§, āĻāĻŦāĻ āĻāĻŦāĻŋāώā§āĻāĻžāϰ āĻāϰ⧠āϝ⧠LLM āĻā§āϞāĻŋ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āĻāϰāĻžāϰ āϏāĻŽāϝāĻŧ āϤāĻžāĻĻā§āϰ āύāĻŋāĻāϏā§āĻŦ āĻāϤā§āĻĒāĻžāĻĻāĻŋāϤ āĻāϤā§āϤāϰ āϤāĻĨā§āϝ āĻāĻĒā§āĻā§āώāĻž āĻāϰā§, āϝāĻž āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āϏāĻāĻā§āĻāĻžāϰ āϏāĻžāĻĨā§ āĻŦāĻŋāϰā§āϧāĻŋāϤāĻž āĻāϰā§āĨ¤
āϤāĻžāϤā§āϤā§āĻŦāĻŋāĻ āĻāĻŦāĻŋāώā§āĻāĻžāϰ : āĻĒā§āϰāĻĨāĻŽāĻŦāĻžāϰā§āϰ āĻŽāϤ⧠āϏāĻŋāϏā§āĻā§āĻŽā§āĻāĻŋāĻāĻāĻžāĻŦā§ "āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž" āĻā§ LLM āĻā§āϞāĻŋāϰ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āĻŽā§āϞ āĻāĻžāϰāĻŖ āĻšāĻŋāϏāĻžāĻŦā§ āĻāĻŋāĻšā§āύāĻŋāϤ āĻāĻŦāĻ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāϰāĻžāĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻĒāĻĻā§āϧāϤāĻŋ : āϏāĻŽā§āĻāĻžāĻŦā§āϝāϤāĻž āĻŦāĻŋāϤāϰāĻŖ āϤā§āϞāύāĻž āĻāĻŦāĻ āĻ
ā§āϝāĻžāĻā§āϰāĻŋāĻŦāĻŋāĻāĻļāύ āĻŦāĻŋāĻļā§āϞā§āώāĻŖā§āϰ āĻāĻĒāϰ āĻāĻŋāϤā§āϤāĻŋ āĻāϰ⧠āĻĻā§āĻŦā§āϤ āϝāĻžāĻāĻžāĻāĻāϰāĻŖ āĻĒāĻĻā§āϧāϤāĻŋ āĻĒā§āϰāϏā§āϤāĻžāĻŦ āĻāϰāĻžāϏāĻŽāĻžāϧāĻžāύ : ADVICE āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĢā§āϰā§āĻŽāĻāϝāĻŧāĻžāϰā§āĻ āĻĄāĻŋāĻāĻžāĻāύ āĻāϰāĻž, āϝāĻž āϏā§āĻĒāώā§āĻāĻāĻžāĻŦā§ āĻŽāĻĄā§āϞāĻā§ āϤāĻžāϰ āĻāϤā§āĻĒāĻžāĻĻāĻŋāϤ āĻāϤā§āϤāϰ⧠āĻŽāύā§āϝā§āĻ āĻĻāĻŋāϤ⧠āĻā§āϏāĻžāĻšāĻŋāϤ āĻāϰā§āĻ
āĻāĻŋāĻā§āĻāϤāĻžāĻŽā§āϞāĻ āϝāĻžāĻāĻžāĻāĻāϰāĻŖ : āĻāĻāĻžāϧāĻŋāĻ āĻĄā§āĻāĻžāϏā§āĻ āĻāĻŦāĻ āĻŽāĻĄā§āϞ⧠āĻĒāĻĻā§āϧāϤāĻŋāϰ āĻāĻžāϰā§āϝāĻāĻžāϰāĻŋāϤāĻž āϝāĻžāĻāĻžāĻ āĻāϰāĻž, āĻāϤā§āϤāϰ āϤāĻĨā§āϝā§āϰ āĻā§āϰā§āϤā§āĻŦ āĻĒā§āϰāĻŽāĻžāĻŖ āĻāϰāĻžāϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻā§āώāĻŽāϤāĻž : āĻŦāĻŋāϤāϰāĻŖ-āĻŦāĻžāĻāϰā§āϰ āĻāĻžāĻā§ āĻĒāĻĻā§āϧāϤāĻŋāϰ āĻļāĻā§āϤāĻŋāĻļāĻžāϞ⧠āϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻā§āώāĻŽāϤāĻž āĻāĻŦāĻ āĻāĻžāϰāϏāĻžāĻŽā§āϝāĻĒā§āϰā§āĻŖ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻŦāĻŋāϤāϰāĻŖ āĻŦā§āĻļāĻŋāώā§āĻā§āϝ āĻĒā§āϰāĻĻāϰā§āĻļāύ āĻāϰāĻžāĻĒā§āϰāĻļā§āύ q āĻāĻŦāĻ āϏāĻāĻļā§āϞāĻŋāώā§āĻ āĻāϤā§āϤāϰ a āĻĻā§āĻāϝāĻŧāĻž, āϏāĻāĻŦāĻžāĻĻāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻāϤā§āϤāϰ āϏāĻ āĻŋāĻ āĻšāĻāϝāĻŧāĻžāϰ āϏāĻŽā§āĻāĻžāĻŦāύāĻž P(correct|q,a) āĻāϰ āĻāĻžāĻāĻžāĻāĻžāĻāĻŋ āĻšāĻāϝāĻŧāĻž āĻāĻāĻŋāϤāĨ¤ āĻāĻĻāϰā§āĻļ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āύāĻŋāĻŽā§āύāϞāĻŋāĻāĻŋāϤ āĻšāĻāϝāĻŧāĻž āĻāĻāĻŋāϤ:
āĻāϤā§āϤāϰ āϏāĻ āĻŋāĻ āĻšāϞ⧠āĻāĻā§āĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāĻāĻžāĻļ āĻāϰāĻž āĻāϤā§āϤāϰ āĻā§āϞ āĻšāϞ⧠āύāĻŋāĻŽā§āύ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāĻāĻžāĻļ āĻāϰāĻž āĻāϤā§āϤāϰ āĻŦāĻŋāώāϝāĻŧāĻŦāϏā§āϤā§āϰ āĻāĻĒāϰ āĻāĻŋāϤā§āϤāĻŋ āĻāϰ⧠āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āϏā§āϤāϰ āϏāĻžāĻŽāĻā§āĻāϏā§āϝ āĻāϰāĻž āύāĻŋāĻŽā§āύāϞāĻŋāĻāĻŋāϤ āĻĻā§āĻāĻŋ āĻŦāĻŋāϤāϰāĻŖ āϤā§āϞāύāĻž āĻāϰ⧠āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž āϝāĻžāĻāĻžāĻ āĻāϰāĻž:
P_M(C | q, a) â P_M(C | q) âa â A_q
āϝā§āĻāĻžāύ⧠āĻĄāĻžāύ āĻĻāĻŋāĻāĻāĻŋ āϏāĻŽā§āĻĒā§āϰā§āĻŖ āϏāĻŽā§āĻāĻžāĻŦā§āϝāϤāĻž āϏā§āϤā§āϰā§āϰ āĻŽāĻžāϧā§āϝāĻŽā§ āĻĒā§āϰāϏāĻžāϰāĻŋāϤ āĻšāϝāĻŧ:
P_M(C | q) = ÎŖ_{a'âA_q} P_M(C | q, a') P_M(a' | q)
āĻĻā§āĻāĻŋ āĻŦāĻŋāϤāϰāĻŖā§āϰ āĻĒāĻžāϰā§āĻĨāĻā§āϝ āĻĒāϰāĻŋāĻŽāĻžāĻĒ āĻāϰāϤ⧠Jensen-Shannon āĻŦāĻŋāĻā§āϝā§āϤāĻŋ (JSD) āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻāϰāĻž, JSD āĻŽāĻžāύ 0 āĻāϰ āĻāĻžāĻāĻžāĻāĻžāĻāĻŋ āύāĻŋāϰā§āĻĻā§āĻļ āĻāϰ⧠āϝ⧠āĻŽāĻĄā§āϞ āĻāϤā§āϤāϰ āϤāĻĨā§āϝā§āϰ āĻĒā§āϰāϤāĻŋ āϏāĻāĻŦā§āĻĻāύāĻļā§āϞ āύāϝāĻŧāĨ¤
āĻŽāύā§āϝā§āĻ āϰā§āϞāĻāĻāĻ (Attention Rollout) : āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāĻāύā§āĻŽā§āϰ āĻāϤā§āϤāϰ āĻā§āĻā§āύāĻā§āϞāĻŋāϤ⧠āĻŽāύā§āϝā§āĻ āĻāĻāύ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāϰāĻžāϏāĻŽāύā§āĻŦāĻŋāϤ āĻā§āϰā§āĻĄāĻŋāϝāĻŧā§āύā§āĻ (Integrated Gradients) : āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰā§āĻŦāĻžāĻāĻžāϏ⧠āĻāϤā§āϤāϰ āĻā§āĻā§āύāĻā§āϞāĻŋāϰ āĻ
āĻŦāĻĻāĻžāύ āĻāĻŖāύāĻž āĻāϰāĻžTriviaQA āĻĨā§āĻ⧠⧍ā§Ļā§Ļā§Ļ āĻāĻĻāĻžāĻšāϰāĻŖ āύāĻŽā§āύāĻž āĻāϰāĻž āĻĒā§āϰāϤāĻŋāĻāĻŋ āĻĒā§āϰāĻļā§āύ q āĻāϰ āĻāύā§āϝ āϤā§āϰāĻŋāĻŽā§āĻā§ (q, a_correct, a_wrong) āύāĻŋāϰā§āĻŽāĻžāĻŖ āĻāϰāĻž āϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻā§āώāĻŽāϤāĻž āĻŦā§āĻĻā§āϧāĻŋāϰ āĻāύā§āϝ āϤāĻŋāύāĻāĻŋ āĻāĻžāώāĻžāĻāϤ āĻĢāϰā§āĻŽā§āϝāĻžāĻ āĻā§āϰāĻŋāϝāĻŧā§āύā§āĻ āύāĻŋāϰā§āĻŽāĻžāĻŖ āĻāϰāĻž āϤāĻŋāύāĻāĻŋ āĻā§āώāϤāĻŋ āĻĢāĻžāĻāĻļāύ āϏāĻāĻā§āĻāĻžāϝāĻŧāĻŋāϤ āĻāϰāĻž:
āĻāĻžāώāĻž āĻŽāĻĄā§āϞāĻŋāĻ āĻā§āώāϤāĻŋ :L_LM = (1/|a_correct|) ÎŖ_{x_tâa_correct} -log P(x_t | x_<t)
āĻŽāĻĄā§āϞā§āϰ āĻŽā§āϞ QA āĻā§āώāĻŽāϤāĻž āĻŦāĻāĻžāϝāĻŧ āϰāĻžāĻāĻž
āĻŦā§āĻĒāϰā§āϤā§āϝāĻŽā§āϞāĻ āĻŦāĻŋāϤāϰāĻŖ āĻā§āώāϤāĻŋ :L_JSD = max(0, δ_JSD - D_JSD(P_correct || P_wrong))
āĻŽāĻĄā§āϞāĻā§ āϏāĻ āĻŋāĻ āĻāĻŦāĻ āĻā§āϞ āĻāϤā§āϤāϰā§āϰ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻŦāĻŋāϤāϰāĻŖ āĻĒāĻžāϰā§āĻĨāĻā§āϝ āĻļāĻŋāĻāϤ⧠āĻāĻžāϞāĻŋāϤ āĻāϰāĻž
āĻŽāĻžāϰā§āĻāĻŋāύ āĻā§āώāϤāĻŋ :L_Margin = max(0, δ_Margin - (Îŧ_correct - Îŧ_wrong))
āύāĻŋāĻļā§āĻāĻŋāϤ āĻāϰāĻž āϝ⧠āϏāĻ āĻŋāĻ āĻāϤā§āϤāϰ āĻāĻā§āĻāϤāϰ āĻĒā§āϰāϤā§āϝāĻžāĻļāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒāĻžāϝāĻŧ
āĻŽā§āĻ āĻā§āώāϤāĻŋ āĻĢāĻžāĻāĻļāύ:
L = Îģ_LM L_LM + Îģ_JSD L_JSD + Îģ_Margin L_Margin
āĻŽā§āϞ āĻāĻžāϰāĻŖ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ : āĻĒā§āϰāĻĨāĻŽāĻŦāĻžāϰā§āϰ āĻŽāϤ⧠āĻāϤā§āϤāϰ-āύāĻŋāϰā§āĻāϰāϤāĻžāϰ āĻĻā§āώā§āĻāĻŋāĻā§āĻŖ āĻĨā§āĻā§ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āϏāĻŽāϏā§āϝāĻž āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāϰāĻžāĻĻā§āĻŦā§āϤ āϝāĻžāĻāĻžāĻāĻāϰāĻŖ : āϏāĻŽā§āĻāĻžāĻŦā§āϝāϤāĻž āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāĻŦāĻ āϏā§āύāĻžāϝāĻŧā§āĻāĻžāϞ āύā§āĻāĻāϝāĻŧāĻžāϰā§āĻ āĻ
ā§āϝāĻžāĻā§āϰāĻŋāĻŦāĻŋāĻāĻļāύ āĻĒāĻĻā§āϧāϤāĻŋ āĻāĻāϤā§āϰāĻŋāϤ āĻāϰāĻžāĻŦā§āĻĒāϰā§āϤā§āϝāĻŽā§āϞāĻ āĻļāĻŋāĻā§āώāĻž : āϏāĻ āĻŋāĻ/āĻā§āϞ āĻāϤā§āϤāϰ āĻā§āĻĄāĻŧāĻž āĻŦā§āϝāĻŦāĻšāĻžāϰ āĻāϰ⧠āĻŦā§āĻĒāϰā§āϤā§āϝāĻŽā§āϞāĻ āĻĒā§āϰāĻļāĻŋāĻā§āώāĻŖ āĻĒā§āϰāϝāĻŧā§āĻ āĻāϰāĻžāĻŦāĻšā§-āĻāĻĻā§āĻĻā§āĻļā§āϝ āĻ
āĻĒā§āĻāĻŋāĻŽāĻžāĻāĻā§āĻļāύ : āĻāĻžāĻā§āϰ āĻāϰā§āĻŽāĻā§āώāĻŽāϤāĻž āĻŦāĻāĻžāϝāĻŧ āϰāĻžāĻāĻž āĻāĻŦāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āĻāύā§āύāϤāĻŋāϰ āĻāĻžāϰāϏāĻžāĻŽā§āϝ āϰāĻžāĻāĻžāĻĒā§āϰāĻļāĻŋāĻā§āώāĻŖ : TriviaQA (⧍ā§Ļā§Ļā§Ļ āĻāĻĻāĻžāĻšāϰāĻŖ)āĻŽā§āϞā§āϝāĻžāϝāĻŧāύ : TriviaQA, MMLU, SciQ, LogiQA (āĻā§āϰāϏ-āĻĄā§āĻŽā§āĻāύ āϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻĒāϰā§āĻā§āώāĻž)LLAMA-3.1-8B-INSTRUCT MISTRAL-7B-INSTRUCT-V0.3 GEMMA-2-9B-IT ScoreText: {low, medium, high} ScoreLetter: {E, D, C, B, A} ScoreNumber: {0, 1, ..., 9} ScoreFloat: 0.0, 1.0 ScorePercent: {0%, 1%, ..., 100%} ECE (āĻĒā§āϰāϤā§āϝāĻžāĻļāĻŋāϤ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āϤā§āϰā§āĻāĻŋ): āĻĒā§āϰā§āĻŦāĻžāĻāĻžāϏāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻāĻŦāĻ āĻĒā§āϰāĻā§āϤ āύāĻŋāϰā§āĻā§āϞāϤāĻžāϰ āĻāĻĄāĻŧ āĻĒāϰāĻŽ āĻĒāĻžāϰā§āĻĨāĻā§āϝNCE (āύā§āĻ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āϤā§āϰā§āĻāĻŋ): āϏā§āĻŦāĻžāĻā§āώāϰāĻŋāϤ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āϤā§āϰā§āĻāĻŋ, āĻĒāĻā§āώāĻĒāĻžāϤ āĻĒā§āϰāϤāĻŋāĻĢāϞāĻŋāϤ āĻāϰā§BS (Brier āϏā§āĻā§āϰ): āϏāĻŽā§āĻāĻžāĻŦā§āϝāϤāĻž āĻĒā§āϰā§āĻŦāĻžāĻāĻžāϏā§āϰ āĻāĻĄāĻŧ āĻŦāϰā§āĻ āϤā§āϰā§āĻāĻŋAUROC : āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āϰā§āϝāĻžāĻā§āĻāĻŋāĻ āĻā§āώāĻŽāϤāĻžDefault : āĻŽā§āϞāĻŋāĻ āĻĒā§āϰāĻŽā§āĻĒāĻāĻŋāĻ āĻĒāĻĻā§āϧāϤāĻŋSelf-Consistency : āύāĻŽā§āύāĻž-āĻāĻŋāϤā§āϤāĻŋāĻ āĻĒāĻĻā§āϧāϤāĻŋConfTuner : āĻŦāϰā§āϤāĻŽāĻžāύ āϏā§āϰāĻž āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĒāĻĻā§āϧāϤāĻŋTriviaQA-āϤ⧠āĻāϰā§āĻŽāĻā§āώāĻŽāϤāĻž āϤā§āϞāύāĻž (GEMMA-2-9B-IT):
ECE : Default (21.9%) â ADVICE (6.5%)NCE : Default (-21.8%) â ADVICE (1.6%)AUROC : Default (52.7%) â ADVICE (78.5%)āĻā§āϰāϏ-āĻĄā§āĻŽā§āĻāύ āϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻĢāϞāĻžāĻĢāϞ āĻĻā§āĻāĻžāϝāĻŧ āϝ⧠ADVICE MMLU, SciQ, LogiQA-āϤ⧠āĻāϞā§āϞā§āĻāϝā§āĻā§āϝ āĻāύā§āύāϤāĻŋ āĻ
āϰā§āĻāύ āĻāϰā§, āĻĒāĻĻā§āϧāϤāĻŋāϰ āĻļāĻā§āϤāĻŋāĻļāĻžāϞā§āϤāĻž āĻĒā§āϰāĻŽāĻžāĻŖ āĻāϰā§āĨ¤
āĻĒā§āϰāϤāĻŋāĻāĻŋ āĻā§āώāϤāĻŋ āĻĢāĻžāĻāĻļāύā§āϰ āĻ
āĻŦāĻĻāĻžāύ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ:
L_JSD āĻāĻāĻž āĻŦā§āϝāĻŦāĻšāĻžāϰ: ECE 19.7% āĻĨā§āĻā§ 4.9% āĻ āĻšā§āϰāĻžāϏ L_Margin āĻāĻāĻž āĻŦā§āϝāĻŦāĻšāĻžāϰ: ECE 19.7% āĻĨā§āĻā§ 3.9% āĻ āĻšā§āϰāĻžāϏ āϏāĻŽā§āĻĒā§āϰā§āĻŖ ADVICE: āϏāϰā§āĻŦā§āϤā§āϤāĻŽ āĻā§āϰāϏ-āĻĄā§āĻāĻžāϏā§āĻ āϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻā§āώāĻŽāϤāĻž āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž āϝāĻžāĻāĻžāĻāĻāϰāĻŖ : JSD āĻŦāĻŋāϤāϰāĻŖ āĻļāĻā§āϤāĻŋ-āĻāĻāύ āĻĒā§āϝāĻžāĻāĻžāϰā§āύ āĻĒā§āϰāĻĻāϰā§āĻļāύ āĻāϰā§, āĻŦā§āĻļāĻŋāϰāĻāĻžāĻ āĻŽāĻžāύ 0 āĻāϰ āĻāĻžāĻāĻžāĻāĻžāĻāĻŋ, āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž āĻ
āύā§āĻŽāĻžāύ āύāĻŋāĻļā§āĻāĻŋāϤ āĻāϰā§āĻŽāύā§āϝā§āĻ āĻĒā§āϝāĻžāĻāĻžāϰā§āύ : āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ â āĻāϤā§āϤāϰā§āϰ āĻŽāύā§āϝā§āĻ āĻāĻāύ āĻ
āύā§āϝāĻžāύā§āϝ āĻĻāĻŋāĻā§āϰ āϤā§āϞāύāĻžāϝāĻŧ āĻāϞā§āϞā§āĻāϝā§āĻā§āϝāĻāĻžāĻŦā§ āĻāĻŽāĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āĻāύā§āύāϤāĻŋ : āύāĻŋāϰā§āĻāϰāϝā§āĻā§āϝāϤāĻž āĻā§āϰāĻžāĻĢ āύāĻŋāϰā§āĻĻā§āĻļ āĻāϰ⧠āϝ⧠ADVICE āĻāϰāĻ āϏā§āĻā§āώā§āĻŽ-āĻĻāĻžāύāĻžāĻĻāĻžāϰ, āĻāϰāĻ āύāĻŋāϰā§āĻā§āϞ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻŦāĻŋāϤāϰāĻŖ āϤā§āϰāĻŋ āĻāϰā§āĻāϤā§āϤāϰ āϏāĻā§āϤāύāϤāĻž āĻŦā§āĻĻā§āϧāĻŋ : āĻŽāĻžāϏā§āĻāĻŋāĻ āĻĒāϰā§āĻā§āώāĻž āĻĻā§āĻāĻžāϝāĻŧ āϝ⧠ADVICE āĻāϤā§āϤāϰ āĻ
āύā§āĻĒāϏā§āĻĨāĻŋāϤ āĻĨāĻžāĻāϞ⧠āϝāĻĨāĻžāϝāĻĨāĻāĻžāĻŦā§ āĻ
āύāĻŋāĻļā§āĻāϝāĻŧāϤāĻž āĻĒā§āϰāĻāĻžāĻļ āĻāϰā§Î´_JSD āĻāϰ āĻŦā§āĻĻā§āϧāĻŋ āĻā§āϰāĻŽāĻžāĻāϤ ECE āĻšā§āϰāĻžāϏ āĻāϰā§, āĻŦā§āĻĒāϰā§āϤā§āϝāĻŽā§āϞāĻ āĻļāĻŋāĻā§āώāĻž āĻāĻĻā§āĻĻā§āĻļā§āϝā§āϰ āĻāĻžāϰā§āϝāĻāĻžāϰāĻŋāϤāĻž āϝāĻžāĻāĻžāĻ āĻāϰā§āĨ¤
Lin āĻāĻŦāĻ āĻ
āύā§āϝāϰāĻž (2022) āĻĒā§āϰāĻĨāĻŽ āϏāĻāĻŦāĻžāĻĻāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āĻĒā§āϰāĻŦāϰā§āϤāύ āĻāϰā§āύ āĻĒāϰāĻŦāϰā§āϤ⧠āĻāĻŦā§āώāĻŖāĻž āĻĒā§āϰāϧāĻžāύāϤ āϤāĻŋāύāĻāĻŋ āĻŦāĻŋāĻāĻžāĻā§ āĻŦāĻŋāĻāĻā§āϤ: āĻĒā§āϰāĻŽā§āĻĒāĻāĻŋāĻ āĻĒāĻĻā§āϧāϤāĻŋ, āύāĻŽā§āύāĻž āĻĒāĻĻā§āϧāϤāĻŋ āĻāĻŦāĻ āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĒāĻĻā§āϧāϤāĻŋ āĻāĻ āĻāĻŦā§āώāĻŖāĻž āĻĒā§āϰāĻā§āϰāĻŋāϝāĻŧāĻž āĻŦāĻŋāĻļā§āϞā§āώāĻŖā§āϰ āĻļā§āύā§āϝāϤāĻž āĻĒā§āϰāĻŖ āĻāϰ⧠āĻŽāύā§āϝā§āĻ āĻĒā§āϰāĻā§āϰāĻŋāϝāĻŧāĻž āĻŦāĻŋāĻļā§āϞā§āώāĻŖ: Attention Rollout, Attention Flow āĻāϤā§āϝāĻžāĻĻāĻŋ āĻā§āϰā§āĻĄāĻŋāϝāĻŧā§āύā§āĻ āĻ
ā§āϝāĻžāĻā§āϰāĻŋāĻŦāĻŋāĻāĻļāύ āĻĒāĻĻā§āϧāϤāĻŋ: Integrated Gradients āĻāϤā§āϝāĻžāĻĻāĻŋ āĻāĻ āĻāĻŦā§āώāĻŖāĻž āϏā§āĻāύāĻļā§āϞāĻāĻžāĻŦā§ āĻāĻ āĻĒāĻĻā§āϧāϤāĻŋāĻā§āϞāĻŋ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻŦāĻŋāĻļā§āϞā§āώāĻŖā§ āĻĒā§āϰāϝāĻŧā§āĻ āĻāϰ⧠LLM āĻā§āϞāĻŋāϰ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻĒā§āϰāϧāĻžāύāϤ āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž āϏāĻŽāϏā§āϝāĻž āĻĨā§āĻā§ āĻāĻĻā§āĻā§āϤ āĻšāϝāĻŧ ADVICE āĻāϤā§āϤāϰ-āύāĻŋāϰā§āĻāϰāϤāĻž āĻŦā§āĻĻā§āϧāĻŋāϰ āĻŽāĻžāϧā§āϝāĻŽā§ āĻāĻžāϰā§āϝāĻāϰāĻāĻžāĻŦā§ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āĻāύā§āύāϤ āĻāϰ⧠āĻĒāĻĻā§āϧāϤāĻŋāĻāĻŋ āĻāĻžāϞ āϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻā§āώāĻŽāϤāĻž āĻāĻŦāĻ āĻŦā§āϝāĻŦāĻšāĻžāϰāĻŋāĻ āĻŽā§āϞā§āϝ āϰāĻžāĻā§ āĻĒā§āϰāϧāĻžāύāϤ āϏāĻāĻā§āώāĻŋāĻĒā§āϤ āĻĒāĻžāĻ ā§āϝ QA āĻāĻžāĻā§ āĻĢā§āĻāĻžāϏ āĻāϰā§, āĻĻā§āϰā§āĻ āĻĒāĻžāĻ ā§āϝ āĻŦā§āĻāĻžāϰ āĻāĻžāĻā§ āĻĒā§āϰāϝāĻŧā§āĻāϝā§āĻā§āϝāϤāĻž āϝāĻžāĻāĻžāĻ āĻāϰāĻž āĻŦāĻžāĻāĻŋ āĻŦā§āĻĒāϰā§āϤā§āϝāĻŽā§āϞāĻ āĻāϤā§āϤāϰ āĻā§āĻĄāĻŧāĻž āϤā§āϰāĻŋ āĻāϰāϤ⧠āĻ
āϤāĻŋāϰāĻŋāĻā§āϤ āĻĄā§āĻāĻž āύāĻŋāϰā§āĻŽāĻžāĻŖ āĻāϰāĻ āĻĒā§āϰāϝāĻŧā§āĻāύ āĻāĻāĻŋāϞ āϝā§āĻā§āϤāĻŋ āĻāĻžāĻā§ āĻĒā§āϰāĻāĻžāĻŦ āĻāϰāĻ āĻ
āύā§āĻŦā§āώāĻŖ āĻĒā§āϰāϝāĻŧā§āĻāύ āĻĻā§āϰā§āĻ āĻĒā§āϰāϏāĻā§āĻ āĻŦā§āĻāĻž āĻāĻŦāĻ āĻāĻāĻŋāϞ āϝā§āĻā§āϤāĻŋāϰ āĻĒā§āϰāϝāĻŧā§āĻāύ āĻāĻŽāύ āĻāĻžāĻā§ āϏāĻŽā§āĻĒā§āϰāϏāĻžāϰāĻŖ āĻāϰāĻž āĻāϰāĻ āĻĻāĻā§āώ āĻĒā§āϰāĻļāĻŋāĻā§āώāĻŖ āĻĄā§āĻāĻž āύāĻŋāϰā§āĻŽāĻžāĻŖ āĻĒāĻĻā§āϧāϤāĻŋ āĻ
āύā§āĻŦā§āώāĻŖ āĻāϰāĻž āĻ
āύā§āϝāĻžāύā§āϝ āĻĒāĻĻā§āϧāϤāĻŋāϤ⧠(āϝā§āĻŽāύ āĻĻā§āώā§āĻāĻŋ-āĻāĻžāώāĻž āĻŽāĻĄā§āϞ) āĻĒā§āϰāϝāĻŧā§āĻ āĻāĻŦā§āώāĻŖāĻž āĻāϰāĻž āϤāĻžāϤā§āϤā§āĻŦāĻŋāĻ āĻ
āĻŦāĻĻāĻžāύ āĻāϞā§āϞā§āĻāϝā§āĻā§āϝ : āĻĒā§āϰāĻĨāĻŽāĻŦāĻžāϰā§āϰ āĻŽāϤ⧠āϏāĻŋāϏā§āĻā§āĻŽā§āĻāĻŋāĻāĻāĻžāĻŦā§ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āĻŽā§āϞ āĻāĻžāϰāĻŖ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāϰāĻž, āĻā§āϰā§āϤā§āĻŦāĻĒā§āϰā§āĻŖ āϤāĻžāϤā§āϤā§āĻŦāĻŋāĻ āĻ
āύā§āϤāϰā§āĻĻā§āώā§āĻāĻŋ āĻĒā§āϰāĻĻāĻžāύ āĻāϰāĻžāĻĒāĻĻā§āϧāϤāĻŋāĻāϤ āĻāĻ ā§āϰāϤāĻž : āĻāĻāĻžāϧāĻŋāĻ āĻā§āĻŖ āĻĨā§āĻā§ āϝāĻžāĻāĻžāĻāĻāϰāĻŖ (āϏāĻŽā§āĻāĻžāĻŦā§āϝāϤāĻž āĻŦāĻŋāĻļā§āϞā§āώāĻŖ + āĻ
ā§āϝāĻžāĻā§āϰāĻŋāĻŦāĻŋāĻāĻļāύ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ), āĻāĻā§āĻ āϏāĻŋāĻĻā§āϧāĻžāύā§āϤā§āϰ āĻŦāĻŋāĻļā§āĻŦāĻžāϏāϝā§āĻā§āϝāϤāĻžāĻĒāϰā§āĻā§āώāĻž-āύāĻŋāϰā§āĻā§āώāĻž āĻĄāĻŋāĻāĻžāĻāύ āϏāĻŽā§āĻĒā§āϰā§āĻŖ : āĻā§āϰāϏ-āĻŽāĻĄā§āϞ, āĻā§āϰāϏ-āĻĄā§āĻāĻžāϏā§āĻ āĻŦā§āϝāĻžāĻĒāĻ āĻŽā§āϞā§āϝāĻžāϝāĻŧāύ, āĻĒāϰā§āϝāĻžāĻĒā§āϤ āĻŦāĻŋāϞā§āĻĒāύ āĻĒāϰā§āĻā§āώāĻž-āύāĻŋāϰā§āĻā§āώāĻžāĻŦā§āϝāĻŦāĻšāĻžāϰāĻŋāĻ āĻŽā§āϞā§āϝ āĻāϞā§āϞā§āĻāϝā§āĻā§āϝ : āĻāĻžāĻā§āϰ āĻāϰā§āĻŽāĻā§āώāĻŽāϤāĻž āĻŦāĻāĻžāϝāĻŧ āϰā§āĻā§ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āĻāϞā§āϞā§āĻāϝā§āĻā§āϝāĻāĻžāĻŦā§ āĻāύā§āύāϤ āĻāϰāĻžāϏāĻžāϧāĻžāϰāĻŖā§āĻāϰāĻŖ āĻā§āώāĻŽāϤāĻž āĻļāĻā§āϤāĻŋāĻļāĻžāϞ⧠: āĻŦāĻŋāϤāϰāĻŖ-āĻŦāĻžāĻāϰā§āϰ āĻĄā§āĻāĻžāϝāĻŧ āĻāĻžāϞ āĻāϰā§āĻŽāĻā§āώāĻŽāϤāĻž, āĻĒāĻĻā§āϧāϤāĻŋāϰ āĻļāĻā§āϤāĻŋāĻļāĻžāϞā§āϤāĻž āĻĒā§āϰāĻĻāϰā§āĻļāύ āĻāϰāĻžāĻāĻžāĻā§āϰ āĻĒāϰāĻŋāϏā§āĻŽāĻž āϏā§āĻŽāĻŋāϤ : āĻĒā§āϰāϧāĻžāύāϤ QA āĻāĻžāĻ āϝāĻžāĻāĻžāĻ āĻāϰāĻž, āĻ
āύā§āϝāĻžāύā§āϝ NLP āĻāĻžāĻā§ āĻĒā§āϰāϝāĻŧā§āĻāϝā§āĻā§āϝāϤāĻž āĻĒāϰā§āϝāĻžāĻĒā§āϤāĻāĻžāĻŦā§ āĻ
āύā§āĻŦā§āώāĻŖ āĻāϰāĻž āĻšāϝāĻŧāύāĻŋāĻāĻŖāύāĻž āĻāĻāĻžāϰāĻšā§āĻĄ : āĻ
āϤāĻŋāϰāĻŋāĻā§āϤ āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĒā§āϰāĻā§āϰāĻŋāϝāĻŧāĻž āĻāĻŦāĻ āĻŦā§āĻĒāϰā§āϤā§āϝāĻŽā§āϞāĻ āĻĄā§āĻāĻž āύāĻŋāϰā§āĻŽāĻžāĻŖ āĻĒā§āϰāϝāĻŧā§āĻāύāϤāĻžāϤā§āϤā§āĻŦāĻŋāĻ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāĻā§āϰāϤāĻž : āϝāĻĻāĻŋāĻ āĻāϤā§āϤāϰ-āϏā§āĻŦāĻžāϧā§āύāϤāĻž āϏāĻŽāϏā§āϝāĻž āĻāĻŋāĻšā§āύāĻŋāϤ āĻāϰāĻž āĻšāϝāĻŧā§āĻā§, āĻāϰ āĻā§āĻĒāϤā§āϤāĻŋāϰ āĻāĻā§āϰ āĻāĻžāϰāĻŖ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻ
āĻĒāϰā§āϝāĻžāĻĒā§āϤāĻĻā§āϰā§āĻāĻŽā§āϝāĻŧāĻžāĻĻā§ āĻĒā§āϰāĻāĻžāĻŦ : āϏā§āĻā§āώā§āĻŽ-āϏā§āϰ āĻĒāϰāĻŦāϰā§āϤ⧠āĻŽāĻĄā§āϞā§āϰ āĻĻā§āϰā§āĻāĻŽā§āϝāĻŧāĻžāĻĻā§ āĻŦā§āϝāĻŦāĻšāĻžāϰ⧠āϏā§āĻĨāĻŋāϤāĻŋāĻļā§āϞāϤāĻž āĻŽā§āϞā§āϝāĻžāϝāĻŧāύ āĻāϰāĻž āĻšāϝāĻŧāύāĻŋāĻāĻāĻžāĻĄā§āĻŽāĻŋāĻ āĻŽā§āϞā§āϝ : āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύ āĻā§āώā§āϤā§āϰ⧠āύāϤā§āύ āĻāĻŦā§āώāĻŖāĻž āĻĻā§āώā§āĻāĻŋāĻāĻā§āĻāĻŋ āĻāĻŦāĻ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻĢā§āϰā§āĻŽāĻāϝāĻŧāĻžāϰā§āĻ āĻĒā§āϰāĻĻāĻžāύ āĻāϰāĻžāĻŦā§āϝāĻŦāĻšāĻžāϰāĻŋāĻ āϤāĻžā§āĻĒāϰā§āϝ : āĻāĻā§āĻ-āĻā§āĻāĻāĻŋāĻĒā§āϰā§āĻŖ āĻĒā§āϰāϝāĻŧā§āĻā§ LLM āĻā§āϞāĻŋāϰ āύāĻŋāϰā§āĻāϰāϝā§āĻā§āϝāϤāĻž āĻāύā§āύāϤ āĻāϰāϤ⧠āĻā§āϰā§āϤā§āĻŦāĻĒā§āϰā§āĻŖ āĻŽā§āϞā§āϝ āϰāĻžāĻāĻžāĻĒā§āύāϰā§ā§āĻĒāĻžāĻĻāύāϝā§āĻā§āϝāϤāĻž : āĻŦāĻŋāϏā§āϤāĻžāϰāĻŋāϤ āĻŦāĻžāϏā§āϤāĻŦāĻžāϝāĻŧāύ āĻŦāĻŋāĻŦāϰāĻŖ āĻāĻŦāĻ āĻāĻĒā§āύ-āϏā§āϰā§āϏ āĻā§āĻĄ āĻĒā§āϰāĻĻāĻžāύ āĻāϰāĻž, āĻĒā§āύāϰā§ā§āĻĒāĻžāĻĻāύ āĻāĻŦāĻ āϏāĻŽā§āĻĒā§āϰāϏāĻžāϰāĻŖ āϏāĻšāĻāϤāϰ āĻāϰāĻžāύāĻŋāϰā§āĻāϰāϝā§āĻā§āϝ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ āĻ
āύā§āĻŽāĻžāύā§āϰ āĻĒā§āϰāϝāĻŧā§āĻāύ āĻāĻŽāύ āĻĒā§āϰāĻļā§āύā§āϤā§āϤāϰ āϏāĻŋāϏā§āĻā§āĻŽ āĻāĻā§āĻ-āĻā§āĻāĻāĻŋāĻĒā§āϰā§āĻŖ āϏāĻŋāĻĻā§āϧāĻžāύā§āϤ āϏāĻšāĻžāϝāĻŧāϤāĻž āϏāĻŋāϏā§āĻā§āĻŽ āĻŽāĻžāύāĻŦ-āĻŽā§āĻļāĻŋāύ āϏāĻšāϝā§āĻāĻŋāϤāĻž āĻĒāϰāĻŋāϏā§āĻĨāĻŋāϤāĻŋāϤ⧠āĻ
āύāĻŋāĻļā§āĻāϝāĻŧāϤāĻž āĻĒā§āϰāĻāĻžāĻļ āĻŽāĻĄā§āϞ āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āĻāĻŦāĻ āĻŦāĻŋāĻļā§āĻŦāĻžāϏāϝā§āĻā§āϝ AI āĻĒā§āϰāϝāĻŧā§āĻ āĻĒā§āĻĒāĻžāϰāĻāĻŋ ā§Ŧā§ŽāĻāĻŋ āϏāĻŽā§āĻĒāϰā§āĻāĻŋāϤ āϰā§āĻĢāĻžāϰā§āύā§āϏ āĻāĻĻā§āϧā§āϤ āĻāϰā§, āϝāĻž āϏāĻāĻŦāĻžāĻĻāĻŋāϤ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏ, LLM āĻ
āύā§āϏāύā§āϧāĻžāύ āĻĒāĻĻā§āϧāϤāĻŋ, āĻā§āϝāĻžāϞāĻŋāĻŦā§āϰā§āĻļāύ āϤāϤā§āϤā§āĻŦ āĻāĻŦāĻ āĻ
āύā§āϝāĻžāύā§āϝ āĻā§āώā§āϤā§āϰā§āϰ āĻā§āϰā§āϤā§āĻŦāĻĒā§āϰā§āĻŖ āĻāĻžāĻ āĻ
āύā§āϤāϰā§āĻā§āĻā§āϤ āĻāϰā§, āĻāĻŦā§āώāĻŖāĻžāϰ āĻāύā§āϝ āĻĻā§āĻĸāĻŧ āϤāĻžāϤā§āϤā§āĻŦāĻŋāĻ āĻāĻŋāϤā§āϤāĻŋ āĻĒā§āϰāĻĻāĻžāύ āĻāϰā§āĨ¤
āϏāĻžāĻŽāĻā§āϰāĻŋāĻ āĻŽā§āϞā§āϝāĻžāϝāĻŧāύ : āĻāĻāĻŋ āĻāĻāĻāĻŋ āĻāĻā§āĻ-āĻŽāĻžāύā§āϰ āĻāĻŦā§āώāĻŖāĻž āĻĒā§āĻĒāĻžāϰ, āϝāĻž āϤāĻžāϤā§āϤā§āĻŦāĻŋāĻ āĻŦāĻŋāĻļā§āϞā§āώāĻŖ āĻāĻŦāĻ āĻŦā§āϝāĻŦāĻšāĻžāϰāĻŋāĻ āĻĒāĻĻā§āϧāϤāĻŋ āĻāĻāϝāĻŧ āĻā§āώā§āϤā§āϰā§āĻ āĻā§āϰā§āϤā§āĻŦāĻĒā§āϰā§āĻŖ āĻ
āĻŦāĻĻāĻžāύ āϰāĻžāĻā§āĨ¤ āϞā§āĻāĻāϰāĻž āĻļā§āϧā§āĻŽāĻžāϤā§āϰ LLM āĻā§āϞāĻŋāϰ āĻ
āϤā§āϝāϧāĻŋāĻ āĻāϤā§āĻŽāĻŦāĻŋāĻļā§āĻŦāĻžāϏā§āϰ āĻŽā§āϞ āĻāĻžāϰāĻŖ āĻāĻŋāĻšā§āύāĻŋāϤ āĻāϰā§āύāύāĻŋ, āĻŦāϰāĻ āĻāĻāĻāĻŋ āĻāĻžāϰā§āϝāĻāϰ āϏāĻŽāĻžāϧāĻžāύāĻ āĻĒā§āϰāϏā§āϤāĻžāĻŦ āĻāϰā§āĻā§āύāĨ¤ āĻĒāĻĻā§āϧāϤāĻŋāĻāĻŋ āϏāĻšāĻ āĻāĻŦāĻ āĻāĻžāϰā§āϝāĻāϰ, āĻĒāϰā§āĻā§āώāĻž-āύāĻŋāϰā§āĻā§āώāĻž āĻĄāĻŋāĻāĻžāĻāύ āĻāĻ ā§āϰ, āĻāĻŦāĻ āĻĢāϞāĻžāĻĢāϞ āĻĒā§āϰāĻāĻžāĻŦāĻļāĻžāϞā§āĨ¤ āĻŦāĻŋāĻļā§āĻŦāĻžāϏāϝā§āĻā§āϝ AI āĻĒā§āϰāĻāĻžāϰ āĻāĻŦāĻ āĻŦāĻžāϏā§āϤāĻŦ āĻĒā§āϰāϝāĻŧā§āĻā§ LLM āĻā§āϞāĻŋāϰ āύāĻŋāϰā§āĻāϰāϝā§āĻā§āϝāϤāĻž āĻāύā§āύāϤ āĻāϰāĻžāϰ āĻāύā§āϝ āĻā§āϰā§āϤā§āĻŦāĻĒā§āϰā§āĻŖ āϤāĻžā§āĻĒāϰā§āϝ āϰāĻžāĻā§āĨ¤