2025-11-24T16:10:25.080119

Using Information Geometry to Characterize Higher-Order Interactions in EEG

Albers, Marriott, Tatsuno

In neuroscience, methods from information geometry (IG) have been successfully applied in the modelling of binary vectors from spike train data, using the orthogonal decomposition of the Kullback-Leibler divergence and mutual information to isolate different orders of interaction between neurons. While spike train data is well-approximated with a binary model, here we apply these IG methods to data from electroencephalography (EEG), a continuous signal requiring appropriate discretization strategies. We developed and compared three different binarization methods and used them to identify third-order interactions in an experiment involving imagined motor movements. The statistical significance of these interactions was assessed using phase-randomized surrogate data that eliminated higher-order dependencies while preserving the spectral characteristics of the original signals. We validated our approach by implementing known second- and third-order dependencies in a forward model and quantified information attenuation at different steps of the analysis. This revealed that the greatest loss in information occurred when going from the idealized binary case to enforcing these dependencies using oscillatory signals. When applied to the real EEG dataset, our analysis detected statistically significant third-order interactions during the task condition despite the relatively sparse data (45 trials per condition). This work demonstrates that IG methods can successfully extract genuine higher-order dependencies from continuous neural recordings when paired with appropriate binarization schemes.

academic

Using Information Geometry to Characterize Higher-Order Interactions in EEG

基本信息

论文ID: 2510.14188
标题: Using Information Geometry to Characterize Higher-Order Interactions in EEG
作者: Eric Albers, Paul Marriott, Masami Tatsuno
分类: q-bio.NC (Neurons and Cognition), q-bio.QM (Quantitative Methods)
发表时间: 2025年10月16日 (arXiv预印本)
论文链接: https://arxiv.org/abs/2510.14188

摘要

本研究将信息几何(Information Geometry, IG)方法从传统的二进制脊波序列数据扩展到连续的脑电图(EEG)信号分析。通过Kullback-Leibler散度和互信息的正交分解来识别神经元间不同阶次的相互作用。研究开发了三种二值化方法用于识别运动想象实验中的三阶相互作用，并使用相位随机化替代数据评估统计显著性。通过前向模型验证方法的有效性，量化了分析各步骤的信息衰减。结果表明，尽管数据相对稀疏(每条件45个试验)，该方法仍能在任务条件下检测到统计显著的三阶相互作用。

研究背景与动机

问题定义

传统神经科学研究主要关注脑区间的成对关系(二阶相互作用)，但大脑作为复杂系统可能存在超越成对关系的高阶相互作用。现有的功能连接网络基于成对相关性构建，可能无法完全捕捉大脑信息处理的复杂性。

重要性

理论意义: 理解大脑是否需要三阶或更高阶的相互作用来完成认知功能
方法学意义: 扩展信息几何方法从离散的脊波数据到连续的EEG信号
应用价值: 为脑机接口和神经疾病诊断提供新的分析工具

现有方法局限性

信息几何方法: 主要应用于二进制脊波数据，对连续信号缺乏有效的离散化策略
传统EEG分析: 主要基于成对相关性，忽略了高阶依赖关系
统计推断: 在稀疏数据条件下，标准渐近工具(如χ²分布)可能不适用

研究动机

将成功应用于脊波分析的信息几何方法扩展到EEG数据，开发适当的二值化策略来捕捉连续神经记录中的真实高阶依赖关系。

核心贡献

方法学创新: 开发了三种二值化方法(Sign、Diff、Power)将连续EEG信号转换为适合信息几何分析的二进制表示
验证框架: 建立了基于相位随机化替代数据的统计显著性检验方法
前向建模: 实现了已知二阶和三阶依赖关系的前向模型，量化了分析过程中的信息衰减
实证发现: 在运动想象EEG数据中检测到统计显著的三阶相互作用
理论洞察: 揭示了从理想化二进制情况到振荡信号实施依赖关系时发生最大信息损失

方法详解

任务定义

输入: 多通道EEG连续信号输出: 通道三元组间的一阶、二阶、三阶互信息分量约束: 处理稀疏数据(45个试验/条件)和连续信号的离散化挑战

信息几何理论基础

对于三个二进制变量X₁, X₂, X₃，联合概率分布可表示为8个概率的向量：

p = (p₀₀₀, p₀₀₁, p₀₁₀, p₀₁₁, p₁₀₀, p₁₀₁, p₁₁₀, p₁₁₁)

期望参数η坐标系统：

η₁, η₂, η₃: 边际激活率
η₁₂, η₁₃, η₂₃: 成对激活率
η₁₂₃: 三元激活率

自然参数θ坐标系统通过对数比值定义，如：

θ₁₂₃ = log(p₀₀₁p₀₁₀p₁₀₀p₁₁₁)/(p₁₁₀p₁₀₁p₀₁₁p₀₀₀)

KL散度的正交分解

使用混合坐标系统，KL散度可正交分解为：

D[p : q] = D[p : p̄] + D[p̄ : p̃] + D[p̃ : q]

其中：

Dp : p̄: 三元相互作用信息
Dp̄ : p̃: 成对相互作用信息
Dp̃ : q: 激活率调制信息

二值化方法

1. Sign方法

binary_signal = 1 if EEG_signal > 0 else 0

捕捉粗糙的相位信息，忽略幅度。

2. Diff方法

diff_signal = diff(EEG_signal)
binary_signal = 1 if diff_signal > 0 else 0

捕捉相位转换模式。

3. Power方法

power = EEG_signal²
envelope = moving_average(power, 30_samples)
z_scores = (envelope - mean) / std
binary_signal = 1 if z_scores > 1 else 0

捕捉高幅度时期，与相位无关。

统计显著性检验

使用测试统计量：

λ = 2N·D[p : p̄] ~ χ²(1)

由于数据稀疏，χ²近似不佳，采用基于IAAFT(迭代幅度调整傅里叶变换)替代数据的非参数检验。

实验设置

数据集

OpenNeuro运动想象数据集 (Triana-Guzman et al., 2022):

参与者: 32名健康受试者(16名女性)
电极: 17个电极，按国际10-20系统放置
采样率: 250 Hz
试验设计:
- 6个区块(3个坐姿，3个站姿)
- 每区块30个试验(15个运动想象，15个空闲状态)
- 总计每条件45个试验

试验结构:

注视(4秒): 注视屏幕十字
观察(3秒): 显示即将执行的任务
想象(4秒): 执行心理任务(运动想象或空闲状态)
休息(4秒): 自由活动

数据预处理

滤波: 0.5 Hz高通滤波，58-62 Hz陷波滤波
伪影去除: 使用ASR(伪影子空间重构)方法
频段滤波: 分为Delta(0.5-4Hz)、Theta(4-8Hz)、Alpha(8-12Hz)、Beta(12-30Hz)、Gamma(30-60Hz)
时期提取: 从想象任务开始前7秒到开始后4秒的11秒时期