A Deep State-Space Model Compression Method using Upper Bound on Output Error
Sakamoto, Sato
We study deep state-space models (Deep SSMs) that contain linear-quadratic-output (LQO) systems as internal blocks and present a compression method with a provable output error guarantee. We first derive an upper bound on the output error between two Deep SSMs and show that the bound can be expressed via the $h^2$-error norms between the layerwise LQO systems, thereby providing a theoretical justification for existing model order reduction (MOR)-based compression. Building on this bound, we formulate an optimization problem in terms of the $h^2$-error norm and develop a gradient-based MOR method. On the IMDb task from the Long Range Arena benchmark, we demonstrate that our compression method achieves strong performance. Moreover, unlike prior approaches, we reduce roughly 80% of trainable parameters without retraining, with only a 4-5% performance drop.
academic
A Deep State-Space Model Compression Method using Upper Bound on Output Error
本文研究包含线性二次输出(LQO)系统作为内部块的深度状态空间模型(Deep SSMs),并提出了一种具有可证明输出误差保证的压缩方法。作者首先推导了两个Deep SSMs之间输出误差的上界,并证明该上界可以通过层间LQO系统的h²误差范数来表达,从而为现有的基于模型降阶(MOR)的压缩方法提供了理论依据。基于此上界,作者以h²误差范数为目标制定了优化问题,并开发了基于梯度的MOR方法。在Long Range Arena基准的IMDb任务上,该压缩方法表现出色,与以往方法不同的是,在不重新训练的情况下减少了约80%的可训练参数,性能仅下降4-5%。