Lifting Manifolds to Mitigate Pseudo-Alignment in LLM4TS
Zheng, Liang, Zhang et al.
Pseudo-Alignment is a pervasive challenge in many large language models for time series (LLM4TS) models, often causing them to underperform compared to linear models or randomly initialised backbones. However, there is limited discussion in the community for the reasons that pseudo-alignment occurs. In this work, we conduct a thorough investigation into the root causes of pseudo-alignment in LLM4TS and build a connection of pseudo-alignment to the cone effect in LLM. We demonstrate that pseudo-alignment arises from the interplay of cone effect within pretrained LLM components and the intrinsically low-dimensional manifold of time-series data. In addition, we also introduce \textit{\textbf{TimeSUP}}, a novel technique designed to mitigate this issue and improve forecast performance in existing LLM4TS approaches. TimeSUP addresses this by increasing the time series manifold to more closely match the intrinsic dimension of language embeddings, allowing the model to distinguish temporal signals clearly while still capturing shared structures across modalities. As a result, representations for time and language tokens remain distinct yet exhibit high cosine similarity, signifying that the model preserves each modality unique features while learning their commonalities in a unified embedding space. Empirically, TimeSUP consistently outperforms state-of-the-art LLM4TS methods and other lightweight baselines on long-term forecasting performance. Furthermore, it can be seamlessly integrated into four existing LLM4TS pipelines and delivers significant improvements in forecasting performance.
의사정렬(Pseudo-Alignment)은 시계열을 위한 많은 대규모 언어모델(LLM4TS)에서 널리 존재하는 과제로, 이러한 모델들의 성능이 선형 모델이나 무작위로 초기화된 백본 네트워크보다 떨어지게 하는 경우가 많습니다. 그러나 커뮤니티에서 의사정렬이 발생하는 원인에 대한 논의는 제한적입니다. 본 논문은 LLM4TS의 의사정렬의 근본 원인을 심층 연구하고, 의사정렬과 LLM의 원뿔 효과(cone effect) 간의 연관성을 확립합니다. 연구 결과는 의사정렬이 사전학습된 LLM 구성 요소의 원뿔 효과와 시계열 데이터의 내재적 저차원 다양체의 상호작용에서 비롯됨을 보여줍니다. 더욱이, 본 논문은 이 문제를 완화하고 기존 LLM4TS 방법의 예측 성능을 향상시키기 위해 고안된 새로운 기법인 TimeSUP을 소개합니다.