2025-11-24T00:55:25.034139

PolyVer: A Compositional Approach for Polyglot System Modeling and Verification

Chen, Lin, Godbole et al.

Several software systems are polyglot; that is, they comprise programs implemented in a combination of programming languages. Verifiers that directly run on mainstream programming languages are currently customized for single languages. Thus, to verify polyglot systems, one usually translates them into a common verification language or formalism on which the verifier runs. In this paper, we present an alternative approach, PolyVer, which employs abstraction, compositional reasoning, and synthesis to directly perform polyglot verification. PolyVer constructs a formal model of the original polyglot system as a transition system where the update functions associated with transitions are implemented in target languages such as C or Rust. To perform verification, PolyVer then connects a model checker for transition systems with language-specific verifiers (e.g., for C or Rust) using pre/post-condition contracts for the update functions. These contracts are automatically generated by synthesis oracles based on syntax-guided synthesis or large language models (LLMs), and checked by the language-specific verifiers. The contracts form abstractions of the update functions using which the model checker verifies the overall system-level property on the polyglot system model. PolyVer iterates between counterexample-guided abstraction-refinement (CEGAR) and counterexample-guided inductive synthesis (CEGIS) until the property is verified or a true system-level counterexample is found. We demonstrate the utility of PolyVer for verifying programs in the Lingua Franca polyglot language using the UCLID5 model checker connected with the CBMC and Kani verifiers for C and Rust respectively.

academic

PolyVer: 다중언어 시스템 모델링 및 검증을 위한 조합적 접근법

기본 정보

논문 ID: 2503.03207
제목: PolyVer: A Compositional Approach for Polyglot System Modeling and Verification
저자: Pei-Wei Chen, Shaokai Lin, Adwait Godbole, Ramneet Singh, Elizabeth Polgreen, Edward A. Lee, Sanjit A. Seshia
분류: cs.PL (프로그래밍 언어)
발표 시간/학회: Formal Methods in Computer-Aided Design 2025
논문 링크: https://arxiv.org/abs/2503.03207

초록

다중언어 소프트웨어 시스템(polyglot systems)은 여러 프로그래밍 언어로 구현된 프로그램의 조합으로 이루어져 있으나, 기존의 프로그램 검증기는 일반적으로 단일 언어에만 맞춤화되어 있습니다. 다중언어 시스템을 검증하기 위해서는 보통 이를 공통 검증 언어 또는 형식적 표현으로 변환해야 합니다. 본 논문은 추상화, 조합 추론 및 합성 기술을 활용하여 다중언어 검증을 직접 수행하는 대안적 방법인 PolyVer를 제시합니다. PolyVer는 다중언어 시스템을 전이 시스템의 형식적 모델로 구성하며, 여기서 전이와 관련된 업데이트 함수는 목표 언어(예: C 또는 Rust)로 구현됩니다. 검증을 수행하기 위해 PolyVer는 업데이트 함수의 전제조건/후제조건 계약을 통해 전이 시스템의 모델 검사기를 언어별 검증기와 연결합니다. 이러한 계약은 구문 유도 합성 또는 대규모 언어 모델 기반 합성 오라클을 통해 자동으로 생성되며, 언어별 검증기에 의해 검사됩니다.

연구 배경 및 동기

문제 정의

현대 소프트웨어 시스템은 ROS2, Lingua Franca 등의 프레임워크가 개발자가 각 구성 요소에 가장 적합한 프로그래밍 언어를 선택할 수 있도록 하는 다중언어 아키텍처를 점점 더 많이 채택하고 있습니다. 그러나 이러한 유연성은 검증 측면에서 다음과 같은 과제를 야기합니다:

언어 의미론의 차이: 서로 다른 프로그래밍 언어는 서로 다른 구문과 의미론을 가지고 있습니다. 예를 들어, Rust의 saturating_add 함수는 오버플로우 시 최댓값으로 포화되지만, C의 덧셈은 래핑될 수 있습니다.
기존 검증기의 한계: 대부분의 프로그램 검증기(예: C용 CBMC, Rust용 Kani)는 단일 언어에 특화되어 설계되었으며 다중언어 시스템을 직접 처리할 수 없습니다.
번역의 복잡성: 전체 다중언어 시스템을 단일 검증 언어로 번역하려면 모든 언어의 완전한 구문과 의미론을 지원해야 하며, 이는 현대 언어에서는 불가능합니다.

연구의 중요성

다중언어 시스템의 복잡성은 소프트웨어 결함의 위험을 증가시키며, 자율주행, 항공우주 등의 안전 관련 분야에서는 테스트 등의 불완전한 방법이 아닌 형식적 검증이 제공하는 강력한 보증이 필요합니다.

기존 방법의 한계

단일체 번역 방법: 각 언어에 대해 완전한 컴파일러 인프라를 개발해야 함
의미론 보존의 어려움: 목표 검증 언어에서 소스 언어의 모든 언어별 구성을 충실하게 포착하기 어려움
확장성 문제: 생성된 검증 문제가 과도하게 커질 수 있음

핵심 기여

다중언어 검증 문제의 형식화: 다중언어 검증 문제를 처음으로 체계적으로 형식화하고 여러 언어별 검증기를 통합하는 조합 솔루션을 제시합니다.
자동화된 계약 합성: 중간 언어와 CEGIS-CEGAR 루프를 사용하여 전제조건/후제조건 계약을 자동으로 합성하고 정제하는 방법을 제시하며, 구문 유도 합성과 대규모 언어 모델을 합성 오라클로 지원합니다.
도구 구현: UCLID5를 기반으로 PolyVer 도구를 구현하여 C와 Rust를 지원하며, CBMC 및 Kani 검증기를 통해 LLM 기반 합성 오라클이 순수 기호 합성 방법보다 우수함을 입증합니다.
사례 연구 및 평가: Lingua Franca 조정 언어의 검증기를 개발하여 C 및 Rust 프로세스를 포함하는 다중언어 시스템과 이전 작업에서 지원할 수 없었던 C 언어 조각을 검증합니다.