2025-11-12T17:13:10.726463

Faver: Boosting LLM-based RTL Generation with Function Abstracted Verifiable Middleware

Mu, Shi, Wang et al.

LLM-based RTL generation is an interesting research direction, as it holds the potential to liberate the least automated stage in the current chip design. However, due to the substantial semantic gap between high-level specifications and RTL, coupled with limited training data, existing models struggle with generation accuracy. Drawing on human experience, design with verification helps improving accuracy. However, as the RTL testbench data are even more scarce, it is not friendly for LLMs. Although LLMs excel at higher-level languages like Python/C, they have a huge semantic gap from RTL. When implementing the same functionality, Python/C code and hardware code differ significantly in the spatiotemporal granularity, requiring the LLM not only to consider high-level functional semantics but also to ensure the low-level details align with the circuit code. It is not an easy task. In this paper, we propose a function abstracted verifiable middleware (Faver) that streamlines RTL verification in LLM-based workflows. By mixing LLM-friendly code structures with a rule-based template, Faver decouples the details of circuit verification, allowing the LLM to focus on the functionality itself. In our experiments on the SFT model and open-source models, Faver improved the model's generation accuracy by up to 14%.

academic

Faver: Boosting LLM-based RTL Generation with Function Abstracted Verifiable Middleware

基本信息

论文ID: 2510.08664
标题: Faver: Boosting LLM-based RTL Generation with Function Abstracted Verifiable Middleware
作者: Jianan Mu, Mingyu Shi, Yining Wang, Tianmeng Yang, Bin Sun, Xing Hu, Jing Ye, Huawei Li
分类: cs.SE cs.AI
发表时间: 2025年10月9日 (arXiv预印本)
论文链接: https://arxiv.org/abs/2510.08664

摘要

本文针对基于大语言模型(LLM)的RTL代码生成准确性问题，提出了一个功能抽象可验证中间件(Faver)。该方法通过将LLM友好的代码结构与基于规则的模板相结合，解耦了电路验证的细节，使LLM能够专注于功能本身。在SFT模型和开源模型的实验中，Faver将模型的生成准确率提升了高达14%。

研究背景与动机

1. 核心问题

RTL设计是芯片设计中自动化程度最低、最耗费人力的阶段。虽然LLM在RTL生成方面展现出潜力，但由于高级规范与RTL之间存在巨大的语义鸿沟，加上训练数据有限，现有模型在生成准确性方面表现不佳。

2. 问题重要性

RTL设计是集成电路设计流程中的关键瓶颈
自动化RTL生成能够显著提升芯片设计效率
现有方法无法有效利用"设计与验证"的人类经验

3. 现有方法局限性

直接LLM判断: 缺乏robust的推理工具来基于规范验证功能
RTL testbench生成: testbench数据比设计数据更稀缺，且生成难度与RTL设计相当
简单Python验证: 硬件和软件在时空粒度上差异巨大，导致co-verification困难

4. 研究动机

借鉴人类设计经验中的"设计与验证"方法，但需要解决LLM在硬件验证方面的固有困难，特别是时序相关变量和测试激励生成的挑战。

核心贡献

提出Faver框架: 允许LLM编写高级语义代码来验证电路，并从设计与验证框架中受益
设计功能-类抽象模板: 将硬件设计中的时钟和寄存器语义映射到事件驱动的Python/C函数类中，减少硬件和软件验证之间的时空鸿沟
实验验证: 在多个测试集和LLM上证明Faver将基于LLM的RTL生成准确率提升高达14%
理论分析: 提供了系统成功率和反馈真实率的数学模型

方法详解

任务定义

输入：自然语言规范描述的硬件功能需求输出：功能正确且通过验证的RTL (Verilog)代码约束：生成的RTL必须在语法和功能上都正确

模型架构

Faver框架包含四个关键步骤：

1. 验证规范生成 (Verification Specification Generation)

保持I/O端口: 保留相同的输入输出端口定义
功能抽象: 将RTL的拓扑连接转换为软件的输入输出处理逻辑
边界分析: 分析RTL的边界条件并在验证规范中枚举

2. 基于类模板的参考模型生成

核心设计：

class ref_model(Model):
    def __init__(self):
        global state_flag0, state_flag1  # 寄存器映射为全局变量
    
    @driver_hook()
    def reset(self):  # 专用重置函数
        pass
    
    @driver_hook() 
    def step(self):   # 统一功能接口
        pass
    
    def func1(self):  # 其他功能函数
        pass