2025-11-16T11:28:23.447951

Patentformer: A demonstration of AI-assisted automated patent drafting

Mudhiganti, Wang, Yang et al.

Patent drafting presents significant challenges due to its reliance on the extensive experience and specialized expertise of patent attorneys, who must possess both legal acumen and technical understanding of an invention to craft patent applications in a formal legal writing style. This paper presents a demonstration of Patentformer, an AI-powered automated patent drafting platform designed to support patent attorneys by rapidly producing high-quality patent applications adhering to legal writing standards.

academic

Patentformer: A demonstration of AI-assisted automated patent drafting

Basic Information

Paper ID: 2510.09752
Title: Patentformer: A demonstration of AI-assisted automated patent drafting
Authors: Sai Krishna Reddy Mudhiganti, Juanyan Wang, Ruo Yang, Manali Sharma (Samsung Semiconductor, Inc.)
Classification: cs.LG cs.AI cs.CY
Publication Time/Venue: EMNLP 2024 Industry Track
Paper Link: https://arxiv.org/abs/2510.09752

Abstract

Patent drafting faces significant challenges as it relies on the extensive experience and expertise of patent attorneys who must possess legal acumen and technical understanding of inventions to draft patent applications in formal legal writing style. This paper presents Patentformer, an AI-driven automated patent drafting platform designed to support patent attorneys by rapidly generating high-quality patent applications that comply with legal writing standards.

Research Background and Motivation

Problems to Address

High cost of patent drafting: Traditional patent drafting averages over $10,000 in cost and requires patent attorneys to possess dual expertise in law and technology
Patent document complexity: Patent documents contain multiple well-organized sections (title, abstract, claims, specification, etc.) requiring precise legal writing style
Limitations of existing LLMs: General-purpose large language models face significant challenges in generating patent specifications

Problem Significance

Patent specifications are the most important part of patent documents, requiring substantial drafting effort
Each claim must be explicitly supported by the specification
Patent documents differ fundamentally from general text in legal and technical nature

Limitations of Existing Methods

Poor domain adaptation: Most pre-trained LLMs are not trained on patent data, making it difficult to adapt to precise writing styles and legal requirements
Length limitations: Patent specifications typically span multiple pages, while LLMs are constrained by fixed token limits (512, 1024, 2048, or 4096 tokens)
Complex relationship handling: Patents contain complex relationships between claims and drawing descriptions that general LLMs struggle to process

Core Contributions

Developed and deployed the Patentformer platform: Accessible at https://patentformer.com, accepting patent claims and corresponding drawing text as input to generate high-quality patent specifications
Constructed specialized training data: Patent-2015-2024-G06F dataset containing 1,006,494 samples, publicly released on HuggingFace
Developed data augmentation methods: Specialized training data construction methods that convert plain text to rich representations, significantly improving output quality
Conducted user research: Quantitatively evaluated the effectiveness of Patentformer in generating patent specifications

Methodology Details

Task Definition

Given a patent document P containing:

l claim sequences: C = {c₁, c₂, ..., cₗ}
m specification paragraphs: S = {s₁, s₂, ..., sₘ}
t drawing images: I = {i₁, i₂, ..., iₜ}
t drawing brief descriptions: B = {b₁, b₂, ..., bₜ}
Component name-number pairs: N = {n₁, n₂, ..., nₜ}

Task objective: Using claims C, drawing descriptions B, and component name-number pairs N as input, generate output specification S.

Model Architecture

1. Input Processing

Claims processing: Users upload claim text C and corresponding drawings I
Drawing processing: Automatically identify key components and their numbers N'
Text enhancement: Convert input to enhanced representation T' = (C', B', N')

2. Mapping Interface

Provides user interface to define relationships between claims and drawing features
Automatic matching strategy based on cosine similarity, BLEU-1, and BLEU-2 scores
Uses 0.1 threshold to select top 5 matching components for each claim feature

3. Generation Module

Uses T5-11B model fine-tuned on patent data
Processes structured text input tuple T' = (C', N', B')
Generates enhanced patent specification S'
Post-processing steps ensure output conforms to standard patent writing conventions

Technical Innovations

Data augmentation method: Rather than using raw text T = (C, B, N), designs enriched version T' = (C', B', N') for training
Specialized training: Fine-tunes T5-11B model on USPTO patent data to learn patent writing style and structural conventions
Interactive mapping: Provides user-friendly interface to establish correspondence between claims and drawing components
Automated workflow: End-to-end automated patent drafting process from input processing to final output

Experimental Setup

Dataset

Patent-2015-2024-G06F: First dataset containing patent specifications, claims, and drawings
Scale: 1,006,494 training samples
Source: USPTO 2015-2024 G06F CPC category patents
Processing: Text truncated to 512 tokens

Evaluation Metrics

User study employs four orthogonal scoring dimensions (0-100 scale):

Language quality: Document language expression and writing quality
Legal quality: Compliance with legal requirements and patent writing standards
Drawing description quality: Accuracy of drawing descriptions
Technical quality: Accuracy and completeness of technical content

Comparison Methods

Comparison with state-of-the-art general LLM baselines (detailed results in Wang et al., 2024)

Implementation Details

Model: T5-11B
Training strategy: Fine-tuning on pre-trained version
Hardware: GPU acceleration supported, automatic CPU fallback without GPU
Post-processing: Uses strategies defined in Wang et al. (2024)

Experimental Results

Main Results

User study scoring based on 30 generations:

Quality Dimension	Average Score	Performance Evaluation
Legal quality	~95 points	Near perfect, most critical aspect
Language quality	~85 points	Good document generation capability
Drawing description quality	~60 points	Room for improvement
Technical quality	~65 points	Moderate level

Performance Analysis

Runtime

Setting	CPU	GPU (A100)	Real-world Case (GPU)
Time (seconds)	3152.4±160.00	92±5.79	807±449.25

Automatic Mapping Accuracy

Precision@5: 0.565
Precision@3: 0.6
Evaluated on approximately 6,000 samples

Experimental Findings

Excellent legal quality: As the most critical aspect of patent drafting, legal quality approaches perfect scores, demonstrating the model's strong learning of legal document characteristics
Good language expression: Despite using special tokens to encrypt input during fine-tuning, users are satisfied with language quality
Multimodal limitations: Relatively lower drawing description and technical quality, primarily due to:
- Current model uses drawing description text rather than direct image processing
- Model is text-to-text, lacking multimodal understanding capability

Main Research Directions

Patent text generation:
- Lee & Hsiang (2020): GPT-2 fine-tuning for generating patent claims
- Jiang et al. (2024): Generating patent claims from detailed descriptions
- Christofidellis et al. (2022): Patent Generative Transformer (PGT)
Patent text summarization:
- Title generation (Souza et al., 2021)
- Abstract generation (Guoliang et al., 2023; Zhu et al., 2023)
- Prior art summarization (Lee & Hsiang, 2020c)
Structured text transformation:
- Leveraging structural metadata within patent documents to guide text generation
- Semantic search-based control methods for patent text generation

Uniqueness of This Work

To the authors' knowledge, Patentformer is the first platform to generate complete high-quality patent specifications from claims and drawing text.

Conclusions and Discussion

Main Conclusions

Patentformer successfully demonstrates the feasibility of AI-assisted automated patent drafting
Through specialized data construction methods and fine-tuning, high-quality patent specifications meeting legal and technical requirements can be generated
User research confirms the system's effectiveness in language and legal quality

Limitations

Missing multimodal capability: Current system relies on user-provided drawing descriptions, unable to directly understand images
Limited technical quality: Room for improvement in accuracy and completeness of technical content
Drawing processing capability: Lacks direct visual understanding of patent drawings

Future Directions

Multimodal model development: Integrate Large Vision-Language Models (LVLMs) for automatic interpretation of text and visual information
Automated drawing generation: Utilize image generation models like Stable Diffusion to automatically generate patent drawings based on user input
End-to-end automation: Reduce manual input requirements and further improve patent drafting efficiency

In-Depth Evaluation

Strengths

Strong practicality: Addresses real pain points in the patent industry with clear commercial value
Methodological innovation: Data augmentation strategies and specialized training methods demonstrate technical innovation
Complete system: Provides comprehensive solution from input to output, including user-friendly interactive interface
Sufficient evaluation: Validated through expert user research in real application scenarios
Open-source contribution: Public release of large-scale patent dataset promotes field research development

Weaknesses

Limited evaluation scale: User study based on only 30 generations from one expert, relatively small sample size
Multimodal deficiency: Inability to process image information is a significant limitation affecting drawing description and technical quality
Domain limitations: Trained only on G06F category patents, generalization capability remains to be verified
Cost considerations: GPU running costs and time consumption may become limiting factors in practical deployment

Impact

Field contribution: First implementation of complete patent specification automatic generation, opening new directions for patent AI
Practical value: Can significantly reduce patent drafting costs and improve patent attorney work efficiency
Reproducibility: Provides online demonstration system and public dataset with good reproducibility
Industry impact: As industrial research from Samsung, demonstrates strong potential for industry transformation

Applicable Scenarios

Patent attorney assistance tool: Helps patent attorneys rapidly generate drafts, improving work efficiency
Enterprise patent departments: Automation of patent application processes in large technology companies
Patent agencies: Enhances efficiency and quality of patent agency services
Research institutions: Assists researchers in understanding patent writing norms and requirements

References

This paper primarily references important works in patent text generation, large language models, and patent legal requirements, including:

Wang et al. (2024): Patentformer core algorithms
Raffel et al. (2020): T5 model foundation
Multiple patent text generation related research (Lee & Hsiang series, Jiang et al., Christofidellis et al.)

Overall Evaluation: This is a high-quality industry paper with strong practical orientation, demonstrating pioneering significance in the patent AI field. Despite limitations in multimodal processing and evaluation scale, its technical innovation and practical value make it an important contribution to the field.