Patent drafting presents significant challenges due to its reliance on the extensive experience and specialized expertise of patent attorneys, who must possess both legal acumen and technical understanding of an invention to craft patent applications in a formal legal writing style. This paper presents a demonstration of Patentformer, an AI-powered automated patent drafting platform designed to support patent attorneys by rapidly producing high-quality patent applications adhering to legal writing standards.
- Paper ID: 2510.09752
- Title: Patentformer: A demonstration of AI-assisted automated patent drafting
- Authors: Sai Krishna Reddy Mudhiganti, Juanyan Wang, Ruo Yang, Manali Sharma (Samsung Semiconductor, Inc.)
- Classification: cs.LG cs.AI cs.CY
- Publication Time/Venue: EMNLP 2024 Industry Track
- Paper Link: https://arxiv.org/abs/2510.09752
Patent drafting faces significant challenges as it relies on the extensive experience and expertise of patent attorneys who must possess legal acumen and technical understanding of inventions to draft patent applications in formal legal writing style. This paper presents Patentformer, an AI-driven automated patent drafting platform designed to support patent attorneys by rapidly generating high-quality patent applications that comply with legal writing standards.
- High cost of patent drafting: Traditional patent drafting averages over $10,000 in cost and requires patent attorneys to possess dual expertise in law and technology
- Patent document complexity: Patent documents contain multiple well-organized sections (title, abstract, claims, specification, etc.) requiring precise legal writing style
- Limitations of existing LLMs: General-purpose large language models face significant challenges in generating patent specifications
- Patent specifications are the most important part of patent documents, requiring substantial drafting effort
- Each claim must be explicitly supported by the specification
- Patent documents differ fundamentally from general text in legal and technical nature
- Poor domain adaptation: Most pre-trained LLMs are not trained on patent data, making it difficult to adapt to precise writing styles and legal requirements
- Length limitations: Patent specifications typically span multiple pages, while LLMs are constrained by fixed token limits (512, 1024, 2048, or 4096 tokens)
- Complex relationship handling: Patents contain complex relationships between claims and drawing descriptions that general LLMs struggle to process
- Developed and deployed the Patentformer platform: Accessible at https://patentformer.com, accepting patent claims and corresponding drawing text as input to generate high-quality patent specifications
- Constructed specialized training data: Patent-2015-2024-G06F dataset containing 1,006,494 samples, publicly released on HuggingFace
- Developed data augmentation methods: Specialized training data construction methods that convert plain text to rich representations, significantly improving output quality
- Conducted user research: Quantitatively evaluated the effectiveness of Patentformer in generating patent specifications
Given a patent document P containing:
- l claim sequences: C = {c₁, c₂, ..., cₗ}
- m specification paragraphs: S = {s₁, s₂, ..., sₘ}
- t drawing images: I = {i₁, i₂, ..., iₜ}
- t drawing brief descriptions: B = {b₁, b₂, ..., bₜ}
- Component name-number pairs: N = {n₁, n₂, ..., nₜ}
Task objective: Using claims C, drawing descriptions B, and component name-number pairs N as input, generate output specification S.
- Claims processing: Users upload claim text C and corresponding drawings I
- Drawing processing: Automatically identify key components and their numbers N'
- Text enhancement: Convert input to enhanced representation T' = (C', B', N')
- Provides user interface to define relationships between claims and drawing features
- Automatic matching strategy based on cosine similarity, BLEU-1, and BLEU-2 scores
- Uses 0.1 threshold to select top 5 matching components for each claim feature
- Uses T5-11B model fine-tuned on patent data
- Processes structured text input tuple T' = (C', N', B')
- Generates enhanced patent specification S'
- Post-processing steps ensure output conforms to standard patent writing conventions
- Data augmentation method: Rather than using raw text T = (C, B, N), designs enriched version T' = (C', B', N') for training
- Specialized training: Fine-tunes T5-11B model on USPTO patent data to learn patent writing style and structural conventions
- Interactive mapping: Provides user-friendly interface to establish correspondence between claims and drawing components
- Automated workflow: End-to-end automated patent drafting process from input processing to final output
- Patent-2015-2024-G06F: First dataset containing patent specifications, claims, and drawings
- Scale: 1,006,494 training samples
- Source: USPTO 2015-2024 G06F CPC category patents
- Processing: Text truncated to 512 tokens
User study employs four orthogonal scoring dimensions (0-100 scale):
- Language quality: Document language expression and writing quality
- Legal quality: Compliance with legal requirements and patent writing standards
- Drawing description quality: Accuracy of drawing descriptions
- Technical quality: Accuracy and completeness of technical content
Comparison with state-of-the-art general LLM baselines (detailed results in Wang et al., 2024)
- Model: T5-11B
- Training strategy: Fine-tuning on pre-trained version
- Hardware: GPU acceleration supported, automatic CPU fallback without GPU
- Post-processing: Uses strategies defined in Wang et al. (2024)
User study scoring based on 30 generations:
| Quality Dimension | Average Score | Performance Evaluation |
|---|
| Legal quality | ~95 points | Near perfect, most critical aspect |
| Language quality | ~85 points | Good document generation capability |
| Drawing description quality | ~60 points | Room for improvement |
| Technical quality | ~65 points | Moderate level |
| Setting | CPU | GPU (A100) | Real-world Case (GPU) |
|---|
| Time (seconds) | 3152.4±160.00 | 92±5.79 | 807±449.25 |
- Precision@5: 0.565
- Precision@3: 0.6
- Evaluated on approximately 6,000 samples
- Excellent legal quality: As the most critical aspect of patent drafting, legal quality approaches perfect scores, demonstrating the model's strong learning of legal document characteristics
- Good language expression: Despite using special tokens to encrypt input during fine-tuning, users are satisfied with language quality
- Multimodal limitations: Relatively lower drawing description and technical quality, primarily due to:
- Current model uses drawing description text rather than direct image processing
- Model is text-to-text, lacking multimodal understanding capability
- Patent text generation:
- Lee & Hsiang (2020): GPT-2 fine-tuning for generating patent claims
- Jiang et al. (2024): Generating patent claims from detailed descriptions
- Christofidellis et al. (2022): Patent Generative Transformer (PGT)
- Patent text summarization:
- Title generation (Souza et al., 2021)
- Abstract generation (Guoliang et al., 2023; Zhu et al., 2023)
- Prior art summarization (Lee & Hsiang, 2020c)
- Structured text transformation:
- Leveraging structural metadata within patent documents to guide text generation
- Semantic search-based control methods for patent text generation
To the authors' knowledge, Patentformer is the first platform to generate complete high-quality patent specifications from claims and drawing text.
- Patentformer successfully demonstrates the feasibility of AI-assisted automated patent drafting
- Through specialized data construction methods and fine-tuning, high-quality patent specifications meeting legal and technical requirements can be generated
- User research confirms the system's effectiveness in language and legal quality
- Missing multimodal capability: Current system relies on user-provided drawing descriptions, unable to directly understand images
- Limited technical quality: Room for improvement in accuracy and completeness of technical content
- Drawing processing capability: Lacks direct visual understanding of patent drawings
- Multimodal model development: Integrate Large Vision-Language Models (LVLMs) for automatic interpretation of text and visual information
- Automated drawing generation: Utilize image generation models like Stable Diffusion to automatically generate patent drawings based on user input
- End-to-end automation: Reduce manual input requirements and further improve patent drafting efficiency
- Strong practicality: Addresses real pain points in the patent industry with clear commercial value
- Methodological innovation: Data augmentation strategies and specialized training methods demonstrate technical innovation
- Complete system: Provides comprehensive solution from input to output, including user-friendly interactive interface
- Sufficient evaluation: Validated through expert user research in real application scenarios
- Open-source contribution: Public release of large-scale patent dataset promotes field research development
- Limited evaluation scale: User study based on only 30 generations from one expert, relatively small sample size
- Multimodal deficiency: Inability to process image information is a significant limitation affecting drawing description and technical quality
- Domain limitations: Trained only on G06F category patents, generalization capability remains to be verified
- Cost considerations: GPU running costs and time consumption may become limiting factors in practical deployment
- Field contribution: First implementation of complete patent specification automatic generation, opening new directions for patent AI
- Practical value: Can significantly reduce patent drafting costs and improve patent attorney work efficiency
- Reproducibility: Provides online demonstration system and public dataset with good reproducibility
- Industry impact: As industrial research from Samsung, demonstrates strong potential for industry transformation
- Patent attorney assistance tool: Helps patent attorneys rapidly generate drafts, improving work efficiency
- Enterprise patent departments: Automation of patent application processes in large technology companies
- Patent agencies: Enhances efficiency and quality of patent agency services
- Research institutions: Assists researchers in understanding patent writing norms and requirements
This paper primarily references important works in patent text generation, large language models, and patent legal requirements, including:
- Wang et al. (2024): Patentformer core algorithms
- Raffel et al. (2020): T5 model foundation
- Multiple patent text generation related research (Lee & Hsiang series, Jiang et al., Christofidellis et al.)
Overall Evaluation: This is a high-quality industry paper with strong practical orientation, demonstrating pioneering significance in the patent AI field. Despite limitations in multimodal processing and evaluation scale, its technical innovation and practical value make it an important contribution to the field.