Migrating codebases from one instruction set architecture (ISA) to another is a major engineering challenge. A recent example is the adoption of Arm (in addition to x86) across the major Cloud hyperscalers. Yet, this problem has seen limited attention by the academic community. Most work has focused on static and dynamic binary translation, and the traditional conventional wisdom has been that this is the primary challenge.
In this paper, we show that this is no longer the case. Modern ISA migrations can often build on a robust open-source ecosystem, making it possible to recompile all relevant software from scratch. This introduces a new and multifaceted set of challenges, which are different from binary translation.
By analyzing a large-scale migration from x86 to Arm at Google, spanning almost 40,000 code commits, we derive a taxonomy of tasks involved in ISA migration. We show how Google automated many of the steps involved, and demonstrate how AI can play a major role in automatically addressing these tasks. We identify tasks that remain challenging and highlight research challenges that warrant further attention.
- Paper ID: 2510.14928
- Title: Instruction Set Migration at Warehouse Scale
- Authors: Eric Christopher, Kevin Crossan, Wolff Dobson, Chris Kennelly, Drew Lewis, Kun Lin, Martin Maas, Parthasarathy Ranganathan, Emma Rapati, Brian Yang (Google, USA)
- Classification: cs.SE (Software Engineering), cs.LG (Machine Learning)
- Publication Date: October 16, 2025 (arXiv preprint)
- Paper Link: https://arxiv.org/abs/2510.14928
This paper challenges conventional perspectives on instruction set architecture (ISA) migration by analyzing Google's large-scale migration from x86 to Arm, encompassing nearly 40,000 code commits. The research demonstrates that the primary challenges in modern ISA migration are no longer code translation, but rather a multifaceted set of engineering tasks. The paper proposes a taxonomy of ISA migration tasks, demonstrates how Google automated many migration steps, and establishes the critical role of AI in automating these tasks.
- Core Challenge: Large-scale codebase instruction set architecture migration represents a significant engineering undertaking with limited academic attention
- Practical Necessity: Major cloud service providers (Amazon, Google, Microsoft) are adopting Arm architecture to complement x86, requiring systematic migration methodologies
- Limitations of Traditional Approaches: Prior research focused primarily on static and dynamic binary translation, treating these as the primary ISA migration challenges
- Technological Evolution: Modern ISAs are well-supported in upstream compilers, runtime libraries, and the Linux kernel, making source-level recompilation feasible
- Absence of Practical Analysis: Lack of systematic analysis of tasks actually involved in modern ISA migration
- Automation Opportunities: Modern software engineering tools and AI technologies enable new possibilities for migration automation
- First Systematic Analysis: Provides the first detailed decomposition and taxonomy of large-scale ISA migration tasks, based on 38,156 real commits
- Paradigm Shift: Demonstrates that ISA migration complexity lies not in code translation but primarily in build and configuration file rewrites
- Automation Framework: Shows that many migration tasks are highly automatable and develops CogniPort, an AI-driven automation tool
- Practical Guidance: Identifies remaining challenging tasks and provides direction for future research
The core task studied is: Migrating a multi-billion line codebase from x86 architecture to a multi-architecture environment supporting both x86 and Arm
Input: Single-architecture (x86) large-scale codebase
Output: Multi-architecture (x86+Arm) capable codebase
Constraints: Maintain parity in performance, security, and stability
- Data Source: 38,156 Arm migration-related commits from Google's monolithic repository
- Automated Classification: Large-scale commit analysis using Gemini 2.5 Flash LLM
- Classification Pipeline:
- Feed commit messages and code diffs into LLM's 1M token context window
- Process 100 commits per batch, with model selecting from 20 categories
- Consolidate 400×20 categories into 50, then manually refine to 16 categories
Defines a five-stage migration process:
- Testing Phase: Fix tests and builds failing on new ISA
- Multi-Architecture CI Setup: Modify Blueprint files to ensure no regressions
- Release Configuration: Modify Blueprint files to enable multi-architecture support by default
- New Binary Rollout: Run on new ISA machines and evaluate performance stability
- Full Production: Allow production workloads to schedule on new ISA machines
- Innovatively employs large language models to analyze tens of thousands of code commits
- Establishes systematic taxonomy of ISA migration tasks (17 categories)
- Large-Scale Changes (LSCs): Implement large-scale automated code modifications via Rosie tool
- CHAMP System: Continuous health monitoring platform that automatically evaluates behavioral differences in Arm builds
- CogniPort AI Agent: Three-layer nested agent loop for automated build and test error fixing
CogniPort comprises three nested agent loops:
- Orchestrator Agent: Invokes build/test repair agents based on workspace state
- Build Repair Agent: Attempts builds and modifies files until success
- Test Repair Agent: Runs tests and modifies code until passing
- Scale: 38,156 commits, approximately 700K lines of code changes
- Time Span: Multi-year migration process
- Coverage: Representative subset of Google's multi-billion line internal codebase
- Automation Degree: Proportion of auto-generated commits and code lines per category
- AI Repair Success Rate: CogniPort's repair success rate on 245 test commits
- Migration Effectiveness: Application qualification rate improved from 4.8% to 59.6%, rollback rate controlled below 1.8%
- Historical Commit Analysis: Restore historical commits and evaluate AI agent repair capabilities
- Classification Accuracy Verification: Manual inspection of model outputs to ensure classification accuracy
- Automation Effectiveness Evaluation: Statistical analysis of LSC and CHAMP system automation coverage
- Build Configuration Dominance: 84% of commits involve build, packaging, and CI/CD configuration (category 8)
- Minimal Code Modification: Code adaptation and correction (categories 1-5) account for only 1% of commits and less than 4% of code lines
- Test Adaptation Significance: Test-related modifications (categories 6-7) constitute a substantial portion
- Early Stage: Primarily tool development and test adaptation
- Middle Stage: Increased proportion of code adaptation commits
- Late Stage: Almost entirely configuration files and support processes, with rapid commit volume growth
- Overall Automation Rate: 83.82% of commits and 14.15% of code lines automated
- LSC-Generated Commits: 31,984 commits automatically generated via Rosie
- Application Qualification Rate Improvement: From 4.8% to 59.6%
- Low Rollback Rate: Final stage rollback rate reduced to below 0.8%
CogniPort performance on 245 test commits:
- Overall Success Rate: 30%
- Best-Performing Categories: Test repair, platform-specific conditional code, data representation fixes
- Challenging Categories: Memory models, test execution environments, performance optimization
AI-assessed automation difficulty (1=simple, 5=difficult):
- Highly Automatable: Build configuration files, test execution environments
- Moderate Difficulty: Code and test adaptation (categories 1-7)
- Requiring Manual Effort: Complex performance optimization, specialized service configuration
- Binary Translation: Static and dynamic binary translation techniques, such as Rosetta systems
- Automated Code Editing: Automation methods for performance optimization, security issue fixing, and error correction
- Shift from Binary Translation to Source Recompilation: Leverages mature support from modern open-source ecosystems
- Systematic Task Analysis: First comprehensive taxonomy of ISA migration tasks
- AI-Driven Automation: Applies large language models to large-scale code migration
- Paradigm Shift: ISA migration complexity lies not in code translation but in multifaceted engineering tasks
- High Automation Potential: Majority of migration tasks can be automated with modern tools
- One-Time Work Reusability: Many tasks can be reused after initial multi-architecture migration
- Significant AI Potential: Modern AI can address many remaining manual tasks
- Evaluation Method Constraints: AI agent evaluation may suffer from information leakage or incomplete testing
- Google-Specific Environment: Conclusions may not fully generalize to other organizations' software environments
- Arm-Specific Focus: Research concentrates on x86-to-Arm migration; other ISA migrations may present different challenges
- ISA-Specific Vector Code: Requires complex performance optimization search spaces
- Deep Performance Optimization: Demands algorithm-level restructuring and intrinsic function usage
- Complex Edge Cases: Requires expertise beyond code itself
- Performance Tuning: Necessitates hyperparameter regeneration and feedback-directed optimization profiles
- Enhanced AI Agent Capabilities: Support more powerful agents with documentation search and performance measurement
- Vectorization Automation: Automatically generate high-performance ISA-specific vector code
- Multi-Architecture Toolchains: Further improve automated multi-architecture support tools
- Exceptional Practical Value: Based on genuine industrial-scale migration experience with strong guidance implications
- Methodological Innovation: First systematic analysis and classification of ISA migration tasks, establishing foundational framework for the field
- Massive Data Scale: Analysis of 38,156 commits represents exceptionally rare scale in software engineering research
- Forward-Looking Technology: Demonstrates significant potential of AI in large-scale software engineering tasks
- Counter-Intuitive Results: Challenges conventional domain wisdom with important academic value
- Environment Specificity: Research grounded in Google's particular software environment with potentially limited generalizability
- Evaluation Method Constraints: AI agent evaluation may contain biases requiring more rigorous benchmarking
- Missing Cost Analysis: Lacks quantitative analysis of migration costs (labor, time, resources)
- Insufficient Long-Term Maintenance Consideration: Limited analysis of long-term maintenance costs for multi-architecture codebases
- Academic Impact: Provides new research directions for software engineering and systems architecture
- Industrial Value: Offers practical methodology and tool references for other organizations undertaking ISA migration
- Technology Advancement: Promotes AI application development in software engineering
- Standard Establishment: Potentially becomes important reference standard for ISA migration field
- Large Technology Companies: Organizations with massive codebases requiring architecture migration
- Cloud Service Providers: Need to support multiple hardware architectures for cost and performance optimization
- Open-Source Project Maintainers: Large-scale open-source projects requiring expanded architecture support
- Research Institutions: Academic teams researching software engineering automation and AI applications
The paper cites 27 important references covering:
- Binary translation techniques (Altman et al., 2000; Gouicem et al., 2022)
- Large-scale software engineering (Potvin & Levenberg, 2016; Winters et al., 2020)
- Automated code repair (Bader et al., 2019; Lin et al., 2025)
- Systems architecture (Verma et al., 2015; Corbett et al., 2012)
Summary: This paper possesses significant practical value and academic merit, providing the first systematic analysis of actual challenges and solutions in modern large-scale ISA migration. Beyond challenging conventional wisdom, it demonstrates the tremendous potential of AI technology in software engineering, providing important guidance for future development in this field.