Learning Weighted Automata over Number Rings, Concretely and Categorically
Aristote, van Gool, PetriÅan et al.
We develop a generic reduction procedure for active learning problems. Our approach is inspired by a recent polynomial-time reduction of the exact learning problem for weighted automata over integers to that for weighted automata over rationals (Buna-Marginean et al. 2024). Our procedure improves the efficiency of a category-theoretic automata learning algorithm, and poses new questions about the complexity of its implementation when instantiated to concrete categories. As our second main contribution, we address these complexity aspects in the concrete setting of learning weighted automata over number rings, that is, rings of integers in an algebraic number field. Assuming a full representation of a number ring OK, we obtain an exact learning algorithm of OK-weighted automata that runs in polynomial time in the size of the target automaton, the logarithm of the length of the longest counterexample, the degree of the number field, and the logarithm of its discriminant. Our algorithm produces an automaton that has at most one more state than the minimal one, and we prove that doing better requires solving the principal ideal problem, for which the best currently known algorithm is in quantum polynomial time.
academic
Learning Weighted Automata over Number Rings, Concretely and Categorically
This paper develops a general active learning problem reduction procedure. The approach is inspired by recent work by Buna-Marginean et al. (2024), which polynomially reduces the exact learning problem for weighted automata over integers to the learning problem for weighted automata over rationals. The procedure improves the efficiency of categorical automata learning algorithms and raises new questions about implementation complexity when instantiated in concrete categories. As a second major contribution, the authors address these complexity questions in the concrete setting of learning weighted automata over number rings (rings of integers in algebraic number fields). Assuming a complete representation of the number ring OK, they obtain an exact learning algorithm for OK-weighted automata with polynomial time complexity in the target automaton size, the logarithm of the longest counterexample length, the degree of the number field, and the logarithm of the discriminant. The algorithm produces automata with at most one more state than the minimal automaton, and it is proven that doing better requires solving the principal ideal problem, for which the best known algorithm is quantum polynomial time.
Classical Automata Learning: Angluin's L* algorithm efficiently learns deterministic finite-state automata in the Minimally Adequate Teacher (MAT) framework, a classical result in computational learning theory.
Challenges in Weighted Automata Learning: Extending learning algorithms to more expressive models (such as weighted automata) presents challenges, particularly when weights lie in rings rather than fields.
Limitations of Existing Approaches:
Polynomial time learning algorithms exist for weighted automata over fields
For weighted automata over general rings, existing methods either have excessive complexity or limited applicability
Categorical approaches, while general, may lead to exponential complexity in concrete instantiations
Theoretical Requirement: Need for a framework that maintains the generality of categorical methods while achieving polynomial complexity in concrete cases
Practical Applications: Number rings have important applications in cryptography; efficient learning of weighted automata over them has practical value
Theoretical Boundaries: Exploring theoretical limits of weighted automata minimization, particularly generalizations of the Fatou property
General Reduction Algorithm: Proposes Algorithm 3, a general reduction procedure within the categorical framework that reduces one class of learning problems to another more tractable class
Concrete Algorithm for Number Rings: Develops Algorithm 4, a polynomial time learning algorithm specifically for weighted automata over number rings OK
Near-Optimality Results: Proves that the algorithm produces automata with at most one more state than the minimal automaton (near-minimality)
Theoretical Complexity Bounds: Proves that obtaining completely minimal automata is equivalent to solving the principal ideal problem (PIP-hard), establishing theoretical lower bounds
Generalization of Fatou Property: Proves that Dedekind domains are "almost strongly Fatou rings," generalizing the classical Fatou property
Input: An unknown OK-weighted language L: Σ* → OK (accessed via oracle)
Output: An OK-weighted automaton computing L
Constraint: Algorithm complexity is polynomial in the target automaton size, logarithm of the longest counterexample length, degree of the number field, and logarithm of the discriminant
Algorithm Idea:
1. Learn automaton in "tractable" category D
2. Establish connection via functor F: C → D
3. Use right adjoint G: D → C to pull back results to target category C
Key Assumption (Assumption 12):
F preserves certain morphism classes
F has a right adjoint G
Unit and counit morphisms have specific properties
Compute basis B of backward space of automaton A
Conjugate A by matrix B to obtain A'
Step 2: Forward Module Generation
Call Algorithm 5 to compute generating set of forward OK-module of A'
Use two-phase strategy:
- Phase 1: Find words increasing rank in K
- Phase 2: Complete module generation in OK
Step 3: Pseudo-basis Computation
Compute pseudo-basis from generating set using pseudo-Hermite Normal Form (pseudo-HNF)
Pseudo-basis form: {(ai, vi) | 1 ≤ i ≤ ℓ}, where ai are fractional ideals
Step 4: Almost-Minimal Generating Set
Convert pseudo-basis to generating set of size at most ℓ+1 via Algorithm 6
Use ideal factor refinement and Chinese Remainder Theorem
Near-Optimality (Proposition 10): For a Dedekind domain R, if L is an R-weighted language of rank n, there exists an R-weighted automaton computing L with at most n+1 states
Complexity Lower Bound (Proposition 26): Determining whether an OK-weighted automaton is state-minimal is PIP-hard
Fatou Property Generalization (Corollary 16): Dedekind domains are almost strongly Fatou rings
Pseudo-HNF computation: Polynomial time (Biasse-Fieker-Hofmann)
Strictly increasing chain length: Bounded by log(N(d)) via Lemma 24
Ideal operations: Polynomial time in CK
This paper makes important theoretical contributions to theoretical computer science, particularly at the intersection of automata learning and algebraic computation. While practical applicability remains to be verified, its theoretical value and methodological significance are substantial.