Generalized Top-k Mallows Model for Ranked Choices
Haddadan, Ahmadian
The classic Mallows model is a foundational tool for modeling user preferences. However, it has limitations in capturing real-world scenarios, where users often focus only on a limited set of preferred items and are indifferent to the rest. To address this, extensions such as the top-k Mallows model have been proposed, aligning better with practical applications. In this paper, we address several challenges related to the generalized top-k Mallows model, with a focus on analyzing buyer choices. Our key contributions are: (1) a novel sampling scheme tailored to generalized top-k Mallows models, (2) an efficient algorithm for computing choice probabilities under this model, and (3) an active learning algorithm for estimating the model parameters from observed choice data. These contributions provide new tools for analysis and prediction in critical decision-making scenarios. We present a rigorous mathematical analysis for the performance of our algorithms. Furthermore, through extensive experiments on synthetic data and real-world data, we demonstrate the scalability and accuracy of our proposed methods, and we compare the predictive power of Mallows model for top-k lists compared to the simpler Multinomial Logit model.
academic
Generalized Top-k Mallows Model for Ranked Choices
The classical Mallows model is a foundational tool for modeling user preferences, but has limitations in capturing real-world scenarios where users typically focus on a limited set of preferred items and are indifferent toward the rest. This paper addresses several challenges in the generalized top-k Mallows model, focusing on buyer choice analysis. Core contributions include: (1) a novel sampling scheme tailored for the generalized top-k Mallows model; (2) an efficient algorithm for computing choice probabilities under this model; (3) an active learning algorithm for estimating model parameters from observed choice data. These contributions provide new tools for analysis and prediction in critical decision-making scenarios, with scalability and accuracy validated through rigorous mathematical analysis and extensive experiments on both synthetic and real data.
The core problem addressed is: How to effectively model user preferences and predict choice behavior in realistic scenarios where users provide only partial ranking information (top-k lists) rather than complete rankings.
Broad practical applications: Recommendation systems, advertising platforms, search engines, news aggregators, and other domains where users typically express preferences for only a limited number of items
Business decision value: Accurate prediction of customer choice behavior is crucial for revenue management, product assortment optimization, and other strategic decisions
Theoretical significance: Extending classical probabilistic ranking models to more realistic partial ranking scenarios
Classical Mallows model: Limited to full permutations, cannot handle partial rankings
Multinomial Logit (MNL) model: While simple, has limited expressiveness and satisfies the Independence of Irrelevant Alternatives (IIA) assumption, restricting modeling of complex choice behavior
Existing top-k extensions: The top-k Mallows model proposed by Chierichetti et al. (2018) lacks efficient sampling algorithms and choice probability computation methods
Parameter learning difficulty: Learning model parameters from choice data (rather than complete rankings) lacks theoretical guarantees
The authors argue that extending the Mallows model to top-k lists can more realistically reflect user preference structures, but requires solving three key algorithmic problems: efficient sampling, choice probability computation, and parameter learning.
PRIM Sampling Algorithm: Proposes Profile-based Repeated Insertion Method (PRIM) for efficient sampling from the TopKGMM distribution, reducing time complexity from O(k²4^k + k²log n) to O(k2^k + k²log n)
DYPCHIP Choice Probability Algorithm: Designs Dynamic Programming for Choice Probabilities (DYPCHIP) algorithm to efficiently compute choice probabilities under the top-k Mallows model
BUCCHOI Active Learning Algorithm: Develops Build Center from Choices (BUCCHOI) active learning algorithm that infers the distribution center and center size k by presenting product assortments of specified sizes and observing choices, with sample complexity Θ(r²log n)
Model Generalization: Extends Chierichetti et al.'s top-k Mallows model to a generalized version (TopKGMM) that associates weights with each product
Empirical Validation: Demonstrates on the Sushi preference dataset that the top-k Mallows model achieves significantly higher predictive accuracy than the MNL model
This decomposition is the core algorithmic innovation, decomposing the complex top-k list distribution into profile selection and conditional ranking stages.
Then construct τ given S through repeated insertion
Algorithm Flow:
1. Precompute probabilities for all profiles (O(2^γk) time, γ=min{k,n-k})
2. Sample profile S
3. Initialize: uniformly randomly sample k-ℓ elements from [n]\[k]
4. Insert elements s ∈ S in priority order, with probability of inserting s at position j:
P(insert s at position j) ∝ exp(-βw_s·j)
Innovations:
Solves the open problem left by Chierichetti et al. (lack of RIM-like sampling methods)
Significantly reduces complexity through profile decomposition
Amortized cost per sample after preprocessing is only O(k log n)
Core Idea: Use dynamic programming to compute choice probabilities given profile S
Algorithm Structure:
3D DP table π_S(i, j, q):
i: i-th element in product assortment A
j: current position
q: q-th iteration of PRIM algorithm
Meaning: Probability that a_i is the highest-ranked element in A∅ and located at position j after q iterations
Recurrence Relations:
When new element not in A:
π_S(i,j,q) = π_S(i,j,q-1)·P(insert>j) + π_S(i,j-1,q-1)·P(insert≤j-1)
When new element in A:
π_S(i,j,q) = π_S(i,j,q-1)·P(insert>j)
π_S(cur,j,q) = P(insert=j)·Σ_{i<cur}Σ_{j'≥j}π_S(i,j',q-1)
Better performance with smaller β (0.01-0.1), indicating relatively uniform distribution
Best performance at p=0.5, balancing different types of inconsistencies
TopKGMM significantly outperforms MNL, validating model expressiveness
With Clustering (Table 2, 2 clusters):
p=0.1, β=0.1: test error 0.0771
p=1.25, β=0.05: test error 0.0788
Observation: Slight performance improvement after clustering, but very low silhouette coefficient (<0.012), suggesting data may come from a single distribution
This paper makes important contributions at the intersection of top-k Mallows models and choice modeling. Through clever profile decomposition, the authors design a complete algorithmic toolkit with rigorous theoretical analysis. Experiments validate the method's effectiveness, particularly showing significant advantages over MNL on real data.
Main values include: (1) theoretically solving important open problems; (2) practically providing usable tools; (3) laying foundation for future research.
Main limitation is exponential dependence on k, restricting applications in large-k scenarios. Future research on learning mixture models and developing approximation algorithms will further enhance practical utility.
For applications requiring modeling partial ranking preferences and predicting choice behavior, the TopKGMM framework and algorithms provided by this paper are valuable tools, particularly in scenarios with small k (≤12) and high prediction accuracy requirements.