We investigate the representation of arbitrary polynomials using probabilistic Bernoulli and degenerate Bernoulli polynomials associated with a random variable $Y$, whose moment generating function exists in a neighborhood of the origin. In addition, this paper explores the problem of representing arbitrary polynomials in terms of their higher-order counterparts. We develop explicit formulas for those representations with the help of umbral calculus and illustrate our results for several discrete and continuous random variables Y.
Representations by probabilistic Bernoulli and degenerate Bernoulli polynomials
- Paper ID: 2510.21558
- Title: Representations by probabilistic Bernoulli and degenerate Bernoulli polynomials
- Authors: Dae San Kim (Sogang University), Taekyun Kim (Kwangwoon University)
- Classification: math.NT (Number Theory), math.PR (Probability)
- Submission Date: October 24, 2025
- Paper Link: https://arxiv.org/abs/2510.21558v1
This paper investigates the representation of arbitrary polynomials using probabilistic Bernoulli polynomials and degenerate Bernoulli polynomials, which are associated with random variables Y whose moment generating functions exist in a neighborhood of the origin. Furthermore, the paper explores the representation of arbitrary polynomials using higher-order corresponding polynomials. Using umbral calculus, the authors develop explicit formulas for these representations and demonstrate results for several discrete and continuous random variables Y.
The central problem addressed is: how to represent arbitrary polynomials as linear combinations of probabilistic Bernoulli polynomials and probabilistic degenerate Bernoulli polynomials, and provide explicit coefficient formulas.
- Theoretical Importance: Bernoulli polynomials and their variants hold foundational positions in number theory and combinatorics; their representation theory constitutes an important component of special function theory
- Historical Context: Research on degenerate special polynomials originates from Carlitz's pioneering 1979 work on degenerate Bernoulli and Euler polynomials
- Modern Development: Probabilistic extensions of special polynomials have received extensive recent attention, combining probability theory with special function theory
- Probabilistic Stirling numbers defined via cumulant generating functions in references 2 and 18 lack orthogonality and inverse relation properties
- The absence of these properties makes the inverse problem (inferring polynomials from representation coefficients) difficult to solve
- Existing proof methods (such as proofs of Miki identities) are often extremely complex, involving sophisticated tools like p-adic analysis and quantum field theory
- Establish a probabilistic Stirling number theoretical framework based on orthogonality
- Develop concise polynomial representation formulas, avoiding complex proof techniques
- Verify the effectiveness and practicality of the theory through concrete examples
- Established a complete representation theory framework: Provided explicit formulas for representing arbitrary polynomials using probabilistic Bernoulli polynomials BkY(x) and probabilistic degenerate Bernoulli polynomials βk,λY(x) (Theorems 3.1 and 3.3)
- Extended to higher-order cases: Provided formulas for representing arbitrary polynomials using higher-order probabilistic Bernoulli polynomials BkY,(r)(x) and βk,λY,(r)(x) (Theorems 4.1 and 4.2)
- Developed key orthogonality theory: Proved that probabilistic Stirling numbers S1Y(n,k) and S2Y(n,k), as well as degenerate versions S1,λY(n,k) and S2,λY(n,k), satisfy orthogonality relations and inverse relations (Propositions 1.1 and 1.2)
- Provided abundant concrete examples: For six common random variables (Bernoulli, binomial, Poisson, geometric, exponential, and Gamma distributions), explicit representations of xn were given
- Simplified proofs of known identities: Using formula (3.22), provided simple proofs of Miki identities and FPZ identities, avoiding the original complex techniques
Input: An arbitrary polynomial p(x)∈C[x] of degree n
Output: Representation coefficients a0,a1,…,an such that
p(x)=∑k=0nakBkY(x)orp(x)=∑k=0nakβk,λY(x)
Constraints: The moment generating function E[eYt] of random variable Y exists in a neighborhood of the origin, and E[Y]=0
Second-kind probabilistic Stirling numbers are defined via generating functions:
k!1(E[eYt]−1)k=∑n=k∞S2Y(n,k)n!tn
Introducing the notation eY(t)=E[eYt]−1, then eY(t) is a delta series (with a0=0,a1=E[Y]=0).
First-kind probabilistic Stirling numbers are defined via compositional inverse:
k!1(eˉY(t))k=∑n=k∞S1Y(n,k)n!tn
where eˉY(t) is the compositional inverse of eY(t), satisfying eY(eˉY(t))=eˉY(eY(t))=t.
∑k=lnS2Y(n,k)S1Y(k,l)=δn,l
This orthogonality yields the important inverse relation:
an=∑k=0nS2Y(n,k)bk⇔bn=∑k=0nS1Y(n,k)ak
The paper employs umbral calculus to establish the theory. Key elements include:
- Sheffer sequences: sn(x)∼(g(t),f(t)) if and only if
g(fˉ(t))1exfˉ(t)=∑k=0∞sk(x)k!tk
- Differential operator properties: f(t)sn(x)=nsn−1(x)
- Sheffer representation of probabilistic Bernoulli polynomials:
BnY(x)∼(g(t)=f(t)et−1,f(t))
where fˉ(t)=logE[eYt]
Step 1: Compute a0a0=∫01f(t)tp(x)dx
Step 2: Construct auxiliary functiona(x)=p(x+1)−p(x)=Δp(x)
Step 3: Compute ar+1 (r=0,1,…,n−1)
Three equivalent forms:
(a) Based on difference operators:
ar+1=r+11∑j=rn−1S1Y(j,r)j!1Δj+1p(0)
(b) Based on derivatives and Stirling numbers:
ar+1=r+11∑k=rn−1∑j=rkk!1S2(k,j)S1Y(j,r)Δp(k)(0)
(c) Based on direct expansion:
ar+1=r+11∑j=rn−1∑k=0j+1(−1)j+1−kj!1(kj+1)S1Y(j,r)p(k)
The structure is completely analogous to Algorithm 1, with only the replacement of S1Y by S1,λY and f(t) by its corresponding degenerate version (whose compositional inverse is fˉ(t)=logE[eλY(t)]).
Through establishing orthogonality relations, the authors cleverly transform the representation problem into solving a linear system. Specifically:
- Starting from p(x)=∑k=0nakBkY(x)
- Computing the difference Δp(x)=∑k=1nkak∑j=0k−1S2Y(k−1,j)(x)j
- Using orthogonality to back-solve for ak
When Y=1, the theory reduces to the representation theory of classical Bernoulli polynomials, with formulas simplifying to:
ak=k!1∫01p(k)(x)dx
For higher-order cases (Theorems 4.1 and 4.2), two situations are distinguished: r>n and r≤n:
- When r>n: All coefficients involve the integral operator Ir−k
- When r≤n: The first r terms involve integral operators, subsequent terms involve difference operators
By introducing algebraic properties of the linear integral operator I and difference operator Δ, complex expressions are transformed into computable forms.
The paper selected six representative random variables for verification:
Discrete random variables:
- Bernoulli distribution: p(0)=1−p,p(1)=p (0<p≤1)
- Binomial distribution: Parameters (m,p), p(i)=(im)pi(1−p)m−i
- Poisson distribution: Parameter α>0, p(i)=e−αi!αi
- Geometric distribution: Parameter 0<p<1, p(i)=(1−p)i−1p
Continuous random variables:
5. Exponential distribution: Parameter α>0, f(y)=αe−αy (y≥0)
6. Gamma distribution: Parameters α,β>0, f(y)=Γ(α)βe−βy(βy)α−1
For each random variable Y, the following must be computed (using results from reference 14):
- fY(t): The compositional inverse of fˉY(t)=logE[eYt]
- fY,λ(t): The compositional inverse of fˉY,λ(t)=logE[eλY(t)]
- S1Y(n,k): First-kind probabilistic Stirling numbers
- S1,λY(n,k): Degenerate version
For each random variable, compute the representation of xn:
xn=∑k=0nakBkY(x)andxn=∑k=0nakβk,λY(x)
This is the most concise example. From reference 14:
fY(t)=α(1−e−t),S1Y(n,k)=(−1)n−k(kn)(n−1)n−kαk
Result:
xn=α1B0Y(x)+∑k=1n{k1∑j=k−1n−1(−1)j−k+1(k−1j)(j−1)j−k+1αk−1j!1Δj+10n}BkY(x)
For the degenerate version:
xn=α1∑r=0n∑l=0r(rn)S2(r,l)(−1)l−r(αλ)lBlβ0,λY(x)+⋯
Analysis:
- a0=α1 is remarkably concise, obtained through Lemma 5.1 by computing the integral
- Coefficients involve combinations of Bernoulli numbers Bl and Stirling numbers S2(r,l)
From reference 14:
fY(t)=log(1+p1(et−1)),S1Y(n,k)=pn1S1(n,k)
Result:
xn=∑l=0npl−11S2(n,l)blB0Y(x)+∑k=1n{k1∑j=k−1n−1pj1S1(j,k−1)j!1Δj+10n}BkY(x)
where bl are second-kind Bernoulli numbers, defined by log(1+t)t=∑l=0∞bll!tl.
Key computation (Formula 5.5):
fY(t)txn=∑r=0n∑l=0r(rn)pl−11S2(r,l)blBn−r(x)
fY(t)=log(1+αt),S1Y(n,k)=∑l=knαl1S1(l,k)S1(n,l)
Result:
xn=∑l=0n(ln)n−l+11αl−11blB0Y(x)+∑k=1n{k1∑j=k−1n−1∑l=k−1jαl1S1(l,k−1)S1(j,l)j!1Δj+10n}BkY(x)
This is the most complex example. It requires the use of Frobenius-Euler numbers Hj(r)(u):
(et−u1−u)r=∑n=0∞Hn(r)(u)n!tn
Computation of a0 (Formula 5.23):
a0=p1∑j=0n∑l=0∞∑r=0l(−1)rl!1(jn)(rl)(1−pp)lblHj(r)(p−1p)(1−p(1−δn,j))
Lemma 5.1:
∫01Bn(x)dx=δn,0,∫01Bn(−x)dx=(−1)n
This lemma plays a key role in computing a0 in all examples. For instance, in the exponential distribution case:
a0=∫01α(1−e−t)txndx=α1(−1)n∫01Bn(−x)dx=α1
- Significant simplification effects: Compared to complex proofs in the literature (e.g., Miki identity requiring Fermat quotients or p-adic analysis), this paper's method requires only integral and difference computations
- Universality: All examples follow the same computational framework, with only the specific fY(t) and S1Y(n,k) differing
- Computational complexity:
- Discrete distributions are typically more concise (e.g., Bernoulli, Poisson)
- Continuous distributions may involve more complex integrals (e.g., geometric distribution)
- Exponential distribution is the most concise
- Additional complexity of degenerate versions: Representations of degenerate Bernoulli polynomials typically involve additional Stirling number summations
Classical theory:
- Representation theory of Bernoulli polynomials is foundational content in special function theory
- Formula (3.22) gives the classical result: p(x)=∑k=0nakBk(x), where ak=k!1∫01p(k)(x)dx
Degenerate theory:
- Carlitz (1979) 4: Pioneering research on degenerate Stirling numbers, Bernoulli numbers, and Euler numbers
- Recent work by Kim et al. 13,16,19,20,23: Systematic development of degenerate special polynomial theory
Probabilistic extensions:
- Adell et al. 1,2,3: Introduction of probabilistic Stirling numbers
- Kim et al. 18,21,22: Development of probabilistic degenerate polynomial theory
Distinction from Adell-Bényi 2:
- 2 defines S1Y(n,k) based on cumulant generating functions
- This paper defines based on compositional inverse, ensuring orthogonality
- Key advantage: Orthogonality makes the inverse problem solvable
Distinction from Kim-Kim 18:
- 18 addresses the degenerate case but does not provide general representation theory
- This paper uniformly handles both non-degenerate and degenerate cases
Comparison with Kim-Kim 16:
- 16 provides representation of degenerate Bernoulli polynomials βk,λ(x) (with Y=1)
- This paper extends to general random variables Y
Miki identity (Formula 1.1):
∑k=1n−1k(n−k)Bk(x)Bn−k(x)=n2∑k=0n−2n−k1(kn)Bn−kBk(x)+n2Hn−1Bn(x)
Traditional proof methods:
- Miki 24: Using Fermat quotient formulas modulo p2
- Shiratani-Yokoyama 30: p-adic analysis
- Gessel 12: Two expressions for Stirling numbers
This paper's method: Direct application of formula (3.22), requiring only derivative and integral computations
- Theoretical completeness: Established a complete representation theory for probabilistic Bernoulli polynomials and degenerate versions, including base and higher-order cases
- Computational effectiveness: Provided three equivalent coefficient computation formulas suitable for different computational scenarios
- Broad applicability: The theory applies to arbitrary random variables whose moment generating functions exist in a neighborhood of the origin
- Simplified proofs: Provided more concise proof pathways for known identities
- Restrictive conditions:
- Requires E[Y]=0
- Moment generating function must exist in a neighborhood of the origin
- Excludes some important distributions (e.g., Cauchy distribution)
- Computational complexity:
- Requires pre-computation of S1Y(n,k) and S1,λY(n,k)
- For complex distributions (e.g., geometric distribution), formulas may be extremely complicated
- Numerical stability:
- Involves high-order differences and Stirling numbers, potentially causing numerical stability issues
- Paper does not discuss numerical implementation
- Theoretical depth:
- Primarily derivations of combinatorial identities
- Lacks asymptotic analysis or exploration of deeper number-theoretic properties
The paper does not explicitly propose future directions, but the following can be inferred:
- Extension to other special polynomials: Study of Euler polynomials, Genocchi polynomials, etc.
- Multivariate generalization: Research on multivariate probabilistic Bernoulli polynomials
- Numerical algorithms: Development of stable and efficient numerical computation methods
- Application exploration: Search for applications in number theory, combinatorics, and quantum field theory
- Orthogonality framework: By ensuring orthogonality of Stirling numbers, resolves key defects in references 2,18
- Umbral calculus application: Systematic use of umbral calculus theory makes proofs elegant and concise
- Unified theory: Incorporates non-degenerate, degenerate, and higher-order cases into a unified framework
- Four main theorems: Cover all important cases (Theorems 3.1, 3.3, 4.1, 4.2)
- Two foundational propositions: Establish orthogonality and inverse relations (Propositions 1.1, 1.2)
- Systematic preliminaries: Section 1 provides thorough background introduction
- Six random variables: Cover common discrete and continuous distributions
- Two representations: Each example provides both non-degenerate and degenerate versions
- Detailed computations: Key intermediate steps are shown (e.g., Formulas 5.5, 5.19-5.20)
Shortcomings:
- Lacks numerical verification
- Does not compare computational efficiency of different formulas
- Clear structure: From preliminaries → umbral calculus → main results → examples, with rigorous logic
- Standardized notation: Consistently uses superscript Y to denote association with random variables
- Sufficient detail: Proof steps are detailed, facilitating reader understanding
- Stringent conditions: E[Y]=0 excludes symmetric distributions (e.g., standard normal distribution)
- Lack of error analysis: Does not discuss truncation error or numerical precision
- No numerical implementation: All results are in symbolic form without numerical examples
- No performance comparison: Which of the three formula forms computes fastest?
- No visualization: No graphs of polynomials or coefficients
- Theory-oriented: Primarily mathematical derivations, lacking practical application scenarios
- Weak connection to probability: Although random variables are introduced, probabilistic significance is not deeply explored
- Scattered related work: Distributed across introduction and Section 5, not sufficiently concentrated
- Insufficient comparison: Technical comparison with 2,18 lacks detail
- Fills theoretical gaps: Resolves the orthogonality problem for probabilistic Stirling numbers
- Methodological contribution: Demonstrates the power of umbral calculus in probabilistic extensions
- Interdisciplinary connection: Combines probability theory, combinatorics, and special function theory
Potential impact:
- May become a standard reference for probabilistic special function theory
- May inspire probabilistic extensions of other special polynomials
- Symbolic computation: Can be used in computer algebra systems (e.g., Mathematica, Maple)
- Theoretical tool: Provides new tools for proving combinatorial identities
- Educational value: Suitable as supplementary material for special functions courses
Limitations:
- Direct application scenarios unclear
- Requires further development for practical problem applications
- Clear formulas: All formulas have clear definitions
- External dependencies: Key computation of S1Y(n,k) depends on reference 14
- No code: No implementation code provided
Recommendations:
- Provide Mathematica or Python implementations
- Establish online calculator
- Combinatorial identity proofs: Simplify proofs of complex identities
- Special function theory: Extend Bernoulli polynomial theory
- Number theory: Potentially applicable to congruence properties of Bernoulli numbers
- Polynomial expansion: Expand arbitrary polynomials in special function bases
- Integral computation: Simplify integrals using Bernoulli polynomial properties
- Special functions courses: Demonstrate modern research methods
- Combinatorics: Advanced applications of Stirling numbers
- Umbral calculus: Concrete application examples
- Quantum field theory: Applications of Bernoulli numbers in Feynman diagram calculations
- Gromov-Witten theory: Connection with FPZ identities
- Asymptotic analysis: Potentially applicable to asymptotic expansions of certain sums
| Dimension | Score | Remarks |
|---|
| Innovation | 8/10 | Orthogonality framework is key innovation |
| Theoretical Depth | 9/10 | Complete theory, rigorous proofs |
| Practicality | 6/10 | Primarily theoretical contribution |
| Writing Quality | 9/10 | Clear, systematic, thorough |
| Experimental Sufficiency | 7/10 | Rich examples but lacks numerical verification |
| Overall Evaluation | 7.8/10 | Excellent theoretical work |
2 J. A. Adell, B. Bényi, Probabilistic Stirling numbers and applications, Aequat. Math. 98 (2024), 1627-1646.
- Introduces probabilistic Stirling numbers but definition lacks orthogonality
4 L. Carlitz, Degenerate Stirling, Bernoulli and Eulerian numbers, Utilitas Math. 15 (1979), 51-88.
- Pioneering work on degenerate special numbers
14 D. S. Kim, T. Kim, Probabilisitc Stirling and degenerate Stirling numbers, Preprint.
- Provides S1Y(n,k) computation results needed for this paper
16 D. S. Kim, T. Kim, Representing polynomials by degenerate Bernoulli polynomials, Quaest. Math. 46 (2022), no. 5, 959-980.
- Prior work for the case Y=1
27-28 S. Roman, The umbral calculus series
- Standard reference for umbral calculus
Summary: This is a high-quality theoretical mathematics paper making substantive contributions to probabilistic special function theory. By establishing an orthogonality framework, the authors resolve key defects in existing literature and develop a complete representation theory, laying a solid foundation for subsequent research. The paper's main value lies in the systematicity of the theory and the elegance of the methods. Primary improvement opportunities lie in adding numerical experiments and exploring practical applications.