2025-11-21T21:46:16.082389

Representations by probabilistic Bernoulli and degenerate Bernoulli polynomials

Kim, Kim

We investigate the representation of arbitrary polynomials using probabilistic Bernoulli and degenerate Bernoulli polynomials associated with a random variable $Y$, whose moment generating function exists in a neighborhood of the origin. In addition, this paper explores the problem of representing arbitrary polynomials in terms of their higher-order counterparts. We develop explicit formulas for those representations with the help of umbral calculus and illustrate our results for several discrete and continuous random variables Y.

academic

Representations by probabilistic Bernoulli and degenerate Bernoulli polynomials

Basic Information

Paper ID: 2510.21558
Title: Representations by probabilistic Bernoulli and degenerate Bernoulli polynomials
Authors: Dae San Kim (Sogang University), Taekyun Kim (Kwangwoon University)
Classification: math.NT (Number Theory), math.PR (Probability)
Submission Date: October 24, 2025
Paper Link: https://arxiv.org/abs/2510.21558v1

Abstract

This paper investigates the representation of arbitrary polynomials using probabilistic Bernoulli polynomials and degenerate Bernoulli polynomials, which are associated with random variables Y whose moment generating functions exist in a neighborhood of the origin. Furthermore, the paper explores the representation of arbitrary polynomials using higher-order corresponding polynomials. Using umbral calculus, the authors develop explicit formulas for these representations and demonstrate results for several discrete and continuous random variables Y.

Research Background and Motivation

1. Core Research Problem

The central problem addressed is: how to represent arbitrary polynomials as linear combinations of probabilistic Bernoulli polynomials and probabilistic degenerate Bernoulli polynomials, and provide explicit coefficient formulas.

2. Problem Significance

Theoretical Importance: Bernoulli polynomials and their variants hold foundational positions in number theory and combinatorics; their representation theory constitutes an important component of special function theory
Historical Context: Research on degenerate special polynomials originates from Carlitz's pioneering 1979 work on degenerate Bernoulli and Euler polynomials
Modern Development: Probabilistic extensions of special polynomials have received extensive recent attention, combining probability theory with special function theory

3. Limitations of Existing Methods

Probabilistic Stirling numbers defined via cumulant generating functions in references 2 and 18 lack orthogonality and inverse relation properties
The absence of these properties makes the inverse problem (inferring polynomials from representation coefficients) difficult to solve
Existing proof methods (such as proofs of Miki identities) are often extremely complex, involving sophisticated tools like p-adic analysis and quantum field theory

4. Research Motivation

Establish a probabilistic Stirling number theoretical framework based on orthogonality
Develop concise polynomial representation formulas, avoiding complex proof techniques
Verify the effectiveness and practicality of the theory through concrete examples

Core Contributions

Established a complete representation theory framework: Provided explicit formulas for representing arbitrary polynomials using probabilistic Bernoulli polynomials $B_k^Y(x)$ and probabilistic degenerate Bernoulli polynomials $\beta_{k,\lambda}^Y(x)$ (Theorems 3.1 and 3.3)
Extended to higher-order cases: Provided formulas for representing arbitrary polynomials using higher-order probabilistic Bernoulli polynomials $B_k^{Y,(r)}(x)$ and $\beta_{k,\lambda}^{Y,(r)}(x)$ (Theorems 4.1 and 4.2)
Developed key orthogonality theory: Proved that probabilistic Stirling numbers $S_1^Y(n,k)$ and $S_2^Y(n,k)$ , as well as degenerate versions $S_{1,\lambda}^Y(n,k)$ and $S_{2,\lambda}^Y(n,k)$ , satisfy orthogonality relations and inverse relations (Propositions 1.1 and 1.2)
Provided abundant concrete examples: For six common random variables (Bernoulli, binomial, Poisson, geometric, exponential, and Gamma distributions), explicit representations of $x^n$ were given
Simplified proofs of known identities: Using formula (3.22), provided simple proofs of Miki identities and FPZ identities, avoiding the original complex techniques

Methodology Details

Task Definition

Input: An arbitrary polynomial $p(x) \in \mathbb{C}[x]$ of degree n

Output: Representation coefficients $a_0, a_1, \ldots, a_n$ such that $p(x) = \sum_{k=0}^n a_k B_k^Y(x) \quad \text{or} \quad p(x) = \sum_{k=0}^n a_k \beta_{k,\lambda}^Y(x)$

Constraints: The moment generating function $E[e^{Yt}]$ of random variable Y exists in a neighborhood of the origin, and $E[Y] \neq 0$

Theoretical Foundations

1. Definition of Probabilistic Stirling Numbers

Second-kind probabilistic Stirling numbers are defined via generating functions: $\frac{1}{k!}(E[e^{Yt}] - 1)^k = \sum_{n=k}^\infty S_2^Y(n,k) \frac{t^n}{n!}$

Introducing the notation $e_Y(t) = E[e^{Yt}] - 1$ , then $e_Y(t)$ is a delta series (with $a_0=0, a_1=E[Y]\neq 0$ ).

First-kind probabilistic Stirling numbers are defined via compositional inverse: $\frac{1}{k!}(\bar{e}_Y(t))^k = \sum_{n=k}^\infty S_1^Y(n,k) \frac{t^n}{n!}$ where $\bar{e}_Y(t)$ is the compositional inverse of $e_Y(t)$ , satisfying $e_Y(\bar{e}_Y(t)) = \bar{e}_Y(e_Y(t)) = t$ .

2. Orthogonality and Inverse Relations (Proposition 1.1)

$\sum_{k=l}^n S_2^Y(n,k) S_1^Y(k,l) = \delta_{n,l}$

This orthogonality yields the important inverse relation: $a_n = \sum_{k=0}^n S_2^Y(n,k) b_k \Leftrightarrow b_n = \sum_{k=0}^n S_1^Y(n,k) a_k$

3. Umbral Calculus Framework

The paper employs umbral calculus to establish the theory. Key elements include:

Sheffer sequences: $s_n(x) \sim (g(t), f(t))$ if and only if $\frac{1}{g(\bar{f}(t))} e^{x\bar{f}(t)} = \sum_{k=0}^\infty s_k(x) \frac{t^k}{k!}$
Differential operator properties: $f(t)s_n(x) = ns_{n-1}(x)$
Sheffer representation of probabilistic Bernoulli polynomials: $B_n^Y(x) \sim \left(g(t) = \frac{e^t-1}{f(t)}, f(t)\right)$ where $\bar{f}(t) = \log E[e^{Yt}]$

Core Algorithm Flow

Algorithm 1: Probabilistic Bernoulli Polynomial Representation (Theorem 3.1)

Step 1: Compute $a_0$ $a_0 = \int_0^1 \frac{t}{f(t)} p(x) dx$

Step 2: Construct auxiliary function $a(x) = p(x+1) - p(x) = \Delta p(x)$

Step 3: Compute $a_{r+1}$ ( $r=0,1,\ldots,n-1$ )

Three equivalent forms:

(a) Based on difference operators: $a_{r+1} = \frac{1}{r+1} \sum_{j=r}^{n-1} S_1^Y(j,r) \frac{1}{j!} \Delta^{j+1} p(0)$

(b) Based on derivatives and Stirling numbers: $a_{r+1} = \frac{1}{r+1} \sum_{k=r}^{n-1} \sum_{j=r}^k \frac{1}{k!} S_2(k,j) S_1^Y(j,r) \Delta p^{(k)}(0)$

(c) Based on direct expansion: $a_{r+1} = \frac{1}{r+1} \sum_{j=r}^{n-1} \sum_{k=0}^{j+1} (-1)^{j+1-k} \frac{1}{j!} \binom{j+1}{k} S_1^Y(j,r) p(k)$

Algorithm 2: Probabilistic Degenerate Bernoulli Polynomial Representation (Theorem 3.3)

The structure is completely analogous to Algorithm 1, with only the replacement of $S_1^Y$ by $S_{1,\lambda}^Y$ and $f(t)$ by its corresponding degenerate version (whose compositional inverse is $\bar{f}(t) = \log E[e_\lambda^Y(t)]$ ).

Technical Innovation Points

1. Critical Role of Orthogonality

Through establishing orthogonality relations, the authors cleverly transform the representation problem into solving a linear system. Specifically:

Starting from $p(x) = \sum_{k=0}^n a_k B_k^Y(x)$
Computing the difference $\Delta p(x) = \sum_{k=1}^n k a_k \sum_{j=0}^{k-1} S_2^Y(k-1,j) (x)_j$
Using orthogonality to back-solve for $a_k$

2. Unification with Classical Cases

When $Y=1$ , the theory reduces to the representation theory of classical Bernoulli polynomials, with formulas simplifying to: $a_k = \frac{1}{k!} \int_0^1 p^{(k)}(x) dx$

3. Treatment of Higher-Order Representations

For higher-order cases (Theorems 4.1 and 4.2), two situations are distinguished: $r>n$ and $r\leq n$ :

When $r>n$ : All coefficients involve the integral operator $I^{r-k}$
When $r\leq n$ : The first $r$ terms involve integral operators, subsequent terms involve difference operators

4. Computational Techniques

By introducing algebraic properties of the linear integral operator $I$ and difference operator $\Delta$ , complex expressions are transformed into computable forms.

Experimental Setup

Dataset (Random Variable Selection)

The paper selected six representative random variables for verification:

Discrete random variables:

Bernoulli distribution: $p(0)=1-p, p(1)=p$ ( $0<p\leq 1$ )
Binomial distribution: Parameters $(m,p)$ , $p(i) = \binom{m}{i} p^i (1-p)^{m-i}$
Poisson distribution: Parameter $\alpha>0$ , $p(i) = e^{-\alpha} \frac{\alpha^i}{i!}$
Geometric distribution: Parameter $0<p<1$ , $p(i) = (1-p)^{i-1} p$

Continuous random variables: 5. Exponential distribution: Parameter $\alpha>0$ , $f(y) = \alpha e^{-\alpha y}$ ( $y\geq 0$ ) 6. Gamma distribution: Parameters $\alpha,\beta>0$ , $f(y) = \frac{\beta e^{-\beta y} (\beta y)^{\alpha-1}}{\Gamma(\alpha)}$

Computational Elements

For each random variable Y, the following must be computed (using results from reference 14):

$f_Y(t)$ : The compositional inverse of $\bar{f}_Y(t) = \log E[e^{Yt}]$
$f_{Y,\lambda}(t)$ : The compositional inverse of $\bar{f}_{Y,\lambda}(t) = \log E[e_\lambda^Y(t)]$
$S_1^Y(n,k)$ : First-kind probabilistic Stirling numbers
$S_{1,\lambda}^Y(n,k)$ : Degenerate version

Verification Strategy

For each random variable, compute the representation of $x^n$ : $x^n = \sum_{k=0}^n a_k B_k^Y(x) \quad \text{and} \quad x^n = \sum_{k=0}^n a_k \beta_{k,\lambda}^Y(x)$

Experimental Results

Main Results Presentation

Example 1: Exponential Distribution ( $\alpha>0$ )

This is the most concise example. From reference 14: $f_Y(t) = \alpha(1-e^{-t}), \quad S_1^Y(n,k) = (-1)^{n-k} \binom{n}{k} (n-1)^{n-k} \alpha^k$

Result: $x^n = \frac{1}{\alpha} B_0^Y(x) + \sum_{k=1}^n \left\{ \frac{1}{k} \sum_{j=k-1}^{n-1} (-1)^{j-k+1} \binom{j}{k-1} (j-1)^{j-k+1} \alpha^{k-1} \frac{1}{j!} \Delta^{j+1} 0^n \right\} B_k^Y(x)$

For the degenerate version: $x^n = \frac{1}{\alpha} \sum_{r=0}^n \sum_{l=0}^r \binom{n}{r} S_2(r,l) (-1)^{l-r} (\alpha\lambda)^l B_l \beta_{0,\lambda}^Y(x) + \cdots$

Analysis:

$a_0 = \frac{1}{\alpha}$ is remarkably concise, obtained through Lemma 5.1 by computing the integral
Coefficients involve combinations of Bernoulli numbers $B_l$ and Stirling numbers $S_2(r,l)$

Example 2: Bernoulli Distribution ( $0<p\leq 1$ )

From reference 14: $f_Y(t) = \log\left(1 + \frac{1}{p}(e^t-1)\right), \quad S_1^Y(n,k) = \frac{1}{p^n} S_1(n,k)$

Result: $x^n = \sum_{l=0}^n \frac{1}{p^{l-1}} S_2(n,l) b_l B_0^Y(x) + \sum_{k=1}^n \left\{ \frac{1}{k} \sum_{j=k-1}^{n-1} \frac{1}{p^j} S_1(j,k-1) \frac{1}{j!} \Delta^{j+1} 0^n \right\} B_k^Y(x)$

where $b_l$ are second-kind Bernoulli numbers, defined by $\frac{t}{\log(1+t)} = \sum_{l=0}^\infty b_l \frac{t^l}{l!}$ .

Key computation (Formula 5.5): $\frac{t}{f_Y(t)} x^n = \sum_{r=0}^n \sum_{l=0}^r \binom{n}{r} \frac{1}{p^{l-1}} S_2(r,l) b_l B_{n-r}(x)$

Example 3: Poisson Distribution ( $\alpha>0$ )

$f_Y(t) = \log\left(1 + \frac{t}{\alpha}\right), \quad S_1^Y(n,k) = \sum_{l=k}^n \frac{1}{\alpha^l} S_1(l,k) S_1(n,l)$

Result: $x^n = \sum_{l=0}^n \binom{n}{l} \frac{1}{n-l+1} \frac{1}{\alpha^{l-1}} b_l B_0^Y(x) + \sum_{k=1}^n \left\{ \frac{1}{k} \sum_{j=k-1}^{n-1} \sum_{l=k-1}^j \frac{1}{\alpha^l} S_1(l,k-1) S_1(j,l) \frac{1}{j!} \Delta^{j+1} 0^n \right\} B_k^Y(x)$

Example 4: Geometric Distribution ( $0<p<1$ )

This is the most complex example. It requires the use of Frobenius-Euler numbers $H_j^{(r)}(u)$ : $\left(\frac{1-u}{e^t-u}\right)^r = \sum_{n=0}^\infty H_n^{(r)}(u) \frac{t^n}{n!}$

Computation of $a_0$ (Formula 5.23): $a_0 = \frac{1}{p} \sum_{j=0}^n \sum_{l=0}^\infty \sum_{r=0}^l (-1)^r \frac{1}{l!} \binom{n}{j} \binom{l}{r} \left(\frac{p}{1-p}\right)^l b_l H_j^{(r)}\left(\frac{p}{p-1}\right) (1-p(1-\delta_{n,j}))$

Case Analysis: Role of Lemma 5.1

Lemma 5.1: $\int_0^1 B_n(x) dx = \delta_{n,0}, \quad \int_0^1 B_n(-x) dx = (-1)^n$

This lemma plays a key role in computing $a_0$ in all examples. For instance, in the exponential distribution case: $a_0 = \int_0^1 \frac{t}{\alpha(1-e^{-t})} x^n dx = \frac{1}{\alpha} (-1)^n \int_0^1 B_n(-x) dx = \frac{1}{\alpha}$

Experimental Findings

Significant simplification effects: Compared to complex proofs in the literature (e.g., Miki identity requiring Fermat quotients or p-adic analysis), this paper's method requires only integral and difference computations
Universality: All examples follow the same computational framework, with only the specific $f_Y(t)$ and $S_1^Y(n,k)$ differing
Computational complexity:
- Discrete distributions are typically more concise (e.g., Bernoulli, Poisson)
- Continuous distributions may involve more complex integrals (e.g., geometric distribution)
- Exponential distribution is the most concise
Additional complexity of degenerate versions: Representations of degenerate Bernoulli polynomials typically involve additional Stirling number summations

1. Historical Development

Classical theory:

Representation theory of Bernoulli polynomials is foundational content in special function theory
Formula (3.22) gives the classical result: $p(x) = \sum_{k=0}^n a_k B_k(x)$ , where $a_k = \frac{1}{k!} \int_0^1 p^{(k)}(x) dx$

Degenerate theory:

Carlitz (1979) 4: Pioneering research on degenerate Stirling numbers, Bernoulli numbers, and Euler numbers
Recent work by Kim et al. 13,16,19,20,23: Systematic development of degenerate special polynomial theory

Probabilistic extensions:

Adell et al. 1,2,3: Introduction of probabilistic Stirling numbers
Kim et al. 18,21,22: Development of probabilistic degenerate polynomial theory

Distinction from Adell-Bényi 2:

2 defines $S_1^Y(n,k)$ based on cumulant generating functions
This paper defines based on compositional inverse, ensuring orthogonality
Key advantage: Orthogonality makes the inverse problem solvable

Distinction from Kim-Kim 18:

18 addresses the degenerate case but does not provide general representation theory
This paper uniformly handles both non-degenerate and degenerate cases

Comparison with Kim-Kim 16:

16 provides representation of degenerate Bernoulli polynomials $\beta_{k,\lambda}(x)$ (with $Y=1$ )
This paper extends to general random variables Y

3. Innovation in Application Examples

Miki identity (Formula 1.1): $\sum_{k=1}^{n-1} \frac{B_k(x) B_{n-k}(x)}{k(n-k)} = \frac{2}{n} \sum_{k=0}^{n-2} \frac{1}{n-k} \binom{n}{k} B_{n-k} B_k(x) + \frac{2}{n} H_{n-1} B_n(x)$

Traditional proof methods:

Miki 24: Using Fermat quotient formulas modulo $p^2$
Shiratani-Yokoyama 30: p-adic analysis
Gessel 12: Two expressions for Stirling numbers

This paper's method: Direct application of formula (3.22), requiring only derivative and integral computations

Conclusions and Discussion

Main Conclusions

Theoretical completeness: Established a complete representation theory for probabilistic Bernoulli polynomials and degenerate versions, including base and higher-order cases
Computational effectiveness: Provided three equivalent coefficient computation formulas suitable for different computational scenarios
Broad applicability: The theory applies to arbitrary random variables whose moment generating functions exist in a neighborhood of the origin
Simplified proofs: Provided more concise proof pathways for known identities

Limitations

Restrictive conditions:
- Requires $E[Y] \neq 0$
- Moment generating function must exist in a neighborhood of the origin
- Excludes some important distributions (e.g., Cauchy distribution)
Computational complexity:
- Requires pre-computation of $S_1^Y(n,k)$ and $S_{1,\lambda}^Y(n,k)$
- For complex distributions (e.g., geometric distribution), formulas may be extremely complicated
Numerical stability:
- Involves high-order differences and Stirling numbers, potentially causing numerical stability issues
- Paper does not discuss numerical implementation
Theoretical depth:
- Primarily derivations of combinatorial identities
- Lacks asymptotic analysis or exploration of deeper number-theoretic properties

Future Directions

The paper does not explicitly propose future directions, but the following can be inferred:

Extension to other special polynomials: Study of Euler polynomials, Genocchi polynomials, etc.
Multivariate generalization: Research on multivariate probabilistic Bernoulli polynomials
Numerical algorithms: Development of stable and efficient numerical computation methods
Application exploration: Search for applications in number theory, combinatorics, and quantum field theory

In-Depth Evaluation

Strengths

1. Methodological Innovation (★★★★☆)

Orthogonality framework: By ensuring orthogonality of Stirling numbers, resolves key defects in references 2,18
Umbral calculus application: Systematic use of umbral calculus theory makes proofs elegant and concise
Unified theory: Incorporates non-degenerate, degenerate, and higher-order cases into a unified framework

2. Theoretical Completeness (★★★★★)

Four main theorems: Cover all important cases (Theorems 3.1, 3.3, 4.1, 4.2)
Two foundational propositions: Establish orthogonality and inverse relations (Propositions 1.1, 1.2)
Systematic preliminaries: Section 1 provides thorough background introduction

3. Experimental Sufficiency (★★★★☆)

Six random variables: Cover common discrete and continuous distributions
Two representations: Each example provides both non-degenerate and degenerate versions
Detailed computations: Key intermediate steps are shown (e.g., Formulas 5.5, 5.19-5.20)

Shortcomings:

Lacks numerical verification
Does not compare computational efficiency of different formulas

4. Writing Clarity (★★★★★)

Clear structure: From preliminaries → umbral calculus → main results → examples, with rigorous logic
Standardized notation: Consistently uses superscript Y to denote association with random variables
Sufficient detail: Proof steps are detailed, facilitating reader understanding

Weaknesses

1. Theoretical Limitations

Stringent conditions: $E[Y] \neq 0$ excludes symmetric distributions (e.g., standard normal distribution)
Lack of error analysis: Does not discuss truncation error or numerical precision

2. Experimental Insufficiency

No numerical implementation: All results are in symbolic form without numerical examples
No performance comparison: Which of the three formula forms computes fastest?
No visualization: No graphs of polynomials or coefficients

3. Limited Applications

Theory-oriented: Primarily mathematical derivations, lacking practical application scenarios
Weak connection to probability: Although random variables are introduced, probabilistic significance is not deeply explored

4. Literature Review

Scattered related work: Distributed across introduction and Section 5, not sufficiently concentrated
Insufficient comparison: Technical comparison with 2,18 lacks detail

Impact Assessment

1. Contribution to the Field (★★★★☆)

Fills theoretical gaps: Resolves the orthogonality problem for probabilistic Stirling numbers
Methodological contribution: Demonstrates the power of umbral calculus in probabilistic extensions
Interdisciplinary connection: Combines probability theory, combinatorics, and special function theory

Potential impact:

May become a standard reference for probabilistic special function theory
May inspire probabilistic extensions of other special polynomials

2. Practical Value (★★★☆☆)

Symbolic computation: Can be used in computer algebra systems (e.g., Mathematica, Maple)
Theoretical tool: Provides new tools for proving combinatorial identities
Educational value: Suitable as supplementary material for special functions courses

Limitations:

Direct application scenarios unclear
Requires further development for practical problem applications

3. Reproducibility (★★★☆☆)

Clear formulas: All formulas have clear definitions
External dependencies: Key computation of $S_1^Y(n,k)$ depends on reference 14
No code: No implementation code provided

Recommendations:

Provide Mathematica or Python implementations
Establish online calculator

Applicable Scenarios

1. Theoretical Research

Combinatorial identity proofs: Simplify proofs of complex identities
Special function theory: Extend Bernoulli polynomial theory
Number theory: Potentially applicable to congruence properties of Bernoulli numbers

2. Symbolic Computation

Polynomial expansion: Expand arbitrary polynomials in special function bases
Integral computation: Simplify integrals using Bernoulli polynomial properties

3. Education

Special functions courses: Demonstrate modern research methods
Combinatorics: Advanced applications of Stirling numbers
Umbral calculus: Concrete application examples

4. Potential Applications

Quantum field theory: Applications of Bernoulli numbers in Feynman diagram calculations
Gromov-Witten theory: Connection with FPZ identities
Asymptotic analysis: Potentially applicable to asymptotic expansions of certain sums

Comprehensive Scoring

Dimension	Score	Remarks
Innovation	8/10	Orthogonality framework is key innovation
Theoretical Depth	9/10	Complete theory, rigorous proofs
Practicality	6/10	Primarily theoretical contribution
Writing Quality	9/10	Clear, systematic, thorough
Experimental Sufficiency	7/10	Rich examples but lacks numerical verification
Overall Evaluation	7.8/10	Excellent theoretical work

Key References

2 J. A. Adell, B. Bényi, Probabilistic Stirling numbers and applications, Aequat. Math. 98 (2024), 1627-1646.

Introduces probabilistic Stirling numbers but definition lacks orthogonality

4 L. Carlitz, Degenerate Stirling, Bernoulli and Eulerian numbers, Utilitas Math. 15 (1979), 51-88.

Pioneering work on degenerate special numbers

14 D. S. Kim, T. Kim, Probabilisitc Stirling and degenerate Stirling numbers, Preprint.

Provides $S_1^Y(n,k)$ computation results needed for this paper

16 D. S. Kim, T. Kim, Representing polynomials by degenerate Bernoulli polynomials, Quaest. Math. 46 (2022), no. 5, 959-980.

Prior work for the case $Y=1$

27-28 S. Roman, The umbral calculus series

Standard reference for umbral calculus

Summary: This is a high-quality theoretical mathematics paper making substantive contributions to probabilistic special function theory. By establishing an orthogonality framework, the authors resolve key defects in existing literature and develop a complete representation theory, laying a solid foundation for subsequent research. The paper's main value lies in the systematicity of the theory and the elegance of the methods. Primary improvement opportunities lie in adding numerical experiments and exploring practical applications.