Issue |
4open
Volume 6, 2023
Statistical Inference in Markov Processes and Copula Models
|
|
---|---|---|
Article Number | 2 | |
Number of page(s) | 7 | |
Section | Mathematics - Applied Mathematics | |
DOI | https://doi.org/10.1051/fopen/2023001 | |
Published online | 14 February 2023 |
Research Article
A method for the elicitation of copulas
Department of Statistics, University of Campinas, Sergio Buarque de Holanda, 651, Campinas, S.P. CEP: 13083-859, Brazil
* Corresponding author: v245362@dac.unicamp.br; viniliff@gmail.com
Received:
15
September
2022
Accepted:
4
January
2023
In this paper, we introduce a method to construct copulas. The method is based on combining the partial derivatives of two copulas. We prove that the proposed method provides a copula. Then, we exemplify the application of the method in several cases, illustrating the versatility of the method. We also prove that using copulas from the family introduced in Rodríguez-Lallena and Úbeda-Flores (2004) [Stat Probab Lett 66, 3, 315–325. https://doi.org/10.1016/j.spl.2003.09.010], the method provides a copula inside that family.
Key words: Dependence / Exchangeability / Representations of de Finetti
© V.A. González-López et al., Published by EDP Sciences, 2023
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Introduction
From the fact that it is convenient to have a variety of copulas at our disposal, amplifying the options for modeling, in this paper we propose a method to build copulas. The method provides a tool to elicitate a copula combining other copulas. We limit the exposition to dimension 2, using 2-copulas to build 2-copulas, but the technique can be easily extended to d-copulas, with d > 2.
Consider U and V two absolutely continuous random variables with Uniform distributions in [0, 1], our goal is to build a 2-copula between U and V. For to do that we consider Θ, a third absolutely continuous random variable linking U and V. The proposal of this paper requires a previous knowledge of the existing dependence between (a) U and Θ, and (b) V and Θ. To simplify the exposition and because we want to propose a method for elicitating 2-copulas between U and V, we assume also that Θ is absolutely continuous random variable with Uniform distribution in [0, 1]. In this way, for the construction of a copula between U and V, it is necessary to postulate a 2-copula between U and Θ and another 2-copula between V and Θ.
If we want to give a clear meaning to Θ, which rudimentarily speaking is a latent variable embedded in the process of dependence of the variables U and V, we appeal to the principles of de Finetti’s representation theorems. Those theorems, proved in de Finetti [1] and Hewitt and Savage [2], indicate that under certain conditions, see Aldous [3] and Mai [4], there is a random variable Θ allowing the conditional independence between U and V. Then, those results provide the necessary intuition behind the method proportioned in this paper.
For practicality and to expand the scope of the method, in this paper, we will not necessarily refer to Θ as the variable identified in de Finetti [1] and Hewitt and Savage [2], since we want to present the idea that it is enough to have two 2-copulas, between U and Θ and V and Θ to postulate the resulting 2-copula between U and V.
The content of this paper is the following, Copula construction method section shows the method for the construction of 2-copulas, we also expose examples showing different situations. Rodríguez-Lallena and Úbeda-Flores Family section shows the performance of the method on the family of 2-copulas proposed in Rodríguez-Lallena and Úbeda-Flores [5]. Conclusion section shows the conclusions.
A copula construction method
In this section, we present the notion of copula and some implications of its definition, on partial derivatives. These notions allow us to define the copula elicitation method that is based on a theorem that we postulate and prove in this section. We exemplify the applicability of the method by addressing examples that allow us to assess the scope of the method.
Definition 2.1
A 2-copula is a function C: [0, 1]2 → [0, 1] with the following properties,
-
, and ;
-
for every , i = 1, 2, such that and ,
Definition 2.1 implies that a 2-copula is a joint cumulative distribution of variables with Unif([0, 1]) marginal distributions. In this paper, we simply use the term copula (and not 2-copula) since the technique and concepts can be extended to dimensions greater than 2, and we limit ourselves to their development in dimension 2.
From Theorem 2.2.7 of Nelsen [6], if C is a copula, for any v ∈ [0, 1], the partial derivative exists for almost all u (in the sense of Lebesgue measure). Also the function is nondecreasing almost everywhere on [0, 1]. In similar way this properties are valid for the coordinate v. We see below an example.
Example 2.2
Let C be a function given by Then, is a copula. Note that this copula is coming from the Ali–Mikhail–Haq family of copulas, with parameter equal to 1, since its general expression is . And,
Functions like are frequently used, together with the Sklar’s theorem, for generating samples from joint distributions, prescribed the marginal distributions and the copula, see for instance Nelsen [6]. In a closer view, note that(1)In a similar way as given by equation (1), we have that . We show in Theorem 2.3 how the functions and can be used to propose a copula. More specifically, Theorem 2.3 constitutes the basis of a new method for elicitation of copulas. We consider two copulas and , following Definition 2.1, we calculate their partial derivatives (Eq. (1)) and finally we combine them (this is the proposal of the theorem) obtaining a new copula.
Theorem 2.3
If and are two copulas, see Definition 2.1 , (2) is a copula.
Proof. Property i. of Definition 2.1: from equation (1) applied to C1,(3)
If u = 0, Prob (U ≤ 0|Θ = t) = 0, then C(0, v) = 0, ∀v ∈ [0, 1], since is well defined ( by Theorem 2.2.7 in Nelsen [6]). Similarly, C(u, 0) = 0, ∀u ∈ [0, 1].
If u = 1, Prob (U ≤ 1|Θ = t) = 1, then, equation (3) is since the marginal distribution of the copula C2 is Uniform in [0, 1]. Similarly, C(u, 1) = u, ∀u ∈ [0, 1].
Property ii. of Definition 2.1: consider u i , v i ∈ [0, 1], i = 1, 2, such that u1 ≤ u2 and v1 ≤ v2, we need to prove V C (A) ≥ 0, for A = [u1, u2] × [v1, v2]. Write C asthen,(4)where equation (4) follows from the assumptions u1 ≤ u2 and v1 ≤ v2.
Then, C is a function following Definition 2.1. □
Thus, the strategy that we propose in this article seeks to fabricate a copula between U and V, with the help of a third quantity, say Θ.
We can then say that via knowledge of two copulas C1 and C2, Theorem 2.3 states how to combine such copulas to build a copula between U and V. This is, if C1 is a copula between U and Θ and C2 is a copula between V and Θ, we calculate and , then, through equation (2) we obtain a copula between U and V.
Equation (2) shows that the role that the variable Θ plays is to make the variables U and V conditionally independent given Θ = t, since,
The variable Θ that allows U and V to be linked could be interpreted as being the variable identified by de Finetti representations, or a transformation of it, since in Theorem 2.3, Θ has Uniform distribution on [0, 1]. Although this aspect escapes the focus of this article, we recommend the reader some papers on the subject, de Finetti [1], Hewitt and Savage [2].
We present three examples, which seek to show the potential of equation (2), as a method of constructing copulas. The first produces a copula that generalizes the Farlie–Gumbel–Morgenstern family (see Eyraud [7]), the second produces a copula in the Ali–Mikhail–Haq family (see Ali et al. [8]), the third example shows that although we choose two copulas C1 and C2 in the same family of copulas, equation (2) does not necessarily generate a copula in the same family of C1 and C2.
The following example shows a case that combines two copulas, one of them being a member of the Farlie-Gumbel–Morgenstern family. Through the method proposed by Theorem 2.3, we build a copula that is a generalization of the Farlie–Gumbel–Morgenstern copula.
Example 2.4
Consider two copulas, and , and with ,
In this case, equation (2) is which is a type of generalization of the Farlie–Gumbel–Morgenstern family of copulas (see Rodríguez-Lallena and Úbeda-Flores [5]), and note that C2 is the Farlie–Gumbel–Morgenstern copula, with parameter α.
The following example shows a case that produces a copula, member of the Ali–Mikhail–Haq family, see Ali et al. [8] and Nelsen [6].
Example 2.5
Consider two copulas, C1 and C2, ,
Now, in this case, equation (2) is which is a case of the Ali–Mikhail–Haq copula (with parameter equal to 1). Note that this copula is the one used in Example 2.2 .
The following example exposes a case that shows that even if the copulas C1 and C2 are coming from the same copula’s family, the resulting copula (by the application of Theorem 2.3) is not a member of the same family of C1 and C2.
Example 2.6
Let C1 and C2 be two Gumbel–Hougaard copulas (see Hougaard [9]), ut exp(−α ln(u) ln(t)), C2 (v,t) = vt exp(-β ln(v) ln(t)), u, v, t ∈ [0,1], with α, β ∈ (0, 1],
Equation (2) is That is, the constructed copula is not a member of Gumbel–Hougaard family.
In this section, we introduce the notion of partial derivative , fundamental for the copula construction method, which is the result of Theorem 2.3. This notion is familiar in random vector simulation methods, as described in Nelsen [6]. The introduced method proposes to construct a copula between two variables U and V, using a third variable Θ and combining the copula between U and Θ and the copula between V and Θ, through equation (2). Θ represents a random variable connecting U and V. We conclude the section by presenting three examples illustrating the construction of three different copulas.
The next subsection deals with the case of the family of copulas introduced in Rodríguez-Lallena and Úbeda-Flores [5]. Our goal is to identify if using two copulas C1 and C2 from that family, Theorem 2.3 gives us a copula in that family. We know that it is not always the case, see for instance Example 2.6. But Example 2.4 seems to indicate that in certain families (like Farlie–Gumbel–Morgenstern family) the construction of copulas proposed by Theorem 2.3 could behave in a closed way.
Rodríguez–Lallena and Úbeda–Flores Family
We start this section by introducing the analytic form of a family of copulas present in Rodríguez-Lallena and Úbeda-Flores [5]. We formalize the conditions for such a function to be a copula. It should be noted that the analytic form of this copula involves a huge range of particular cases, widely investigated in the literature. For instance it includes families of copulas with quadratic sections, see Quesada Molina and Rodríguez-Lallena [10], the family of copulas with cubic cross-sections proposed in Nelsen et al. [11], and families of positive quadrant dependent copulas as the one introduced in Lai and Xie [12]. See also Amblard and Girard [13] to obtain details about several properties of semi parametric families of copulas in this class.
Following Theorem 2.3 of Rodríguez-Lallena and Úbeda-Flores [5], equation (5) is a copula, if and only if, f and g are absolutely continuous, f(0) = f(1) = g(0) = g(1) = 0, and under the existence of the derivatives of f and g, say, on A f and A g , respectively,(6) (7)and, the next condition is true,(8)The family of copulas that responds to the form given by equation (5) generalizes the Farlie–Gumbel–Morgenstern copula and others that were developed, based on the idea of constructing copulas that are disturbances of the independence copula, Π(u, v) = uv, reaching for example weak dependence types. It is worth mentioning here that the independence copula Π(u, v) responds to Boltzmann–Gibbs–Shannon’s notion of entropy, while the copula given by equation (5) responds to Tsallis–Havrda–Chavát’s entropy (see García et al. [14]). That is, while the copula that maximizes the entropy of Boltzmann–Gibbs–Shannon is the copula Π, the copula that maximizes the entropy of Tsallis–Havrda–Chavát is the one given by equation (5). Then, in summary, this last family adopts a practical sense in terms of the quantification of chaos made by the notion of entropy of Tsallis–Havrda–Chavát.
The following corollary shows that the family of copulas introduced in Rodríguez-Lallena and Úbeda-Flores [5] is preserved, by applying the copula construction given by Theorem 2.3. The following result is established using Corollary 2.4 (Rodríguez-Lallena and Úbeda-Flores [5]). Corollary 2.4 adapts what is established by Theorem 2.3 (Rodríguez-Lallena and Úbeda-Flores [5]). Note that Theorem 2.3 shows that under the constraints of equations (6)–(8) the functional form uv + f(u)g(v) is a copula, while Corollary 2.4 investigates a functional form that introduces a constant (a parameter δ), being the investigated form uv + δf(u)g(v). The result identifies the range of possible values for such a parameter δ, to guarantee that the equation follows Definition 2.1.
Corollary 3.1
Let C1 and C2 be two copulas given by equation (5) , C1 (u,t) = ut + f1 (u)g1 (t), C2 (v, t) = vt + f2 (v)g2 (t), then, the copula resulting from equation (2) follows the form given by equation (5) , with and where and ,i = 1,2 follows (6).
Proof. is a direct consequence of equation (2). Already, condition follows from Corollary 2.4 of Rodríguez-Lallena and Úbeda-Flores [5]. □
What we see in Corollary 3.1 allows us to say that all generalizations of the Farlie–Gumbel–Morgenstern copula, which are particular cases of equation (5), are preserved by the copula construction method proposed in this paper.
Example 3.2
Consider two copulas, C1 and C 2 , and with u, v, t ∈ [0, 1], a i , b i ≥ 1, and , with i = 1,2. For details on the properties of the copulas C1 and C2, see specific literature Lai and Xie [12] and Rodríguez-Lallena and Úbeda-Flores [5]. We can also see the use of these models in practice in Fernández et al. [15].
Define , , i = 1, 2. Applying the Corollary 3.1 , and If we fix the values a i = b i = 1, i = 1, 2, C1 and C2 are the usual Farlie–Gumbel–Morgenstern copulas with parameters λ 1 and λ 2 respectively and −1 ≤ λ i ≤ 1, i = 1, 2. Then, and the resulting copula is also a Farlie–Gumbel–Morgenstern copula with parameter .
The following corollary tells us how the method behaves when only one of the copulas chosen for the construction of C follows equation (5).
Corollary 3.3
Let C1 and C2 be two copulas, with C1 given by equation (5) , C1 (u, t) = ut + f1 (u)g1 (t), then, the copula resulting from equation (2) follows the form given by equation (5) ,
Proof. Since C1 (u, t ) =ut + f1 (u)g1(t), as C1 is a copula, are applied the conditions of Theorem 2.3 of Rodríguez-Lallena and Úbeda-Flores [5] (Eqs. (6), (7) and (8)), and Then,
The form (5) results from C2 being a copula (C2 (v, 1) = v and C2 (v, 0) = 0) and using the Theorem 2.3.
Example 3.4
Consider two copulas, C1 and C2, C1 (u, t) = ut + λu(1 − u)t(1 − t) with λ ∈ [−1, 1]. and C2 (v, t) = (v−1 + t−1 − 1)−1, u, v, t ∈ [0, 1]. C1 is a Farlie–Gumbel–Morgenstern copula and C2 is a copula coming from the Clayton family with parameter one, see Clayton [16].
Define f1 (u) = λu(1 − u) and g1 (t) = t(1 − t). Applying Corollary 3.3 , and since the copula is that is, a case of equation (5).
In this section, we investigate how the copula build by equation (2) turn out if we use at least one copula from the copula family introduced in Rodríguez-Lallena and Úbeda-Flores [5]. We present our results in two corollaries. The first of them identifies the analytic form of the resulting copula if both copulas and belong to the family introduced in Rodríguez-Lallena and Úbeda-Flores [5]. Corollary 3.1 shows that the resulting copula is also a member of the family given in Rodríguez-Lallena and Úbeda-Flores [5]. Corollary 3.3 gives the analytic form of the resulting copula if only one copula, say is a member of the family introduced in Rodríguez-Lallena and Úbeda-Flores [5]. And again, the resulting copula is a member of the family introduced in Rodríguez-Lallena and Úbeda-Flores [5]. Both results are exemplified, see Examples 3.2 and 3.4.
Conclusion
In this paper we show how to use the functions and to obtain a copula between U and V, where u is a value of U and v a value of V. We prove, in Theorem 2.3 that is only necessary C1 and C2 to be copulas to obtain a copula of (U, V) by equation (2). Note that the quantities and are also used to generate random vectors with prescribed dependence, and with prescribed marginal distributions, as Nelsen [6] shows. This means that all those functions usually employed to generate vectors can also be used to create copula models. This fact shows the potential of the method proposed in this paper. For a variety of models see Joe [17] and Nelsen [6].
We illustrate the method in three situations, (i) generating a copula that generalizes the Farlie–Gumbel–Morgenstern copula family, see Example 2.4 (Eyraud [7]), (ii) generating an Ali-Mikhail-Haq type copula, see Example 2.5 (Ali et al. [8]), and (iii) showing the type of dependence that results when combining two copulas coming from the Gumbel-Hougaard family, see Example 2.6 (Hougaard [9]).
Returning to the problem cited in the introduction of this paper, we can then postulate an inferential strategy so that from the dependence relationship between U and Θ represented by a copula C1 and between V and Θ, identified by the copula C2, it is possible to postulate a dependence relationship between U and V that is generated from equation (2). This type of construction of the dependence between U and V requires the identification of an auxiliary variable Θ, linking U and V. For a subjective reading/interpretation of the meaning of the variable Θ see de Finetti [1], Hewitt and Savage [2] and O’ Neill [18].
We finish our contribution with two results (Corollaries 3.1 and 3.3) that identify the behavior of our method when applied to copulas from the family introduced in Rodríguez-Lallena and Úbeda-Flores [5]. We see that the method is able to preserve it, guaranteeing that the copula between U and V is a member of the Rodríguez-Lallena and Úbeda-Flores [5] family. Corollaries 3.1 and 3.3 are exemplified in popular cases, see Examples 3.2 and 3.4 respectively.
Conflict of interest
Authors declared no conflict of interests.
Acknowledgments
Vinícius Litvinoff Justus gratefully acknowledges the support, provided by CNPq with a fellowship of scientific initiation from University of Campinas. The authors wish to thank the referees and editors for their many helpful comments and suggestions on an earlier draft of this paper.
References
- de Finetti B (1929), Funzione caratteristica di un fenomeno aleatorio, in Atti del Congresso Inter nazionale dei Matematici: Bologna del 3 al 10 de settembre di 1928, Vol. 6 (Comunicazioni, sezione IV (A)-V-VII), pp. 179–190. [Google Scholar]
- Hewitt E, Savage LJ (1955), Symmetric measures on Cartesian products. Trans Am Math Soc 80, 2, 470–501. https://doi.org/10.2307/1992999. [CrossRef] [Google Scholar]
- Aldous DJ (1985), Exchangeability and related topics, in: P.L. Hennequin (Ed.), École d'Été de Probabilités de Saint-Flour XIII – 1983, Vol. 1117, Lecture Notes in Mathematics, Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0099421. [Google Scholar]
- Mai J-F (2020), The infinite extendibility problem for exchangeable real-valued random vectors. Probab Surv 17, 677–753. https://doi.org/10.1214/19-PS336. [Google Scholar]
- Rodríguez-Lallena JA, Úbeda-Flores M (2004), A new class of bivariate copulas. Stat Probab Lett 66, 3, 315–325. https://doi.org/10.1016/j.spl.2003.09.010. [CrossRef] [Google Scholar]
- Nelsen RB (2006), An introduction to copulas, Springer-Verlag, New York. https://doi.org/10.1007/0-387-28678-0. [Google Scholar]
- Eyraud H (1938), Les principes de la mesure des correlations. Ann Univ Lyon Series A 1, 30–47. [Google Scholar]
- Ali MM, Mikhail NN, Haq MS (1978), A class of bivariate distributions including the bivariate logistic. J Multivar Anal 8, 3, 405–412. https://doi.org/10.1016/0047-259X(78)90063-5. [CrossRef] [Google Scholar]
- Hougaard P (1986), A class of multivariate failure time distributions. Biometrika 73, 3, 671–678. https://doi.org/10.1093/biomet/73.3.671. [Google Scholar]
- Quesada-Molina JJ, Rodríguez-Lallena JA (1995), Bivariate copulas with quadratic sections. J Nonparametr Statist 5, 4, 323–337. https://doi.org/10.1080/10485259508832652. [CrossRef] [Google Scholar]
- Nelsen RB, Quesada-Molina JJ, Rodríguez-Lallena JA (1997), Bivariate copulas with cubic sections. J Nonparametr Statist 7, 3, 205–220. https://doi.org/10.1080/10485259708832700. [CrossRef] [Google Scholar]
- Lai CD, Xie M (2000), A new family of positive quadrant dependent bivariate distributions. Stat Probab Lett 46, 4, 359–364. https://doi.org/10.1016/S0167-7152(99)00122-4. [CrossRef] [Google Scholar]
- Amblard C, Girard S (2002), Symmetry and dependence properties within a semiparametric family of bivariate copulas. J Nonparametr Statist 14, 6, 715–727. https://doi.org/10.1080/10485250215322. [CrossRef] [Google Scholar]
- García JE, González-López VA, Nelsen RB (2016), The structure of the class of maximum Tsallis–Havrda–Chavát Entropy Copulas. Entropy 18, 7, 264. https://doi.org/10.3390/e18070264. [CrossRef] [Google Scholar]
- Fernández M, González-López VA, Rifo LR (2015), A note on conjugate distributions for copulas. Math Methods Appl Sci 38, 18, 4797–4803. https://doi.org/10.1002/mma.3394. [CrossRef] [Google Scholar]
- Clayton DG (1978), A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 65, 1, 141–151. https://doi.org/10.1093/biomet/65.1.141. [CrossRef] [Google Scholar]
- Joe H (2014), Dependence modeling with copulas. Chapman and Hall/CRC, New York. https://doi.org/10.1201/b17116. [CrossRef] [Google Scholar]
- O’Neill B (2009), Exchangeability, correlation, and bayes’ effect. Int Stat Rev 77, 2, 241–250. https://doi.org/10.1111/j.1751-5823.2008.00059.x. [CrossRef] [Google Scholar]
Cite this article as: González-López VA & Litvinoff Justus V. 2023. A method for the elicitation of copulas. 4open, 6, 2.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.