( PDF ) Sample average approximation for chance constrained programming

Download PDF

ads:

Bernardo Kulnig Pagnoncelli

Sample average approximation for chance

constrained programming

TESE DE DOUTORADO

Thesis presented to the Postgraduate Program in Mathemtics of

the Departamento de Matem´atica, PUC-Rio as partial fulﬁllment

of the requirements for the degree of Doutor em Matem´atica

Adviser : Prof. Carlos Tomei

Co–Adviser: Prof. Shabbir Ahmed

Co–Adviser: Prof. Alexander Shapiro

Co–Adviser: Prof. Humberto Jos´e Bortolossi

Rio de Janeiro

February 2009

PUC-Rio - Certificação Digital Nº 0510535/CA

ads:

Livros Grátis

http://www.livrosgratis.com.br

Milhares de livros grátis para download.

Bernardo Kulnig Pagnoncelli

Sample average approximation for chance

constrained programming

Thesis presented to the Postgraduate Program in Mathema-

tics, of the Departamento de Matem´atica do Centro T´ecnico

Cient´ıﬁco da PUC-Rio, as partial fulﬁllment of the requirements

for the degree of Doutor.

Prof. Carlos Tomei

Adviser

Departamento de Matem´atica — PUC-Rio

Prof. Shabbir Ahmed

Co–Adviser

H. Milton Stewart School of Industrial & Systems Engineering

— Georgia Institute of Technology

Prof. Alexander Shapiro

Co–Adviser

H. Milton Stewart School of Industrial & Systems Engineering

— Georgia Institute of Technology

Prof. Humberto Jos´e Bortolossi

Co–Adviser

Instituto de Matem´atica — UFF

Prof. H´elio Cˆortes Vieira Lopes

Departamento de Matem´atica — PUC-Rio

Prof. Cristiano Fernandes

Departamento de Engenharia El´etrica — PUC-Rio

Prof. Alexandre Street

Departamento de Engenharia El´etrica — PUC-Rio

Prof. H´elio dos Santos Migon

Instituto de Matem´atica — UFRJ

Prof. Beatriz Vaz de Melo Mendes

Instituto de Matem´atica — UFRJ

Prof. Roberto Imbuzeiro

IMPA

Prof. Jos´e Eugenio Leal

Coordenador Setorial do Centro T´ecnico Cient´ıﬁco — PUC–Rio

Rio de Janeiro — February 3, 2009

PUC-Rio - Certificação Digital Nº 0510535/CA

ads:

Bernardo Kulnig Pagnoncelli

Bernardo Kulnig Pagnoncelli obtained his B.Sc. diploma in

pure mathematics from PUC-Rio in 2002 a nd his M.Sc. title

in applied mathematics in 2004, with the dissertation “Ap-

plications of the tensor product in numerical analysis”, under

the supervision of Carlos Tomei. He joined the Ph.D. program

at PUC-Rio in March 2005 and was invited to spend 2007 at

H. Milton Stewart School of Industrial Systems & Engineering

Department at the Georgia Institute of Technology (Atlanta,

U.S.A.) by Professor Shabbir Ahmed, one of his co-advisers.

Bibliographic data

Pagnoncelli, Bernardo K.

Sample average approximation for chance constrained

programming / Bernardo Kulnig Pagnoncelli; adviser: Carlos

Tomei; co–adviser: Shabbir Ahmed et.al. – 2009.

53 f: il. ; 30 cm

Tese (Doutorado em Matem´atica) – Pontif´ıcia Universi-

dade Cat´olica do Rio de Janeiro, Rio de Janeiro, 2009.

Inclui bibliograﬁa.

1. Matem´atica – Teses. 2. Stochastic Programming. 3.

Chance Constraints. 4. Sampling Methods. 5. Provisioning

problem. I. Tomei, Carlos. II. Ahmed, S.. III. Pontif´ıcia

Universidade Cat´olica do Rio de Janeiro. Departamento de

Matem´atica. IV. T´ıtulo.

CDD: 510

PUC-Rio - Certificação Digital Nº 0510535/CA

A minha mulher Daniela e aos meus pais Dante e Zuza

PUC-Rio - Certificação Digital Nº 0510535/CA

Acknowledgments

To my Adviser Carlos Tomei for the constant help and friendship during

those four years and for always challenging me to do my best. Thank you for

placing your bets on a diﬀerent project and for believing in my potential.

To Shabbir Ahmed for the invitation to spend one year at Georgia

Tech under his supervision. Thank you for showing me how to do research

in sto chastic programming and for all the support during my stay in Atlanta.

To Alexander Shapiro for participating so lively in my research project

and for always being available to answer my questions with inﬁnite patience.

To Humberto Bortolossi for inviting me to join a series of weekly seminars

during my PhD, which were my very ﬁrst contact with stochastic programming.

Thank you for all the collaboration we had during those four years and I hope

we will continue to work to gether for a long time.

To the members o f my committee for the careful reading of the manus-

cript and for the the suggestions that certainly made the text clearer.

To Helder Inacio for helping me with computational issues on the thesis

that made my life a lot easier. Many thanks for listening carefully to me while

I spoke about my work a nd for the suggestions of improvement of the text.

Thank you for the friendship during my stay at Georgia Tech and for t he

uncountable soccer matches.

To Ricardo, Marta and Felipe Fukasawa for everything they did for me

and for my wife during our stay in Atlanta. Your help, advice and friendship

were very imp ortant to us and without them our life a t Georgia Tech would

have been harder and less funny.

To Felipe Pina and Marcos Lage for the help and suggestions that

contributed directly to the improvement of the thesis.

To all t he g reat friends I made in Atlanta: Yao-Hsuan Chen, Claudio

Santiago, Daniel Faissol, Eliana & Jim, Luiz & Diana, Roberto & Mariana,

Luciano & Denise, Danny & Johnny, Daniel & L´ıvia and Lili.

To all my friends at PUC for the support a nd friendship, in particular

Miguel Schnoor, Aldo Ferreira da Silva, Renato Zanforlin (in memoriam),

Eduardo Teles, G uilherme Frederico Lima, R egina Fukuda, Jo˜ao Romanelli,

Jos´e Cal Neto, Yuri Ki, Jessica Kubrusly, Juliana Val´erio, Kennedy Pedroso,

Renato Adelino, Andr´e Carneiro, Marina Dias, D´ebora Mondaini, Carlos

Pe˜naﬁel, Inˆes de Oliveira, F´abio de Souza, Jo˜ao Domingos, Lhaylla Crissaﬀ,

Luiz Felipe Fran¸ca.

To all the Professors of the Mathematics Department at PUC-Rio for the

inestimable contribution to my education, specially Fred Palmeira.

PUC-Rio - Certificação Digital Nº 0510535/CA

To my friends Diogo Haddad, Diogo Montes, Paula Abreu, Leandro

& Tatiana, Fl´avia Cordeiro, Beatriz Malajovich, Tania Vasconcelos, S´ergio

Almaraz, Lucas Sigaud and Priscila Almeida.

To my grandmother Deliza Pagnoncelli and to my brother Eduardo

Pagnoncelli for always thinking I was doing something important.

To N´eia for tolerating me all those years and for providing me my every

day food.

To my family and my wife’s family for the constant support.

To the staﬀ of the Mathematics Department at PUC-Rio K´atia, Creuza,

Ana, Orlando and Ot´avio for always being so nice to me.

To PUC-Rio, FUNENSEG and CAPES for the ﬁnancial support.

PUC-Rio - Certificação Digital Nº 0510535/CA

Abstract

Pagnoncelli, Bernardo K.; Tomei, Carlos; Ahmed, S.; Shapiro,

A.; Bortolossi, Humberto Jos´e. Sample average approximation

for chance constrained programming. Rio de Janeiro, 2009.

53p. TESE DE DOUTORADO — Department of Mathematics,

Pontif´ıcia Universidade Cat´olica do R io de Janeiro.

We study sample approximations of chance constrained problems through

the sample average approximation (SAA) approach and prove the related

convergence properties. We discuss how to use the SAA method to obtain

good candidate solutions and bounds for the optimal value of t he original

problem. In o r der to tune the parameters of SAA, we apply the method

to two chance constrained problems. The ﬁrst is a linear portfolio selection

problem with returns following a multivariate lognormal distribution. The

second is a joint chance constrained version of a simple blending problem.

We conclude with a more demanding application of SAA methodology to

the determination of the minimum provision an economic agent must have

in order to meet a series of future payment o bligations with suﬃciently high

probability.

Keywords

Stochastic Programming. Chance Constraints. Sampling Methods.

Provisioning problem.

PUC-Rio - Certificação Digital Nº 0510535/CA

Resumo

Pagnoncelli, Bernardo K.; Tomei, Carlos; Ahmed, S.; Shapiro, A.;

Bortolossi, Humberto Jos´e. M´etodo da aproxima¸c˜ao amostral

para restri¸c˜oes probabil´ısticas. Rio de Janeiro, 2009. 53p.

Tese de Doutorado — Departa mento de Matem´atica, Pontif´ıcia

Universidade Cat´olica do Rio de Janeiro.

Estudamos aproxima¸c˜oes amostrais de problemas com restri¸c˜oes proba-

bil´ısticas atrav´es da aproxima¸c˜ao pela m´edia amostral (SAA) e demons-

tramos as propriedades de convergˆencia relacionadas. Utilizamos SAA para

obter bons candidatos `a solu¸c˜ao e cotas estat´ısticas para o valor ´otimo

do problema original. Para ajustar corretamente parˆametros, aplicamos o

m´etodo a dois problemas com restri¸c˜oes probabil´ısticas. O primeiro ´e um

problema de sele¸c˜ao de po r tfolio linear com retornos seguindo uma distri-

bui¸c˜ao lognormal multivariada. O segundo ´e uma vers˜ao com restri¸c˜oes pro-

babil´ısticas conjuntas de um problema da mistura simpliﬁcado. Conclu´ımos

com uma aplica¸c˜ao mais exigente ao problema de se determinar a provis˜ao

m´ınima que um agente econˆomico deve ter de forma a satisfazer uma s´erie

de obriga¸c˜oes futuras com probabilidade suﬁcientemente alta.

Palavras–chave

Otimiza¸c˜ao Estoc´astica. Restri¸c˜oes Probabil´ısticas. M´etodos Amos-

trais. Problema de Reserva.

PUC-Rio - Certificação Digital Nº 0510535/CA

Contents

1 Introduction 11

2 Chance constrained programming 15

2.1 An example 15

2.2 Single and joint constraints 16

2.3 Some properties and special cases 18

3 Sample Average Approximation 21

3.1 Theoretical background for SAA 21

4 Two applications of SAA 27

4.1 Portfolio problem 27

4.2 A blending problem 35

5 From separated to j oint variables: the hurdle-race problem 39

5.1 The hurdle-race problem and comonotonicity 40

5.2 The joint hurdle-race problem 41

5.3 Stochastic hurdles 44

6 Conclusion 47

PUC-Rio - Certificação Digital Nº 0510535/CA

Ah! Doutor! Doutor!... Era m ´agico o t´ıtulo,

tinha poderes e alcances m´ultiplos, v´arios, po-

lif´ormicos... Era um palli um, era alguma coisa

como clˆamide sagrada, tecida com um ﬁo

tˆenue e quase imponder´avel, mas a cujo en-

contro os ele mentos, os maus olhare s , os exor-

cismos se quebravam. De posse dela, as go-

tas de chuva af astar-se-iam transidas do meu

corpo, n˜ao se animariam a tocar-me nas rou-

pas, no calado sequer. O invisvel distribui-

dor dos raios solares escolheria o s mais me i -

gos para me aquecer, e gastaria os fortes, os

inexor´aveis, com o comum dos homens que

n˜ao ´e doutor. Oh! Ser formad o, de anel no

dedo, sobrecasaca e cartola, inﬂado e grosso,

como um sapo antes de ferir a martelada `a

beira do brejo; andar assim pelas ruas, pelas

pra¸cas, pelas estradas, pelas sala s, recebendo

cumprimentos: Doutor, como passou? Como

est´a, doutor? Era sobre-humano!...

Lima Barreto, Recorda¸c˜oes do Escriv˜ao Isa´ıas Caminha.

PUC-Rio - Certificação Digital Nº 0510535/CA

Introduction

The ﬁeld of stochastic programming is mainly concerned with t he deve-

lopment of models a nd algorithms for optimization pro blems with uncertainty.

More often than not the constants of an optimization problem are only ap-

proximations of measured quantities that could hardly be known to high ac-

curacy. For instance in [BN], the authors analyze the set of problems of the

well-known NETLIB library and perform their sensitivity analysis. Using robust

optimization techniques, they show that feasibility of the usual optimal solu-

tion of linear programs can indeed be heavily aﬀected by small perturbations

in problem data.

The ﬁrst publications in the ﬁeld of stochastic programming appeared

in the 1950’s [BEA], [DAN], [CCS]. The subject received moderate attention

until the early nineties, when an explosion in the numb er of publications took

place. Stochastic programming is a powerful tool to deal with uncertainty and,

unlike other approaches such as ro bust optimization, models the coeﬃcients

as random variables with known joint distribution. From the point of view

of applications, such assumption may be quite demanding, specially if t here

is not enough data to correctly approximate the distribution of parameters.

However, in some cases one does not need to have a complete knowledge

of the distribution o f the parameters. It is enough, instead, to have an

algorithm which generates samples from the random variables in the problem.

Furthermore, we believe that in most situations even a subjective choice of the

joint distribution serves the decision maker well.

There are two main approaches to stochastic programming: two-stage

problems with recourse and chance constrained progr amming. In two-stage

models, the decision maker chooses an action in the present without knowing

the outcome of future events. After the uncertainty is revealed, he then

takes the best possible recourse action to (possibly) correct the unwanted

consequences of his ﬁrst decision. Deviations fro m the goals are penalized

by the objective function. Such framework has applications in several ﬁelds

such as ﬁnance [AHM], electricity generation [LS],[PP], hospital budgeting

[KQ], production planning [PBK], etc. We refer the reader to [SR] for a

PUC-Rio - Certificação Digital Nº 0510535/CA

detailed discussion of the theoretical properties and more examples of two-

stage problems. Among the eﬃcient algorithms which deal with two-stage

problems, one of t he most popular is t he L-shap ed method, which is essentially

Benders decompositions applied to the so-called extensive form of a two-stage

problem (see [HV], [BP]). O t her important methods based on sampling are

the Stochastic Decomposition ([HS]) and the sample average approximation

for two-stage problems ([LSW]).

The second approach, which is the focus of this thesis, is chance cons-

trained programming, sometimes referred to as probabilistic programming. The

subject was introduced by Charnes, Cooper and Symmonds [CCS] and have

been extensively studied since. For a theoretical background we may refer to

Pr´ekopa [PREb] where an extensive list of references can be found. Applica-

tions include, e.g., water management [DGKS], soil management [ZTSK] and

optimization of chemical processes [HLMS],[HM].

In chance constrained programming, the decision maker is interested in

satisfying his goal “most of the time”, that is, he admits constraint violation

for some realizations of the r andom events. While two-stage problems penalize

deviations from goals, chance constrained programming considers only the

possibility of infeasibility, regardless of the amount by which the constraints

are violated. In other words, the former approach measures risk quantitatively

while the latter does it qualitatively. We consider problems of the fo r m

Min

x∈X

f(x)

s.t. Prob



G(x, ξ) ≤ 0



≥ 1 − ε,

(1-1)

where X ⊂ R

, ξ is a random vector

with probability distribution P supported

on a set Ξ ⊂ R

, constraints are expressed through G : R

× Ξ → R

f : R

→ R is the objective function and ε ∈ (0, 1) is the reliability level.

Although chance constraints were introduced almost 50 years ago, little

progress was made until recently. Even for simple functions G (·, ξ), e.g., linear,

problem (1-1) may be extremely diﬃcult to solve numerically. One of the

reasons is that for a given x ∈ X the quantity Prob {G(x, ξ) ≤ 0} requires

a multi-dimensional integration. Thus, it may happen t ha t the only way t o

check feasibility of a point x ∈ X is by Monte-Carlo simulation. Moreover,

convexity of X and of G(·, ξ) does not imply the convexity of the feasible set

of problem (1-1).

That led to two somewhat diﬀerent directions of research. One consists of

discretizing the probability distribution P a nd solving the related combinato-

We use the same notation ξ to denote a random vector and its particular realization.

Which of these two meanings will be used will be clear from the context.

PUC-Rio - Certificação Digital Nº 0510535/CA

rial problem (see, e.g., [DPR], [LAN]). Another approach is to employ convex

approximations of chance constraints ([NS]). As active members in this lines

of research, A. Shapiro and S. Ahmed have been working recently in the the-

ory of sampling and simulation applied to chance constrained programming.

The sample average approximation (SAA) studied in this thesis is a sampling

method for joint chance constrained problems or problems with a single chance

constraint. The approach is natural, and, as will be seen, it is a ﬂexible tool

which can alleviate several diﬃculties such as non-convexity and the intracta-

bility of the probabilistic constraint.

In the third year of my doctorate in Atlanta, Ahmed and Shapiro

introduced me to some theoretical aspects of SAA, and we proceeded to clarify

foundations in order to advance to interesting applications. Ahmed’s previous

sup ervision of J. Luedtke [LA] gave rise to convergence results o f SAA on

speciﬁc scenarios, which were then used on probabilistic versions of the set

covering and transpo r t ation problems. In this text we continue this path with

the fo llowing contributions: ﬁrst, the theoretical results vindicate t he numerical

approximations; then we provide further empirical evidence on how t o choose

the parameters involved in SAA and how to use it to solve chance constrained

problems. Part of this material may be found in [PAS].

Chapter 2 contains some basic results about chance constrained pro-

blems, with emphasis on hypotheses leading to the convexity of the feasible

set. In Chapter 3 we provide theoretical background and present the main re-

sults about sample approximations of (1-1). We state and prove convergence

results and describe how to construct bounds for the optimal value of chance

constrained programs.

In Chapter 4, we apply SAA to two rather simple problems, which

allow for veriﬁcation of our methods. The ﬁrst is a linear portfolio selection

problem with 10 assets, in the spirit of Markowitz ([MAR]). We consider two

very distinct situations: the distribution of the returns of the assets is either

multivariate normal or lognormal. In the ﬁrst case, the explicit solution is well

known: we use it as a benchmark to our numerics. The second problem is a

simple blending pro blem modeled as a joint chance constrained problem, for

which, again, the explicit solution is known.

In Chapter 5 we turn our attention to a more realistic problem arising

from actuarial sciences, the hurdle-race problem, proposed in [VDGK]. It

consists of a decision maker who needs to determine the current capital

(provision) required to meet future obligations. Furthermore, for each period

separately he needs to keep his capital above given thresholds, the hurdles,

with high probability. In [VDGK], the a uthor s make use of comonotonicity

PUC-Rio - Certificação Digital Nº 0510535/CA

([DDGa], [DDGb]) to obtain candidate solutions to the problem.

I contacted S. Vanduﬀel, the corr esponding author of [VDGK], and

proposed a variant of the hurdle-race problem, the joint hurdle-race problem.

Instead of separate hurdles, the decision maker has to pass the whole collection

of hurdles with high probability. This phrasing is clearly more adequate from

an actuarial point of view. Comonotonicity cannot be easily applied to the joint

version, but SAA yields good candidate solutions t o the problem. Both models

are compared in the original hurdle-race format, and numerical evidence of the

robustness of the joint formulation is provided in Section 5.2.1.

In addition, we extend the formulation to include stochastic hurdles, so

that the hurdles itself are not known at (known) future times and depend on

discounted values of futures obligations at the risk free rate. Although the

model becomes more involved, SAA handles the extension by essentially the

same computational cost.

Chapter 6 concludes the thesis with a summary of the results and future

directions of research.

We use the following notation throughout the text. The integer part of

number a ∈ R is denoted by ⌊a⌋. By Φ(z) we denote the cumulative distribution

function (cdf) of standard normal random variable and by z

the corresponding

ε−quantile, i.e., Φ(z

) = 1−ε, for ε ∈ (0, 1). The cdf B(k; p, N) of the binomial

distribution is

B(k; p, N ) :=



i=0





(1 − p )

N −i

, k = 0, ..., N.

(1-2)

For sets A, B ⊂ R

we denote by

D(A, B) := sup

x∈A

dist(x, B)

(1-3)

the deviation of set A from set B.

PUC-Rio - Certificação Digital Nº 0510535/CA

Chance constrained programming

In this Chapter we give a brief introduction to chance constrained

programming. The goals are to motivate the subject and to give the reader an

idea of the r elated diﬃculties. All proofs are omitted: we indicate references

where rigorous deductions can be found.

2.1

An example

At t he risk of being repetitive, we start giving an example f rom [HV] of a

simple chance constraint that illustrates one of the diﬃculties associated with

this formulation. The simplicity of the example makes it essentially unique.

For x

, x

∈ R, ε ∈ [0, 1], let

p(x) = Prob{ξx

+ x

≥ 7} ≥ 1 − ε

be a chance constraint, where ξ is uniformly distributed in [0, 1], with cumu-

lative distribution

F (t) =











0, if t ∈ (−∞, 0),

t, if t ∈ [0, 1],

1 otherwise.

In the general framework deﬁned in (1-1), we have

G(x, ξ) = −ξx

− x

+ 7. (2-1)

We are interested in a n explicit representation of the set

C(ε) = {(x

, x

) ∈ R

: p(x) ≥ 1 − ε}. (2-2)

If x

= 0, we clearly need to have x

≥ 7. If x

> 0,

p(x) = P (ωx

+ x

≥ 7) = P



ω ≥

7 − x



= 1 − F



7 − x



. (2-3)

Thus,

p(x) ≥ 1 − ε ⇔ F

−1

(ε)x

+ x

≥ 7.

PUC-Rio - Certificação Digital Nº 0510535/CA

-10

α = 0.3

Figure 2.1: The set C(0.3).

-10

α = 0.7

Figure 2.2: The set C(0.7).

Proceeding in an a na logous way fo r the case x

< 0, we end up with

C(ε) = C

(ε)



(ε)



−

(ε), where

(ε) =



x ∈ R

| x

> 0 , F

−1

(ε)x

+ x

≥ 7



(ε) =



(0, x

) ∈ R

| x

≥ 7



−

(ε) =



x ∈ R

| x

< 0 , x

−1

(1 − ε) + x

≥ 7



Figures 2.1 and 2.2 show the sets C(0.3) and C(0.7).

Clearly, from Figure 2.2, one cannot expect to have convex feasible sets

for chance constrained programs even for linear functions G . Convexity is

restored by requiring additional hypothesis, as shown below.

2.2

Single and joint constraints

There are essentially two ways of writing a chance constrained model. We

can have several separated constraints, each one representing one goal. Formu-

lation (1-1) represent the situation of a single separated chance constraint,

which is amenable to the SAA approach we discuss later. A general separated

chance constrained problem can be written as follows.

min

x∈X

f(x)

s.t. p

(x) := Prob



(x, ξ) ≤ 0



≥ 1 − ε

, i = 1, . . . , m,

(2-4)

where ε

∈ [0, 1 ]. A point x is feasible to problem (2-4) if it belongs to the set

C(ε

, ε

, . . . , ε

) =



i=1

(ε

where

(ε

) =



x ∈ R

| p

(x) ≥ 1 − ε



PUC-Rio - Certificação Digital Nº 0510535/CA

Another possibility is having a number of constraints modeled as a single

one as follows.

Min

x∈X

f(x)

s.t. p(x) := Prob



(x, ξ) ≤ 0, G

(x, ξ) ≤ 0, . . . , G

(x, ξ) ≤ 0



≥ 1 − ε,

(2-5)

with ε ∈ [0, 1]. Formulation (2-5) is refereed to as a joint chance constrained

problem since all the constraints G

(x, ξ) ≤ 0 are taken jointly. A point x is

feasible to problem (2-5) if it belongs to the set

C(ε) =



x ∈ R

| p(x) ≥ 1 − ε



Fr om a modeling point of view, sometimes it makes sense to model all the

constraints jointly if they together describe o ne g oal. In [HEN], the author

presents a cash matching problem using both separated and joint chance

constraints. He compares the robustness of both formulations in the ﬁnancial

context and performs experiments showing the diﬀerence between the two

approaches. We will have the opportunity to compare both formulations when

we discuss the joint hurdle- race problem in Chapter 5.

Joint chance constrained problems are usually hard to solve because

the joint expression in (2-5) requires a multidimensional integration to be

computed. Even checking feasibility for a given candidate solution cannot be

done easily and Monte-Carlo is required. There are some algorithms available

for those problems such as Sz´antai’s method ([HV]) or the solvers PCSPIOR,

PROCON and PROBALL [KM], but they are restricted to multivariate normal

distributions. Furthermore, they only deal with linear chance constraints with

constant technology matrix [HV]. Other examples of algorithms are the SUMT

and the supporting hyperplane method, described in detail in Chapter 5 of

[PREa].

There is an interesting result linking the two formulations.

Theorem 1 Let (2-5) be a joint chance constrained problem with reliability

level ε. If w e c hoose reliability levels ε

= 1 − (1 − ε)/m, i = 1 , . . . , m for the

separated problem (2 - 4), then



i=1



1 −

1 − ε



⊂ C(ε),

that is, any feasible solution to the separated pro blem is feasible fo r the joint

problem for a suitable choice of reliability levels ε

Proof. The result follows directly from Bonferroni inequality [HV].

PUC-Rio - Certificação Digital Nº 0510535/CA

We can convert any joint chance constrained problem such as (2-5) into

the form (1-1) by using the max-function as follows.

Min

x∈X

f(x)

s.t. Prob



max

i=1...,n



(x, ξ)



≤ 0



≥ 1 − ε.

(2-6)

It is straightforward to check t ha t problems (2-6) and (2-5) are equivalent.

Of course in some cases desired properties of the considered functions may b e

destroyed, but convexity is preserved and if the functions G

(·, ξ) are linear we

still can write (2-6) as a linear program. We will see an explicit example of

such operation when we discuss the blending problem.

2.3

Some properties and special cases

The following result gives basic properties of feasible sets of chance

constrained problems.

Theorem 2 a) Let p(x) be deﬁ ned as in (2-5). We have that p(x) is upper

semicontinuous, that is

p(x) ≥ lim sup

y →x

p(y), x ∈ R

and thus the set C(ε) is a closed set for all ε ∈ [0, 1].

b) The set C(ε) is nondecreasing: if 0 ≤ ε

< ε

≤ 1, then C(ε

) ⊂ C(ε

In addition, C(1) = R

and C(ε) = ∅ for all ε ∈ [0, 1] if and only if the

set C(0) is non em pty.

Part b) is trivial. A proof of part a) can be found in [HV].

As shown in Section 2.1, the feasible set of a chance constrained problem

in general is not convex. However, there are results establishing convexity under

certain hypothesis on the function G and on the density function of the random

vector ξ. The most important is due to Pr´ekopa and Borell ([PREa]) and is

stated without proof.

Theorem 3 Assume the random vector ξ has a continuous probability distri-

bution with density function f. The followi ng statements hold:

a) If log f is concave (with log 0 = −∞), or

b) If f

−1/m

is conve x (with 0

−1/m

= ∞),

then the cumulative d i stribution function F is quasi-concave and hence C(ε)

is a convex set for all ε ∈ [0, 1].

PUC-Rio - Certificação Digital Nº 0510535/CA

Fortunately, there are several important distributions that satisfy the hypothe-

sis of Theorem 3. We give two examples:

– Uniform distribution. Let D be a convex subset of R

with ﬁnite measure

|D|. The probability density function is given by

f(x) =



|D|

if x ∈ D,

0 if x /∈ D.

– Normal distribution. The probability density function is deﬁned by

f(x) =



|Σ|(2π)

exp

−

(x−µ)

−1

(x−µ)

, x ∈ R

where µ is the expectation vector, Σ the covariance matrix of the

distribution and |Σ| is the determinant of Σ.

Other examples are the (multivariate) Beta and Gamma distributions. More

examples can be found in [PREa].

In the case the random variable ξ is discretely distributed, we can

formulate problem (1-1) as a mixed-integer linear program. Let us assume

Prob{ξ = ξ

} = p

, k = 1, . . . , K. Problem (1-1) becomes

Min

x∈X

f(x)

s.t.



k=1

(−∞,0)

(G(x, ξ

)) ≥ 1 − ε,

(2-7)

or, equivalently,

Min

x∈X

f(x)

s.t.



k=1

(0,∞)

(G(x, ξ

)) ≤ ε,

(2-8)

Consider the following mixed-integer program.

Min

x∈X

f(x)

s.t. G(x, ξ

) − Mz

≤ 0, k = 1, . . . , K ,



k=1

≤ ε,

z ∈ {0, 1}

(2-9)

where M is a suﬃciently large constant. We claim that (2-8) and (2-9) are

equivalent. Indeed, let (x, z

, . . . , z

) be a solution of problem (2-9). The

ﬁrst constraint of (2-9) tells us that z

≥ 1l

(0,∞)

(G(x, ξ

)). From the second

constraint of (2-9) we have

ε ≥



k=1

≥



k=1

(0,∞)

(G(x, ξ

)),

PUC-Rio - Certificação Digital Nº 0510535/CA

which implies that x is feasible for (2-8), with same objective value. Conversely,

let x be feasible for problem (2-8) and deﬁne z

:= 1l

(0,∞)

(G(x, ξ

)). We have

that (x, z

, . . . , z

) is feasible for (2-9) with same objective value, and thus both

problems are equivalent.

We conclude with a convexity result for discrete distributions. A proo f

can be found in [KW].

Proposition 4 Consider problem (2-5) with discrete distribution, that is, let

= Prob{ξ = ξ

}, k = 1, . . . , K. Then for

ε < min

k=1...,K

}

the feasible set C(ε) is convex.

PUC-Rio - Certificação Digital Nº 0510535/CA

Sample Average Approximation

We have seen in Chapter 2 that chance constrained problems are usually

hard to solve and explicit solutions are only available in very pa rt icular cases.

The main idea of SAA is to replace the original problem by an approximate

problem obta ined via sampling from the distribution of the original problem.

We claim that good candidate solutions and bo unds on the true optimal

value can be obtained by solving such approximations. We start with the

theoretical background for the method, stating and proving consistency results.

The discussion follows [PAS].

3.1

Theoretical background for SAA

As stated in Chapter 1, we consider chance constrained problems

Min

x∈X

f(x)

s.t. Prob



G(x, ξ) ≤ 0



≥ 1 − ε.

(3-1)

In order t o simplify the presentation we assume in this section that the

constraint function G : R

× Ξ → R is real valued. Of course, a number

of constraints G

(x, ξ) ≤ 0, i = 1, . . . , m, can be equivalently replaced by one

constraint with the max-function as discussed in (2-6). We assume that the

set X is closed, the function f(x) is continuous and the function G(x, ξ) is a

Carath´eodory function, i.e., G(x, ·) is measurable for every x ∈ R

and G(·, ξ)

continuous for a.e. ξ ∈ Ξ.

Problem (3-1) can be written in the following equivalent form

Min

x∈X

f(x) s.t. p(x) ≤ ε, (3-2)

where

p(x) := P {G(x, ξ) > 0}.

Now let ξ

, . . . , ξ

be an independent identically distributed (iid) sample of N

realizations o f random vector ξ and P

:= N

−1



j=1

∆(ξ

) be the respective

empirical measure. Here ∆(ξ) denotes measure of mass one at point ξ, and

hence P

is a discrete measure assigning probability 1/N t o each point ξ

PUC-Rio - Certificação Digital Nº 0510535/CA

j = 1, . . . , N. The sample average approximation ˆp

(x) of function p(x) is

obtained by replacing the ‘true’ distribution P by the empirical measure P

That is, ˆp

(x) := P

{G(x, ξ) > 0 }. Let 1l

(0,∞)

: R → R be the indicator

function of (0, ∞), i.e.,

(0,∞)

(t) :=



1, if t > 0,

0, if t ≤ 0.

Then we can write that p(x) = E

[1l

(0,∞)

(G(x, ξ))] and

ˆp

(x) = E

[1l

(0,∞)

(G(x, ξ))] =



j=1

(0,∞)



G(x, ξ

)



That is, ˆp

(x) is equal t o the proportion of times that G(x, ξ

) > 0. The

problem, associated with the generated sample ξ

, . . . , ξ

, is

Min

x∈X

f(x) s.t. ˆp

(x) ≤ γ. (3-3)

We refer to problems (3-2) and (3-3) as the true and SAA problems, respec-

tively, at the respective signiﬁcance levels ε and γ. Note that, fo llowing [LA]

and [PAS], we allow t he signiﬁcance level γ ≥ 0 of the approximate problem to

be diﬀerent from the signiﬁcance level ε of the true problem. Next we discuss

the convergence of a solution of the SAA problem (3-3 ) to that of the true

problem (3-2 ) with respect to the sample size N and the signiﬁcance level γ. A

convergence analysis of problem (3-3) has been given in [LA]. Here we present

complementary results under slightly diﬀerent assumptions.

Recall that a sequence f

(x) of extended real valued functions is said to

epiconverge to a function f(x), written f

→ f, if f or any point x the following

two conditions hold: (i) for any sequence x

converging to x one has

lim inf

k→∞

) ≥ f(x), (3-4)

(ii) there exists a sequence x

converging to x such that

lim sup

k→∞

) ≤ f(x). (3-5)

Note that by the (strong) Law of Large Numbers (LLN) we have that for a ny

x, ˆp

(x) converges w.p.1 to p(x).

Proposition 5 Let G(x, ξ) be a Carath´eodory function. Then the functions

p(x) and ˆp

(x) are lower semicontinuous, and ˆp

→ p w . p.1. Moreover,

suppose that for every ¯x ∈ X the set {ξ ∈ Ξ : G(¯x, ξ) = 0} has P -m easure zero,

i.e., G(¯x, ξ) = 0 w.p.1. Then the function p(x) is continuous at every x ∈ X

and ˆp

(x) converges to p(x) w.p.1 uniformly on any compact set C ⊂ X, i.e . ,

PUC-Rio - Certificação Digital Nº 0510535/CA

sup

x∈C

|ˆp

(x) − p(x)| → 0 w.p.1 as N → ∞. (3-6)

Proof. Consider function ψ(x, ξ) := 1l

(0,∞)



G(x, ξ)



. Recall that p(x) =

[ψ(x, ξ)] and ˆp

(x) = E

[ψ(x, ξ)]. Since the function 1l

(0,∞)

(·) is lower

semicontinuous and G(·, ξ) is a Carath´eodory function, it follows that the

function ψ(x, ξ) is random lower semicontinuous

(see, e.g., [RW, Proposition

14.45]). Then by Fatou’s lemma we have for any ¯x ∈ R

lim inf

x→¯x

p(x) = lim inf

x→¯x



ψ(x, ξ)dP (ξ)

≥



lim inf

x→¯x

ψ(x, ξ)dP (ξ) ≥



ψ(¯x, ξ)dP (ξ) = p(¯x).

This shows lower semicontinuity o f p(x). Lower semicontinuity o f ˆp

(x) can

be shown in the same way.

The epiconvergence ˆp

→ p w.p.1 is a direct implication of Artstein and

Wets [AW, Theorem 2.3]. Note that , of course, |ψ(x, ξ)| is dominated by an

integrable function since |ψ(x, ξ)| ≤ 1.

Suppose, further, that for every ¯x ∈ X, G(¯x, ξ) = 0 w.p.1, which

implies that ψ(·, ξ) is continuous at ¯x w.p.1. Then by the Lebesgue Dominated

Convergence Theorem we have for any ¯x ∈ X,

lim

x→¯x

p(x) = lim

x→¯x



ψ(x, ξ)dP (ξ)



lim

x→¯x

ψ(x, ξ)dP (ξ) =



ψ(¯x, ξ)dP (ξ) = p(¯x).

This shows that p(x) is continuous at x = ¯x. Finally, the uniform convergence

(3-6) follows by a version of the uniform Law of Large Numbers (see, e.g.,

[SHA, Proposition 7, p.363]).

By lower semicontinuity o f p(x) and ˆp

(x) we have that the feasible

sets of the ‘true’ problem (3-2) and its SAA counterpart (3-3) ar e closed sets.

Therefore, if the set X is bounded (i.e., compact), then problems (3-2) and (3-

3) have nonempty sets of optimal solutions denoted, respectively, as S and

provided that these problems have nonempty feasible sets. We also denote by

∗

and

the optimal values of the true and the SAA problems, r espectively.

The following result shows that for γ = ε, under mild regularity conditions,

and

converge w.p.1 to their counterparts of the true problem.

We make the following assumption.

(A) There is an optimal solution ¯x of the true problem (3-2) such that fo r

any ε > 0 there is x ∈ X with x − ¯x ≤ ε and p(x) < ε.

Random lower semicontinuous functions are called normal integrands in [RW].

PUC-Rio - Certificação Digital Nº 0510535/CA

In other words the above condition (A) assumes existence of a sequence

} ⊂ X converging to an optimal solution ¯x ∈ S such that p(x

) < ε

for all k, i.e., ¯x is an accumulatio n point of the set {x ∈ X : p(x) < ε}.

Proposition 6 Suppose that the signiﬁcance levels of the true and SAA

problems are the sa me, i.e . , γ = ε, the set X is com pact, the function f (x) is

continuous, G(x, ξ) is a Carath´eodory function, an d condition (A) holds. Then

→ ϑ

∗

and D(

, S) → 0 w.p.1 as N → ∞.

Proof. By the condition (A), the set S is nonempty and there is x ∈ X

such that p(x) < ε. We have that ˆp

(x) converges to p(x) w.p.1. Consequently

ˆp

(x) < ε, and hence the SAA problem has a feasible solution, w.p.1 for N

large enough. Since ˆp

(·) is lower semicontinuous, the feasible set of the SAA

problem is compact, and hence

is nonempty w.p.1 for N large enough. Of

course, if x is a feasible solution of an SAA pro blem, then f(x) ≥

. Since

we can ta ke such point x arbitrary close to ¯x and f(·) is continuous, we obtain

that

lim sup

N→∞

≤ f(¯x) = ϑ

∗

w.p.1. (3-7)

Now let ˆx

∈

, i.e., ˆx

∈ X, ˆp

(ˆx

) ≤ ε and

= f(ˆx

). Since the

set X is compact, we can assume by passing to a subsequence if necessary that

ˆx

converges to a point ¯x ∈ X w.p.1. Also we have that ˆp

→ p w.p.1, and

hence

lim inf

N→∞

ˆp

(ˆx

) ≥ p(¯x) w.p.1.

It fo llows that p(¯x) ≤ ε and hence ¯x is a feasible point of the true problem,

and thus f(¯x) ≥ ϑ

∗

. Also f(ˆx

) → f(¯x) w.p.1, and hence

lim inf

N→∞

≥ ϑ

∗

w.p.1. (3-8)

It follows from (3-7) and (3-8) that

→ ϑ

∗

w.p.1. It also follows that the

point ¯x is an optimal solution of the true problem and consequently we obtain

that D(

, S) → 0 w.p.1.

Condition (A) is essential for the consistency of

and

. Think, for

example, about a situation where the constraint p(x) ≤ ε deﬁnes just one

feasible point ¯x such that p(¯x) = ε. Then arbitrary small changes in the

constraint ˆp

(x) ≤ ε may result in that t he feasible set of the corresponding

SAA problem becomes empty. Note also that condition (A) was not used in

the proof of inequality (3-8). Veriﬁcation of condition (A) can be done by ad

hoc methods.

Suppose now that γ > ε. Then by Proposition 6 we may expect that with

increase of the sample size N, an optimal solution of the SAA problem will

PUC-Rio - Certificação Digital Nº 0510535/CA

approach an o ptimal solution of the true problem with the signiﬁcance level

γ rather than ε. Of course, increasing the signiﬁcance level leads to enlarging

the feasible set o f the true problem, which in turn may result in decreasing

of the optimal value of the true problem. For a point ¯x ∈ X we have that

ˆp

(¯x) ≤ γ, i.e., ¯x is a feasible point of the SAA problem, iﬀ no more than γN

times the event “G(¯x, ξ

) > 0” happens in N trials. Since probability of the

event “G(¯x, ξ

) > 0” is p(¯x), it follows that

Prob



ˆp

(¯x) ≤ γ



= B



⌊γN⌋; p(¯x), N



. (3-9)

Recall that by Chernoﬀ inequality [CHE], for k > Np,

B(k; p, N) ≥ 1 − exp



−N(k/N − p)

/(2p)



It follows that if p(¯x) ≤ ε and γ > ε, then 1 − Prob



ˆp

(¯x) ≤ γ



approaches

zero at a rate of exp(−κN), where κ := (γ − ε)

/(2ε). Of course, if ¯x is an

optimal solution of the true problem a nd ¯x is a feasible po int of t he SAA

problem, then

≤ ϑ

∗

. That is, if γ > ε, then the probability o f the event

“

≤ ϑ

∗

” approaches one exponentially fast. By similar a na lysis we have

that if p(¯x) = ε and γ < ε, then probability t ha t ¯x is a feasible point of the

corresponding SAA problem approaches zero exponentially fast (cf., [LA]).

The above is a qualitative analysis. For a given candidate point ¯x ∈ X,

say obtained as a solution of a SAA problem, we would like to validate its

quality as a solution of the true problem. This involves two questions, namely

whether ¯x is a feasible po int of the true problem, and if so, then what is the

optimality ga p f(¯x) −ϑ

∗

. Of course, if ¯x is a feasible point of the true problem,

then f (¯x) − ϑ

∗

is nonnegative and is zero iﬀ ¯x is a n optimal solution of the

true problem.

Let us start with veriﬁcation of feasibility of ¯x. For that we need to

estimate the probability p(¯x). We proceed by employing again the Monte Carlo

sampling techniques. Generate an iid sample ξ

, ..., ξ

and estimate p(¯x) by

ˆp

(¯x). Note that this random sample should be generated independently of

a random procedure which produced the candidate solution ¯x, and that we

can use a very large sample since we do not need to solve any optimization

problem here. The estimator ˆp

(¯x) of p(¯x) is unbiased and for large N and

not “too small” p(¯x) its distribution can be approximated reasonably well by

the no r mal distribution with mean p(¯x) and variance p(¯x)(1 − p(¯x))/N. This

leads to the following approximate (1 − β)-conﬁdence upper bound on p(¯x):

β,N

(¯x) := ˆp

(¯x) + z



ˆp

(¯x)(1 − ˆp

(¯x))/N. (3-10)

PUC-Rio - Certificação Digital Nº 0510535/CA

A more accurate (1 − β)-conﬁdence upper bound is given by (cf., [NS]):

∗

β,N

(¯x) := sup

ρ∈[0,1]

{ρ : B(k; ρ, N) ≥ β} , (3-11)

where k := N ˆp

(¯x) =



j=1

(0,∞)

(G(¯x, ξ

)).

In order to get a lower bound for the optimal va lue ϑ

∗

we proceed as

follows. Let us choose two positive integers M and N, and let

:= B



⌊γN⌋; ε, N



and L be the largest integer such that

B(L − 1; θ

, M) ≤ β. (3-12)

Next generate M independent samples ξ

1,m

, . . . , ξ

N,m

, m = 1, . . . , M, each of

size N, of random vector ξ. For each sample solve the associated optimization

problem

Min

x∈X

f(x)

s.t.



j=1

(0,∞)

(G(x, ξ

j,m

)) ≤ γN,

(3-13)

and hence calculate its optimal value

, m = 1, . . . , M. That is, solve M times

the corresponding SAA problem at the signiﬁcance level γ. It may happen

that problem (3-13) is either infeasible or unbounded from below, in which

case we assign its optimal value as +∞ or −∞, respectively. We can view

, m = 1, . . . , M, a s an iid sample of the random variable

, where

is the optimal value of the respective SAA problem at signiﬁcance level γ.

Next we rearrange the calculated optimal values in the nondecreasing order as

follows

(1)

≤ · · · ≤

(M)

, i.e.,

(1)

is the smallest,

(2)

is the second smallest

etc, among the values

, m = 1, . . . , M. We use the random quantity

(L)

as a lower bound of the true optimal value ϑ

∗

. It is possible to show that

with probability at least 1 − β, the random quantity

(L)

is below the true

optimal value ϑ

∗

, i.e.,

(L)

is indeed a lower b ound of the true optimal value

with conﬁdence at least 1 − β (see

[NS]). We will discuss later how to choose

the constants M, N and γ based on numerical experiments.

In [NS] this lower bound was derived for γ = 0. It is straightforward to extend the

derivations to the case of γ > 0.

PUC-Rio - Certificação Digital Nº 0510535/CA

Two applications of SAA

In this Chapter we apply SAA to a portfolio problem and to a blending

problem. In the ﬁrst the decision maker must choose the composition of a

portfolio of assets such that the expected return is maximized. Due to the

chance constraint, the total gain has to be greater than a pre-speciﬁed return

level v with high proba bility. When returns follow a multivariate normal

distribution, we compute the solution explicitly and compare it with the

results of SAA. When returns are lognormally distributed, we have to rely

on approximations.

The second problem is a joint version of a two dimensional blending

problem. We show that SAA can be readily applied to this situation at no

extra cost. D ue to the independence assumption, we compute explicit answers

for this problem and use them as benchmarks to tune the parameters of SAA.

4.1

Portfolio problem

Consider the following maximization problem subj ect to a single chance

constraint:

Max

x∈X





s.t. Prob



x ≥ v



≥ 1 − ε.

(4-1)

Here x ∈ R

is vector of decision variables, r ∈ R

is a random vector

with known probability distribution, v ∈ R, ε ∈ (0, 1), e is a vector whose

components are all equal to 1 and

X := {x ∈ R

: e

x = 1, x ≥ 0}.

Note that, of course, E





= µ

x, where µ := E[r] is the corresponding

mean vector. That is, the objective function of problem (4-1) is linear and

deterministic.

The motivation to study ( 4-1) is the portfolio selection problem going

back to Markowitz [MAR]. The vector x represents the percentage of a total

wealth of one do lla r invested in each of n available assets, r is the vector of

random returns of these assets and the decision agent wants to maximize the

PUC-Rio - Certificação Digital Nº 0510535/CA

mean return subject to having a return greater or equal to a desired level v,

with proba bility at least 1 − ε. In terms of risk measures, this requirement is

equivalent to a Value-at-Risk constraint. We note that problem (4-1) is not

realistic because it do es not incorporate crucial features of real markets such

as cost of transactions, short sales, lower and upper bounds on the holdings,

etc. However, it will serve to our purposes as an example of an application

of the SAA method. For a more realistic model we can refer the reader, e.g.,

to [WCZ], where the authors include market f rictions and discuss the best

distribution function for asset returns.

A very similar version of problem (4-1) was analyzed in [ZTSK], where the

authors obtained important information about the diﬀerent policies available

in soil management as well as the trade oﬀ between net returns and soil loss.

We consider two diﬀerent situations, namely when the vector o f ra ndom

returns r follows multivar iate normal and multivaria t e lognor mal distributions.

The two cases are very distinct; on the for mer one can solve explicitly the

chance constraint, while in the la t t er no explicit formula is known. Under

normality, we can compare the quality of the approximations with the true

optimal value, while in the lognormal case we have to rely on approximations.

4.1.1

SAA of the portfolio problem

First assume that r follows a multivariate normal distribution with

mean vector µ and covariance matrix Σ, written r ∼ N (µ, Σ). In that case

x ∼ N



x, x

Σ x



, and hence (as it is well known) the chance constraint

in (4-1) can be written as a convex second order conic constraint (SOCC) as

follows.

Prob



x ≥ v



≥ 1 − ε ⇔

Prob



x − µ

√

Σx

≥

v − µ

√

Σx



≥ 1 − ε ⇔

1 − Prob



x − µ

√

Σx

≤

v − µ

√

Σx



≥ 1 − ε ⇔

1 − Φ



v − µ

√

Σx



≥ 1 − ε ⇔

v −µ

√

Σx

≤ z

1−ε

⇔

v − µ

x + z

1−ε

√

Σx ≤ 0. (4-2)

Using the explicit fo r m (4-2) of the chance constraint, one can eﬃciently solve

the convex problem (4-1) for diﬀerent values of ε. An eﬃcient frontier of

PUC-Rio - Certificação Digital Nº 0510535/CA

portfolios can be constructed in an objective function value versus conﬁdence

level plot, that is, for every conﬁdence level ε we associate the optimal value

of problem (4-1). The eﬃcient frontier dates back to Markowitz [MAR]. A

discussion of the subject can be found, e.g., in [CM].

If r follows a multivar ia t e lognormal distribution, then no closed form

solution for the chance constrained problem (4-1 ) is available. The related

SAA problem (4-1) can be written as

Max

x∈X

s.t. ˆp

(x) ≤ γ,

(4-3)

where ˆp

(x) := N

−1



i=1

(0,∞)

(v − r

x) and γ ∈ [0, 1). The reason we use γ

instead of ε is t o suggest that for a ﬁxed ε, a diﬀerent choice of the parameter

γ in (4-3) might be suitable. For instance, if γ = 0, then the SAA problem

(4-3) becomes the linear program

Max

x∈X

s.t. r

x ≥ v, i = 1, . . . , N.

(4-4)

A recent paper by Campi and Garatti [CG], building on the work of Calaﬁore

and Campi [CC], provides an expression for the probability of an optimal

solution ˆx

of the SAA problem (3-3), with γ = 0, to be infeasible for the true

problem (3-2) . That is, under the assumptions that the set X and functions

f(·) and G(·, ξ), ξ ∈ Ξ, are convex a nd that w.p.1 the SAA problem attains

unique optimal solution, we have that for N ≥ n,

Prob {p(ˆx

) > ε} ≤ B(n − 1; ε, N), (4-5)

and the above bound is tight. We apply this bound to the considered portfolio

selection problem to conclude that for a conﬁdence parameter β ∈ (0, 1) and

a sample size N

∗

such that

B(n − 1; ε, N

∗

) ≤ β, (4-6)

the optimal solution of problem (4-4) is feasible for the corresponding true

problem (4-1) with probability at least 1 − β.

For γ > 0, problem (4-3 ) can be written as the mixed-integer linear

program

Max

x,z

s.t. r

x + vz

≥ v,



i=1

≤ Nγ,

x ∈ X, z ∈ {0, 1}

(4-7)

The equivalence of problems (4-3) and (4-7) was already proved when we

PUC-Rio - Certificação Digital Nº 0510535/CA

showed the equivalence of problems (2-8) and (2-9) in Chapter 2 , Section 2.3.

Given a ﬁxed ε in (4-1), it is not clear what are the best choices of γ and

N for approximation (4-7). We believe it is problem dependent and numerical

investigations will be perfor med with diﬀerent values for both parameters. We

know from Proposition 6 that, for γ = ε the larger the N the closer we are to

the original problem (4-1). However, the number of samples N must be chosen

carefully because problem (4-7) is a binary problem. Even moderate values of

N can generate instances that are very hard to solve.

4.1.2

Obtaining candidate solutions

First we perform numerical experiments applying SAA to the portfo lio

problem (4-1) assuming that r ∼ N(µ, Σ). We considered 10 assets (n = 10)

and the da t a for the estimation of the parameters was taken from historical

monthly returns adjusted for dividends from 1997 to 2007 of 10 US major

companies

. The sample was generated by the Triangular Factorization Method

[BS]. We wrote the codes in GAMS and solved the linear and binary problems

using CPLEX 9.0. The computer was a PC with an Intel Core 2 processor and

2GB of RAM.

Let us ﬁx ε = 0.10 and β = 0.01. For these values, t he sample size

suggested by (4-6) is N

∗

= 183. We ran 10 independent replications of (4-4)

for each of the sample sizes N = 30, 40, . . . , 200 and for N

∗

= 183. We also

build an eﬃcient frontier plot of optimal portfolios with an objective value

versus Prob{r

≥ v} axes, where x

is the optimal solution of problem (4-

1) f or a given ε. We show in the same plot (Figure 4.1) the corresponding

objective function va lues and Prob{r

ˆx

≥ v} for each optimal solution ˆx

found for the problem (4-4). To identify each point with a sample size, we

used a gray scale that attributes light tones of gray to smaller sample sizes

and darker ones to larger samples. The eﬃcient frontier curve is calculated for

ε = 0.8, 0.81, . . . , 0.99 and then connected by lines. The vertical and horizontal

lines are for reference only: they represent the optimal value for problem (4-1)

with ε = 0.10 and the 90% reliability level, respectively.

Figure 4.1 shows interesting f eatures of the SAA problem (4-4). Although

larger sample sizes always generate feasible points, the value of the objective

function, in general, is quite small if compared with the optimal value 1.004311

of problem (4-1) with ε = 0.10. We also observe the absence of a convergence

property: if we increase the sample size, the feasible region of problem (4-4)

JP Morgan, Oracle, Intel, Exxon, Wal-Mart, Apple, Sun Microsystems, Microsoft, Yahoo

and Procter & Gamble

PUC-Rio - Certificação Digital Nº 0510535/CA

0.80

0.85

0.90

0.95

1.00

0.998

1.004

1.010

Figure 4.1: Normal returns for γ = 0.

gets smaller and the approximation becomes more and more conservative and

therefore suboptimal. The reason is that fo r increasingly large samples the

condition r

x ≥ v for all i approaches the condition Prob{r

x ≥ v} = 1.

In order to ﬁnd better candidate solutions for problem (4-1), we need to

solve the SAA problem with γ > 0, (problem (4-7)), which is a combinatorial

problem. Since o ur port f olio problem is a linear one, we can solve problem

(4-3) eﬃciently for a moderate number (e.g., 200 constraints) of instances. We

performed tests for problem (4-3) ﬁxing γ = 0.05 and 0.10 and changing N as

in the sample approximation case. The results are in Figures 4.2 and 4.3.

The best candidate solutions to problem (4-1) were obtained by choosing

γ = 0 .0 5. We considered diﬀerent sample sizes from 30 to 200. Although several

points are infeasible to the original problem (4-1), we observe in Fig ure 4.2 that

whenever a point is feasible it is close to the upper bound. For γ = 0.10, Figure

4.3 shows us that almost every generated point is infeasible. To further justify

this claim, note that among the feasible po ints in Figure 4.2, more than 70%

of them are within 0.2% of the true optimal value 1.004311 of problem (4-1)

with ε = 0.10. If we r elax the tolerance to 0.3%, then more than 93% of the

points are no more than 0.3% away from the optimal value.

PUC-Rio - Certificação Digital Nº 0510535/CA

0.80

0.85

0.90

0.95

1.00

1.000

1.005

1.010

0.995

Figure 4.2: Normal with γ = 0.05.

0.80

0.85

0.90

0.95

1.00

1.000

1.010

1.005

0.995

Figure 4.3: Normal with γ = 0.10.

To investigate the possible choices of γ and N in problem (4- 7), we

created a three dimensional representation which we call γN-plot. The domain

is a discretization of values of γ and N, forming a grid with pairs (γ, N). For

each pair we solve an instance of problem (4 -7) with these parameters and

stored the optimal value and the approximate probability of being feasible to

the original problem (4-1 ) . The z-axis represents the optimal value associated

to each point in the domain in the grid. Finally, we created a surface of triangles

based on this grid as follows. Let i be the index for the values of γ and j for the

values of N. If candidate points associated with grid members (i, j), (i + 1, j)

and (i, j + 1) or (i + 1, j + 1), (i + 1, j) and (i, j + 1) are feasible to problem

(4-1) (with pro bability greater than or equal t o (1 −ε)), then we draw a dark

gray triangle connecting the three points in the space. Otherwise, we draw a

light gray triangle.

We created a γN-plot for problem (4-1) with normal returns. The result

can be seen in Figure 4.4, where we a lso included the plane corresponding

to the optimal solution with ε = 0.10. The values for par ameter γ were

0, 0.01, . . . , 0.10 and for N = 30, 40, . . . , 200. From Figure 4.4 we see that

for any ﬁxed γ small sample sizes tend to generate infeasible solutions and

large samples feasible ones. As predicted by the results of Campi and Garatti,

when γ = 0, large sample sizes generate feasible solutions, although they can

be seen to be of poor quality judging by the low peaks observed in this region.

The concentration of high peaks corresponds to γ values around ε/2 = 0.05 for

almost all sample sizes, including small ones (varying from 50 until 120). We

generated diﬀerent instances of Figure 4.4 and the output followed the pattern

described here.

PUC-Rio - Certificação Digital Nº 0510535/CA

1.005

1.0

0.995

130

180

0.0

0.025

0.05

0.075

0.1

1.01

Figure 4.4: γN-plot for the portfolio problem with normal returns.

Even thoug h there are peaks in other regions, Figure 4.4 suggests a

strategy to obtain good candidates for chance constrained problems: choose

γ close to ε/2, solve instances of SAA problems with small sizes of N (e.g. one

third of the Campi-Garatt i estimate (4-6)) and keep the best solution. This

is f ortunate because SAA problems with γ > 0 are binary problems that can

be hard to solve. Our experience with the p ortfolio problem and with others

suggest this strategy works better than trying to solve SAA problems with

large sample sizes. The choice γ = ε/2 came from our empirical experience.

We believe in general the choice of γ is problem dependent.

4.1.3

Upper bounds

A method to compute lower bounds o f chance constrained problems of

the form (3-1) was suggested in [NS]. We summarized their procedure at the

end of Section 3.1, leaving the question of how to choose the constants L, M

and N. Given β, M and N, it is straightforward to specify L: it is the largest

integer that satisﬁes (3-12). For a given N, the lar ger M the better because we

are approximating the L-th order statistic of the random variable

. However,

note that M represents the number of problems to be solved and this value is

often constrained by computational limitations.

In [NS] an indication of how N should be chosen is not given. It is possible

to gain some insight on the magnitude of N by doing some algebra in inequality

(3-12). With γ = 0, the ﬁrst t erm (i = 0) of the sum (3-12) is



1 − (1 − ε)



≈



1 − e

−Nε



. (4-8)

PUC-Rio - Certificação Digital Nº 0510535/CA

Approximation (4-8) suggests that for small values of ε we should take N of

order O(ε

−1

). If N is much bigger than 1/ε then we would have to choose a very

large M in order to honor inequality (3-12). For instance if ε = 0.10, β = 0.01

and N = 100 instead of N = 1/ ε = 10 or N = 2/ ε = 20, we need M to be

greater than 100 000 in order to satisfy (3-12), which can be computationally

intractable for some problems. If N = 200 then M has to be grater then 10

which is impractical for most applications.

In [LA], the a uthor s applied the same technique to generate bounds on

probabilistic versions of the set cover problem and the transportation problem.

To construct the bounds they varied N and used M = 10 and L = 1. For many

instances they obtained lower bounds slightly smaller (less than 2%) or even

equal to the best o ptimal values g enerated by SAA. In the portfolio problem,

the choice L = 1 generated poor bounds as we will see.

We performed experiments for the portfolio problem with returns now

following a lognormal distribution. Figure 4.5 shows the sample points obtained

by SAA with γ = 0 and with the corresponding proba bility estimated by

Monte-Carlo. The reader is referred to [LK] for detailed instructions of how

to generate samples from a multivariate lognormal distribution. Since in the

lognormal case one cannot compute the eﬃcient frontier, we also included in

Figure 4.5 upper bo unds

for ε = 0.02, . . . , 0.20, calculated according to (3-

12). We ﬁxed β = 0.01 for all cases and chose three diﬀerent values for the

constants L, M and N.

First we ﬁxed L = 1 and N = ⌈1/ε⌉ (solid line in Figure 4.5, upper bound

A). The constant M was chosen to satisfy the inequality (3-12). The results

were not satisfactory, mainly because M ended up being too small. Since the

constant M deﬁnes the number of samples from ˆv

and since our problem is

a linear one, we decided to ﬁx M = 1 000 . Then we chose N = ⌈1/ε⌉ (dashed

line in Figure 4.5, upp er bo und B) and deﬁned L to be the largest integer such

that ( 3-12) is satisﬁed. Finally, we generated an upper bound with M = 1 000

and N = ⌈2/ε⌉ (dotted line in Figure 4.5, upper bound C).

It is harder to construct upper bounds with γ > 0. The diﬃculty lies

in solving integer problems and it is hard to ﬁnd an appropriate choice of

the parameters M or N in order to keep the problem size manageable. Based

on experiments, a g ood choice for this problem is M = 500, N = 50 and

γ = ε/2 = 0.05, which originated the dotted upper bound in Figure 4.5.

Although it is slightly b etter than the bounds obtained with γ = 0, in many

situations one often wants an upper bound without much computational eﬀort.

If that is the case, it might be appropriate to use equation (3-12) for γ = 0

The portfolio problem (4-1) is a maximization problem.

PUC-Rio - Certificação Digital Nº 0510535/CA

0.80

0.85

0.90

0.95

1.00

1.02

1.01

1.00

Figure 4.5: Lognormal with γ = 0 .

since the corresp onding problems are easier to solve.

0.80

0.85

0.90

0.95

1.00

1.010 1.0301.020

Figure 4.6: Lognormal with γ =

0.05.

0.80

0.85

0.90

0.95

1.00

1.010 1.020

1.030

Figure 4.7: Lognormal with γ =

0.10.

Following the normal case, we performed similar experiments with γ =

ε/2 and γ = ε. The results are in Figures 4.6 and 4.7 respectively, where we

only included the dashed upper bound. The experiments for the lognormal

case conﬁrmed the tendency observed in the nor mal case: γ = ε generated

infeasible points, γ = 0 generated feasible points of poor quality if measured

by the distance to the upper bound curves and γ = ε/2 yielded t he best

candidate solutions.

4.2

A blending problem

Let us consider a second example of a chance constrained problem.

Suppose a farmer has some crop and wants to use fertilizers to increase the

PUC-Rio - Certificação Digital Nº 0510535/CA

production. He hires an agronomy engineer who recommends 7 g of nutrient

A and 4 g of nutrient B. He has two kinds of fertilizers available: the ﬁrst has

g of nutrient A and ω

g of nutrient B per kilogram. The second has 1 g of

each nutrient per kilogram. The quantities ω

and ω

are uncertain: we assume

they are (independent) continuous uniform random variables with support in

the intervals [1, 4] and [1/3, 1] respectively. Furthermore, each fertilizer has a

unitary cost per kilogram.

There are several ways to model this blending problem. A detailed

discussion can be found in [HV], where the authors use this problem to motivate

the ﬁeld of stochastic prog r amming. We consider a joint chance constrained

formulation as follows:

Min

+ x

s.t. Prob{ω

+ x

≥ 7, ω

+ x

≥ 4} ≥ 1 − ε,

, x

≥ 0,

(4-9)

where x

represents the quantity of f ertilizer i purchased, i = 1, 2, and ε ∈ [0, 1 ]

is the reliability level. The independence assumption allows us to convert the

joint probability in (4-9) into a product of probabilities. After some tedious

calculations, one can explicitly solve (4-9) for all values of ε. For ε ∈ [1/2, 1]

the solution (x

∗

, x

∗

) and the optimal value v

∗

= x

∗

+ x

∗

are

∗

9 + 8(1 − ε)

, x

∗

2(9 + 28(1 − ε))

9 + 8(1 − ε)

, v

∗

4(9 + 14(1 − ε))

9 + 8(1 − ε)

For ε ∈ [0, 1/2], v

∗

2(25 − 18(1 − ε))

11 − 9(1 − ε)

. (4-10)

Our goal is to exemplify the use o f SAA to joint chance constrained

problems. In addition, we use problem (4-9) as a benchmark to gain more

understanding of tuning of the underlying parameters of the SAA approach

since an explicit solution is available in this case. As mentioned in Section

2.3 of Chapter 2, we can convert a joint chance constrained problem into a

problem of the form (3- 1) using the min (or max) operators. Problem (4-9)

becomes

Min

+ x

s.t. Prob {min{ω

+ x

− 7, ω

+ x

−4} ≥ 0} ≥ 1 − ε,

, x

≥ 0.

(4-11)

Introducing one auxiliary variable z

per scenario, it is possible to formulate the

SAA method to problem (4-11) as a mixed integer linear program as follows.

PUC-Rio - Certificação Digital Nº 0510535/CA

0.90

0.95

1.00

6.0

5.5

7.0

6.5

7.5

Figure 4.8: SAA for the blending problem with γ = 0.025.

Min

+ x

s.t. ω

+ x

−7 + Kz

≥ 0, i = 1, . . . , N,

+ x

−4 + Kz

≥ 0, i = 1, . . . , N,



i=1

≤ Nγ,

z ∈ {0, 1}

, x

≥ 0,

(4-12)

where N is the number of samples, ω

and ω

are samples from the random

variables ω

and ω

, γ ∈ (0, 1) and K is a po sitive constant greater or equal

than 7.

4.2.1

Numerical experiments

We performed experiments similar to the ones for the po rt folio problem

so we present the results without details. In Figure 4.8 we generated appro-

ximations for problem (4- 9) with ε = 0.05 using SAA. The sample points

were obta ined by solving a SAA problem with γ = 0.025 and sample sizes

N = 60, 70, . . . , 150. The Campi-Ga r atti inequality (4-6) suggested value is

∗

= 130. In addition, we included the eﬃcient frontier for problem (4-9). We

will no t show the corresponding F igures for other values of γ, but the pattern

observed in the portfolio pro blem repeated: with γ = 0 almost every point

was feasible but far from the optimal, with γ = ε = 0.05 almost every point

was infeasible. Again, the parameter choice that generated the best candidate

solutions was γ = ε/2 = 0.025.

We also show the γN-plot for SAA applied to problem (4-9). We tested γ

values in the range 0, 0.005, 0.01, . . . , 0.05 and N = 60, 70, . . . , 150. We included

PUC-Rio - Certificação Digital Nº 0510535/CA

6.95

5.45

6.45

5.95

0.025

0.05

0.075

0.1

120

100

140

Figure 4.9: γN-plot for the blending problem.

a plane representing the o ptimal value of problem (4-9) for ε = 0.05, which is

readily obtained by applying formula (4 -10).

In accordance with Figure 4.4, we note that in Figure 4.9 the best

candidate solutions are the ones with γ around 0.025. Even for very small

sample sizes we have feasible solutions (dark gray triangles) relatively close to

the optimal plane. On the other hand, this experiment gives more evidence that

SAA with γ = 0 is excellent to generate feasible solutions (dark gray triangles)

but the quality of the solutions is poor. As shown in Figure 4.9, t he high peaks

associated with γ = 0 persist for any sample size, generating points far form

the optimal plane. In agreement with Figure 4.4, the candidates obta ined for

γ close to ε were mostly infeasible.

PUC-Rio - Certificação Digital Nº 0510535/CA

From separated to joint variables: the hurdle-race problem

We consider a general pro v isioning problem, where an economic agent

aims to determine an initial amount that would be needed in order t o meet

a series of future payment obligations with a suﬃciently large probability. We

assume a “hold-to-maturity” (actuarial) approach: the initial provision is such

that upon investing it in a p ortfolio one is always able in the future to pay-

oﬀ the cash ﬂows. In contrast, the “available-for-sales” (ﬁnancial) framework

admits t he possible fall in the arbitrage-free price of the series of cash ﬂows

over a given time period and use it as means of assessing risk and establishing

buﬀers. Although the latter a pproach is at the core of the latest r egulatory

documents such as Basel II and Solvency II, it can induce crashes when they

would not otherwise occur. Furthermore, we believe better risk measures a r e

available, such as the so-called coherent ri s k measures [ADEH]. This does not

mean that the ﬁnancial approach is wrong but the a ctuarial framework is at

least a complementing alternative. For a detailed criticism of those regulatory

documents we refer the reader to [D EGK].

In [VDGK], the authors study the provisioning problem and impose

minimum requirements of available capital at each period, called hurdles. In

their framework, hurdles are modeled as separated chance constraints with

given reliability levels, one for each period of time. They coined the term hurdle-

race problem to describe the provisioning problem with hurdles. We start by

summarizing the approach proposed in [VDGK] and stating their main result.

Then we propose an alternative chance constrained model which requires that

all the obligations should be met jointly with a given reliability level. To this

end we make use of a joint chance constraint and refer to the problem as

the joint hurdle-race problem. We pro pose to solve the corresponding problem

using SAA and to compare the results with [VDGK]. In addition, we consider

another generalization in which the hurdles are not determined by the model

builder but are deﬁned as discounted values of f uture obligations by stochastic

risk-free rates.

PUC-Rio - Certificação Digital Nº 0510535/CA

5.1

The hurdle-race problem and comonotonicity

An insurer wants to determine the initial provision R

required to meet

n future obligations, of costs α

, . . . , α

at ﬁxed, prescribed times t

, . . . , t

, for

t ∈ [0, t

]. Among obligations, t he insurer may invest his capital, with random

returns. More precisely, the stochastic return process (Y

, . . . , Y

) such that 1

unit at time 0 grows to exp(Y

+ · · · + Y

) at time t

determines the evolution

of capital R

in time,

= R

j− 1

exp(Y

) − α

, j = 1, . . . , n. (5-1)

In [VDGK], t he authors impose probabilistic constraints (the hurdles) tha t

have to be met every time t

, that is, provision R

has to be larger than a

deterministic value V

with high probability 1 − ε

. They formulate the hurdle-

race problem as follows:

= Min

≥V

(5-2)

Prob{R

≥ V

| R

} ≥ 1 − ε

, j = 1, . . . , n

for given hurdles V

, V

, . . . , V

and given tolerances ε

, ε

, . . . , ε

∈ [0, 1 ].

To determine the optimal provision

in (5-2) set

[0,j]

j− 1



i=1

exp(−Y

− · · · − Y

) + (V

+ α

) exp(−Y

− · · · − Y

), (5-3)

the stochastically discounted value of the future obligations in the restricted

time period [0, j]. Theorem 1 in [VDG K], below, gives the optimal solution of

problem (5-2) in terms of the quantiles of the distributions of S

[0,j]

Theorem 7 The optimal initial provision

deﬁned in (5-2) is given by

= Max{V

, F

−1

[0,1]

(1 − ε

), F

−1

[0,2]

(1 − ε

), . . . , F

−1

[0,n]

(1 − ε

)},

where F

[0,j]

is the cumulative distribution function of S

[0,j]

, j = 1, . . . , n.

A basic ingredient in the proo f is the simple fact that

Prob {R

≥ V

| R

} = Prob



[0,j]

≤ R



, j = 1, . . . , n.

Thus, in order to determine the optimal

we are led to compute the quan-

tiles of the random variables S

[0,j]

, which is very hard in most relevant cases.

For instance, if the random return process (Y

, . . . , Y

) follows a multivariate

normal distribution, we have that S

[0,j]

is a sum of lognormal distributions, a

PUC-Rio - Certificação Digital Nº 0510535/CA

random variable with no known distribution. The approach in [VDGK] repla-

ces the random variables S

[0,j]

by simpler random variables using comonotonic

approximations, whose quantiles can be calculated explicitly. Such approxima-

tions assume the random vector is strongly correlated, with all the components

depending on the same univa r iate random variable. A detailed description of

the t heory of comonotonicity can be found in [DDGa]. For examples and ap-

plications we refer the reader to [DDG b].

In [VDGK], numerical experiments are performed for the case in which

the random variables (5-3) are sums of lognormals, and hence the return

process (Y

, . . . , Y

) follows a multivariate normal distribution. They compare

their results with the va lues obtained f r om the empirical distribution of S

[0,j]

In the next section, in the more general joint hurdle -race problem, we apply

SAA to obtain good candidate solutions. We compare the results for the joint

hurdle-race to the ones obtained in [VDGK]. From now on we refer to this

model (5-2) as the separated hurdl e -race problem.

5.2

The joint hurdle-race problem

The deﬁnition of

in (5-2) does not reﬂect the proper safety requi-

rements: for each ﬁxed time t

the probability of not satisfying a hurdle is

small, but the probability of having missed one of the hurdles may remain

high. Indeed, there is no guarantee that the optimal provision keeps the joint

probability of missing at least one hurdle low. In [HEN], the author exempli-

ﬁes the contrast between both models in a cash matching problem somewhat

similar to the separated hurdle race problem (5-2).

We then consider the joi nt hurdle - race problem,

= Min

≥V

(5-4)

Prob{R

≥ V

, j = 1, . . . , n | R

} ≥ 1 − ε (5-5)

for ε ∈ [0, 1]. As opposed to problem (5-2), the optimal provision

in problem

(5-4) is the smallest value such that with high probability no hurdle is violated.

Although we have a single constraint in (5-4) opposed to n constraints

in (5-2), problem (5- 4) is harder to solve: the joint probability calculation in

(5-4) involves the computation of a quantile of the cumulative distribution

of the random vector (S

[0,1]

, S

[0,2]

, . . . , S

[0,n]

), an extremely diﬃcult task. Even

checking feasibility for a given candidate R

is usually hard.

We use SAA to obtain good candidate solutions of (5-4) and lower bounds

for the optimal value. Indeed, (5-4) is a joint chance constrained problem so

PUC-Rio - Certificação Digital Nº 0510535/CA

the techniques of the previous Chapters apply.

The jo int hurdle-race problem is not explicitly written in format (1-1),

but it can be easily converted by employing the max-function

Prob {R

≥ V

, . . . , R

≥ V

| R

} = Prob



[0,1]

≤ R

, . . . , S

[0,n]

≤ R



= Prob



max

j=1,...,n

[0,j]

} ≤ R



= Prob



max

j=1,...,n

[0,j]

} − R

≤ 0



. (5-6)

Using (5-6) we have that problem (5-4) is a particular case of (1-1):

Min

≥V

s.t. Prob



max

j=1,...,n

[0,j]

} − R

≤ 0



≥ 1 − ε.

(5-7)

But how do we solve (5-7) with SAA? Given a sample size N and a reliability

level γ, the SAA formulation becomes a MILP as follows.

Min

≥V

s.t. (V

+ α

) exp(−Y

) − R

− Kz

≤ 0, s = 1, . . . , N,



i=1

i(j)

exp(−Y

− · · · − Y

) − R

− Kz

≤ 0, s = 1, . . . , N,



i=1

i(n)

exp(−Y

− · · · − Y

) − R

− Kz

≤ 0, s = 1, . . . , N,



s=1

≤ Nγ,

∈ {0, 1}

(5-8)

where K is a suﬃciently large positive constant, Y

are samples f r om the return

process (Y

, . . . , Y

) and

i(j)



, i = j,

+ α

, i = j.

The feasibility check cannot be performed exactly and uses Monte-Carlo

methods.

5.2.1

Numerical experiments

We compare separated and the joint hurdle-race problems. Following

[VDGK], we performed experiments with n = 40 periods of investment and

with normal iid returns Y

with mean µ = log 1.10 and standard deviation

σ = 0.10. The hurdles and t he liabilities are equal to 10 and 0.8 respectively for

all time periods. Fo r the separated hurdles we choose ε

= 0 .05 for all periods

PUC-Rio - Certificação Digital Nº 0510535/CA

and for the joint case we take ε = 0.05. Our numerical exp eriments showed tha t

the stochastic lower bound approximation in [VDGK] is extremely accurate

for the separated hurdle-race problem, and we obtained

= 13.56411337.

An (sto chastic) upper bound, obtained by the comonotonic approximation, is

14.4812509 9.

For the joint hurdle-race problem, we ﬁrst choose values of γ a nd N.

Following the empirical ﬁndings for the portfo lio problem and the blending

problem of the previous Chapter, we set γ = ε/2 = 0.025. We then compute

the value of N for which the optimal solution of the SAA problem is f easible

for the original problem with probability greater than 99% (see [CG]), which

in this case is N = 90. This estimate is usually t oo conservative and should

be regarded as an upper bound for the actual value of N to be used in the

experiments. The best result was obtained f or N = 50 and t he smallest initial

provision was

= 15.81238194. A Monte-Carlo experiment with 100 000

samples estimated the true probability o f this candidate as 0.9527 and it is

thus feasible for the original problem.

We compute statistical lower bounds for the optimal value of the joint

hurdle-race problem following section 4.1.3. Fixing M = 1 000 and L according

to (3-12), the 99%-conﬁdence lower bound for the optimal value is 15.302594.

Obviously, the candidate solution obtained by SAA can be regarded as an

upper bound for the true optimal value (if feasible).

Similar experiments were performed with ε

= 0.01. The techniques in

[VDGK] g ive 16.34858684 for the solution of the separated problem, with

comonotonic upper bound

equal to 18.37208466. It is not clear how these

methods should be extended to the joint hurdle problem. In this case, applying

SAA with N = 150 and γ = ε/2 = 0.005, the new method obtained

18.2164034 5. The true probability was estimated by Monte-Carlo and was

equal to exactly 1, with 100 000 samples. The 99%-conﬁdent lower bound

obtained was 17.515987.

It is interesting to compare the solution for the joint case (5-4) with the

one for the separate case (5-2). The best provisions obtained for the latter were

less than those for the former in both experiments. One might be tempted to

adopt the separated version as the best model since smaller provisions are

usually preferred. However, the solution of the joint problem is obviously more

robust. We proceed to substantiate this claim with some simulations.

The separated case was treated as in [VDGK], whereas we used SAA

for the joint problem. For illustration, we ﬁrst present small size simulation

The comonotonic upper bound is constructed using conditional expectations of como-

notonic random variables.

PUC-Rio - Certificação Digital Nº 0510535/CA

experiments that show the diﬀerences between the two approaches. Figures

5.1 and 5.2 show the estimated probability of violating each hurdle for the

separated problem. The values in the y-axis are close to 0.0 5 and 0.01

respectively, in accordance with the chosen reliability levels. Figures 5.3 and

5.4 show 100 sample paths for each choice of ε in the separated case. In Figure

5.3, 12 paths violated one of the hurdles at least once, g iving a violation

probability of 0.12, signiﬁcantly higher than the original 0.05 conﬁdence level.

In Figure 5.4, the probability of violation was 0.06, also higher tha n the original

signiﬁcance level 0.01. In the joint case, Figures 5.5 and 5.6 indicate that at

least one constraint was violated only 3% and 1% of the time, in accordance

with the joint reliability levels ε = 0.05 and ε = 0.01, a much more robust

situation.

In order improve numerical accuracy, we ran the same experiments for

10 000 paths. For ε

= ε = 0.05, the estimated probability of violation of

at least one hurdle for the separated hurdle-race problem was 0.1173, much

higher than the corresponding value 0.0328 for the j oint version with the same

reliability level. For ε

= ε = 0.01, the estimated values were 0.024 7 for the

separated for mulation and 0.009 4 for the joint counterpart. In both cases the

separated hurdle-race problem misses the joint reliability level by roughly twice

the pre-determined reliability level, substantially underestimating the more

robust provision given by SAA for the joint case.

0 5 10 15 20 25 30 35 40 45

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08

Figure 5.1: ε = 0.05.

0 5 10 15 20 25 30 35 40 45

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

Figure 5.2: ε = 0.01.

5.3

Stochastic hurdles

We now consider a joint hurdle-race model with the possibility of mo-

deling stochastic hurdles. Now we deﬁne the hurdles as the market consistent

value of future liabilities when evaluating the portfolio at period j, i.e., the

PUC-Rio - Certificação Digital Nº 0510535/CA

0 5 10 15 20 25 30 35 40 45

−200

200

400

600

800

1000

1200

Figure 5.3: Sample paths for ε =

0.05, SHR.

0 5 10 15 20 25 30 35 40 45

500

1000

1500

2000

2500

3000

Figure 5.4: Sample paths for ε =

0.01, SHR.

0 5 10 15 20 25 30 35 40 45

500

1000

1500

2000

2500

Figure 5.5: Sample paths for ε =

0.05, JHR.

0 5 10 15 20 25 30 35 40 45

200

400

600

800

1000

1200

1400

1600

1800

Figure 5.6: Sample paths for ε =

0.01, JHR.

hurdles will be the discounted cash ﬂows at the stochastic risk-free rate. In

particular, the hurdles V

are not known at period j.

In addition to the stochastic return process (Y

, . . . , Y

), the model will

have a risk-free rate process (r

, . . . , r

) that will determine the (stochastic)

hurdles as follows.

= α

exp(−r

) + α

exp(−r

− r

) + · · · + α

exp(−r

− · · · − r

= α

+ α

exp(−r

) + · · · + α

exp(−r

− · · · − r

. (5-9)

= α

SAA obtains candidate solutions with no additional computational eﬀort.

In the numerics, the risk-free rate process was composed of iid normal random

variables with mean µ

= 0.04 and standard deviation σ

= 0.05. The

stochastic return process is the same as described in section (5.2.1). For

PUC-Rio - Certificação Digital Nº 0510535/CA

ε = 0.05, the best solution was 21.68791898, with N = 60 and γ = ε/2 = 0.025.

The estimated probability was 0.9527 5, using 20 000 samples. For ε = 0.01,

the best solution was 25.179 16015, with N = 120 and γ = ε/2 = 0.005. The

estimated probability was exactly 1, with 20 000 samples.

PUC-Rio - Certificação Digital Nº 0510535/CA

Conclusion

We believe that t he SAA method is a valuable tool to obtain good

candidate solutions for chance constrained problems. An important advantage

of the method is the fairly general assumption on the underlying distribution

of the random variables of the problem: SAA only requires the ability to

sample from the given distribution. The related numerical analysis is tra ctable

and stable, allowing for practical implementations. The thesis bridges the

gap between theory and application, by presenting theoretical foundations,

techniques for parameter calibration and a collection of applications, which

exemplify as benchmarks and as new amenable problems.

SAA was used in a portfolio chance constrained problem with random

returns. In the normal case, the eﬃcient f rontier can be computed explicitly

and we used it as a benchmark solution. Fixing the parameters ε = 0.10 and

γ = 0 in the method, we concluded that the sample size suggested by Campi-

Garatti inequality (4-6 ) was t oo conservative for our problem: a much smaller

sample gave rise to better feasible solutions. Similar results were obtained

for the log no r mal case, where upper bounds were computed using a method

developed in [NS].

As another illustration of the SAA method, we modeled a two dimen-

sional blending problem as a joint chance constrained problem. Due to the

independence assumption, o ne can again solve the problem explicitly and ﬁnd

the optimal solution for any given reliability level. This served as a benchmark

for SAA on the class of approximate joint chance constrained problems.

In both examples, the choice γ = ε/2 obtains very good candidate

solutions. Even though it generated more infeasible points if compared t o the

choice γ = 0, the feasible ones were of better quality. Using the γN-plot

(Figures (4.4) and (4.9)) we were able to conﬁrm these empirical ﬁndings for

our two t est problems. Relatively small sample sizes (e.g., if compared to (4-6)

estimates) can yield good candidate solutions, which is crucial since for γ > 0

the SAA problem is an integer program. Upper bounds were constructed for

the portfolio pro blem with γ = 0 and continuous linear programs were solved

to obtain the estimates. According to approximation (4-8), the number of

PUC-Rio - Certificação Digital Nº 0510535/CA

samples N should be of order 1/ε. Since no closed solution is available for

the por t f olio problem with lognormal returns, having an upper bo und is an

important information abo ut the variability of the solution.

Finally we described the hurdle-race problem, proposed in [VDGK]. We

proposed a more adequate formulation in which the hurdles were taken jointly.

We obtained good candidate solutions and lower bounds for the true optimal

value using SAA. An additional extension, assuming stochastic hurdles instead

of deterministic ones, is handled by SAA at almost no extra cost.

Future work will include writing a SAA solver. For this goal, the empirical

ﬁndings in this text are crucial. Together with H. Bortolo ssi, we started to write

a program in C++ using Osi, a uniform API for interacting with callable solver

libraries. At present, we handle SAA problems with SYMPHONY, an open-

source generic MILP developed by T.K. Ralphs. Both Osi and SYMPHONY

can be freely downloaded at www.coin-or.org. The idea is to make our solver

freely available f or use with SLP-IOR

, a user-friendly interface for sto chastic

programs developed by P. Kall and J. Mayer. As of yet, problems such as

the portfolio problem of Chapter 4, where the uncertainty is multiplying the

decision var ia bles, cannot be solved by any solver available at SLP-IOR.

The ﬁrst Chapters of the thesis, in a condensed fo rm, are the content

to [PAS]. A text with S. Vanduﬀel about the hurdle-race problem is in prepa-

ration. We plan to extend the results described here by assuming stochastic

liabilities of the type

= a

+ f

), i = 1, . . . , n,

where a

are constants, (E

, . . . , E

) is a multivariate normal vector and f

are given functions. Such a setting may be appropriate in the context of life

insurance activities. The a

would be t he total amount of ﬁxed guaranteed

beneﬁts to be paid t o the policy holders. The insurance company may also

provide a proﬁt sharing mechanism linked to some return process (E

, . . . , E

)

of an external benchmark (e.g a stock index).

www.ior.uzh.ch/Pages/English/Research/StochOpt/index

en.php

PUC-Rio - Certificação Digital Nº 0510535/CA

Bibliography

[AHM] AHMED, S. Convexity and decomposition of mean-risk sto-

chastic programs. Mathematical Proga mming, v. 106, p. 433–446,

2006.

[AW] ARTSTEIN, Z.; WETS, R .J.-B. Consistency of minimizers and

the slln for stochastic programs. Journal of Convex Analysis,

v. 2, p. 1–17, 1996.

[ADEH] ARTZNER, P.; DELBAEN, F.; EBER, J-.M.; HEATH, D. Cohe-

rent measures of risk. Mathematical Finance, v. 9, p. 203–228,

1999.

[BS] BARR, D.R.; SLEZAK, N.L. A comparison of multivariate

normal generators. Communications of the ACM, v. 15, p. 1048–

1049, 1972.

[BEA] BEALE, E.M.L. On minimizing a convex function subject to

linear inequalities. Journal of the Royal statistical Society, Series

B, v. 17, p. 173–184, 1955.

[BN] BEN-TAL, A.; NEMIROVSKI, A. Robust optmization –

methodology and applications. Mathematical Programming, v.

93, p. 453–480 , 2002.

[BP] BORTOLOSSI, H.J.; PAGNONCELLI, B. Uma

introdu¸c˜ao `a otimiza¸c˜ao sob incerteza. III

Bienal da SBM, Goiˆa nia, Brazil. Available at

http://www.mat.puc-rio.br/∼hjbortol/seminarios

/2006.1/soe/arquivos/iii-bienal-sbm-texto.pdf, 2006.

[CC] CALAFIORE, G.; CAMPI, M.C. The scenario approach to

robust control design. IEEE Transactions on Automatic Control,

v. 51, p. 742–753 , 2006.

[CCS] CHARNES, A.; COOPER, W. W.; SYMMONDS, G.H. Cost ho-

rizons and c ertainty equivalents: an approach to stochastic

PUC-Rio - Certificação Digital Nº 0510535/CA

programming of heating oil. Management Science, v. 4, p. 235–

263, 1958.

[CG] CAMPI, M.C.; GARATTI, S. The exact feasibility of rando-

mized solutions of robust convex programs. Available a t Op-

timization online (www.optimization-online.org), 2007.

[CHE] CHERNOFF, H. A measure of asymptotic eﬃciency for tests

of a hypothesis based on the sum observations. Annals of

Mathematical Statistics, v. 23, p. 493–507, 1952.

[CM] CONSTANTINIDES, G.; MALLIARIS, A.G. Portfolio theory. In

R.A. Jarrow, V. Maksimovic, and W.T. Ziemba, editors, Finance, v.

9 of Handbooks in OR & MS, North- Holland Publishing Company,

p. 1–30, 1995.

[DEGK] DAN

IELSSON, J. et al. An academic response to Basel II.

LSE Financial Markets Group, Special Paper Series, number 130,

2001.

[DAN] DANTZIG, G.B. Linear programming under uncertainty.

Management science, vol. 1, 197–206, 1955.

[DPR] DENTCHEVA, D.; PR

EKOPA, A.; RUSZCZY

NSKI, A. Conca-

vity and eﬃcient points of discrete distributions in pro-

babilistic programming. Mathematical Programming, v. 89, p.

55–77, 2000.

[DDGa] DHAENE, J. et al. The concept of comonotonicity in actu-

arial science and ﬁnance: theory. Insurance: Mathematics and

Economics, v. 31, p. 3–33 , 2002.

[DDGb] DHAENE, J. et al. The concept of comonotonicity in actua-

rial science and ﬁnance: applications. Insurance: Mathematics

and Economics, v. 31, p. 133–16 1, 2002.

[DGKS] DUPA

COV

A, J.; GAIVORONSKI, A.; KOS, Z.; SZ

ANTAI, T. Sto-

chastic programming in water management: a case study

and a comparison of solution techniques. Europ ean Journal

of Operational Research, v. 52, p. 28–44, 1991.

[HV] KLEIN HANEVELD, W.K.; VAN DER VLERK, M.H. Stochastic

Programming (lecture notes). 20 07.

PUC-Rio - Certificação Digital Nº 0510535/CA

[HEN] HENRION, R. Introduction to chance constrained program-

ming. Available at http://www.stoprog.org/.

[HM] HENRION, R.; M

OLLER, A. Optimization of a continuous

distillation pro cess under random inﬂow rate. Computers &

Mathematics with Applications, v. 45, p. 247–262, 2003.

[HLMS] HENRION, R. et al. Stochastic optimization for opera-

ting chemical processes under uncertainty. In Grtschel, M.,

Krunke, S., Rambau, J., (editors), Online optimization o f large scale

systems, Springer, p. 457–478, 2001.

[HS] HIGLE, J.; SEN, S. Stochastic Decomposition: A Statistical

Method for Large Scale Stochastic Linear Programming.

Springer, Berlin, 1996.

[KM] KALL, P.; MAYER, J. Stochastic Linear P rogramming. Sprin-

ger, 2005.

[KW] KALL, P.; WALLACE, S. Stochastic P rogramming. Wiley John

& Sons, 1995. Freely available at http://www.stoprog.org/.

[KQ] KAO, E.P.C.; QUEYRANNE, M. Budgeting cost of nursing in

a hospital. Management Science, v. 31, p. 608–621, 1985.

[LSW] LINDEROTH, J.; SHAPIRO, A.; WRIGHT, S. The empirical

behavior of sampling methods for stochastic programming,

Annals of Operations Research, v. 1 42, p. 215–241, 2006.

[LS] LOUVEAUX, F.V.; SMEERS, Y. Optimal investments for elec-

tricity generation: a st ochastic model and a test problem.

In Yu. Ermoliev and R.J.-B. Wets, editors, Numerical techniques for

stochastci optimization, chapter 24, Springer-Verlag, Berlin, 1988.

[LA] LUEDTKE, J.; AHMED, S. A sample approximation approach

for optimization with probabilistic constraints SIAM Journal of

Optimization, v. 19, p. 674–699 , 2008.

[LAN] LUEDTKE, J.; AHMED, S.; NEMHAUSER, G.L. An integer

programming approach for linear programs with proba-

bilistic constraints. To appear in Mathematical Programming,

2007.

PUC-Rio - Certificação Digital Nº 0510535/CA

[LK] LAW, A.; KELTON, W.D. Simulation Modeling and Analysis.

Industrial Engineering and Management Science Series. McGraw-

Hill Science/Engineering/Math, 1999.

[MAR] MARKOWITZ, H. Portfolio selection. Journal of Finance, v. 7,

p. 77–97, 1952.

[NS] NEMIROVSKI, A.; SHAPIRO, A.: Convex approximations of

chance constrained programs. SIAM Journal on Optimization,

v. 17, 969–996, 2006.

[PAS] PAGNONCELLI, B.K.; AHMED, S.; SHAPIRO, A. The Sample

Average Approximation method for chance constrained

programming: theory and applications. To appear in Journal

of Optimization Theory and Appplications, 2009 .

[PP] PEREIRA, M.V.F.; PINTO, L.M.V.G. Multi-stage stochastic

optimization applied to energy planning. Mathematical Pro-

gramming, v. 52, p. 359–3 75, 1991.

[PBK] PETERS, R.J.; BOSKMA, K.; KUPPER, H.A.E. Stochastic pro-

gramming in production planning: a case with non-simple

recourse. Statistica Neerlandica, v. 31, p. 113– 126, 1977.

[PREa] PR

EKOPA, A. Probabilistic programming. In A. Ruszczy´nski

and A. Shapiro, editors, Stochastic Programming, volume 10 of

Handbooks in OR & MS, North-Holland Publishing Company, p.

267-351, 2003.

[PREb] PR

EKOPA, A. Stochastic Programming. Kluwer Acad. Publ.,

Dordrecht, Boston, 1995 .

[RW] ROCKAFELLAR, R.T.; WETS,R.J.-B. Variational Analysis.

Springer, Berlin, 1998.

[SHA] SHAPIRO, A. Monte carlo sampling methods. In

A. Ruszczy´nski and A. Shapiro, editors, Stochastic Program-

ming, volume 1 0 of Handbooks in OR & MS, North-Holland

Publishing Company, p. 3 53–425, 2003.

[SR] SHAPIRO, A.; RUSZCZY

NSKI, A. Lectu-

res on stochastic progr amming. Available at

http://www2.isye.gatech.edu/~ashapiro/publications.html,

2008.

PUC-Rio - Certificação Digital Nº 0510535/CA

[VAN] VANDUFFEL, S. Comonotonicity: from risk measurement to

risk management. PhD thesis, University of Amsterdam, 2005.

[VDGK] VANDUFFEL, S., DHAENE, J., GOOVAERTS, M., KASS, R. The

hurdle-race problem. Insurance: Mathematics and Economics, v.

33, p. 405–413 , 2003.

[WCZ] WANG, Y.; CHEN, Z.; ZHANG, K. A chance-constrained port-

folio selection problem under t-distrbution. Asia Paciﬁc Jour-

nal of Operational Research, v. 24, p. 53 5–556, 2007.

[ZTSK] ZHU, M.; TAYLOR , D.B.; S. C. SARIN, S.C.; KRAMER, R.A.

Chance constr ained programming models for risk-based

economic and policy analysis of soil conservation. Agricul-

tural and Resource Economics Review, v. 23, p. 58–65, 1994.

PUC-Rio - Certificação Digital Nº 0510535/CA

Livros Grátis
( http://www.livrosgratis.com.br )
 
Milhares de Livros para Download:
 
Baixar livros de Administração
Baixar livros de Agronomia
Baixar livros de Arquitetura
Baixar livros de Artes
Baixar livros de Astronomia
Baixar livros de Biologia Geral
Baixar livros de Ciência da Computação
Baixar livros de Ciência da Informação
Baixar livros de Ciência Política
Baixar livros de Ciências da Saúde
Baixar livros de Comunicação
Baixar livros do Conselho Nacional de Educação - CNE
Baixar livros de Defesa civil
Baixar livros de Direito
Baixar livros de Direitos humanos
Baixar livros de Economia
Baixar livros de Economia Doméstica
Baixar livros de Educação
Baixar livros de Educação - Trânsito
Baixar livros de Educação Física
Baixar livros de Engenharia Aeroespacial
Baixar livros de Farmácia
Baixar livros de Filosofia
Baixar livros de Física
Baixar livros de Geociências
Baixar livros de Geografia
Baixar livros de História
Baixar livros de Línguas

Baixar livros de Literatura
Baixar livros de Literatura de Cordel
Baixar livros de Literatura Infantil
Baixar livros de Matemática
Baixar livros de Medicina
Baixar livros de Medicina Veterinária
Baixar livros de Meio Ambiente
Baixar livros de Meteorologia
Baixar Monografias e TCC
Baixar livros Multidisciplinar
Baixar livros de Música
Baixar livros de Psicologia
Baixar livros de Química
Baixar livros de Saúde Coletiva
Baixar livros de Serviço Social
Baixar livros de Sociologia
Baixar livros de Teologia
Baixar livros de Trabalho
Baixar livros de Turismo