OR in an OB World: Modeling the All Units Discount

Friday, December 10, 2010

Modeling the All Units Discount

In a previous entry, I discussed how to incorporate continuous piecewise linear functions in a mathematical program. There are times, however, when you have to model a discontinuous piecewise linear function. A classic example is the all units discount, where exceeding a quantity threshold (called a breakpoint) for a product order garners a reduced price on every unit in the order. (The incremental discount case where the price reduction applies only to units in excess of the breakpoint, is covered in the prior entry.) SOS2 variables, which I used in the first post, will not work here because they model a continuous function.

Let $y$ be a quantity (typically cost) that we wish to minimize. As before, we will assume that $y$ is a piecewise linear function of some variable $x$; for simplicity, I'll assume bounds $0 \le x \le U$ and $y\ge 0$. Let $0 = \gamma_1 < \cdots < \gamma_N < \gamma_{N+1}=U$ denote the breakpoints, and let $y=f_i(x)=\alpha_i x + \beta_i$ when $\gamma_i \le x < \gamma_{i+1}$, with the adjustment that the last interval is closed on the right ($f(U)=\alpha_N U + \beta_N$). With all units discounts, it is typical that the intercepts are all zero ($\beta_1 = \cdots = \beta_N = 0$) and cost ($y$) is proportional to volume ($x$), as seen in the first graph below, but we need not assume that. The second graph illustrates a somewhat more general case. In fact, we need not assume an economy of scale; the same modeling approach will work with a diseconomy of scale.

To model the discount, we need to know into which interval $x$ falls. Accordingly, we introduce binary variables $z_1,\ldots ,z_N$ and add the constraint $$\sum_{i=1}^N z_i = 1,$$ effectively making $\{z_1,\ldots , z_n\}$ an SOS1 set. Let $$M_i = \max_{0 \le x \le U} f_i(x)$$ for $i=1,\ldots,N$. Typically $M_i = f_i(U)$, but again we need not assume that. To connect the binary variables to $x$, we add for each $i\in \{1,\ldots,N\}$ the constraint $$\gamma_i z_i \le x \le \gamma_{i+1} z_i + U(1-z_i).$$ This just repeats the domain $0\le x \le U$ for all but the one index $i$ with $z_i = 1$; for that index, we get $\gamma_i \le x \le \gamma_{i+1}$.

The last piece is to relate the binary variables to $y$. We handle that by adding for each $i\in \{1,\ldots,N\}$ the constraint $$y\ge \alpha_i x + \beta_i - M_i(1-z_i).$$ For all but one index $i$ the right hand side reduces to $f_i(x)-M_i \le 0$. For the index $i$ corresponding to the interval containing $x$, we have $y\ge f_i(x)$. Since $y$ is being minimized, the solver will automatically set $y=f_i(x)$.

Adjustments for nonzero lower bounds for $x$ and $y$ are easy and left to the reader as an exercise. One last thing to note is that, while we can talk about semi-open intervals for $x$, in practice mathematical programs abhor regions that are not closed. If $x=\gamma_{i+1}$, then the solver has a choice between $z_i=1$ and $z_{i+1}=1$ and will choose whichever one gives the smaller value of $y$. This is consistent with typical all units discounts, where hitting a breakpoint exactly qualifies you for the lower price. If your particular problem requires that you pay the higher cost when you are sitting on a breakpoint, the model will break down. Unless you are willing to tolerate small gaps in the domain of $x$ (so that, for instance, the semi-open interval $[\gamma_i ,\gamma_{i+1})$ becomes $[\gamma_i , \gamma_{i+1}-\epsilon]$ and $x$ cannot fall in $(\gamma_{i+1}-\epsilon,\gamma_{i+1})$), you will have trouble modeling your problem.

10 comments:

Scott RJuly 2, 2014 at 8:19 PM
Dear Paul,

Is there a reason you wouldn't use this technique to also model a continuous function? Moreover, if you wanted a concave piecewise linear (<=) constraint on a linear function L(x), could you define y as you have (without the minimization term in the objective) and then make the constraint: y <= f(x). I would think this would create a feasible region with the constraint L(x)>= f(x), correct? Would this only work if f(x) is concave? My first thought is it'd work for any piecewise linear function.

Also, would this method be slower than the SOS2 implementation?
ReplyDelete
Replies
UnknownNovember 13, 2015 at 4:16 PM
Dear Paul,
I want to minimize a piecewise price function of multiple variables where the price is defined as a function of the total consumed quantity by many users and on different hours of the day. The price applied to each hour depends on the total consumption in that hour and is defined on successiv increasing intervals . The price get higher when we hit a breakpoint
The function to minimize is :
F = ∑ (P(h). Q(hi))
with Q(H) = Q(h1)+Q(h2)+ …+Q(hn) with H is some hour and N number of consumers .
I want to use binary variables z for every hour and apply the corresponding price in the constraints the same way you mentioned.
L(i)z(i) <= ∑ Q(hi) < U(i)z(i)
I am also using the 2 large constants M1 and M2 in my model.
∑ Q(hi) < U(i+1)z(i) + M1(1-z(i))
Y(h) >= (P(h). Q(hi)) - M2(1- z(i))
The objective function is rewritten as:
F = ∑ Y(h)
My question is how can I choose the appropriate values for the large constants M1 and M2 so that the optimization problem get solved (I am working with Matlab).
Right now my model is working but I am just trying with multiple values of M2 to get it work. And I am not sure at all about the values of M1.
Which other constraints or conditions should I take care of more to make this model return the minimum total cost.
Thank you very much.
ReplyDelete
Replies
UnknownNovember 18, 2015 at 12:33 PM
Hi Paul,

Thank you for the clarifications and for this very interesting blog.

I tried to enter correctly the mathematical formulas to explain the details but I didn't found a good mathematical online editor to author the formulas. I am sorry about this.

M1 is used in the constraint that specifies the upper limit in regard to the interval in which the total consumption ∑ Q(Ht) for a specific hour Ht may fall in. There is one such constraint for each possible interval and for each hour.
So, for Interval3 = (L3,U3) and hour h = (Ht), I will have this constraint for the upper limit :
∑ Q(Ht) < U3 . z3 + M1(1-z3) . While the constraint for the lower limit will be
L3 . z3 <= ∑ Q(Ht).
So I have : L3.z3 <= ∑ Q(Ht) < U3.z3 + M1(1-z3) to express the fact that ∑ Q(Ht) may fall in interval 3 by the branch and bound optimizer using the binary variables z.

M2 on the counterpart, is used in the constraints of the cost incurred for a specific hour. I will use M2 in the constraint of the total cost P(Ht) for a specific hour Ht for total consumption being in interval3:
Y(Ht) >= (P(Ht) * Q(Ht)) - M2 . (1- z3)

The objective function after variable substitution becomes:
F = ∑ Y(Ht) instead of
F = ∑ (P(Ht). Q(Ht)) which Ht = 1 .. 24.

Right now I am using only one value for M1 in all constraints and one value for M2 and I can see that M2 delivers the good results only after some threshold, while for M1 I was not able to see the effect of different values that I tested with.

Thank you.

ReplyDelete
Replies

Add comment

Due to intermittent spamming, comments are being moderated. If this is your first time commenting on the blog, please read the Ground Rules for Comments. In particular, if you want to ask an operations research-related question not relevant to this post, consider asking it on Operations Research Stack Exchange.

OR in an OB World

Friday, December 10, 2010

Modeling the All Units Discount

10 comments:

Previous Posts

Labels