OR in an OB World: Farkas Certificates in CPLEX

Saturday, July 23, 2011

Farkas Certificates in CPLEX

This is a post about linear programs, specifically infeasible linear programs, and how to deal with them in CPLEX. In response to a question on a CPLEX forum, I played around a bit with Farkas certificates as implemented in CPLEX 12.2. I've never really worked with them, as I have a tendency to produce feasible models. The posted question, though, aroused my curiosity. It asked how to extend a Farkas certificate to a ray for the (unbounded) dual when the primal problem contains bounds on variables.

A Farkas certificate is a vector of constraint multipliers that proves a primal linear program to be infeasible. The name comes from Farkas's Lemma. Assume, without loss of generality, that we have a linear program of the form\[
\textrm{(P) }\begin{array}{lrcl}
\textrm{minimize} & 0\\
\textrm{subject to} & Ax & \ge & b\\
& x & \ge & l\\
& x & \le & u
\end{array}.
\]We are interested here only in the feasibility of (P), so we may as well assume the objective function (irrelevant to feasibility) is zero. Vectors $l$ and $u$ are lower and upper bounds for the variables $x$. To simplify the discussion, we will assume that $l$ and $u$ are finite (but not necessarily nonnegative). What will transpire extends easily to infinite bounds.

Assume that the lower and upper bounds are entered as bounds, rather than as constraints (so that the solver sees $Ax\ge b$ as the only set of constraints). A Farkas certificate for (P) will be a vector $y\ge0$ with the following property. Set $q'=y'A$ and define a vector $z$ of the same dimension as $x$ as follows:\[
z_{j}=\begin{cases}
l_{j} & \textrm{if }q_{j}<0\\
u_{j} & \textrm{if }q_{j}\ge0
\end{cases}.
\](Actually,the definition of $z_{j}$ is arbitrary and irrelevant if $q_{j}=0$; I chose $u_{j}$ just for concreteness.) The key property of the Farkas certificate is that it will satisfy\[
q'z=y'Az < y'b.
\]Now suppose that $x$ is any feasible solution, so that in particular $l\le x\le u$. Then \[
q_{j}\ge0\implies z_{j}=u_{j}\ge x_{j}\implies q_{j}z_{j}\ge q_{j}x_{j}
\]and
\[
q_{j}<0\implies z_{j}=l_{j}\le x_{j}\implies q_{j}z_{j}\ge q_{j}x_{j}.
\]So $q'x\le q'z<y'b$. If $x$ is feasible in (P), though, we have $Ax\ge b$; given that $y\ge0$, it must follow that $q'x=y'Ax\ge y'b$, a contradiction. Thus the existence of a Farkas certificate for (P) tells us that there cannot be any feasible solutions $x$ to (P).

Now let's tie this to the dual (D) of (P), which is\[
\textrm{(D) }\begin{array}{lrcl}
\textrm{maximize} & y'b+v'l-w'u\\
\textrm{subject to} & y'A+v'-w' & = & 0\\
& y,v,w & \ge & 0
\end{array}
\]if we now include the bounds on $x$ as constraints (and treat $x$ as unrestricted in sign). The feasible region of (D) is a cone with vertex at the origin. Denoting by $h^{+}$ and $h^{-}$ the positive and negative parts respectively of a vector $h$ (i.e., $h_{j}^{+}=\max(h_{j,}0)$ and $h_{j}^{-}=-\min(h_{j},0)$), let $y$ be a Farkas certificate for (P), let $q'=y'A$ as before, and let $v=q^{-}$ and $w=q^{+}$. Then $(y',v',w')\ge0$ and $(y'A+v'-w')'=q+v-w=q+q^{-}-q^{+}=0$;
so $(y',v',w')$ is a feasible solution to (D). Moreover, based on our definition of $z$ above, $q'z=-(q^{-})'l+(q^{+})'u=-v'l+w'u$, and so $y'b+v'l-w'u=y'b-q'z>0$. By extending the Farkas certificate $y$ with $v=q^{-}$ and $w=q^{+}$, we obtain a feasible solution to the dual with a positive dual objective value. Since the dual is a cone, $(y',v',w')$ is a recession direction along which the dual objective is unbounded.

Let's go in the other direction. Suppose that (D) is unbounded (making (P) infeasible), and let $(y',v',w')$ be a feasible dual solution with objective value $y'b+v'l-w'u>0$. Let $q'=y'A=w'-v'$ and define $z$ as before. Then $l\le z\le u$, so\[
q'z=w'z-v'z\le w'u-v'l<y'b
\](the last step since the dual objective is positive at $(y',v',w')$); thus $y$ must be a Farkas certificate for (P).

To answer the original question (finally!), suppose we are solving an infeasible primal problem in CPLEX. CPLEX provides a mechanism (the IloCplex.dualFarkas method in the Java API) to retrieve a Farkas certificate $y$. To extend that to the corresponding dual multipliers $v$, $w$ for the lower and upper bounds respectively, compute $y'A$ and assign the positive parts to $w$ and the negative parts to $v$. It would be nice if there were a more direct method (one that would avoid the multiplication $y'A$), but I do not know of one.

One last note: the point of extending the Farkas certificate to a full dual solution is usually to obtain an extreme ray of the dual. As shown above, any solution to (D) with positive objective value provides a Farkas certificate, not just extreme rays. So we rely on the mechanism by which the solver generates the certificate to pick one corresponding to an extreme ray.

12 comments:

Thomas OpferJanuary 13, 2012 at 7:29 AM
Hi!

I think you should use (A | I | I) as constraint matrix of (D) instead of (A | I | -I) for the following reason: If the bounds of (P) were constraints, the constraint matrix of (P) would be (A' | I' | I')' (or the upper bound-constraints should be -x >= -u). w should then be a non-positive variable ( w<=0 ).

The answer to the original Problem is then: Set v = w = 0. Compute q' = -y'A. For i = 1..n: If q[i] >= 0 then v[i] = q[i] else w[i] = q[i].

Best regards,
Thomas
ReplyDelete
Replies
Paul A. RubinJanuary 13, 2012 at 9:11 AM
Thomas,

Making the third block of the constraint matrix of (D) I, making w <= 0, and adding w'u in the dual objective is equivalent to what I did (third block -I, w >= 0, subtract w'u in the objective). I guess it's a matter of taste which way you write it; I lean toward all nonnegative variables where possible. Among other things, I think it makes the connection between q, v and w a bit cleaner.

I think, though, that your q may have the opposite signs of what CPLEX (and other solvers?) produces.
ReplyDelete
Replies
Thomas OpferJanuary 13, 2012 at 11:09 AM
Hi!

Thank you for your reply.

Of course, this is equivalent. We are working on benders cuts and I spent a lot of time in finding a mistake in our code until I saw that I got confused with the signs. Maybe others have similar problems.

For me, q was just some variable. Maybe I should write: Set v = w = 0. Compute q' = y'A. For i = 1..n: If q[i] <= 0 then v[i] = -q[i] else w[i] = -q[i].

By the way: Thank you for this page. It helped us a lot!

Best regards,
Thomas
ReplyDelete
Replies
Paul A. RubinJanuary 13, 2012 at 11:25 AM
You are most welcome - and thank you for the comments!

Cheers,
Paul
ReplyDelete
Replies
AnonymousOctober 28, 2015 at 12:59 PM
Hi Paul.Thanks for the post. I have two questions:
1) In CPLEX, if the variables bounds are explicitly entered as constraints, rather than bounds, then can the dualFarkas return the complete extreme ray? As for the method you stated in this post, I wonder how to calculate y′A in CPLEX?
2) I found the following descriptions about dualFarkas(), at the CPLEX Optimizer (https://www.ibm.com/developerworks/community/forums/html/topic?id=77777777-0000-0000-0000-000014433103). Does this mean we can obtain the extreme ray by combining the results returned by dualFarkas() and getReducedcost()?

CPXdualfarkas() is somewhat tricky to understand. The method provides a set of dual multipliers y', such that the aggregated row y'Ax >= y'b is a valid inequality (the signs of y' match the inequality sense of the constraints) that proves infeasibility of the system. Namely, this aggregated row a'x >= b' cannot be made feasible even if you substitute the "most feasible" bound of x_j for each variable x_j. In other words, max{a'x | l <= x <= u} < b'. In this sense, y' is not a dual ray (because y'A does not need to be 0), but you can extend it with suitable reduced cost values (i.e., duals for the bounds) to a dual ray.

For example, consider the following system:

1x1 + 3x2 - 2x3 >= 5
1x1 - 4x2 + 4x3 >= 6

with bounds 0 <= x1 <= 3, 0 <= x2 <= 5, 0 <= x3 <= 1.

Then, the dual vector y'=(1,2) yields the aggregated row y'Ax >= y'b that reads

3x1 - 5x2 + 6x3 >= 17.

This shows the infeasibility of the model, because substituting the "most feasible" bounds for x yields

3*3 - 5*0 + 6*1 = 15 < 17.

Now, if you extend the dual vector y' with reduced costs r' = (-3,5,-6), you get a dual ray:

1x1 + 3x2 - 2x3 >= 5 (y'_1 = 1)
1x1 - 4x2 + 4x3 >= 6 (y'_2 = 2)
x1 <= 3 (r'_1 = -3)
x2 >= 0 (r'_2 = 5)
x3 <= 1 (r'_3 = -6)

Summing this up with the weights of the dual ray yields

0x1 + 0x2 + 0x3 >= 2,

which proves the infeasibility of the model.
ReplyDelete
Replies
Ralf GollmerMay 1, 2017 at 7:25 AM
I think it depends on the purpose whether one needs the full dual ray.
My goal is not the proof of infeasibility, but to add Benders feasibility cuts to a two-stage stochastic MIP problem.
Here the infeasible problems stem from one original scenario problem with the first-stage variables fixed to values f.
When CPLEX mipopt indicates infeasibility for one of these and dualopt applied to the relaxed problem gives LP-infeasibility, too, dualfarkas yields dual weights to constraints (ray) and a violation (viol) of the weighted sum inequality with the best-feasible values of the remaining variables.
In my case I read an arbitrary model file, so for the construction of y'A I have to use getcolumn for the first-stage variables to compute their coefficients by the scalar product with the weights from the ray argument of dualfarkas. This results in a vector w of coefficients for the first-stage variables.
lhs = w'f gives the value in that point.
If I'm not wrong
w'x >= lhs+viol
yields a valid inequality.
ReplyDelete
Replies
UnknownOctober 17, 2019 at 6:17 AM
Dear Prof. Rubin,
I am working on a Benders Decomposition with Gurobi, and I would like to add feasiblity cuts to master problem. I use farkas dual to generate feasiblity cuts, which is similar in CPLEX.

The primal sub-problem is minimziation. I added " 0 >= r * (B-Dy)" to the master
probelm, where "r" is obtained by fakas dual. But I cannot get the right solution.
When I added " 0 >= (-1) * r * (B-Dy)", then the solution is right. Do you konw the reason for this?

Sometimes, r * (B-Dy) is possible positvie or negative, it's uncertain.
ReplyDelete
Replies

Add comment

Due to intermittent spamming, comments are being moderated. If this is your first time commenting on the blog, please read the Ground Rules for Comments. In particular, if you want to ask an operations research-related question not relevant to this post, consider asking it on Operations Research Stack Exchange.

OR in an OB World

Saturday, July 23, 2011

Farkas Certificates in CPLEX

12 comments:

Previous Posts

Labels