OR in an OB World: Benders Decomposition in CPLEX

Friday, July 27, 2012

Benders Decomposition in CPLEX

Author's note: This post is getting a bit long in the tooth. For users with CPLEX 12.7 or later, you might want to read this more recent post (after reading this one).

I finally got around to constructing a reasonably small example of Benders decomposition for solving a mixed-integer linear program (MIP) in CPLEX. What I am about to describe works in CPLEX 12.4 (possibly 12.2 or 12.3) and hopefully will work in the future (since I have no ambition to revise the post). The significance of the CPLEX version is twofold: IBM refactored the API for cuts, emphasizing the distinction between "user cuts" (cuts that tighten bounds without eliminating integer-feasible solutions) and "lazy constraints" (cuts that eliminate feasible solutions); and they decided that lazy constraint callbacks would be called whenever CPLEX found an incumbent, regardless of when and how it found the incumbent (node LP solution, application of heuristics, lucky guess ...), which eliminated the need for incumbent and branch callbacks. If you are still using an earlier version of CPLEX, I strongly recommend you upgrade. Among various reasons, you will need a bunch more code to pull off Benders correctly with an old version.

My test problem is simple: a transportation problem with fixed charges for the use of supply sources. (In the code, I refer to sources as "warehouses" and sinks as "customers"). The fixed-charge transportation problem is well know and rather easy to solve -- too easy to warrant Benders decomposition, in fact. I chose it because it is sufficiently straightforward that it should not interfere with understanding how the code works. (Subsequent to writing the first draft of this post, I discovered a nice document by Erwin Kalvelagen that demonstrates Benders decomposition in GAMS. He too used a fixed-charge transportation problem, but in his version there is a fixed charge for each arc used, rather than for each source.)

The mathematical model is as follows: \begin{gather*} \textrm{minimize }c'x+f'y\\ \textrm{s.t. }\sum_{i}x_{ij}\ge d_{j}\quad\forall j\\ \sum_{j}x_{ij}\le s_{i}y_{i}\quad\forall i\\ x\ge0\\ y_{i}\in\{0,1\}\quad\forall i. \end{gather*} Continuous variable $x_{ij}$ represents the flow from source $i$ to sink $j$; binary variable $y_{i}$ signals whether source $i$ is used (1) or not (0). Parameters $c\ge0$ and $f\ge0$ are unit flow costs and facility use charges respectively, while $s$ and $d$ are supplies (source capacities) and demands.

My Benders decomposition puts the facility decisions in a master problem (MIP) and the flow decisions in a subproblem (linear program). The subproblem is just \begin{gather*} \textrm{minimize }z=c'x\\ \textrm{s.t. }\sum_{i}x_{ij}\ge d_{j}\quad\forall j\\ \sum_{j}x_{ij}\le s_{i}y_{i}\quad\forall i\\ x\ge0 \end{gather*} in which the $y_{i}$ are treated as constants. The master problem is \begin{gather*} \textrm{minimize }f'y+\hat{z}\\ \textrm{s.t. }a^{(k)\prime}y-\hat{z}\le b^{(k)}\quad\forall k\in\mathcal{O}\\ a^{(k)\prime}y\le b^{(k)}\quad\forall k\in\mathcal{F}\\ y_{i}\in\{0,1\}\quad\forall i\\ \hat{z}\ge0 \end{gather*} where $\hat{z}$ is a surrogate for the flow cost component ($z$) of the original objective function (i.e., the value of the subproblem objective), $a^{(k)}$ and $b^{(k)}$ are coefficient vector and constant term for a cut generated during the solution of the problem, $\mathcal{O}$ is the index set of "optimality" cuts (cuts that attempt to correct the master underestimating the true flow costs for a given set of decisions $y$, i.e., $\hat{z}<z$) and $\mathcal{F}$ is the index set of "feasibility" cuts (cuts that eliminate use decisions $y$ that provide too little capacity to satisfy demand). Initially $\mathcal{O}=\emptyset=\mathcal{F}$, and the first master problem solution is both trivial ($y=0$, $\hat{z}=0$) and infeasible in the original model.

To generate cuts, we will need to work indirectly with the dual of the subproblem: \begin{gather*} \textrm{maximize }z=d'\lambda+\sum_{i}(s_{i}\mu_{i})y_{i}\\ \textrm{s.t. }\lambda_{j}+\mu_{i}\le c_{ij}\quad\forall i,j\\ \lambda\ge0\\ \mu\le0 \end{gather*} where $\lambda$ is the vector of shadow prices of the demand constraints and $\mu$ the vector of shadow prices of the supply constraints. Given the assumption that all flow costs are nonnegative ($c\ge0$), the dual is obviously feasible ($\lambda=0=\mu$ works). If, for a given choice of $y$, the LP subproblem has an optimal solution, then so will the dual (with equal objective value $z$). Should the master problem generate a solution $(y,\hat{z})$ for which the subproblem is feasible with optimal value $z=\hat{z}$, we accept that solution as the new incumbent. If, instead, the subproblem is feasible but $\hat{z}<z$, then we add a new optimality cut $\hat{z}\ge d'\lambda+\sum_{i}(s_{i}\mu_{i})y_{i}$ where $(\lambda,\mu)$ is the dual solution to the subproblem (so that $a_{i}^{(k)}=s_{i}\mu_{i},b^{(k)}=-d'\lambda$).

Should the master problem generate a solution $(y,\hat{z})$ for which the subproblem is infeasible, then the dual will be unbounded, and there will be a dual solution $(\lambda,\mu)$ such that $t(\lambda,\mu)$ is feasible in the dual for all $t>0$ and $d'\lambda+\sum_{i}(s_{i}\mu_{i})y_{i}>0$. Note that the master problem variables ($y$) have no role in defining the dual problem's feasible region; in the dual, they occur only in the objective. So the dual ray we just obtained will remain a recession direction for the dual regardless of $y$, and to get a feasible subproblem we need to modify $y$ so that $d'\lambda+\sum_{i}(s_{i}\mu_{i})y_{i}\le0$ (i.e., so that the ray now points in an undesirable direction). This gives us our feasibility cut ($a_{i}^{(k)}=s_{i}\mu_{i},b^{(k)}=-d'\lambda$). Observe that optimality and feasibility cuts have the same constant term and coefficient vector; the only difference is the inclusion of $\hat{z}$ in optimality cuts and its exclusion in feasibility cuts.

On to the code. Let me start with my usual disclaimer: I am not a professional coder, and I make no claim that my code is particularly efficient, nor that I took the fastest/most straightforward route to the solution. As usual, the code is in Java. For the most part, conversion to other APIs that expose the callback functionality should be straightforward, but there are a few wrinkles to note.

If you code the comparisons $\hat{z}<z$ and $z=\hat{z}$ literally, the gods of computing will wait until your back is turned and then bite you in your derrière. There will be rounding errors in both $z$ and $\hat{z}$, so asking for exact equality is inviting disaster. Insert some rounding tolerance into your comparisons. In my code, look for a constant named FUZZ.
When the LP subproblem is feasible, we can get the dual values directly with a function named getDuals. When the LP is infeasible, it is tempting to use the getRay function to get the dual ray, but to do that you would have to formulate the dual itself, solve it directly, and then invoke getRay on the dual. It is easier to ask for a Farkas certificate, which provides the ray without further ado.
I found it necessary to turn off presolving of the subproblem. If the presolver detects that the subproblem is infeasible, it reports that fact with no option to get a Farkas certificate. So I sacrifice whatever speed advantage the presolver confers in order to have access to the Farkas certificate when the subproblem is infeasible.
Update 11/30/16: I recently tripped over another thing that I should have realized but somehow overlooked. In order to access the Farkas certificate, the LP subproblem must be solved by one of the simplex solvers (primal, dual or network). I'd been leaving the choice up to CPLEX, and on one instance it used the barrier method, decided the LP was infeasible, and then threw an exception when the Farkas certificate was requested. There's also a glitch in the CPLEX documentation for the dualFarkas method. It turns out that dualFarkas only works if the LP was solved by the dual simplex algorithm. So the key is to set the RootAlg parameter for the LP to 2 (dual simplex).
Forming a cut (either optimality or feasibility) involves multiplying the right-hand side of a subproblem constraint, expressed as a linear function of the master variables, by the corresponding dual value. In this particular example, the right-hand side always has a single term, either a constant ($d_{j}$) in a demand constraint or a multiple of a single master variable ($s_{i}y_{i}$) in a supply constraint.

More generally, though, each right-hand side could contain multiple terms. I deliberately over-engineered my code a bit so that it will generalize to other problems. What I do in the code is form and store an instance of IloNumExpr for each right-hand side, at the time that I am creating the subproblem. Inside the lazy constraint callback, I multiply those expressions by the corresponding dual values and sum to get the necessary expression ($a^{(k)\prime}y-b^{(k)}$) for the new cut. Somewhat counterintuitively, and for rather arcane reasons, this cannot be done with IloLinearNumExpr, even though the right-hand sides are in fact linear.
Since any master variable can occur in multiple subproblem right-hand sides, the method I just described for assembling the cut expression will typically result in multiple terms involving the same variable ($\dots+\alpha_{3}y_{7}+\dots+\alpha_{12}y_{7}+\dots+\alpha_{23}y_{7}+\dots$). Not to worry: when you actually add the cut, CPLEX will combine like terms ($\dots+(\alpha_{3}+\alpha_{12}+\alpha_{23})y_{7}+\dots$) for you.

Interpreting the Farkas certificate is a bit tricky, since CPLEX returns the dual values in an arbitrary order (see my earlier post "Infeasible LPs and Farkas Certificates"). To deal with this, I store the subproblem constraints in two arrays (one for supply constraints, one for demand constraints), and then use a HashMap to map each constraint to the corresponding right-hand side expression.

The C++ Standard Template Library has some analogous container class you can use (don't ask me which; I'm blissfully C++-free and mean to stay that way). With Python, you would create a dict. With other APIs, you are on your own.
The reason I use two arrays, rather than storing all constraints in a single array, is that it facilitates updating right-hand sides in the subproblem each time the callback is invoked. I need to recompute the RHS of each supply constraint ($s_{i}y_{i}$, using the new value of $y_{i}$), but not the RHS of each demand constraint ($d_{j}$). Having the supply and demand constraints separate makes this a bit easier, but in a more general application (where most or all constraints require an update), you may want to use more than two arrays, or just a single array.

Once the master problem has been solved, you have the optimal set of warehouse use decisions ($y$). There's a fairly good chance you will also want to know the optimal flows ($x$). The subproblem may well have been solved more recently than when the callback accepted what proved to be the optimal $y$; after that node, CPLEX may have generated some incumbents that your callback tested and rejected. So the most recent subproblem solution may not correspond to the optimal $y$. One resolution is to solve the subproblem one last time, using the optimal $y$. The other (which I use in the code) is to store the subproblem solution any time an incumbent is accepted, overwriting any previously stored solution. When the master is solved, the stored subproblem solution will be the correct flows.

That's it. To avoid inducing comas among readers, I will not go through the code line by line here. Instead, I will just provide ~~this link~~ for anyone who wants to download it. [UPDATE: A newer version of CPLEX broke the code. I have created a repository for the updated code; please see this more recent post for details.]

82 comments:

GelarehJuly 28, 2012 at 4:01 AM
Many thanks Paul, very much appreciated, I have written Benders codes several times using C++ Concert interface but I was always curious to see how others do that, perhaps more efficient, and what I am missing in my codes.
ReplyDelete
Replies
UnknownJanuary 27, 2013 at 8:26 PM
Hi, I am new to OR and CPLEX programming and I was trying to convert your program into multi-commodity transportation problem.The singlemodel.java file I tried to convert into multi commoodity by using 3 dimensional array, but CPLEX is saying "unable to solve model". Could you please help me ? as I could not find any reference on using multi dimensional arrays with CPLEX java APIs.
ReplyDelete
Replies
SohaibFebruary 7, 2013 at 3:59 PM
Great post Paul
What I want now is to implement it for the VRPTW problem.
ReplyDelete
Replies
AnonymousFebruary 14, 2013 at 6:57 AM
Hello. I downloaded your code and ran it with CPLEX 12.2. SingleModel.java solves the generated problem. But when Benders.java starts iterating, part of output generated is below. It looks like there is a cycling. Can you please comment what might be the cause?
Sincerely.

Parallel mode: none, using 1 thread.
Root relaxation solution time = 0.02 sec.
Parallel mode: deterministic, using up to 2 threads for concurrent optimization.
>>> Adding feasibility cut: IloRange : -infinity <= -0.6895988277118706*Use_4 - 6.387051152521494*Use_3 - 7.888060535570556*Use_2 - 5.304215302137547*Use_1 - 5.825244285340782*Use_0 <= -2.829154110490712

Nodes Cuts/
Node Left Objective IInf Best Integer Best Node ItCnt Gap Variable B NodeID Parent Depth

0 0 0.9050 1 0.0000 0
Parallel mode: deterministic, using up to 2 threads for concurrent optimization.
>>> Adding feasibility cut: IloRange : -infinity <= -0.6895988277118706*Use_4 - 6.387051152521494*Use_3 - 7.888060535570556*Use_2 - 5.304215302137547*Use_1 - 5.825244285340782*Use_0 <= -2.829154110490712
0 0 0.9050 1 User: 1 0
Parallel mode: deterministic, using up to 2 threads for concurrent optimization.
>>> Adding feasibility cut: IloRange : -infinity <= -0.6895988277118706*Use_4 - 6.387051152521494*Use_3 - 7.888060535570556*Use_2 - 5.304215302137547*Use_1 - 5.825244285340782*Use_0 <= -2.829154110490712
0 0 0.9050 1 User: 1 0
Parallel mode: deterministic, using up to 2 threads for concurrent optimization.
ReplyDelete
Replies
UnknownFebruary 19, 2013 at 1:47 PM
Hi Paul,

I am currently working on a similar problem but with a quadratic subproblem. The program stops with "no solution exists" without generating any feasibility cut. I wonder, does the Farkas Certificate only work for LPs? If yes, do you know whether there exists any alternative for QPs?

Thank you very much.

Wenyi
ReplyDelete
Replies
UnknownFebruary 20, 2013 at 1:28 PM
Paul, I am dealing with a subproblem with quadratic objective function only, which makes my life a bit easier. As you suggested, I change my objective function in getting a Farkas certificate.

Meanwhile, for my curiosity, I did the following to see whether "getDual" is meaningful to QPs (though it is supposed to be, according to their manual): I constructed a dual problem myself and solved it to optimality (this part is 100% correct, I swear). And then I compared them with those I got from "getDuals". And they are completely different. Could you give me some comments on what might go wrong in using your benders method?
ReplyDelete
Replies
UnknownJuly 16, 2013 at 2:23 PM
Dear Mr Paul,
It would be a great help if you suggest a solution for my following problem:
I am using MATLAB GUROBI interface to solve my nonlinear nonconvex optimization problem using piecewise lienar approxiamtion
and separable programming concepts.My Program in given below. In which my objective is to minimize A from the given search region (5 to 100)
subjct to one noninear constraint and four sos2 constraints. Here u is a vector data representing a curve with data points using separable prog concept.
I want that the point obtained after solution of the problem must be below of u curve. So i am formulating constraint as
20*log10(A)+10*log10((power(10,2)./(B.^2))+1)-10*log10((power(10,2)./(C.^2))+1)-20 <=u' .In gurobi it is,
20*log10(A)+10*log10((power(10,2)./(B.^2))+1)-10*log10((power(10,2)./(C.^2))+1)-u' <=20
Program:
A=linspace(5,100,19);
B=linspace(3,50,19);
C=linspace(1,100,19);
a=ones(19);
a1=a(1,:);
b=zeros(19);
b1=b(1,:);
u = Columns 1 through 12
7.6510 7.2751 6.1274 4.1760 1.4977 -1.5030 -4.1812 -6.1327 -7.2803 -7.6563 -7.2803 -6.1327
Columns 13 through 19
-4.1812 -1.5030 1.4977 4.1760 6.1274 7.2751 7.6510
(here u represents some non linear non convex curve as a piecewise linear approximation
using separable programming concepts using lamda method where horizontal axis is 0 t0 -250 with 19 brekapoints)
model.obj =[A b1 b1 b1];
model.ub = [a1 a1 a1 a1];
model.lb = [b1 b1 b1 b1];
model.modelsense = 'Min';
model.A = sparse([20*log10(A) 10*log10((power(10,2)./(B.^2))+1) -10*log10((power(10,2)./(C.^2))+1) -u'
a1 b1 b1 b1
b1 a1 b1 b1
b1 b1 a1 b1
b1 b1 b1 a1
]);
model.rhs =[20 1 1 1 1];
model.sense=['<' '=' '=' '=' '='];
% % Add SOS2 constraint 1:
model.sos(1).type = 2;
model.sos(1).index = [linspace(1,19,19)];
model.sos(1).weights = [linspace(1,19,19)];
% Add SOS2 constraint 2:
model.sos(2).type = 2;
model.sos(2).index = [linspace(20,38,19)];
model.sos(2).weights = [linspace(20,38,19)];
% Add SOS2 constraint 3:
model.sos(3).type = 2;
model.sos(3).index = [linspace(39,57,19)];
model.sos(3).weight = [linspace(39,57,19)];
% Add SOS2 constraint 4:
model.sos(4).type = 2;
model.sos(4).index = [linspace(58,76,19)];
model.sos(4).weights =[linspace(58,76,19)];

result = gurobi(model);

for i=1:76
fprintf('x%d %e\n', i, result.x(i))
end
fprintf('Obj: %e\n', result.objval);
A_optimal=A*result.x(1:19);%(answer is 5)
B_optimal=B*result.x(20:38);%(answer is 3)
C_optimal=C*result.x(39:57);%(answer is 50)

The problem is, after geting optimal values of A,B and C ,when i put them in a constraint
20*log10(A_optimal)+10*log10((power(10,2)./(B_optimal.^2))+1)-10*log10((power(10,2)./(C_optimal.^2))+1)-20 (which is a signle number),It satisfyis constraint.
but when i plot it, a point obtained from 20*log10(A_optimal)+10*log10((power(10,2)./(B_optimal.^2))+1)-10*log10((power(10,2)./(C_optimal.^2))+1)-20
does not lie below the graph or curve u. Means rathar than taking u as a piecewise lienar function of curve,GUROBI takes it as any feasible single value from u.
Would anyone please answer following,
1) My constraint formulation is right or wrong?
2) How i can represent curve with data points only (u) in GUROBI MATLAB interface so does my resultant points lies below the curve?
3) How can i incorporate X cordinate of u (which is 0 to -250) in constraint....
Thanks.
ReplyDelete
Replies
Paul A. RubinJuly 16, 2013 at 2:28 PM
I'm afraid that I use neither MATLAB nor GUROBI, so I will not be of help. Someone else is free to answer, but I think this question would be more appropriately targeted at a GUROBI support forum.
ReplyDelete
Replies
UnknownOctober 14, 2013 at 6:41 AM
This comment has been removed by the author.
ReplyDelete
Replies
AnonymousNovember 27, 2013 at 4:23 PM
hi paul I am trying to modify your code to solve an MIP,not using regular benders but codato and fischetti combinatorial benders cuts. also rather than adding objective values and constraints, I already have it in a textfile so can I call the textfile if I want to add a constraint to master problem, or how can that be done.
ReplyDelete
Replies
Vedat BayramJune 17, 2014 at 3:32 PM
Paul, thanks for posting the Benders Code. It would not be easy to start without it. I am writing my own code for a two stage stochastic model. In the first stage facilities are located (binary) and in the second stage depending on the scenario realised, facility and route assignments are made. I do not need to check for the feasibility because in the first stage (master problem) I ensure that the subproblem is feasible by using induced constraints. So I just create optimality cuts. When I run the code it starts with the master problem and stops at the root node with an error message that reads:
Nodes Cuts/
Node Left Objective IInf Best Integer Best Bound ItCnt Gap Variable B NodeID Parent Depth

0 0 0.0000 0 0.0000 1

Root node processing (before b&c):
Real time = 0.00 sec. (0.08 ticks)
Sequential b&c:
Real time = 0.00 sec. (0.00 ticks)
------------
Total (root+branch&cut) = 0.00 sec. (0.08 ticks)
Exception in thread "main" java.lang.NullPointerException
at ilog.cplex.IloCplex$MIPInfoCallback.getIndex(IloCplex.java:13779)
at ilog.cplex.IloCplex$ControlCallback.getValues(IloCplex.java:15511)
at ilog.cplex.IloCplex$ControlCallback.getValues(IloCplex.java:15487)
at bendersdecomposition.Benders$BendersCallback.main(Benders.java:548)
at ilog.cplex.CpxCallback.callmain(CpxCallback.java:151)
at ilog.cplex.CpxLazyConstraintCallbackFunction.callIt(CpxLazyConstraintCallbackFunction.java:44)
at ilog.cplex.Cplex.CPXmipopt(Native Method)
at ilog.cplex.CplexI$SolveHandle.start(CplexI.java:2527)
at ilog.cplex.CplexI.solve(CplexI.java:2650)
at ilog.cplex.IloCplex.solve(IloCplex.java:10948)
at bendersdecomposition.Benders.solve(Benders.java:652)
at bendersdecomposition.BendersEvacuation.main(BendersEvacuation.java:568)

The error message is related to the getValues(y) where y is the first stage binary facility location decision variable. Could you please comment what the problem might be? Thanks in advance.

public static class BendersCallback extends IloCplex.LazyConstraintCallback {

@Override
protected void main() throws IloException {
// TODO Auto-generated method stub
double zMaster = getValue(surVar);
double[] yNew = getValues(y);

Best regards,

Vedat Bayram
PhD. Candidate
Department of Industrial Engineering
Bilkent University
ReplyDelete
Replies
UnknownAugust 29, 2014 at 1:54 PM
Hi, I am relatively new to CPLEX. I am implementing Lagrangian Relaxation using CPLEX. I am trying to figure out how to save and then to add cuts that were generated from previous iterations. It seems that addUserCuts should help, I am just wondering how to operationalize it in CPLEX. Thank you.
ReplyDelete
Replies
UnknownOctober 14, 2014 at 1:57 PM
Dear Mr. Paul Rubin:
I am doing a MIP model, using Bender Descomposition, for my thesis proyect in my Master degree of Industrial engineer. Nevertheless this model is very big and have a lot of parameter and variables. Also my master problem is interger, that's why this works very slow. Therefore, i want to now if in each iteration of the bender descomposition, i can set a initial solution for the branch and bound process. The idea is try to set the initial solution in each iteration with the solution of the iteration before, so with this, the time to find the new solution will be smaller. I am working with c++ and cplex.
How can i do that?

please i appreciate if you can help me
ReplyDelete
Replies
AnonymousAugust 6, 2015 at 11:37 AM
Hi
I am using LazyCallBack in my BD algorithm. I know that BD is genreating many violated cuts (I print them out!) but they never affect the lower bound. I am using standard code as the one in the Ilocplex example (add(cut<=Z).end();). Is there any way I force cplex to take them into account or is there anything I should be aware of?
Thanks a lot
Ragheb
ReplyDelete
Replies
UnknownSeptember 16, 2015 at 1:25 AM
Dear Prof. Paul Rubin,

I have a problem using getDuals() function in JAVA. I modified your code to incorporate my model. Right now the returning values of the dual does not match the actual dual obtaining by CPLEX.
Let me explain a little bit more, In the callback I am exporting the sub model and solve it using cplex solver and get the solution and in the code I get the duals using getDuals() when I cross check the numbers I have different value for couple of constraints. I have no idea why this happens.
Can you please give me an idea what might went wrong?

Thank you
ReplyDelete
Replies
AnonymousDecember 24, 2015 at 12:06 PM
hi
is there anyway to know which variables are fixed in the current node of the tree by using the callbacks?
Thanks you very much, in advance!
ReplyDelete
Replies
UnknownApril 5, 2016 at 10:04 PM
Dear Paul,

I was looking at your blog while trying to implement Benders using the Python API that CPLEX exposes. Simultaneously, I am trying to look at the sample code that CPLEX supplies. In that file (named bendersatsp.py), only ray cuts are used and point cuts are not. Any ideas on why this was done? Also, the objective function of the dual subproblem has only a part of it implemented. The 'u' dual variable doesn't feature in the objective function. Any help on this would be appreciated.

Thank you.
ReplyDelete
Replies
UnknownMay 20, 2016 at 4:56 AM
Hi Dear Prof. Paul Rubin,

I am a second year PhD student and I am using Benders decomposition to solve large MILP instances.
In my implementation, I am having problems with abnormal memory consumption by Cplex.
To verify if this situation is due to my code or to Cplex, I executed the asymmetric traveling saleman problem example coded with the Benders algorithm supplied with the Cplex installation package (/opt/ibm/ILOG/CPLEX_Studio126/cplex/examples/src/cpp/ilobendersatsp.cpp).
I changed nothing in the code and I increased the size of the instance to 120 cities (see instance attached). I have chosen lazyConstraintCallBack (argument 0 ).
With this instance, Cplex memory consumption increases very fast and it ends up exceeding my computer RAM. These are my computer characteristics:

Memory : 7.5 GiB
Processor : intel R core TM i7-3520M CPU @2.90GHzx4
Graphics : intel R Ivybridge Mobile
OS type : 64-bit
Disk : 243.8 GB

I'm using Cplex 12.6
Have you ever noticed this constant and very significant increase of memory consumption? Did you have a suggestion on how I may solve it?
Currently, my Benders decomposition algorithm does not manage to solve even instances which are solved by the plain Cplex in a few seconds.
The instances I'm using are confidential, otherwise I would have sent you one of them.

Thank you in advance for your help.
Best regards

Kaba Keita
ReplyDelete
Replies
UnknownMay 26, 2016 at 8:51 AM
Thanks a lot Paul for your help.

I applied your recommendation but the memory consumption did not decrease very much.

As to Benders algorithm, I do not understand how the iterations are done with the Benders algorithm using the callback approach. In the main function before solving the master, I attach the callback to the master model as it is usually done. But how the different iterations are done ? I means after solving the master at the first time how it is ran again in the next iteration ? There is no loop and the callback is called one time in the code.

As to my model, I have a mixed integer linear model. the model does not contain binary variables in the objective function. So the master objective function contain only z (the value of the subproblem objective). I wonder if the Benders algorithm is appropriate for this model ?

Thank you in advance for your help.
Best regards
ReplyDelete
Replies
UnknownMay 27, 2016 at 4:40 PM
Thanks you Dear Paul,

I have a last question about the Benders on two points.

My model is a routing and a scheduling model. It contains three sets of big M constraints. Two disjunctive constraints with big M and another big M constraints. The big M formulation could make my model inappropriate for the Benders ?

I solved a small instance with the Benders. I have the same result as the plain Cplex. But I do not understand the value of z returned when I solved the restricted master problem (its objective function contains only z dummy variable).
For seven iterations, I have these values for z ( z=0, z=0, z=0, z=0, z=508, z=0, z=0, z=53). 53 is the original problem optimal solution. In the iteration 4 z=508, that corresponds to the dual objective value (dual objVal= 1462, 1123, 508, 672, 508, 217, 53, 53), then in the next iteration z=0. Normally the value of z may increase iteration by iteration ? When I solved the restricted master problem and added the cuts manually. I have this case (z=0, z=0, z=0, z=0, z=0, z=0, z=53). I hoped to have the same case with the Benders but it is not the case.

Thank you in advance for your help.
Best regards

Kaba Keita
ReplyDelete
Replies
AnonymousSeptember 15, 2016 at 3:21 PM
Hi,

I have been trying using callbacks for my benders, and apparently for different threads, some of the BD's solutions (approximately 10 out of 400 problems) are not exactly the same as MIP. I tried different problems on my PC, and HPC with different threads, and there is absolutely no pattern which one always works accurately. Sometime everything is the same, but my PC and HPC solutions are not the same.
Also when I fed the solution of my MIP to the BD it actually got it! while without feeding the BFS of the MIP, it says the best solution is something worse than that.

I was wondering if you know why this has been happening to my BD. Is there some kind of bug in some versions of cplex for lazy callback?

Thanks
ReplyDelete
Replies
UnknownNovember 26, 2016 at 9:49 PM
Hi Professor
Cplex 12.7 has added its BD function.
Have you tried to compare your BD algorithm efficiency with cplex 12.7?

THANKS
Keji
ReplyDelete
Replies
Diako BaloochJanuary 18, 2017 at 1:40 AM
Hi Paul,
Thanks for sharing your thoughts on Benders Decompostion.

I am trying to use Benders Decompostion for my problem, in which the subproblem can be decomposed to N smaller problems, solvable in parallel. In fact the subproblem has N^3 varaibles and N^3 constraints. And the matrix of constraints is only independent blocks (not L shaped). The subproblem can be decomposed into N problems, each N^2 variables and N^2 constraints. I want to use IloLazyConstraintsCallback to add these different N cuts. Part of my code in IloLazyConstraintsCallback is:
for ( i=0 ; i < N; i++){
GenCut(X, Xsol, i , cplx_subp, U, sub_Obj , cutLhs, cutRhs);
add(cutLhs+ eta[i] >= cutRhs);
}

However Cplex throws an error in second iteration of the loop when calls GenCut. I am not sure how IloLazyConstraintsCallback works, whether it is smart enough to decompose the subproblem into N problems, or not. Do you have any suggestion how one can get this done? I appreciate any thought.

Regards,

Diako
ReplyDelete
Replies
AnonymousJanuary 28, 2023 at 9:29 PM
Dr. Rubin, many thanks for your usefull blog. Is there any specific command in CPLEX to report the run time spent solving MP and SP at each itteration?(Lets say we don't use any callbacks, so MP should've been solved every itteration.)
ReplyDelete
Replies

Add comment

Due to intermittent spamming, comments are being moderated. If this is your first time commenting on the blog, please read the Ground Rules for Comments. In particular, if you want to ask an operations research-related question not relevant to this post, consider asking it on Operations Research Stack Exchange.

OR in an OB World

Friday, July 27, 2012

Benders Decomposition in CPLEX

82 comments:

Previous Posts

Labels