From Optflux
Jump to: navigation, search
Line 53: Line 53:
 
The layer (a) addresses the objective functions at the bioengineering level to be addressed by the MO algorithm, while the layer (b) depicts the constraints of the inner phenotype simulation (cellular level). In this bi-level formulation the strain optimization and the phenotype simulation methods can be chosen independently from all available options (e.g. SPEA2, EA or SA in a) and FBA, MOMA or ROOM in b)).
 
The layer (a) addresses the objective functions at the bioengineering level to be addressed by the MO algorithm, while the layer (b) depicts the constraints of the inner phenotype simulation (cellular level). In this bi-level formulation the strain optimization and the phenotype simulation methods can be chosen independently from all available options (e.g. SPEA2, EA or SA in a) and FBA, MOMA or ROOM in b)).
 
The output of phenotype simulation is the set of flux values for all reactions in the model. These are used by the optimization algorithms to compute the fitness value of the solution, using an appropriate objective function.
 
The output of phenotype simulation is the set of flux values for all reactions in the model. These are used by the optimization algorithms to compute the fitness value of the solution, using an appropriate objective function.
 +
 +
= Performing strain optimization in OptFlux =
 +
 +
The operation can be accessed in the  menu '''Optimization->Evolutionary...'''
 +
 +
[[File:menu.png]]
 +
 +
This will launch the optimization dialog
 +
 +
[[File:maindialog.png|400px|thumb]]

Revision as of 03:26, 8 February 2013


Framework description

The new version of OptFlux's Strain Optimization plug-in includes all the previous single objective (SO) optimization methods, seamlessly integrated with the new multi-objective (MO) architecture.

In this version of the Optimization plug-in, the following features are available:

  • Phenotype methods:
    • FBA, pFBA, ROOM, MOMA, Linear MOMA, MiMBL
  • Optimization methods:
    • EA, SA, SPEA2
    • Single and multi-objective
  • Types of Knockout:
    • Reactions, Genes
  • Over/under expression optimization
Overview of the developed framework. The upper part shows the major steps of the three algorithms. The evaluation gray box illustrates the processes of solution decoding, from Gene Knockout Sets (GKS) to Reaction Knockout Sets (RKS) (upper- left), phenotype simulation showing the added constraints (bottom) and fitness evaluation for both MO and SO cases (upper-right).


The Archive Manager

In this version of OptFlux, some changes were applied to the EA and SA algorithms, regarding the information that is kept during each run, although not changing their functioning nor the best final solution. The major change was the implementation of an archive that keeps the best solutions found by the EA and SA during the run. This is not used by the algorithms for selection or for creating new solutions, but allows having a richer final result. The archive runs in a parallel thread, and smartly manages the solutions that are kept, removing duplicate solutions and non-better super solutions of already existing ones.

Solution Simplification

The end step of an optimization procedure is the solution simplification. These heuristics, although intelligent, may generate solutions that include unnecessary knockouts/regulations. To prevent this, all the non-meaningful genetic modifications are removed from the solutions.

Representation scheme and operators

The optimization problem here addressed consists in selecting, from a set of genes of an organism (represented in the respective metabolic model), a subset to be deleted (or over/under regulated), in order to optimize a set of objective functions. The encoding of a solution is achieved by a set-based representation (Rocha et al., 2008), where only gene deletions are represented. The representation scheme uses variable-sized genomes and hence sets with distinct cardinalities can compete within the search process. Two types of reproduction operators were used: crossover and mutation.

Solution decoding and evaluation

Each solution is represented as a gene knockout set GKSi,t (i-th solution of the population at generation t) containing a list of indexes for genes in the model, a subset of the full set of genes that can be deleted (e.g. essential genes are no allowed). Transcriptional/ translational information is encoded in the model as a set of Gene Reaction (GR) associations, represented by Boolean rules, including AND and OR operations.

Each GKS is converted through those rules into a reaction knockout set (RKSi,t). Each RKS consists of a set of integer values between 1 and N, where N is the number of reactions in the model. When simulating this solution, for each reaction in this set, the flux is forced to be 0, therefore disabling it from the model. The process proceeds with the phenotype simulation of the mutant using one of the available methods. In the experiments performed within this work, the FBA method was used, but any of the methods available within the OptFlux framework can be used (e.g. MOMA or ROOM). FBA is a constraint-based approach where feasible flux distributions are mainly defined by three types of constraints (Equation 1b): i) stoichiometric constraints, defining the mass balance equations over internal metabolites assuming steady-state (in the formulation vj corresponds to the flux of re- action j and Sij, stands for the stoichiometric coefficient of metabolite i in reaction j); ii) thermodynamic or capacity constraints, mainly defining re- action reversibility (vj,min for the lower limit and vj,max for the upper limit of the reaction vj); and iii) those imposed by the knockouts defined in the respective RKS. FBA uses linear programming to determine the optimal flux distributions using a specified objective function, maximizing a flux representing biomass production.

The optimization problem can be formulated as:

Formulation.png

where N corresponds to the set of reactions and M to the set of metabolites in the model.

The layer (a) addresses the objective functions at the bioengineering level to be addressed by the MO algorithm, while the layer (b) depicts the constraints of the inner phenotype simulation (cellular level). In this bi-level formulation the strain optimization and the phenotype simulation methods can be chosen independently from all available options (e.g. SPEA2, EA or SA in a) and FBA, MOMA or ROOM in b)). The output of phenotype simulation is the set of flux values for all reactions in the model. These are used by the optimization algorithms to compute the fitness value of the solution, using an appropriate objective function.

Performing strain optimization in OptFlux

The operation can be accessed in the menu Optimization->Evolutionary...

Menu.png

This will launch the optimization dialog

Maindialog.png