Design of experiment

SOSstat

Introduction

Experimental designs consist of a series of organised experiments or assays, allowing the acquisition of new knowledge based solely on experimental results. The relationships between the input parameters (variables or factors) of the system under study and one (or several) output quantity(ies), also called response, are modelled. The model implementing a response $y$ and variables or factors $x_i$ can be presented in a generic form :

$$y=f(x_1, x_2,x_3, ... , x_k) $$

Experimental designs are used in a wide range of fields, for example :

allow to prioritize the influence of parameters on an industrial process,
optimize the performance of a product from prototypes,
model a complex physical phenomenon of a material,
or optimize the performance of a chemical mixture ...

SOSstat provides intuitive and simple functionality for trial design and analysis of experimental design results. In addition to a comprehensive library of classical designs of experiments, SOSstat offers an innovative custom design creation feature that allows user to adapt the design of experiments to the experimental constraints while minimizing the number of trials. Once the data is collected, SOSstat provides a simple and accurate analysis of the model (with many graphs), the prediction functions, then allow to find the configurations that best fit our problem.

First of all, it is important to note, that the modeling of a system cannot be done by performing tests without a prior strategy. Indeed, the organization of the trials has a decisive influence on the quality of the model, and thus of the resulting optimizations. As this property is not intuitive, it is important to follow the design of experiments methodology to guarantee the quality of the results.

Here are some of the advantages of the design of experiments methodology:

Define the number of experiments required in advance (planning assistance)
Propose a simple and systematic method of analysis
Build a model to predict responses across the field of experimentation
Maximize the accuracy of the results (for a given number of experiments)
Reduce the number of trials (under certain conditions)

Experimental designs adapted to each problem

Over the years, the experimental plans have been enriched, with many tools to respond to a wide variety of problems. In this section, we will try to give an overview of the main experimental techniques.

Plans d'expériences disponibles dans SOSstat

Factorial designs are the oldest DOE. They allow both continuous and categorical variables to be used. The models studied are linear models with or without interaction. Factorial designs can be used in two situations:

Doing screening, i.e. prioritizing the influence of a large number of factors
Model a system in order to optimize it

Factorial designs

Several techniques can be used to meet these objectives:

The full factorial designs (which exploit all combinations of factors in the design of experiments)
Plackett and Burman Screening designs . These fractional designs which preferentially deal with two-level factors, are limited to quantifying effects and ignore interactions. The number of experiments increases quite slowly with the number of factors, since it's a multiple of 4.
Fractional factorial designs, which may, depending on the case, do screening or modeling. These plans are generally appreciated by experimenters, as they drastically limit the number of trials.

Response surface designs

When the input variables are continuous and the model has a local optimum, it is preferable to use a second-order model. It is in this context that response surface designs develop their full power. They allow the optimization of a system by modeling its complex response surface. The best known surface planes are :

the central composite designs, are appreciated for their sequential implementation and their ability to meet several criteria of optimality (rotational isovariance and near-orthogonality).
the Box-Behnken designs, provide a model of reasonable and economical quality, while limiting the number of factor modalities to 3.
It should also be noted, that response surface designs can also be algorithmically created, with D-optimal designs.This is a useful option if you want to reduce the number of experiments, or if the experimental domain has exclusion zones (impossible combinations of factors).

Répartition des essais dans un plan avec une contrainte sur la combinaison des
facteurs température et pression.

The case of mixtures is close to the problem of response surface designs, since we are often in a context of optimization of a non-linear system. The particularity of mixture designs is to be able to take into account the relational constraints between the different constituents of the mixture. These experimental design techniques need to be computerized, because the experiment matrices are not orthogonal, moreover the definition of the experiments requires to establish beforehand the experimentation area, which strongly depends on the constraints applied to each constituent (limitation of the experimental field).

Example of application

Using a full factorial design

The design of a DOE always starts with the identification of factors that are supposed to influence the response. Then, the experimenter determines the number of levels for each of them, as well as their levels.

When designing a design of experiments, it is also important to determine the interactions to be quantified, but in the particular case of full factorial designs, this step is not essential because all interactions can be quantified.

In the example below, 4 two-level factors are studied on a CNC lathe. The 4 factors are cutting parameters, which are assumed to have an influence on the tool life (system response).

Factors are :

VI: Feed speed
PP: Depth of Pass
VC : Cutting speed
DL: Cutting fluid flow

The implicit model is therefore (where M is the mean of the model):

$$y=M+VI+PP+VC+DL +VI \cdot PP+VI \cdot VC+VI \cdot DL+PP \cdot VC+PP \cdot DL+VC \cdot DL $$

Conception d'un plan factoriel complet avec
SOSstat

Once the model is defined (list of factors with their modalities and list of interactions), SOSstat can create the matrix of experiments. In this example, it is a full factorial design. The matrix of experiments displayed in the grid describes the combination sequence of factors. It contains the following information:

Experience number
Random number of the experiment, if one wishes to randomize the trials
Replication number of each experiment. Replications help improve model accuracy
The matrix of experiments, on the following four columns
The response column (it is possible to process several responses), where the experimenter can enter the results of each experiment

Conception de la matrice d'expériences

Model analysis

The model consists of a constant, effects and interactions.

Table of effects

	Lev.1	Lev.2
Feed speed	-2.075	2.075
Depth	-3.425	3.425
Cutting speed	6.2875	-6.2875
Flow	-0.1	0.1

We notice that for two-level factors the effects are opposite ( $\sum \text{Effets}=0$ )

Interaction Tables

The table below shows only some of the interactions.

Effect	Depth		Depth		Flow
	Lev1	Lev2	Lev1	Lev2	Lev1	Lev2
Feed speed (Lev1)	0.6	-0.6	1.1125	-1.1125	0.35	-0.35
Feed speed (Lev2)	-0.6	0.6	-1.1125	1.1125	-0.35	0.35

For two-level factors, we observe a symmetry on the value of interactions $\sum \text{Interactions}=0$ ).

Analysis of variance of coefficients

Analysis of variance (ANOVA) is an important step in running a design of experiments, as it identifies the significant terms in the model. Only parameters with a p-Value of less than 5 percent are kept in our case. (This analysis can also be performed with a T-test table).

Source	df	S.Square	Mean Square	p-value
Feed speed	1	68.89	68.89	0.002232439	*
Depth	1	187.69	187.69	0.000219593	*
Cutting speed	1	632.523	632.523	1.14195e-05	*
Flow	1	0.16	0.16	0.792806
Feed speed:Depth	1	5.76	5.76	0.1573
Feed speed:Cutting speed	1	19.8025	19.8025	0.02738993	*
Forward speed:Flow	1	1.96	1.96	0.3766902
Depth:Cutting speed	1	40.3225	40.3225	0.007030376	*
Depth:Flow	1	1.69	1.69	0.4091185
Cutting speed:Flow	1	6.5025	6.5025	0.1375826
Residual	5	10.42	2.084

After exploitation of the ANAVAR, the model, which initially included 4 factors and 6 interactions, now includes only 3 factors and 2 interactions; This model can be used directly to optimize the system (SOSstat prediction tool).

$$y = M+VI+PP+VC + VI \cdot VC + PP \cdot VC $$

DOE plots

Usually the coefficients of the model are represented graphically to facilitate their interpretation. The effects graph represents the evolution of the average response as a function of the individual fluctuations of each factor. The interaction graph represents the evolution of the averages when two factors change together.

On the effects plot, we can clearly see the major influence of the factor Cutting speed.

Only one graph of the interactions has been shown. This is the interaction Depth:Cutting speed (the most significant one). The presence of the interaction can be clearly seen because the two segments are not parallel.

Other plots are generally proposed to perform residue analysis

Optimization of results

Desirability functions

It's quite common for a design of experiments to analysis several responses. In this case, we have as many models as there are responses. In the case of two responses, for example, we have two models $y_1=f(x_1,x_2, ... , x_k)$ and $y_2=f(x_1,x_2, ... , x_k)$ , involving the same factors.

How can we optimize these two answers simultaneously, given that the models are different, and that the objectives for $y_1$ and $y_2$ are certainly different too?

To achieve a reasonable compromise, we will describe our goals using desirability functions. These functions define the attainment of the goal by a value between 0 (non-optimal) and 1 (optimal). Three families of functions are used, depending on whether we wish to minimize, maximize a response, or target a particular value. These functions must be maximized to achieve the objectives. To do this, the objectives assigned to each response are synthesized by constructing the "composite desirability" function. By maximizing the composite desirability function, the best compromise is found.

On maximise la désirabilité composite

Using Taguchi's Signal to Noise Ratio

Genichi Taguchi, a Japanese engineer, developed the concept of robustness in engineering in the 1960s. The aim is to be able to optimize systems, while ensuring that their performance is not degraded by disruptive factors. With this in mind, Taguchi then had the idea of creating a summary variable that would make it possible to report on the achievement of objectives in terms of performance and robustness against disruptive factors. This variable is a signal/noise ratio that allows, depending on the case, to maximize or minimize a response, while minimizing the variance of the responses.

Bibliography

DUCLOS, E - Introduction aux plans d'expériences , LULU , 2019 , ISBN13 : 5800133354798

DROESBEKE Jean-Jacques , FINE Jeanne , SAPORTA Gilbert - Plans d'expériences, Applications à l'entreprise , Editions Technip , ISBN : 9782710807339

Goupy, J. - Introduction aux plans d'expériences , Dunod, 2017 , EAN13: 9782100778027

D.R. Cox, Nancy Reid - The Theory of the Design of Experiments , CRC Press, 6 juin 2000 - 336 pages

Douglas C. Montgomery - Design and Analysis of Experiments ,John Wiley & Sons, 2008 - 680 pages

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search