- identification of critical sources of variability
- management of this variability by the manufacturing process
- ability to predict quality attributes, accurately and reliably.
- raw material measurements
- processing data from various unit operations
- intermediate quality measurements
- environmental data
Review of the types of data generated throughout the complete production cycle of a product yields a new set of challenges. There are a mix of real-time measurements, both univariate (temperature, pressure, pH) and multivariate (NIR or other spectroscopic method), sampled during processing, as well as static data sampled from raw materials, intermediates and finished product.MV methods are an excellent choice for analysis for many reasons. The greatest strength of PCA and PLS methods is their ability to extract information from large, highly correlated sets of data with many variables and relatively few observations. Models generated for prediction of quality attributes also provide information on which of the potentially thousands of variables are most highly correlated with quality. This is an important property for identification of critical parameters.Other strengths include performance on data with significant noise, missing values and the ability to model not only relationships between the X (raw materials and in-process data) and Y space (quality metrics), but also the internal correlational structure of X.The ability to model the internal structure of the X space is of fundamental importance, because a prediction method is only valid within the range of its calibration. Modeling the X space provides a means for recognizing whether a new set of data is similar or different from the training set the model was built on.Thus, MV methods can help predict and justify quality metrics. If the raw materials are dissimilar or a processing unit was operated differently from what the calibration data show, the confidence of the predicted quality metrics must be considered low.The FDA alludes to this point in its definition of process understanding in the PAT guidance as the accurate and reliable prediction of product quality attributes over the design space established for materials used, process parameters, manufacturing, environmental and other conditions.The model of the X space contained in these MV modeling methods is essentially a mathematical description of the design space the FDA is referring to in their guidance.Generating Process UnderstandingThe objective in modeling and analysis of this data is to develop process understanding. The FDA provides direction on what it means and how to develop process understanding. The FDA guidance on process analytical technology (PAT) provides four types of tools for generation and application of process understanding including:
- multivariate tools for design, data acquisition and analysis
- process analyzers
- process control tools
- continuous improvement and knowledge management tools.
- the identification of critical attributes relating to quality
- sensors that provide information related to these attributes and tools to translate the data generated into meaningful and reliable metrics