Health index#
Dataset#
The dataset includes the result of an experiment conducted on a Matsuura machining centre (MC-510V) that performed dry, rough milling of cast iron or J45 stainless steel workpieces with a six-tooth face mill with KC710 carbide inserts at various cutting parameters [1]. The dataset contains 16 sub-datasets in which the cutting depths, feed rates and material types were changed. The sensors used to record the milling process are two vibroaccelerometers, two “AE” sensors, and two “CTA” current sensors.
Task#
The dataset’s target space represents the level of sawing surface wear measured based on optical microscopy. Of the total 16 cases in the dataset, 17 steps, including five in case 6, six in case 8, and six in case 16, contain no tool wear level information. The following dataset parameters are used for prediction: AC spindle motor current (smcAC), DC spindle motor current (smcDC), Table vibration (vib_table), Spindle vibration (vib_spindle), Acoustic emission at the table (AE_table), Acoustic emission at spindle (AE_spindle). The main idea of the dataset is to build a process-independent feature extraction model for predicting the equipment wear value. Due to this, the dataset is divided by processes: numbers 1,3,5 for testing and others for training and validation.
The runs always start with a perfectly operable device state, confirmed by microscopy. To construct a process-based HI on wear, we use the Taylor equation proposed in the dataset documentation:
where \(T\) is tool life, \(C_{v}\) is the constant related to 1 min tool life, \(v_{c}\) is cutting speed, \(k\) is the Taylor exponent.
We performed an approximation (Fig. 1) of the wear measured at the end of each cut using the Nelder Mead method from SciPy library [2]. The obtained equation parameters characterize the process itself by definition, and the character of the curve behavior between the measured points corresponds to the empirically verified equation.

Figure 1: Milling dataset. A – Sensor reading examples of the first dataset case, used as input for pipeline. B – Target tool wear constructed using the Taylor equation to approximate missing sawtooth wear values, used for HI construction.#
Next, we transformed the curves characterizing the process and the state of the device simultaneously to the device Health Index. To do this, we performed min-max scaling (using train subdatasets), where the minimum value of wear is 0 mm, and the maximum corresponds to the maximum allowable wear of 1.58 mm, and subtracted the obtained curves from 1 to reduce the impact of implicit weighting of healthy states.

Figure 2: Milling dataset trajectory targets.#
Metrics#
The performance of the models on the milling dataset HI problem was evaluated using RMSE on 1,3,5 cuts chosen for evaluation. RMSE metric is applying both to wear and acquired HI values:
where \(\hat{y}_i\) and \(y_i\) are predicted and true value of HI targets during equipment \(i\), \(N\) is number of test samples.
References#
[1] Agogino, A.; Goebel, K. Milling Data Set. NASA Prognostics Data Repository 2007.
[2] Virtanen, P. et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods 2020, 17, 261–272.