Predictive modelling of colossal ATR-FTIR spectral data using PLS-DA: empirical differences between PLS1-DA and PLS2-DA algorithms
文献情報
Loong Chuen Lee, Abdul Aziz Jemain
In response to our review paper [L. C. Lee et al., Analyst, 2018, 143, 3526–3539], we present a study that compares empirical differences between PLS1-DA and PLS2-DA algorithms in modelling a colossal ATR-FTIR spectral dataset. Over the past two decades, partial least squares-discriminant analysis (PLS-DA) has gained wide acceptance and huge popularity in the field of applied research, partly due to its dimensionality reduction capability and ability to handle multicollinear and correlated variables. To solve a K-class problem (K > 2) using PLS-DA and high-dimensional data like infrared spectra, one can construct either K one-versus-all PLS1-DA models or only one PLS2-DA model. The aim of this work is to explore empirical differences between the two PLS-DA algorithms in modeling a colossal ATR-FTIR spectral dataset. The practical task is to build a prediction model using the imbalanced, high dimensional, colossal and multi-class ATR-FTIR spectra of blue gel pen inks. Four different sub-datasets were prepared from the principal dataset by considering the raw and asymmetric least squares (AsLS) preprocessed forms: (a) Raw-global region; (b) Raw-local region; (c) AsLS-global region; and (d) AsLS-local region. A series of 50 models which includes the first 50 PLS components incrementally was constructed repeatedly using the four sub-datasets. Each model was evaluated using six different variants of v-fold cross validation, autoprediction and external testing methods. As a result, each PLS-DA algorithm was represented by a number of figures of merit. The differences between PLS1-DA and PLS2-DA algorithms were assessed using hypothesis tests with respect to model accuracy, stability and fitting. On the other hand, confusion matrices of the two PLS-DA algorithms were inspected carefully for assessment of model parsimony. Overall, both the algorithms presented satisfactory model accuracy and stability. Nonetheless, PLS1-DA models showed significantly higher accuracy rates than PLS2-DA models, whereas PLS2-DA models seem to be much more stable compared to PLS1-DA models. Eventually, PLS2-DA also proved to be less prone to overfitting and is more parsimonious than PLS1-DA. In conclusion, the relatively high accuracy of the PLS1-DA algorithm is achieved at the cost of rather low parsimony and stability, and with an increased risk of overfitting.
関連文献
Electrochemical synthesis of metal and semimetal nanotube–nanowire heterojunctions and their electronic transport properties
Guowen Meng, Shuyuan Zhang, Lide Zhang
DOI: 10.1039/B614147A
Quantifying the working stroke of tetrathiafulvalene-based electrochemically-driven linear motor-molecules
Amar H. Flood, Camilla N. Hansen, Jan O. Jeppesen, J. Fraser Stoddart
DOI: 10.1039/B511575B
Ionic liquidsin vacuo; solution-phase X-ray photoelectron spectroscopy
Emily F. Smith, Ignacio J. Villar Garcia, David Briggs, Peter Licence
DOI: 10.1039/B512311A
Genotoxicity screening for N-nitroso compounds. Electrochemical and electrochemiluminescent detection of human enzyme-generated DNA damage from N-nitrosopyrrolidine
Sadagopan Krishnan, Eli G. Hvastkovs, Besnik Bajrami, Ingela Jansson, John B. Schenkman
DOI: 10.1039/B703012F
Cross-metathesis of unsaturated natural oils with 2-butene. High conversion and productive catalyst turnovers
Jim Patel, Jomana Elaridi, W. Roy Jackson, Andrea J. Robinson, Algirdas K. Serelis, Chris Such
DOI: 10.1039/B511626K
Stabilisation of a paramagnetic BH4−-bridged dinickel(ii) complex by a macrodinucleating hexaaza-dithiophenolate ligand
Yves Journaux, Vasile Lozan, Julia Klingele, Berthold Kersting
DOI: 10.1039/B512744K
A highly sensitive oxygen sensor operating at room temperature based on platinum-doped In2O3nanocrystals
Giovanni Neri, Anna Bonavita, Giuseppe Micali, Giuseppe Rizzo, Signorino Galvagno, Markus Niederberger, Nicola Pinna
DOI: 10.1039/B510832B
Bioconjugation onto biological surfaces with fluorescently labeled polymers
Julien Nicolas, Ezat Khoshdel, David M. Haddleton
DOI: 10.1039/B617596A
Synthesis and magnetic properties of a 4-(2′-pyrimidyl)-1,2,3,5-dithiadiazolyl dimanganese complex
Michael Jennings, Kathryn E. Preuss, Jian Wu
DOI: 10.1039/B512312G
こちらもおすすめ
2,3-スチオエポキシマドルを取り扱う際の実験室安全事項は何ですか?
取り扱いにはPPE(プロテクティブ・パーソナル・エイド)が必要で、防ぐ手袋と保護眼鏡を着用してください。ドラフトチャンバーの使用を推奨します。漏洩した場合は、適...
BOC-S-3-アミニ-4-(4-メチオキシベンチル)-ブタン酸の代替品はありますか?
この化合物の代替品としては、BOC保護基を有さないアミノ酸やその他の保護基化合物が考えられます。また、メチオキシ基を有しない他の芳香族アミノ酸も代替品として挙げ...
Methyl 2-(chloromethyl)-3-nitrobenzoate(1218910-61-2)の代替品はありますか?
Methyl 2-(chloromethyl)-3-nitrobenzoate(1218910-61-2)の代替品としては、化学組成を変えることで効果を達成する...
(2R)-2-アミノ-N-ベンジル-3-ヒドロキシプロパナミドを含む廃棄物はどのように処理すべきですか?
(2R)-2-アミノ-N-ベンジル-3-ヒドロキシプロパナミドを含む廃棄物は、適切な廃棄物管理ガイドラインに基づき処理する必要があります。まず、廃棄物を適切に収...
6,7-二氢-咪唑並[1,2-a]ピリドイン-8(5h)-酮はどのように合成されますか?
6,7-二氢-咪唑並[1,2-a]ピリドイン-8(5h)-酮は、2-ブロモフェニルアセトインとリン酸ハロゲン化物を反応させることで合成できます。この反応は高温で...
エチル(3R)-3-ピロリジニル酢酸水和塩とは何ですか?
エチル(3R)-3-ピロリジニル酢酸水和塩は、CAS番号1332459-32-1の化合物で、(R)-乙基2-(ピロリジン-3-基)酢酸塩水和塩と呼ばれます。この...
(2S)-{[(2-メチルエチルオキシ]カルボニル}アミノ)[2-(トリアフルオロメチルフェニル]エチカシック酸の物理化学的性質は何ですか?
(2S)-{[(2-メチルエチルオキシ]カルボニル}アミノ)[2-(トリアフルオロメチルフェニル]エチカシック酸のCAS番号は1203454-45-8です。この...
2-ブロモ-1-(2-メチル-2-プロパニル)-4-ニトロベンゼンはどのように保存すればよいですか?
2-ブロモ-1-(2-メチル-2-プロパニル)-4-ニトロベンゼンは、直射日光を避けて暗所で、室温(約15℃〜25℃)、乾燥した場所に保存する必要があります。ま...
1-[(4-硝基フェニル)スルホニル]-1H-1,2,4-三唑の市場動向や研究トレンドはどうですか?
市場動向としては、1-[(4-硝基フェニル)スルホニル]-1H-1,2,4-三唑は主に農業用除草剤や合成化学製品の原料として利用されています。研究トレンドとして...
掲載誌
Analyst

Analyst publishes analytical and bioanalytical research that reports premier fundamental discoveries and inventions, and the applications of those discoveries, unconfined by traditional discipline barriers.













