TY - JOUR
T1 - Automatic organofacies identification by means of Machine Learning on Raman spectra
AU - Vergara Sassarini, Natalia A.
AU - Schito, Andrea
AU - Gasparrini, Marta
AU - Michel, Pauline
AU - Corrado, Sveva
N1 - Funding Information: IFP Energies nouvelles (France) is warmly acknowledgment for kindly providing access to samples, laboratory facilities and unpublished database. Dr. Amalia Spina and Prof. Simonetta Cirilli from the University of Perugia are warmly acknowledged for the high-quality kerogen isolate extraction. This research was funded by: MIUR grants to Roma Tre PhD School in Earth Sciences (XXXIV doctoral cycle, 2018–2021) and IFP Energies nouvelles PhD program.
Publisher Copyright: © 2023 The Authors
PY - 2023/4/15
Y1 - 2023/4/15
N2 - In this study we compare and evaluate different unsupervised clustering algorithms for organofacies discrimination in low maturity dispersed organic matter based on Raman spectroscopic analyses. A total of 1363 Raman spectra were collected from a set of 27 organic-rich samples from the Lower Toarcian shale interval of the Paris Basin sub-surface. Rock-Eval pyrolysis data indicate a type II to type III kerogen with a vitrinite reflectance (Ro%) between 0.45% and 0.65%, and Tmax between 415 °C and 438 °C. Organic petrographic observations under transmitted light reveal the presence of organofacies composed by amorphous organic matter, opaque, and translucent phytoclasts. An optical classification of organic particles was performed on about 40–60 fragments per sample and used as the ground truth. Raman spectra were obtained for all the classified fragments and principal component analysis was performed to underline the variability among spectra. Unsupervised clustering was then applied on Raman spectra principal components. Three clustering methods were applied to evaluate their effectiveness in predicting number, shape and density of clusters and a contingency matrix was used to quantify their ability to predict different organofacies. Gaussian Mixture Model (GMM) was found to be the best algorithm for organofacies identification showing an accuracy mostly between 80% and 90%. This work outlines how unsupervised clustering of Raman spectra of dispersed organic matter can reduce the uncertainty in thermal maturity assessment and help the classification of highly heterogeneous organofacies when using large datasets for Earth and planetary sciences studies.
AB - In this study we compare and evaluate different unsupervised clustering algorithms for organofacies discrimination in low maturity dispersed organic matter based on Raman spectroscopic analyses. A total of 1363 Raman spectra were collected from a set of 27 organic-rich samples from the Lower Toarcian shale interval of the Paris Basin sub-surface. Rock-Eval pyrolysis data indicate a type II to type III kerogen with a vitrinite reflectance (Ro%) between 0.45% and 0.65%, and Tmax between 415 °C and 438 °C. Organic petrographic observations under transmitted light reveal the presence of organofacies composed by amorphous organic matter, opaque, and translucent phytoclasts. An optical classification of organic particles was performed on about 40–60 fragments per sample and used as the ground truth. Raman spectra were obtained for all the classified fragments and principal component analysis was performed to underline the variability among spectra. Unsupervised clustering was then applied on Raman spectra principal components. Three clustering methods were applied to evaluate their effectiveness in predicting number, shape and density of clusters and a contingency matrix was used to quantify their ability to predict different organofacies. Gaussian Mixture Model (GMM) was found to be the best algorithm for organofacies identification showing an accuracy mostly between 80% and 90%. This work outlines how unsupervised clustering of Raman spectra of dispersed organic matter can reduce the uncertainty in thermal maturity assessment and help the classification of highly heterogeneous organofacies when using large datasets for Earth and planetary sciences studies.
KW - Cluster analysis
KW - Dispersed organic matter
KW - Machine learning
KW - Principal component analysis
KW - Raman spectroscopy
KW - Thermal maturity
UR - http://www.scopus.com/inward/record.url?scp=85153086274&partnerID=8YFLogxK
U2 - 10.1016/j.coal.2023.104237
DO - 10.1016/j.coal.2023.104237
M3 - Article
AN - SCOPUS:85153086274
VL - 271
JO - International Journal Of Coal Geology
JF - International Journal Of Coal Geology
SN - 0166-5162
M1 - 104237
ER -