Fast Decliner Phenotype of Chronic Obstructive Pulmonary Disease (COPD): Applying Machine Learning for Predicting Lung Function Loss

Vasilis Nikolaou; Sebastiano  Massaro; Wolfgang Garn; Masoud  Fakhimi; Lampros  Stergioulas; David Price

doi:10.1136/bmjresp-2021-000980

Fast Decliner Phenotype of Chronic Obstructive Pulmonary Disease (COPD): Applying Machine Learning for Predicting Lung Function Loss

Vasilis Nikolaou^* (Corresponding Author), Sebastiano Massaro, Wolfgang Garn, Masoud Fakhimi, Lampros Stergioulas, David Price

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

8 Downloads (Pure)

Abstract

Background: Chronic Obstructive Pulmonary Disease (COPD) is a heterogeneous group of lung conditions challenging to diagnose and treat. Identification of phenotypes of patients with lung function loss may allow early intervention and improve disease management. We characterized patients with the “fast decliner” phenotype, determined its reproducibility, and predicted lung function decline after COPD diagnosis.
Methods: A prospective 4-years observational study that applies machine learning tools to identify COPD phenotypes among 13260 patients from the UK Royal College of General Practitioners and Surveillance Centre database. The phenotypes were identified prior to diagnosis (training dataset), and their reproducibility was assessed after COPD diagnosis (validation dataset).
Results: Three COPD phenotypes were identified, the most common of which was the “fast decliner” - characterized by patients of younger age with the lowest number of COPD exacerbations and better lung function - yet a fast decline in lung function with increasing number of exacerbations. The other two phenotypes were characterized by a) patients with the highest prevalence of COPD severity and b) patients of older age, most males and the highest prevalence of diabetes, cardiovascular comorbidities, and hypertension. These
phenotypes were reproduced in the validation dataset with 80% accuracy. Gender, COPD severity, and exacerbations were the most important risk factors for lung function decline in the most common phenotype.

Conclusions: In this study, three COPD phenotypes were identified prior to patients being diagnosed with COPD. The reproducibility of those phenotypes in a blind dataset following COPD diagnosis suggests their generalizability among different populations.

Original language	English
Article number	e000980
Number of pages	11
Journal	BMJ Open Respiratory Research
Volume	8
Issue number	1
Early online date	29 Oct 2021
DOIs	https://doi.org/10.1136/bmjresp-2021-000980
Publication status	Published - 31 Jan 2022

Bibliographical note

Acknowledgements
We acknowledge patients for allowing their data to be used for surveillance and research. Practices who have agreed to be part of the RCGP RSC and allow us to extract and used health data for surveillance and research. Ms. Filipa Ferreira from RCGP and Mr. Julian Sherlock from the University of Surrey. Apollo Medical Systems for data extraction. Collaboration with EMIS, TPP, In-Practice and Micro-test CMR supplier for facilitating data extraction. Colleagues at Public Health England.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors

Keywords

fast decliner phenotype
Machine learning
cluster analysis
ensemble models

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1136/bmjresp-2021-000980Licence: CC BY-NC

Nikolau_etal_BMJO_Fast_Decliner_Phenotype_VoR
© Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/
Final published version, 1.24 MBLicence: CC BY-NC

Cite this

@article{f7307e3f5060458b9e9b5c0d02c97d31,

title = "Fast Decliner Phenotype of Chronic Obstructive Pulmonary Disease (COPD): Applying Machine Learning for Predicting Lung Function Loss",

abstract = "Background: Chronic Obstructive Pulmonary Disease (COPD) is a heterogeneous group of lung conditions challenging to diagnose and treat. Identification of phenotypes of patients with lung function loss may allow early intervention and improve disease management. We characterized patients with the “fast decliner” phenotype, determined its reproducibility, and predicted lung function decline after COPD diagnosis.Methods: A prospective 4-years observational study that applies machine learning tools to identify COPD phenotypes among 13260 patients from the UK Royal College of General Practitioners and Surveillance Centre database. The phenotypes were identified prior to diagnosis (training dataset), and their reproducibility was assessed after COPD diagnosis (validation dataset).Results: Three COPD phenotypes were identified, the most common of which was the “fast decliner” - characterized by patients of younger age with the lowest number of COPD exacerbations and better lung function - yet a fast decline in lung function with increasing number of exacerbations. The other two phenotypes were characterized by a) patients with the highest prevalence of COPD severity and b) patients of older age, most males and the highest prevalence of diabetes, cardiovascular comorbidities, and hypertension. Thesephenotypes were reproduced in the validation dataset with 80% accuracy. Gender, COPD severity, and exacerbations were the most important risk factors for lung function decline in the most common phenotype.Conclusions: In this study, three COPD phenotypes were identified prior to patients being diagnosed with COPD. The reproducibility of those phenotypes in a blind dataset following COPD diagnosis suggests their generalizability among different populations.",

keywords = "fast decliner phenotype, Machine learning, cluster analysis, ensemble models",

author = "Vasilis Nikolaou and Sebastiano Massaro and Wolfgang Garn and Masoud Fakhimi and Lampros Stergioulas and David Price",

note = "Acknowledgements We acknowledge patients for allowing their data to be used for surveillance and research. Practices who have agreed to be part of the RCGP RSC and allow us to extract and used health data for surveillance and research. Ms. Filipa Ferreira from RCGP and Mr. Julian Sherlock from the University of Surrey. Apollo Medical Systems for data extraction. Collaboration with EMIS, TPP, In-Practice and Micro-test CMR supplier for facilitating data extraction. Colleagues at Public Health England. Funding This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors",

year = "2022",

month = jan,

day = "31",

doi = "10.1136/bmjresp-2021-000980",

language = "English",

volume = "8",

journal = "BMJ Open Respiratory Research",

issn = "2052-4439",

publisher = "BMJ Publishing Group",

number = "1",

}

TY - JOUR

T1 - Fast Decliner Phenotype of Chronic Obstructive Pulmonary Disease (COPD)

T2 - Applying Machine Learning for Predicting Lung Function Loss

AU - Nikolaou, Vasilis

AU - Massaro, Sebastiano

AU - Garn, Wolfgang

AU - Fakhimi, Masoud

AU - Stergioulas, Lampros

AU - Price, David

N1 - Acknowledgements We acknowledge patients for allowing their data to be used for surveillance and research. Practices who have agreed to be part of the RCGP RSC and allow us to extract and used health data for surveillance and research. Ms. Filipa Ferreira from RCGP and Mr. Julian Sherlock from the University of Surrey. Apollo Medical Systems for data extraction. Collaboration with EMIS, TPP, In-Practice and Micro-test CMR supplier for facilitating data extraction. Colleagues at Public Health England. Funding This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors

PY - 2022/1/31

Y1 - 2022/1/31

N2 - Background: Chronic Obstructive Pulmonary Disease (COPD) is a heterogeneous group of lung conditions challenging to diagnose and treat. Identification of phenotypes of patients with lung function loss may allow early intervention and improve disease management. We characterized patients with the “fast decliner” phenotype, determined its reproducibility, and predicted lung function decline after COPD diagnosis.Methods: A prospective 4-years observational study that applies machine learning tools to identify COPD phenotypes among 13260 patients from the UK Royal College of General Practitioners and Surveillance Centre database. The phenotypes were identified prior to diagnosis (training dataset), and their reproducibility was assessed after COPD diagnosis (validation dataset).Results: Three COPD phenotypes were identified, the most common of which was the “fast decliner” - characterized by patients of younger age with the lowest number of COPD exacerbations and better lung function - yet a fast decline in lung function with increasing number of exacerbations. The other two phenotypes were characterized by a) patients with the highest prevalence of COPD severity and b) patients of older age, most males and the highest prevalence of diabetes, cardiovascular comorbidities, and hypertension. Thesephenotypes were reproduced in the validation dataset with 80% accuracy. Gender, COPD severity, and exacerbations were the most important risk factors for lung function decline in the most common phenotype.Conclusions: In this study, three COPD phenotypes were identified prior to patients being diagnosed with COPD. The reproducibility of those phenotypes in a blind dataset following COPD diagnosis suggests their generalizability among different populations.

AB - Background: Chronic Obstructive Pulmonary Disease (COPD) is a heterogeneous group of lung conditions challenging to diagnose and treat. Identification of phenotypes of patients with lung function loss may allow early intervention and improve disease management. We characterized patients with the “fast decliner” phenotype, determined its reproducibility, and predicted lung function decline after COPD diagnosis.Methods: A prospective 4-years observational study that applies machine learning tools to identify COPD phenotypes among 13260 patients from the UK Royal College of General Practitioners and Surveillance Centre database. The phenotypes were identified prior to diagnosis (training dataset), and their reproducibility was assessed after COPD diagnosis (validation dataset).Results: Three COPD phenotypes were identified, the most common of which was the “fast decliner” - characterized by patients of younger age with the lowest number of COPD exacerbations and better lung function - yet a fast decline in lung function with increasing number of exacerbations. The other two phenotypes were characterized by a) patients with the highest prevalence of COPD severity and b) patients of older age, most males and the highest prevalence of diabetes, cardiovascular comorbidities, and hypertension. Thesephenotypes were reproduced in the validation dataset with 80% accuracy. Gender, COPD severity, and exacerbations were the most important risk factors for lung function decline in the most common phenotype.Conclusions: In this study, three COPD phenotypes were identified prior to patients being diagnosed with COPD. The reproducibility of those phenotypes in a blind dataset following COPD diagnosis suggests their generalizability among different populations.

KW - fast decliner phenotype

KW - Machine learning

KW - cluster analysis

KW - ensemble models

U2 - 10.1136/bmjresp-2021-000980

DO - 10.1136/bmjresp-2021-000980

M3 - Article

SN - 2052-4439

VL - 8

JO - BMJ Open Respiratory Research

JF - BMJ Open Respiratory Research

IS - 1

M1 - e000980

ER -

Fast Decliner Phenotype of Chronic Obstructive Pulmonary Disease (COPD): Applying Machine Learning for Predicting Lung Function Loss

Abstract

Bibliographical note

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this