Title |
A novel approach for selecting combination clinical markers of pathology applied to a large retrospective cohort of surgically resected pancreatic cysts
|
---|---|
Published in |
Journal of the American Medical Informatics Association, June 2016
|
DOI | 10.1093/jamia/ocw069 |
Pubmed ID | |
Authors |
David L Masica, Marco Dal Molin, Christopher L Wolfgang, Tyler Tomita, Mohammad R Ostovaneh, Amanda Blackford, Robert A Moran, Joanna K Law, Thomas Barkley, Michael Goggins, Marcia Irene Canto, Meredith Pittman, James R Eshleman, Syed Z Ali, Elliot K Fishman, Ihab R Kamel, Siva P Raman, Atif Zaheer, Nita Ahuja, Martin A Makary, Matthew J Weiss, Kenzo Hirose, John L Cameron, Neda Rezaee, Jin He, Young Joon Ahn, Wenchuan Wu, Yuxuan Wang, Simeon Springer, Luis L Diaz, Nickolas Papadopoulos, Ralph H Hruban, Kenneth W Kinzler, Bert Vogelstein, Rachel Karchin, Anne Marie Lennon |
Abstract |
Our objective was to develop an approach for selecting combinatorial markers of pathology from diverse clinical data types. We demonstrate this approach on the problem of pancreatic cyst classification. We analyzed 1026 patients with surgically resected pancreatic cysts, comprising 584 intraductal papillary mucinous neoplasms, 332 serous cystadenomas, 78 mucinous cystic neoplasms, and 42 solid-pseudopapillary neoplasms. To derive optimal markers for cyst classification from the preoperative clinical and radiological data, we developed a statistical approach for combining any number of categorical, dichotomous, or continuous-valued clinical parameters into individual predictors of pathology. The approach is unbiased and statistically rigorous. Millions of feature combinations were tested using 10-fold cross-validation, and the most informative features were validated in an independent cohort of 130 patients with surgically resected pancreatic cysts. We identified combinatorial clinical markers that classified serous cystadenomas with 95% sensitivity and 83% specificity; solid-pseudopapillary neoplasms with 89% sensitivity and 86% specificity; mucinous cystic neoplasms with 91% sensitivity and 83% specificity; and intraductal papillary mucinous neoplasms with 94% sensitivity and 90% specificity. No individual features were as accurate as the combination markers. We further validated these combinatorial markers on an independent cohort of 130 pancreatic cysts, and achieved high and well-balanced accuracies. Overall sensitivity and specificity for identifying patients requiring surgical resection was 84% and 81%, respectively. Our approach identified combinatorial markers for pancreatic cyst classification that had improved performance relative to the individual features they comprise. In principle, this approach can be applied to any clinical dataset comprising dichotomous, categorical, and continuous-valued parameters. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Philippines | 1 | 50% |
Unknown | 1 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 2 | 100% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Germany | 1 | 2% |
Unknown | 44 | 98% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 10 | 22% |
Other | 6 | 13% |
Professor | 5 | 11% |
Student > Doctoral Student | 3 | 7% |
Student > Postgraduate | 3 | 7% |
Other | 7 | 16% |
Unknown | 11 | 24% |
Readers by discipline | Count | As % |
---|---|---|
Medicine and Dentistry | 25 | 56% |
Biochemistry, Genetics and Molecular Biology | 2 | 4% |
Computer Science | 2 | 4% |
Nursing and Health Professions | 1 | 2% |
Earth and Planetary Sciences | 1 | 2% |
Other | 1 | 2% |
Unknown | 13 | 29% |