Identification of Patients with Family History of Pancreatic Cancer--Investigation of an NLP System Portability.

Overview of attention for article published in this source, January 2015

Altmetric Badge

Readers on

mendeley: 36 Mendeley

Title	Identification of Patients with Family History of Pancreatic Cancer--Investigation of an NLP System Portability.
Pubmed ID	26262122
Authors	Saeed Mehrabi, Anand Krishnan, Alexandra M Roch, Heidi Schmidt, DingCheng Li, Joe Kesterson, Chris Beesley, Paul Dexter, Max Schmidt, Mathew Palakal, Hongfang Liu
Abstract	In this study we have developed a rule-based natural language processing (NLP) system to identify patients with family history of pancreatic cancer. The algorithm was developed in a Unstructured Information Management Architecture (UIMA) framework and consisted of section segmentation, relation discovery, and negation detection. The system was evaluated on data from two institutions. The family history identification precision was consistent across the institutions shifting from 88.9% on Indiana University (IU) dataset to 87.8% on Mayo Clinic dataset. Customizing the algorithm on the the Mayo Clinic data, increased its precision to 88.1%. The family member relation discovery achieved precision, recall, and F-measure of 75.3%, 91.6% and 82.6% respectively. Negation detection resulted in precision of 99.1%. The results show that rule-based NLP approaches for specific information extraction tasks are portable across institutions; however customization of the algorithm on the new dataset improves its performance.

View on publisher site Alert me about new mentions

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 36 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Korea, Republic of	1	3%
Unknown	35	97%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	10	28%
Student > Master	4	11%
Student > Ph. D. Student	3	8%
Student > Doctoral Student	2	6%
Other	2	6%
Other	6	17%
Unknown	9	25%

Readers by discipline	Count	As %
Medicine and Dentistry	10	28%
Computer Science	9	25%
Pharmacology, Toxicology and Pharmaceutical Science	1	3%
Mathematics	1	3%
Economics, Econometrics and Finance	1	3%
Other	3	8%
Unknown	11	31%