Publication

Development of Query Strategies to Identify a Histologic Lymphoma Subtype in a Large Linked Database System

Downloadable Content

Persistent URL
Last modified
  • 02/20/2025
Type of Material
Authors
    Michael Graiser, Emory UniversitySusan J. Moore, Emory UniversityRochelle Victor, Emory UniversityAshley Hilliard, Emory UniversityLeroy Hill, Emory UniversityMichael S. Keehan, NuTec Health SystemsChristopher R Flowers, Emory University
Language
  • English
Date
  • 2007
Publisher
  • Libertas Academica
Publication Version
Copyright Statement
  • © 2007 The authors.
License
Final Published Version (URL)
Title of Journal or Parent Work
ISSN
  • 1176-9351
Volume
  • 3
Start Page
  • 149
End Page
  • 158
Grant/Funding Information
  • Sources of funding used to assist in the preparation of this manuscript include the PhRMA Foundation Health Outcomes Research Starter Grant, the Winship Cancer Institute Faculty Development Award, and the Georgia Cancer Coalition Distinguished Clinician and Scientists Award.
Abstract
  • Background: Large linked databases (LLDB) represent a novel resource for cancer outcomes research. However, accurate means of identifying a patient population of interest within these LLDBs can be challenging. Our research group developed a fully integrated platform that provides a means of combining independent legacy databases into a single cancer-focused LLDB system. We compared the sensitivity and specifi city of several SQL-based query strategies for identifying a histologic lymphoma subtype in this LLDB to determine the most accurate legacy data source for identifying a specifi c cancer patient population. Methods: Query strategies were developed to identify patients with follicular lymphoma from a LLDB of cancer registry data, electronic medical records (EMR), laboratory, administrative, pharmacy, and other clinical data. Queries were performed using common diagnostic codes (ICD-9), cancer registry histology codes (ICD-O), and text searches of EMRs. We reviewed medical records and pathology reports to confirm each diagnosis and calculated the sensitivity and specificity for each query strategy. Results: Together the queries identified 1538 potential cases of follicular lymphoma. Review of pathology and other medical reports confirmed 415 cases of follicular lymphoma, 300 pathology-verifi ed and 115 verified from other medical reports. The query using ICD-O codes was highly specific (96%). Queries using text strings varied in sensitivity (range 7–92%) and specifi city (range 86–99%). Queries using ICD-9 codes were both less sensitive (34–44%) and specific (35–87%). Conclusions: Queries of linked-cancer databases that include cancer registry data should utilize ICD-O codes or employ structured free-text searches to identify patient populations with a precise histologic diagnosis.
Author Notes
  • Correspondence: Christopher R. Flowers, M.D., M.S., Medical Director, Oncology Data Center, Assistant Professor, Winship Cancer Institute, 1365 Clifton Road, N.E. Building C, Suite 3006, Emory University, Atlanta, GA 30322. Tel: 404-778-5554; Fax: 404-778-5520; Email: crflowe@emory.edu
Keywords
Research Categories
  • Health Sciences, Oncology
  • Health Sciences, Education
  • Health Sciences, General

Tools

Relations

In Collection:

Items