Georgia Tech Leads Effort to Convert Electronic Health Records into Meaningful Data

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Contact

Phillip Taylor

News and Media Relations Manager

ptaylor@cc.gatech.edu

Sidebar Content
No sidebar content submitted.
Summaries

Summary Sentence:

Four universities collaborate on NSF-sponsored project to develop methods and algorithms to turn enormous clinical health record databases into useful phenotypes

Full Summary:

No summary paragraph submitted.

Media
  • Jimeng Sun Jimeng Sun
    (image/jpeg)

Ever since the adoption of electronic health records (EHRs), medical universities, hospitals and other health institutions have amassed enormous databases of information, comprising a diverse array of information such as diagnoses, medications and lab results.

While such databases promise to serve as rich resources for clinical research, the data tends to be difficult, time-extensive and costly to analyze. A new project funded by the National Science Foundation (NSF) aims to change that.

“As available now, databases of electronic health records are diverse and massive, but they are also messy and heterogeneous. There’s a lot of noise,” said Jimeng Sun, associate professor at Georgia Tech’s School of Computational Science and Engineering. “Our charge is to find ways to make the information more robust and easier to read, thus leading to meaningful clinical concepts without extensive labor and time.”

As part of the four-year, $2.1 million NSF research project, data analytic teams from Georgia Tech and the University of Texas, Austin, will develop algorithms and methods to convert the EHR data into meaningful clinical concepts or phenotypes focused on diseases and specific health traits. Vanderbilt University will provide initial EHR data and phenotype validation.

Resulting phenotypes will be refined and adapted in conjunction with data from Northwestern University so that the information and data can be used across multiple health institutions.

In addition to Sun, who serves as the lead principal investigator of the project, the team includes Bradley Malin and Joshua Denny, associate professors of biomedical informatics and computer science at Vanderbilt; Joydeep Ghosh, professor of electrical and computer engineering at Texas; and Abel Kho, associate professor of medicine-biomedical informatics at Northwestern. 

Past efforts to create phenotypes from data tended to be costly and time-intensive.  Several challenges face physicians and researchers in developing scalable phenotype methods. These include accurate patient representations, working with data across multiple dimensions, sufficient expert refinement and adaptability across multiple health institutions.

“Traditionally it takes six to 18 months to develop an algorithm for a single phenotype, which is too long,” Denny said. “There is also a tremendous need for developing high-throughput phenotyping methods that can directly model the interactions among heterogeneous information sources.”

The project will focus on three specific applications, including a system to accurately and effectively identify patients, even with multiple symptoms and health traits, for clinical research and developing predictive models for health studies.

The project can also provide effective phenotypes for genomic-wide association studies (GWAS). At present, health researchers can only work with one phenotype at a time. But this project will enable researches to quickly study multiple phenotypes jointly. Finally, those identified phenotypes can help analyze specific risk about patients, such as key health factors, exhibited by Type 2 diabetes patients.

In addition to developing the algorithms and methods, the professors will try to develop new health analytics curricula as a massive open online course (MOOC) and for tutorial sessions at conferences.

This research is supported by the National Science Foundation (NSF) under Award 1418511. Any conclusions or opinions are those of the authors and do not necessarily represent the official views of the NSF.

Additional Information

Groups

College of Computing

Categories
No categories were selected.
Related Core Research Areas
Bioengineering and Bioscience, People and Technology
Newsroom Topics
No newsroom topics were selected.
Keywords
EHRs, electronic health records, health records, jimeng sun, National Science Foundation, NSF, phenotypes, Press Release, School of Computational Science and Engineering, University of Texas at Austin
Status
  • Created By: Brittany Aiello
  • Workflow Status: Published
  • Created On: Nov 18, 2014 - 6:29am
  • Last Updated: Oct 7, 2016 - 11:17pm