Large-Scale Mining of Medical Text- a Hybrid Statistical/Semantic Approach
published: May 28, 2013, recorded: September 2012, views: 38
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
The British Medical Journal Group (BMJ Group) has a wide and varied content set, including a suite of medical journals, online learning materials, best practice guidelines, clinical evidence summaries, a doc-2-doc online forum, and a portfolio system for doctors. There is an emerging need to aggregate accross these content types, providing a unified tagging and I inking system, so that related content can easily be retrieved across the group. The main use-cases include an improved search and browse capability, and the (semi-)automatic construction of "specialty portals", which may be clinical in nature (e.g. diabetes) or non-clinical (e.g. NHS reform). This provides a challenge to standard Pattern Analysis algorithms, due in part to the highly technical nature of the documents. Prior work has mainly been focussed on the use of tools that automatically index against a medical ontology (such as Meta Map and UMLS), but this approach has drawbacks in terms of computational resources, lack of user control, and limitations to medical-only concepts. A hybrid approach based on statistical and semantic methods appears to have some notable advantages. The presentation will focus on the first phase of work taking the two approaches, and talk about some specific technical issues that have arisen along the way. This is based on joint work with Jonathon Peterson, Keith Marshall, Chris Wroe, and Rob Challen.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !