About MDE Purchase Demo Product Partners

M Data Extractor (MDE) White Paper

Download this document
White Paper

Executive Summary

The M Data Extractor (MDE) is an intermediary (middle-ware) software component that makes possible robust, high performance connections between the FileMan database residing in an M system and modern, commercial off-the-shelf (COTS) software tools such as relational database systems, decision support systems, and ad hoc report generation systems.

Introduction

FileMan, a component of Vista, is a nationally standardized database with a dictionary-based architecture. It is built on top of the M operating environment and employs a hierarchical data storage methodology.

FileMan is the result of a government-sponsored database development effort over the past 20 years, and is used principally within the Veterans Health Administration. However, FileMan is public-domain software, and as such, has been adopted by non-government healthcare as well as non-healthcare businesses. Thus, FileMan software is capable of running on every major computer operating system.

The Issue: Large Scale Data Analysis on M-based VA Healthcare Systems

FileMan database systems greatly impact a healthcare facility’s effectiveness through its methods for locating, acquiring, and analyzing information. Many healthcare facilities require voluminous data to be gathered and summarized for making informed clinical and business decisions. However, the M-based FileMan database systems do not typically facilitate this type of decision-making. While data is available for the routine healthcare operations, this same data is relatively inaccessible for in-depth analysis and ad hoc queries and reports.

The present FileMan reporting capabilities, while powerful, are not easy to learn. They also consume a significant portion of system resources when complex queries are executed. Plus, the resulting operational data is usually not in the form that users want, since data often contains internal codes, unneeded groupings of data items, and redundant data fields. And, an additional problem with FileMan is that it requires a person highly familiar with the file structure to locate information.

Although many healthcare facilities follow FileMan standards for storing data in data dictionaries, many make minor modifications so that FileMan can meet their unique needs. Although very helpful for facilities on an individual basis, this flexibility can make the analysis of data difficult, whether over single or multiple locations. For example, consider the database definition of a lab test, such as potassium. There may be differences in the way this test is named at various facilities. There can also be multiple forms of these tests at a single site, as in the following example from the Pittsburgh VA:

Test Location of test in File Manager
Potassium Field 6 in File 63
Potassium (BG) Field 645059 in File 63
Potassium (CX7-OUT) Field 646074 in File 63

The location of these tests would be different at different VA sites. Therefore, it is necessary to use a data extraction tool that works with FileMan and outputs the data in a correlated fashion.

One approach to improving the accessibility of data stored in M databases is the utilization of ODBC mapping directly from within the M system. However, the direct ODBC approach does not provide an efficient solution for large-scale data analysis for the following reasons:

The data remains in its hierarchical form - making performance slow
There is no aggregation or cleansing of data - two tasks that greatly improve the ease and speed of data analysis
There is no segmentation of domain-specific data elements - making analysis more complex

There is great potential for clinical and administrative enlightenment in unlocking the data stored in FileMan, the basis of the Veterans Health Care delivery system. There are 172 VA medical centers and outpatient clinics that span the nation. Since FileMan is standard throughout the entire system, the data contained in the VA database is perhaps the largest unified collection of administrative and clinical data anywhere.

The Solution: the M Data Extractor (MDE)

The M Data Extractor (MDE), combined with readily available, relationally-based software tools, provides a solution to these M data analysis difficulties. The MDE is an intermediary (middle-ware) software component between the FileMan database residing in an M system and modern, commercial off-the-shelf (COTS) software tools, such as relational database systems, decision support systems, and ad hoc report generation systems.

In other words, the MDE performs data migration from FileMan's hierarchical format to a modern, high-performance, relational format, while transforming the hierarchical data relationships within FileMan to equivalent relationally-based linkages. Once this data transformation is performed, third-party tools allow the migrated data to be fully analyzed.

Features and Functions

The MDE includes the following features and functions:

Access to right-out-of-the-box FileMan files, such as:
  Predefined (demographics, medications, admission / discharge)
  Customized (group your own files)
Extraction of data from M FileMan databases into a relational format that does the following:
  creates tables and keys
  Flattens hierarchically-defined data structures
  Removes transitive data dependencies
Control of the amount and type of File Manager data to extract, either interactively or via a predefined data file. For example, a researcher could create a list of the CPK lab values from 50 specific patients on his/ her desktop PC
Conversion of FileMan data types to standard SQL data types
Ability to extract data from FileMan and load it into SQL in bulk
Ability to performs small ad hoc data transfers from FileMan to SQL using network (TCP/IP) or serial connections
Ability to clean and filter data as it is transferred from File Manager to SQL
Merger of data listed under separate, but essentially the same names, such as lab tests
Evaluation of arcane data pointers into values, suitable for relational storage

The reporting capabilities of FileManager-based M/Caché, while powerful, are not easy to learn.
FileManager consumes a significant portion of system resources when complex reports are executed.
The resulting operational data is usually not in the form that users want, since the data often contains internal codes, unneeded groupings of data items and redundant data fields.

The Solution: The MDE

The MDE performs data migration from FileManager-based M/Caché's hierarchical format to SQL databases, such as Microsoft SQL Server's high-performance relational format, while transforming the hierarchical data relationships within FileManager-based M/Caché to equivalent relationally-based linkages. Once this data transformation is performed, third-party tools allow the migrated data to be fully analyzed.

Features and Benefits

The M Data Extractor offers several features and benefits because it does the following:

Accesses right-out-of-the-box FileManager-based M/Caché file categories, such as:
  Predefined (demographics, medications, admission / discharge)
  Customed (group your own files)
Extracts data from FileManager-based M/Caché databases to any SQL database, data warehouse, or data mart
Allows the use of COTS (Commercial Off-The-Shelf) data analysis tools which permit the following:
  Natural language specifications and graphical diagrams
  User control over the type and appearance of data on reports, using functions that sort and filter data
Enables administrators, clinicians, and researchers to analyze their own data
  Allows administrators to view trends in large amounts of data using the server component — clinicians and researchers to extract data from more specialized FileManager files using the desktop component
  Allows extractions to be performed without the need to involve I/S personnel
Converts FileManager data types to standard SQL data types, enabling users to take advantage of the COTS tool functionality that is available when data is stored as standard SQL data types
Filters data as it is extracted so only the desired data is actually extracted
Offloads slow-running, computationally-intensive FileManager-based M/Caché reports from primary hospital data systems to Microsoft SQL Server-equipped peripheral systems without impacting production.
Provides scalability — since M itself is highly scalable, so the MDE is also scalable
Uses data warehousing and data mart tools to:
  Structure large amounts of data into a format suitable for providing answers to questions that healthcare analysts and epidemiologists wish to determine, such as:
    Persian Gulf veterans who have never been seen in the Persian Gulf Clinic
    Numbers of World War II Veterans taking various types of hypertensive medications
    Outcomes of treating specific diseases with different approaches
    Efficacy of lab tests to determine disease prognoses
    Total costs incurred for the same illnesses treated at different sites
    Forecasting of trends
  Combine data from different departments, such as clinical and financial, to create insightful data "views"
  Use executive information-type systems for decision support

Benefits

Using the MDE provides many benefits. It does the following:

Organizes FileMan data relationally, and thereby allows the use of third-party relational analysis tools
Opens the door to web access and graphical display of analytical results
Unifies patient data identifiers, allowing integration of data from disparate FileMan database sites and concentrating data from various sites to produce composite reports of all relevant fields of interest
Groups data in ways most suitable for analysis
Strips off unneeded fields, making analysis easier
Removes redundant data fields
Simplifies access to data so that intimate knowledge of FileMan is unnecessary
Offloads the slow, computationally-intensive File Manager reports from primary hospital data systems to SQL-equipped peripheral systems
Allows the use of commercial off-the-shelf (COTS) tools that non-programmers can use to generate reports, which permit the following:
  Natural language specifications and graphical diagrams
  User control over the type and appearance of data on reports, using functions that sort and filter the data
Provides the ability to use data warehousing and data mart tools, which then allows for:
  Structuring large amounts of data into a format suitable for providing answers to questions that healthcare analysts and epidemiologists wish to determine, such as the following:
    Persian Gulf veterans who have never been seen in the Persian Gulf Clinic
    Numbers of World War II Veterans taking various types of hypertensive medications
    Outcomes of treating specific diseases with different approaches
    Efficacy of lab tests for determining disease prognoses
    Total costs incurred for the same illnesses treated at different sites
    Forecasting of trends
  Combining data from different departments, such as clinical and financial, to create insightful data "views"
  Using Executive Information Systems for decision support
Provides scalability— Since M itself is highly scalable, so is the MDE scalable
Uses industry standards such as ODBC and standard SQL to achieve independence from vendor-specific third-party relational database tools

Recap

The difficulty in analyzing very large data sets is not inherent to FileMan alone as other system designs suffer from the same shortcomings. The issue here is that systems, such as FileMan, are designed to provide the very discrete detail required for the daily functioning of real-world businesses— that is, OLTP (on-line transaction processing).

Enterprise-wide data analysis requires an overview of all data. It is not only the MDE and associated relational software tools that provide a data analysis solution, but it is also the process of the data migration — the steps taken to get from the microscopic view to the macroscopic— that offers the solution.

The following table summarizes the differences between transactional-type database systems and relational-type warehouse systems:

Characteristic Transactional Database Relational Warehouse
Type of user All users Management, Epidemiologists, Healthcare providers
Goal Record Keeping Information analysis
Update frequency Continuous Batch
Level of data detail Detailed Summary
Database processing architecture Transactions Analysis
Database structure Fixed Dynamic

Creation of data warehouses requires appropriate selection and structure of data. Once the data has been identified, the MDE becomes the pathway to enable the healthcare facility to migrate the data as it wishes in preparation for analytical use.

About MDE

Purchase

Demo

Product

Partners