Using document stores in business model transformation
by user
Comments
Transcript
Using document stores in business model transformation
www.pwc.com/technologyforecast Technology Forecast: Remapping the database landscape Issue 1, 2015 Using document stores in business model transformation By Alan Morrison and Colin White Healthcare providers need data model flexibility now. The US healthcare industry’s ongoing business model shift is having a considerable impact on how providers collect, analyze, and store data. Although the fee-for-service model still dominates, payers are refining pay-forperformance programs, which measure and reward the value, not the volume, of healthcare delivered. To get paid for performance, providers are finding they need data collection and analysis capabilities that are different from those that relational databases deliver. More than 14 percent of Medicare beneficiaries— nearly 8 million people—already receive care through pay-forperformance plans. The pay-for-performance model encourages providers to focus less on the most aggressive treatment of each distinct ailment and more on a balanced approach to the patient’s overall health. For example, a payer might seek evidence that a provider has successfully brought blood pressure, blood glucose and lipid levels under control for a given patient population with both high blood pressure and diabetes. By building in incentives, program advocates hope to reduce the overall cost and to improve the quality of care. More than 14 percent of Medicare beneficiaries—nearly 8 million people—already receive care through a payfor-performance program, and the Obama administration aims to reach 30 percent by the end of 2016.1 Pay for performance triggers changes in the data collection and analysis that surrounds patient care. Providers need to document, aggregate, and share detailed analyses of the impact of their care on overall patient health. Reliable and persuasive health outcome reporting is essential to the billing process and the success of pay for performance. To achieve successful outcome reporting, providers such as Avera (a chain of nonprofit hospitals, clinics, and nursing homes in the American Midwest) know they need to run longitudinal queries on the same patient populations over time across a continuum of care. Unfortunately, electronic medical record (EMR) databases, the primary data source, weren’t designed for longitudinal analysis. Most EMR data models are visit oriented to facilitate the old fee-for-service billing. To address this challenge, Avera uses Apache CouchDB, a NoSQL document Medicare patients moving out of fee-for-service programs Number of patients receiving pay-for-performance care, millions 8 million 2 015 6 2 014 4 2 013 2 2 012 0 Source: Bloomberg Business, 2015 1 John Tozzi, “How Obama’s $3 Trillion Health-Care Overhaul Would Work,” Bloomberg Business, January 26, 2015, http://www.bloomberg.com/news/articles/2015-01-26/how-obama-s-3-trillion-health-care-overhaul-would-work, accessed January 26, 2015. 2 Structured query language, or SQL, is the dominant query language associated with relational databases. NoSQL stands for not only structured query language. In practice, the term NoSQL is used loosely to refer to non-relational databases designed for distributed environments, rather than the associated query languages. PwC uses the term NoSQL, despite its inadequacies, to refer to non-relational distributed databases because it has become the default term of art. See the section “Database evolution becomes a revolution” in “Enterprises hedge their bets with NoSQL databases” at http://www.pwc.com/us/en/technology-forecast/2015/remapping-databaselandscape/features/enterprises-nosql-databases.jhtml for more information on relational versus non-relational database technology. 2 PwC Technology Forecast Using document stores in business model transformation store, as a bridge between the EMR and the healthcare provider’s analytics systems.2 NoSQL document stores such as CouchDB, MongoDB, or Microsoft DocumentDB can offer an expedient bridge between a rigid EMR database and a more permissive, versatile, general-purpose data repository. Healthcare is just one domain where a sea change is under way in database technology. This issue of the Technology Forecast continues to explore the upheaval caused by these technologies. This article provides a deeper look at the NoSQL document store and how it differs from the relational database. How document stores and relational databases compare Unlike a typical EMR relational database, a document store accommodates data in many structures and formats and allows the longitudinal queries necessary for patient outcome reporting purposes. This kind of flexibility pays off when working with all the different data formats providers collect. Take the case of diabetics who’ve also had high cholesterol levels during three visits over 12 months. With a document store, providers can preserve the full fidelity of the data, regardless of the structure or quality of the data. The CouchDB database that Avera uses identifies a test result as an element connected to its parent, the low-density lipid cholesterol test. The tags describing the relationship connecting them indicate the data type, such as a fax image, a PDF, or other file type. Any query in CouchDB would indicate the presence of a lab result and its data type, even if the user can’t read the result value itself. Contemporary relational databases can store data in various forms, but they weren’t originally designed for that purpose, which limits their utility for querying. To function properly, queries to relational databases generally require data to be in a specifically keyed, tabular form with predetermined, explicit schema (that is, predefined tables, fields, views, indices, and other elements) and all the table cells filled in. Document objects, by comparison, may conform to a schema (a data description list matched to the tabular data), but the schema is rarely explicit and can be unique to each document in the store. Document stores typically support familiar schemas but won’t strictly enforce them. How relational databases and document stores handle four records Relational data model Highly structured table organization with rigidly defined data formats and record structure Document data model Collections of complex documents with arbitrary, nested data formats and varying “record” format Collections R1C1 R1C2 R1C3 R1C4 R2C1 R2C2 R2C3 R2C4 R3C1 R3C2 R3C3 R3C4 R4C1 R4C2 R4C3 R4C4 { { "UUID": “21f7f8de-8051-5b89-8 { "UUID": “21f7f8de-8051-5b89-8 "Time": "2011-0401T13:01:02.42” { "UUID": “21f7f8de-8051-5b89-8 "2011-0401T13:01:02.42” "Server": "Time": "A2223E", "UUID": “21f7f8de-8051-5b89-8 "Time": "2011-0401T13:01:02.42” "Server": "A2223E", "Calling Server": "A2213W", "Time": "2011-0401T13:01:02.42” "Server": "A2223E", "Calling Server": "A2213W", "Type": "E100", "Server": "A2223E", "Calling Server": "A2213W", "E100", "Initiating"Type": User": "[email protected]", "Calling Server": "A2213W", "Type": "E100", "Details": "Initiating User": "[email protected]", "E100", User": "[email protected]", "Details": "Initiating"Type": { "Details": "Initiating User": "[email protected]", { "IP": "10.1.1.22", "Details": { "IP": "10.1.1.22", "API": "InsertDVDQueueItem", { "IP": "10.1.1.22", "API": "InsertDVDQueueItem", "Trace": "cleansed", "IP": "10.1.1.22", "API": "InsertDVDQueueItem", "Tags": "Trace": "cleansed", "API": "InsertDVDQueueItem", "Trace": "cleansed", "Tags": [ "Tags": "Trace": "cleansed", [ "SERVER", "Tags": [ "SERVER", "US-West", [ "SERVER", "US-West", "API" "SERVER", "US-West", "API" ] "US-West", "API" ] } "API" ] } } ] } } } } } Source: “Making the Shift from Relational to NoSQL,” Couchbase, 2014, p. 1, http://www.couchbase.com/sites/default/files/ uploads/all/whitepapers/Couchbase_Whitepaper_Transitioning_Relational_to_NoSQL.pdf, accessed January 27, 2015. 3 PwC Technology Forecast Using document stores in business model transformation A lack of strict enforcement in this sense means more flexibility overall. Document stores typically store data in either a version of JSON (JavaScript Object Notation) or XML (Extensible Markup Language) objects. Inside the objects are inherently hierarchical, tagged data consisting of values and their associated name tags, also known as name-value or attribute-value pairs. is, table linking) metadata that demands consistency across tables. This metadata in essence occurs between the tables and can include patient name metadata that needs to be queried. Treating the metadata the same as the data, which is what happens in document stores, improves ease of use. Document stores treat data and metadata equally within documents, and they aren’t fussy about whether each document has a particular tag or not. In CouchDB, for example, running a map function on lastName creates a view in documents that have the lastName attribute (or tag), which a user can then query. With the help of multiple queries, users can either query multiple collections that have this tag or embed all documents within a single document or collection. All the data elements can describe a constantly changing patient, rather than a single transactional visit. By contrast, adding new tables to relational databases can complicate the data model substantially, resulting in foreign key (that Finally, in document stores, the documents themselves can contain hierarchical collections of subdocuments. This capability suits continuous, small, volatile read and write operations, such as telemetry from a bedside monitor. When to use NoSQL document stores For most enterprises in most industries, document stores will augment, rather than replace, relational databases. For lots of writes, relational technology continues to be the norm. Relational databases are write intensive, and immediate consistency is mandatory—known as ACID (atomicity, consistency, isolation, and durability) compliance. By requiring immediate consistency, they allow thousands of users to concurrently trigger changes in the data. This sample of JSON side by side with its XML equivalent shows the name-value pairs Source: “JSON,” Wikipedia, February 6, 2015, http://en.wikipedia.org/wiki/JSON, accessed February 11, 2015. 4 PwC Technology Forecast Using document stores in business model transformation Mission-critical transactional systems will continue to be relational. Pros and cons of document stores and relational databases Many of the other distinctions between relational databases and document stores involve the heritage and maturity of these different types. Fundamentally, the distinctions boil down to a tradeoff. Relational databases have maturity on their side; a broad range of management and visualization tools are available, as well as comprehensive security and transactional integrity features. Document stores, by contrast, are horizontally scalable; they are generally designed to scale out across clusters and between data centers, and they have developer ease of use and flexibility on their side. Document store workloads are read intensive and generally designed to be highly available. NoSQL document stores relax ACID requirements somewhat, and many—such as MongoDB, CouchDB, and DocumentDB—claim tunable consistency. But consistency expectations coming from the relational data world can’t always be met in the document world, nor should they be. The focus is on the richness of the data for analytics purposes and thus read capabilities are more important than write capabilities. For analytics and auditability, the principle of immutability—preserving versions intact for data mining, audit trail, and historical A comparison of document stores and relational databases Document stores Relational databases Consistency (ACID compliance) Moderate, tunable consistency High, concurrency controlled Integrity of writes Within a single document Cross-system Query language Fragmented and evolving Standard SQL Developer API REST and method invocation a ODBC and derivatives b Horizontal scalability Across and between clusters at different data centers Local clusters Schema flexibility Fluid Rigid Management and visualization tools Evolving Mature Tree and graph processing Optimized Adequate Security paradigm Generally application layer oriented Role oriented and directory service friendly Encryption Application layer only Well supported Audit and compliance Undetermined and evolving Transaction logging (SOX, PCIDSS, and HIPAA friendly) c a. Representational State Transfer (REST) is a simplified, web-based method for communicating with IT systems. b. Open Database Connectivity (ODBC) is a long-established method for communicating with database management systems. Established in 1992, ODBC predates the commercial web. c. SOX = Sarbanes–Oxley Act of 2002 PCI-DSS = Payment Card Industry Data Security Standard HIPAA = Health Insurance Portability and Accountability Act of 1996 5 PwC Technology Forecast Using document stores in business model transformation consistency purposes—becomes a higher priority than immediate writability. As storage costs continue to decline, immutability becomes more feasible. In healthcare as in other industries, the absence of a transaction is as important as its presence, and inconsistencies or gaps can be crucial clinical indicators, rather than something that needs normalizing, smoothing, or interpolating. Document stores also allow developers to be much more efficient. The favored way to access document stores are RESTful application programming interfaces (APIs), which is one reason for their efficiency. Another element of efficiency is ease of use. Click-to-deploy capability is emerging for Amazon DynamoDB, MongoDB, and DocumentDB, for example. Conclusion Healthcare organizations will increasingly explore NoSQL document stores as a means of gaining insight into their growing data lakes. Providers are largely technical, science-driven To have a deeper conversation about remapping the database landscape, please contact: Gerard Verweij Principal and US Technology Consulting Leader +1 (617) 530 7015 [email protected] Chris Curran Chief Technologist +1 (214) 754 5055 [email protected] Oliver Halter Principal, Data and Analytics Practice +1 (312) 298 6886 [email protected] Bo Parker Managing Director Center for Technology and Innovation +1 (408) 817 5733 [email protected] professionals and will acquire the skills they need to work with NoSQL document stores as long as NoSQL offers competitive advantage and freedom from relational constraints. However, relational databases for transaction processing will continue to precede document stores in the data pipeline. For healthcare and many other industries, weak ACID compliance is a showstopper for any critical transaction processing purpose. Some document stores claim ACID compliance, but the approach to consistency differs from that of relational databases. Document stores have gained in popularity, and enterprises will need to become more familiar with their capabilities and limitations. Over the next several years, NoSQL document stores are expected to expand their role of ingesting relational data to feed auxiliary systems such as clinical reporting and advanced analytics, and enterprises will gain familiarity with their advantages and limitations for various purposes. About PwC’s Technology Forecast Published by PwC’s Center for Technology and Innovation (CTI), the Technology Forecast explores emerging technologies and trends to help business and technology executives develop strategies to capitalize on technology opportunities. Recent issues of the Technology Forecast have explored a number of emerging technologies and topics that have ultimately become many of today’s leading technology and business issues. To learn more about the Technology Forecast, visit www.pwc.com/ technologyforecast. About PwC PwC US helps organizations and individuals create the value they’re looking for. We’re a member of the PwC network of firms in 157 countries with more than 195,000 people. We’re committed to delivering quality in assurance, tax and advisory services. Find out more and tell us what matters to you by visiting us at www.pwc.com. © 2015 PricewaterhouseCoopers LLP, a Delaware limited liability partnership. All rights reserved. PwC refers to the US member firm, and may sometimes refer to the PwC network. Each member firm is a separate legal entity. Please see www.pwc.com/structure for further details. This content is for general information purposes only, and should not be used as a substitute for consultation with professional advisors. MW-15-1351 LL