Need speed for big data? Think in-memory data management
January 27, 2013 Editor 0
In a forthcoming report we will highlight technologies and solutions that take advantage of the decline in prices of RAM, the popularity of distributed and cloud computing systems, and the need for faster queries on large, distributed data stores. Established technology companies have had interesting offerings, but what initially caught our attention were open source projects that started gaining traction last year.
An example we frequently hear about is the demand for tools that support interactive query performance. Faster query response times translate to more engaged and productive analysts, and real-time reports. Over the past two years several in-memory solutions emerged to deliver 5X-100X faster response times. A recent paper from Microsoft Research noted that even in this era of big data and Hadoop, many MapReduce jobs fit in the memory of a single server. To scale to extremely large datasets several new systems use a combination of distributed computing (in-memory grids), compression, and (columnar) storage technologies.
Another interesting aspect of in-memory technologies is that they seem to be everywhere these days. We’re looking at tools aimed at analysts (Tableau, Qlikview, Tibco Spotfire, Platfora), databases that target specific workloads or data types (VoltDB, SAP HANA, Hekaton, Redis, Druid, Kognitio, and Yarcdata), frameworks for analytics (Spark/Shark, GraphLab, GridGain, Asterix/Hyracks), and the data center (RAMCloud, memory Iocality).
We’ll be talking to companies and hackers to get a sense of how in-memory solutions fit into their planning. Along these lines, we would love to hear what you think about the rise of these technologies, as well as applications, companies and projects we should look at. Feel free to reach out to us on Twitter (Ben is @bigdata and Roger is @rogerm) or leave a comment on this post.
- Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework
- Building applications in Azure
- Three kinds of big data
- Cloud Computing for Development: A Four-Stage Analysis of Public, Private, and Hybrid Solutions
- Governing economic growth in the cloud
- The problem of managing schemas
Subscribe to our stories
- A regional enterprise to commercialize an integrated technology for waste water treatment and biowaste conversion in eastern Africa May 27, 2018
- Dr Peggy Oti-Boateng May 27, 2018
- Three years in a row: Mauritania continues to excel in its Doing Business performance May 27, 2018
- Africa RISING farming technologies reach 60,000 households in Ethiopia May 27, 2018
- 7 questions to ask before you launch an enterprise blockchain project May 27, 2018