Directions in Enterprise Data Storage Systems

Guest Editor's Introduction • Sundara Nagarajan • March 2010

Theme Articles


This month's theme features the following articles:

A Taxonomy and Survey on Distributed File Systems
Read about a comprehensive taxonomy describing distributed file system architectures and a survey of existing distributed file system implementations in very large-scale network computing systems. More »

Reducing the Storage Burden via Data Deduplication
Many organizations are turning to data deduplication to reduce the huge volumes of data they collect and store, as well as equipment and operational costs. More »

Using Intradisk Parallelism to Build
Energy-Efficient Storage Systems

Intradisk parallelism can significantly reduce server storage systems' power consumption, making it possible to match (and even surpass) a storage array's performance using a single, high-capacity disk drive. More »

Reliable Distributed Storage
A distributed storage service lets clients abstract a single reliable shared storage device using a collection of possibly unreliable computing units. More »

Will NoSQL Databases Live Up to Their Promise?
Organizations that collect large amounts of unstructured data are increasingly turning to nonrelational databases, now frequently called NoSQL databases. More »

Designing Dependable Storage Solutions
for Shared Application Environments

A principled automated approach for designing dependable storage solutions for multiple applications in shared environments reduces cost in initial outlays and expected data penalties. More »

 

What else is new? »

 

Storage Businesses' data storage needs are always increasing, and nothing indicates that we will be storing less data in the future. On the contrary, everything points to an era where data management will gain center stage in computing. Data loss is a disaster and can have significant costs to consumers and businesses; if data is unavailable, even for minutes, it can cause productivity losses or cost millions in revenue. CIOs admit that their biggest expenditure is in managing storage systems, and managing storage is the most difficult part of IT management. This month, Computing Now brings together a few articles that deal with the contemporary issues concerning storage systems and their management.

Context

The need for storage arises from the need for persistence of data or memory. File systems evolved in operating systems to be the bridge between volatile and persistent memory devices. Different file systems evolved in operating systems to address the varying characteristics and capabilities of these devices in the context of application demands for performance.

Ever-increasing demands for access to data have led to different methods of organizing and managing it. The evolution from data or transaction processing to business intelligence is leading to tradeoffs in the way we organize and search raw data. The explosive growth of unstructured data brings new challenges in organization and retrieval of meaningful information.

The implicit demand to eliminate data loss and data unavailability drives the development of highly reliable, highly available storage systems. Distributing storage elements to increase capacity and availability across a wider geographic area is becoming common practice. Additionally, the desire for the lowest cost per Gbyte has resulted in advances in storage devices, systems, and equipment. It has also led to the development of a hierarchy in storage systems in operation. The large-scale growth of data has led system engineers to create storage systems assembled from numerous industry-standard, commodity storage devices. However, these elements must appear as a single, reliable shared storage service to the applications.

Overall, storage systems offer exciting times for engineers and researchers.

Selected Articles

The articles in this month's theme illustrate a variety of challenges storage researchers are addressing. In "A Taxonomy and Survey on Distributed File Systems," Tran Doan Thanh and colleagues introduce a framework to compare architectural approaches to building distributed file systems. David Geer presents an overview of deduplication technology in "Reducing the Storage Burden via Data Deduplication." This technology dramatically reduces the need for storage capacity and hence capital and operating costs. "Using Intradisk Parallelism to Build Energy Efficient Storage Systems" by Sudhanva Gurumurthi, Mircea R. Stan, and Sriram Sankar covers an important aspect of storage systems: energy efficiency. "Reliable Distributed Storage" by Gregory Chockler and colleagues discusses algorithms that implement the abstractions and tradeoffs in developing modern storage systems. Neal Leavitt reviews the trend of organizations to increasingly collect unstructured data and search for information—and say "no" to SQL or relational databases—in "Will NoSQL Databases Live Up to Their Promise?" Shravan Gaonkar and colleagues present an automated approach to designing dependable storage systems in "Designing Dependable Storage Solutions for Shared Application Environments."

David H.C. Du's article, "Recent Advancements and Future Challenges of Storage Systems," is a treasure and will give newcomers to the storage domain a quick overview of the concepts and issues, with an excellent collection of references. (IEEE Xplore login is required for this article.)

We are interested to hear your perspectives on the trends, hard problems, and advances in storage in the coming decade. For instance, what do you perceive to be the hardest and most impactful problem to solve in this decade concerning storage systems? What’s the strongest trend you're observing about how storage integrates with the rest of the data center elements—servers, networking gear, application software, and so on? In your opinion, how will cloud storage service affect the way storage is used in enterprises? Please let us know in the comments below.

 

Sundara NagarajanSundara Nagarajan ("SN") is director of R&D in Hewlett Packard's Unified Storage Division in Bangalore, India and a visiting professor at International Institute of Information Technology, Bangalore. He's also Computing Now's regional liaison to IEEE Computer Society activities in India. Contact him at s.nagarajan@computer.org.

 

Article Comments

Please login to post comments.

What's New

The Four Forces Shaping Cybersecurity
Advancing cybersecurity begins by recognizing all its aspects as a vector quantity with four distinct forces shaping its evolution. More »

Is E-Learning Really Working? The Trillion Dollar Question
In the world of Web 2.0, most learning still happens in traditional classrooms. As education costs continue to grow faster than inflation, what's e-learning's role? More »

Outlook: Cloudy with a Chance of Security Challenges and Improvements
Cloud computing allows offloading computing to third-party resources, but this business model isn't without security risks. But there are ways to ease customers' security configuration burden. More »

Location and Navigation Support for Emergency Responders: A Survey
Preinstalled location systems, wireless sensor networks, and inertial sensing all have benefits and drawbacks when considering emergency response requirements. More »

Preventing IC Piracy Using Reconfigurable Logic Barriers
The once-vertical IC supply chain has flattened, exposing new threats to intellectual property. An improved combinational-locking scheme adds reconfigurable logic barriers to IC prefabrication. More »

Domain Partitioning Technology for Embedded Multicore Processors
A physical partitioning controller integrates multiple operating systems on a multicore processor to meet embedded-system requirements for real-time control and IT functions. More »

Computing Then: Annals through the Years
Efforts to better organize large-scale software development projects are far from new. Michael Cusumano's early study provides an insightful comparative analysis of US and Japanese firms. More »

Web Filtering and Censoring
Information on the Web is not as uncontrolled as it may appear. More »

Business and Market Intelligence 2.0
Combining Web 2.0 advances and social media analytics presents a unique opportunity to meld business intelligence with new ways to understand the market. More »

Collaboration Tools for Global Software Engineering
Collaborative development tools and environments help us build better software without face-to-face meetings and across time zones.
More »

Toward Natural Selection in Virtual Reality
Cinematic-quality game engines, broadband networking, and VR technology advances are converging to where players will have shared, "better than life" experiences in persistent virtual worlds. More »

Modeling Media Synchronization with Semiotic Agents
A mix of agent technology and semiotics provides a sound theoretical framework for expressing and manipulating media-synchronization attributes in real-world applications. More »

Why Modern CPUs Are Starving and What Can Be Done About It
CPUs spend most of their time waiting for data to arrive. Identifying low-level bottlenecks—and how to ameliorate them—can save hours of frustration over poor performance in apparently well-written programs. More »

Understanding Unified Messaging
No single solution has yet emerged to unify communications among the various messaging systems, but both proprietary and open approaches are in work. More »