Let s take a look at these latter four.
Hadoop san storage.
Enhanced das hadoop with a proprietary data storage software layer in place of hdfs.
Instead of storing data on direct attached local disks data is in one or more arrays attached to data nodes through a san.
Virtualization hadoop running on virtual machines.
A rapidly growing portion of san deployments leverages all flash storage to gain its high performance consistent low latency and lower total cost when compared to spinning disk.
Looks like local storage to data nodes hadoop still utilizes hdfs.
These two industry factors have led to a partial convergence of nas and san approaches to network storage effectively creating high speed high capacity and centrally located network devices.
Hadoop is first and foremost a general purpose data storage and processing platform designed to scale out to thousands of compute nodes and petabytes of data.
Cloud cloud clustered storage like red hat storage.
Also emerging is the unthinkable replacing das with nas or san as hadoop s primary storage layer.
Look for appliances that shove bits of hadoop down to the hardware layer in a hybrid san server setup to come out in the coming year or two.
All the normal advantages of arrays.
San nas appliance an enterprise storage system or appliance.
Hadoop changes the game for enterprise storage.
When considering hadoop s capabilities for working with structured data or working with data of any type for that matter remember hadoop s core characteristics.
Storage area networks sans are the most common storage networking architecture used by enterprises for business critical applications that need to deliver high throughput and low latency.
When san and nas are joined together into one device in this way it s sometimes referred to as unified san.
We re now seeing the use of san and nas as secondary storage for hadoop clusters storage that essentially functions as a data protection and or archival storage layer in conjunction with hadoop s das based primary storage layer.
We look at how hadoop crunches big data its key storage requirements and survey the vendors that offer hadoop storage products.
The longer answer is that depending on the size of your hadoop implementation and the number of nodes it may not make sense from a cost perspective to use san and network attached storage nas systems as the primary storage layer.
San for hadoop storage.
For now however stick to your guns and keep your hdfs.