View Sidebar
Enterprise Ready Hadoop Infrastructure from EMC – Isilon

Enterprise Ready Hadoop Infrastructure from EMC – Isilon

11/21/2013 10:45 am0 comments

With increased reliance on technology and large scale usage of applications and IT systems, the amount of structured & unstructured data stored and processed by a typical modern-day enterprise has been growing very rapidly. Organizations today, lest they’re okay with the idea of being left-behind in the race, require highly efficient, effective and scalable storage solutions to manage this growth.

Modern day organizations require high-end storage systems also because the latter helps provide powerful analytics; they can draw information of concern from data. EMC Isilon scale-out Network-attached storage (NAS) with native Hadoop Distributed File System (HDFS) provides Hadoop users access to shared storage infrastructure that helps minimize the void between Big Data Hadoop and IT analytics.

The lsilon NAS integrated with HDFS offers customers a solution to accelerate enterprise ready development of Apache Hadoop. Until now, customers of Hadoop have benefited from storage infrastructure solutions that weren’t really optimized for big data storage, thus limiting the scope of Hadoop’s applicability in large enterprises. But, EMC Isilon with native HDFS tackles this challenge well and offers an all-inclusive enterprise ready storage system to collect, protect, analyze and share data in Hadoop environment.

Enterprise Ready Hadoop Infrastructure from EMC - Isilon

By integrating Hadoop natively in an enterprise-class storage solution, Isilon has enabled customers to benefit from a comprehensive data protection system (irrespective of the size of the Hadoop data). By combining EMC Isilon scale-out NAS with native HDFS, EMC will be able to reduce the complications related to Hadoop usage to allow enterprises to extract valuable data from the gigantic heaps of unstructured & structured data.

EMC Isilon provides Hadoop customers a built-in entrance to enterprise data protection; this is made possible with the integration of Isilon scale-out NAS storage system and native HDFS. This integration of Isilon and HDFS eliminates any one point failure with open source Apache Hadoop that enterprises are using; further, the combination allows customers to use a Hadoop system of choice to accelerate their Hadoop adaptation in enterprise ready environment.

Industry’s first scale-out storage system with native HDFS offers the following advantages: 

  • Enterprises can utilize more benefits of Hadoop
  • Reduces risks
  • Increases organization knowledge

The reason why enterprises need to consider ‘HDFS plus Isilon’ is that there’s no ingest necessary anymore. It’s comparatively cheaper and still, the performance is better. With multiple enterprise-features, multi-protocol access and Hadoop multi-tenancy, ‘HDFS on Isilon’ supports nearly everything you’d possible want to work with such as Pivotal, Apache, Cloudera and Hortonworks. NameNode SPOF and 3x Mirroring, two key challenges with DAS Hadoop are eliminated too!

Advantages of EMC Isilon storage implementation over traditional implementation

  • It offers scale-out storage to facilitate multiple workflow and applications
  • No downtime associated, it is distributed in NameNode
  • Provides matchless storage efficiency
  • Offers independent scalability to compute and store separately
  • Provides end-to-end data protection using SnapshotIQ, SynclQ and NDMP Backup

Benefits an enterprise derives from data storage & analytics solution – Hadoop

Hadoop as an enterprise ready big data analytics solution can help store, analyze, structure and visualize big amounts of structured & unstructured data. Hadoop is especially beneficial because it enables users to process unstructured big data, to give it structure so that it can be used for the advantage of the enterprise.

a)   Benefits an enterprise derives

  • Enhanced business agility
  • Easier data management
  • Faster and more convenient data analytics
  • Reduction in time and cost of infrastructure and maintenance
  • Ability to accommodate and analyze irrespective of type or size

b)   Hadoop enterprise ready EMC Isilon advantages:

  • Dependable security
  • Scalable storage solution
  • Continuous availability
  • Existing infrastructure and simple integration
  • Easy deployment and faster administration

EMC Hadoop Starter Kit (HSK)

For extracting insights on customer sentiments and other such information from big data, you will need the Hadoop integration if you are an enterprise that uses VMware Vsphere and/or EMC Isilon . Hadoop with Isilon integration becomes enterprise-ready and helps your data architecture deal with new opportunities provided by data most diligently along with the existing tasks.

Now, to make things even simpler for an organization that uses VMware Vsphere and EMC Isilon, an EMC Hadoop Starter Kit has been developed (video). This HSK step-by-step guide is designed to help enterprises learn and discover the all encompassing potentials of Hadoop.

VMware has also started an open source project (called Serengeti) that can help automate the management and deployment of Hadoop clusters on vSphere. With a virtualized infrastructure, Hadoop can be run as a service.

Whether you are a seasoned Hadoop user or a newbie, all can equally benefit with the HSK because of following reasons:

Rapid provisioning: Most of the Hadoop cluster development can be automated with expertise. Thus, the guide takes you through the process of creation of Hadoop nodes and to set up and start Hadoop service on a cluster, which makes it ever so simple for you to execute.

High availability: High availability protection with use of virtualization platform ensures that single point of failure in Hadoop storage solution can be protected.

Profitability: Enterprises can use and benefit from any Hadoop distribution within the big data application lifecycle; this, with zero data migration.

Elasticity: The same physical infrastructure can be shared amid Hadoop and other application, since, the Hadoop capacity can be scaled to and fro according to demand.

Multi tenancy: Hadoop infrastructure offers multi tenancy option, which means different tenants can have virtual machines provided to them, thus enhancing data security.

EMC Hadoop Starter Kit combines the benefits of VMware vSphere with Isilon scale-out NAS in order to help achieve big data storage goals and added analytics solution.

Some of the reasons why the HSK can be considered as the outright solution have been mentioned above. The merits, especially ‘profitability,’ explains that users can use Hadoop distribution all through the big data application lifecycle with zero data migration that includes, Hortonworks, Pivotal HD, Cloudera and Apache Open Source etc.

This means that starting Hadoop project with EMC Isilon scale-out NAS, enterprises can profit with zero data migration when they have to move from one Hadoop distribution to another. This implies that user can run multiple Hadoop distributions for same data without data duplication.

EMC Isilon’s Notable Collaborations

In addition, Isilon also shares a good collaborative effort with companies like Splunk, Rackspace and Rainstor. EMC Isilon scale-out NAS is no doubt the finest storage system offering users an opportunity to scale capacity and performance of data to meet their needs. To benefit Hadoop users, Isilon has teamed up with Splunk, Rackspace and Rainstor for additional benefits.

Isilon and Splunk: Splunk for Isilon app integrates EMC scale-out NAS with Splunk. The team up of EMC and Splunk helps enterprises manage avalanche of data across virtual, cloud and physical environments to transform this data into real time insight for the user.

Isilon and Rackspace: EMC Isilon helps enterprises to store, consolidate, analyze and use data and applications exceeding 100 TB. Rackspace offers its services to EMC Islion NL400 and X400 high density and large capacity models to perform their tasks diligently for greater benefit of enterprises.

Isilon and RainStor: The combination of EMC and RainStor helps enterprises run the Hadoop distribution anywhere. The RainStor’s unique data compression technique helps enterprises to analyze their large data sets with more efficiency and greater predictability.

Virender Thakur is Subject Matter Expert (SME) and Big data Sales Alliance Manager at EMC corporation. He is currently managing relationship with EMC’s top System Integrators (SIs) and Software providers (Software Providers).

Leave a reply