Dell, EMC, Dell Technologies, Cisco,

Tuesday, August 2, 2016

What is YARN?

Find out what #YARN is. Yet Another Resource Negotiator, or YARN for short, is an #Apache #Hadoop technology. The technology is designed for cluster management and is one of the key features in the second generation of Hadoop, the Apache Software Foundation’s open source distributed processing framework. YARN is described as a large-scale, distributed operating system for big data applications. YARN is a software rewrite that is capable of decoupling #MapReduce ’s resource management and scheduling capabilities from the data processing component. The benefit of this is that it enables Hadoop to support more varied processing approaches and a broader array of applications, such as Hadoop clusters now being able to run interactive querying and streaming data applications along with MapReduce batch jobs. Combining a central resource management with node manager agents that monitor the processing operations of individual cluster nodes has helped to increase the appeal of YARN and Hadoop. The separation of HDFS from MapReduce with YARN has made Hadoop more suitable for operational applications that can’t wait for batch jobs to finish.

http://www.cbronline.com/news/big-data/software/what-is-yarn-4963516

No comments:

Post a Comment