Hadoop ecosystem

By | October 29, 2014
Share the joy
  •  
  •  
  •  
  •  
  •  
  •  


From hadoop 2.0, yarn is introduced. Yarn is a resource management, similar to JobTracker. On yarn, different calculation model can be implemented. Such as MapReduce, Tez, Storm, Spark. Database like hbase, hive are supported on yarn.
MapReduce, use Map, Reduce, <Key, Value> to calculate.
Storm, there is constant input to the calculation model. Once there is new input, the calculation will change. It is a real time calculation model.
Spark, a memory calculation model
Tez, a DAG calculation model