May 26, 2021 Hadoop
Resource management and task monitoring for individual jobs
Description of the specific functions:
ApplicationMaster can be a program written in any language, it interacts with ResourceManager and NodeManager via ProtocolBuf, which was previously the responsibility of a global JobTracker, and now has one job, more scalable, at least not because there are too many jobs, causing JobTracker bottlenecks. At the same time, the logic of the job is placed in a separate ApplicationMaster, making it more flexible, and each job can have its own way of handling it without binding to MapReduce's processing mode
How to calculate resource requirements
The average MapReduce determines the calculated number of Map and Reduce based on the number of block, and then the average Map or Reduce takes up a Container
How to discover localization of data
Data localization is obtained through the blocking information of HDFS