Hadoop infrastructure – is a popular solution to such tasks as distributed data storage and calculation of Map/Reduce on cluster. High scalability and developed ecosystem make Hadoop one the most popular framework in Information systems. But the more responsibility you put on it, the more important it becomes to provide its fail-safety and high availability. This report will be useful to those, who have already been using Hadoop. For the rest it will be interesting to learn some architectural solutions used in Hadoop.
In my report I will tell you how to provide high availability of Hadoop cluster components. Besides, I will talk on: