Overview
Alibaba Cloud Elastic MapReduce (E-MapReduce) is a system solution for big data processing that runs on the Alibaba Cloud platform. E-MapReduce is built on Alibaba Cloud Elastic Compute Service (ECS) and is based on open-source Apache Hadoop and Apache Spark. It facilitates the use of other peripheral systems (for example, Apache Hive) in the Hadoop and Spark ecosystems to analyze and process data.
You can also easily import data to and export data from other cloud data storage systems and database systems, such as Alibaba Cloud OSS and Alibaba Cloud RDS.
In this ACT82003: Alibaba Cloud E-MapReduce course, you will learn how to use Alibaba Cloud’s E-MapReduce (EMR), a managed Hadoop cluster service. Learn how to work with EMR to create, configure, manage, and scale Hadoop clusters on Alibaba Cloud, allowing you to store and process huge datasets both offline and in real time.
Skills Covered
- Understand the advantages (cost, performance, resilience) of EMR over self-built Hadoop
- See best practices and use-cases for EMR, and learn how data can be migrated to the cloud
- Get hands-on experience creating EMR clusters and running jobs in Hive
Who Should Attend
Data Developers and Engineers Based on Hadoop, that are Looking for a similar solution in Alibaba Cloud.
Course Curriculum
Prerequisites
There are no perquisites to attend this course. If you are new to Alibaba Cloud Big Data, please consider to attend the following course:
ACT82001: Alibaba Cloud Big Data Architecture
This Alibaba Cloud Big Data Architecture training course covers the big data solutions provided by Alibaba Cloud in order to deal with the challenges of enterprises in data management and digital transformation. The course will introduce the best practice of data integration, data development, data quality, data security, and data management, governance and data service in Alibaba Cloud.