top of page

Course 20775A:

Performing Data Engineering on Microsoft HD Insight

About this course

The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Audience profile

The primary audience for this course is data engineers, data architects, data scientists, and data developers who plan to implement big data engineering workflows on HDInsight.

After Completing This Course

After completing this course, students will be able to:

  • Deploy HDInsight Clusters.

  • Authorizing Users to Access Resources.

  • Loading Data into HDInsight.

  • Troubleshooting HDInsight.

  • Implement Batch Solutions.

  • Design Batch ETL Solutions for Big Data with Spark

  • Analyze Data with Spark SQL.

  • Analyze Data with Hive and Phoenix.

  • Describe Stream Analytics.

  • Implement Spark Streaming Using the DStream API.

  • Develop Big Data Real-Time Processing Solutions with Apache Storm.

  • Build Solutions that use Kafka and HBase.

bottom of page