About the course
Perform real-time data analytics with Hadoop; Video Tutorial
This course is your guide to performing real-time data analytics and stream processing with Spark. Use different components and tools such as HDFS, HBase, and Hive to process raw data. Learn how tools such as Hive and Pig aid in this process.
In this course, you will start off by learning data analysis techniques with Hadoop using tools such as Hive. Furthermore, you will learn to apply these techniques in real-world big data applications. Also, you will delve into Spark and its related tools to perform real-time data analytics, streaming, and batch processing on your application.
Finally, you'll learn how to extend your analytics solutions to the cloud.
Style and Approach
This course has a completely practical approach, with real-world examples which will help you to manage and analyze large-volume data with Hadoop.
What You Will Learn
- Store data with HDFS and learn in detail about HBase
- Share and access data in a SQL-like interface for HDFS
- Analyze real-time events using Spark Streaming
- Perform complex big data analytics using MapReduce
- Analyze data to perform complex processing with Hive and Pig
- Explore functional programming using Spark
- Learn to import data using Sqoop
Tomasz Lelek is a Software Engineer who programs mostly in Java and Scala. He is a fan of microservice architectures and functional programming. He dedicates considerable time and effort to being better every day. Recently, he's been delving into big data technologies such as Apache Spark and Hadoop. He is passionate about nearly everything associated with software development.
Tomasz thinks that we should always try to consider different solutions and approaches to solving a problem. Recently, he was a speaker at several conferences in Poland - Confitura and JDD (Java Developer's Day) and also at Krakow Scala User Group. You can find the JDD video here: https://www.youtube.com/watch?v=BnORjQbnZNQ&t - ML Spark talk.
He also conducted a live coding session at Geecon Conference. He is currently working on this website using ML: http://www.allegro.pl.. He conducted workshops about Apache Kafka at the Geecon conference: https://2018.geecon.org/workshops/#Kafka.
Google Sheet Data with AJAX (featuring API and JSON) [Video]