This course will give a high-level introduction to the ecosystem of big data. It is non-technical and focuses on the ecosystem of Big Data and common use cases. We will discuss technologies like Apache Hadoop, Spark, Kafka, NoSQL, Samza, Storm, and Nifi.
Video 1: Introducing Big Data Challenges.
In this video we will discuss the problem of Big Data, introduce Gartner’s three “V”s of Big Data, and outline some of the ecosystem challenges.
Video 2: A Big Data Use Case: Capture and Real-Time Processing
We will introduce a typical Big Data IOT (Internet of Things) use case, and discuss the
ecosystem. In this video we will focus on capture of data and introduce Apache Kafka and
alternatives. We will follow that up with Real-Time Processing of Data using technologies like
Storm, Spark Streaming, Samza, Flink and NiFi
Video 3: A Big Data use Case: Storage
We will continue talking about our Big Data Use Case and talk about actually storing the data.
There are two distinct storage requirements: batch analytical storage and low-latency real-time storage. We will attempt to discover both solutions for both cases.
Video 4: Putting it All Together
We will wrap up our talk about our big data use case with discussion of Analytics and
Visualization, and unveil our solution to solve the IOT use case that we have been looking at.