Students will learn how to collect, manage, analyze, and visualize data to deliver clear business insights from raw data sources. This course will cover the Hadoop ecosystem as it is a primary platform for any other tools like Spark or Kafka. This course also covers an example of NoSQL, such as Cassandra which is suited for distributed computing. Emerging tools and technologies may be presented as applicable to course content.