Building a Data Science Team

Data Science teams can provide immense value to an organization if built or it can provide no value at all. Sometime the difference in success comes down to the simple fact that you didn’t actually need a Data Science team to begin with. Other times it comes down to how you hire, manage, grow and nurture the team. In this post we’ll cover all these topics and more.

Read More
/

Wikipedia Data in Spark and Scala

More than you possibly ever wanted to know about parsing various Wikipedia data sources in Spark and Scala.

Read More

Setting Up Spark Notebooks on EC2 (No VPN)

Getting up and running with Spark Notebooks (Part 1)

Read More

Spark EC2 Setup and Workflow

So how do we run and deploy code using Scala and Spark in EC2?

Read More

Setting up a personal VPN to access AWS instances

The goal of this post is to lay the groundwork for running a Spark driver against a Spark cluster in a AWS VPC. Specifically by setting up a VPN to access VPC instances.

Read More