Data Engineering Blog

Welcome to the Data Engineering blog!

Below are my most recent posts. You can navigate to the categories page to see a list of all posts sorted by category.


This post goes over basic Linux commands that every developer should be familiar with.


This post showcases some useful Hive commands that I use on a daily basis.


This post serves as a basic introduction to those getting started with Apache Hive.


This post walks through the process of setting up a Twitter streaming program using a Python Library called Tweepy. I am streaming tweets about Cincinnati and collecting data like when and where they tweeted, the number of followers the user has, the text of the tweet, and then storing that data into a Sqlite database.