Install a WordPress Theme Using the CLI for a Subsite

Hi! So, you need to install a wordpress theme using the CLI for a subsite? Maybe you’re using Cloud Formation to scale your deployment or something and need it done each time your cluster of EC2 scales horizontally? Regardless, there’s a flag for that! I kept this one short and to the point. This should …

Read more

What Happens When Your Code Does Not Follow The Python Rules?

python

So what happens if your code does not follow the Python rules? This article will be fairly short. Most of this will not depend on your version of Python. Tabbing/Spacing So what happens if you get your tab or spacing incorrect? Let’s give it a go. Often times the best way to figure something like …

Read more

Red Tape – Are Your Workflows Worth The Cost?

reddit-netflix-michael-kors-city-of-hope

Who Am I? So, what is red tape? But first a little bit of background about me. My names Carlos and I’m a data guy. I started off as an analyst with a smaller firm that sold non passenger car tires. Think wheelbarrow tires. Next came Michael Kors, a luxury fashion brand valued somewhere around …

Read more

Removing Duplicate Elements From An Array in BigQuery

We’ll be removing duplicate elements from an array in BigQuery in this post. Let’s set the stage. It’s a chilly night in the office and marketing reaches out asking for data. PANIK. But they’ve given you a general location of where that data is located. CALM. You do a quick select * of the table …

Read more

Horizontally Scaling A WordPress Website – Part 1 – The Overview

Wordpress

Let’s start off with saying that horizontally scaling a wordpress website is very easy to do but is kinda expensive. I’d say it’s about $100+ a month to leave this infra consistently standing in AWS. Most of the cost is in the DB and EC2 instances. So, beware of forgetting about infra you stand up …

Read more

Apache Airflow DAG is Failing Silently

python

So your Apache Airflow DAG is failing silently. Are you running an ETL on a huge dataset? This is a symptom of an Airflow instance without sufficient memory. Dig into your instances logs and you’ll probably see an evicted worker if your running your instance’s workers on Kubernetes. You’ll see similar logs wherever you run …

Read more

Listing the Largest N Files or Folders Recursively

bash

Listing the largest N files or folders recursively is handy. Let’s frame a usecase. So, you’re Ubuntu server is tanking. You’re running Jenkins or something else and your job logs have just started piling up. You’re UI doesn’t work anymore and the only thing you can do is SSH into your instance. What do you …

Read more

Functional Annotations in Python 3.x

python

Have you used Functional Annotations in Python 3.x? Maybe you’ve heard them mentioned? Regardless, let’s explore what they are and how they help us. Because if they don’t help us, then we probably shouldn’t care. The Problem You’re programming and don’t know what thisRandomFunction should return. Maybe it’s a bool maybe it’s a string who …

Read more

How To Get Started With Apache Airflow?

When Airbnb was scaling rapidly, they faced the problem of organizing complex data pipelines. To combat this and become a data-driven organization, Airbnb launched Apache Airflow in 2015, their custom-made open-source platform to manage complex workflows. In simple words, Apache Airflow is a platform where you can create, schedule, and monitor complex workflows using simple …

Read more