post

Engaging Hermit Mode to Do Deep Work

Walking down the road

Deep work is the ability to focus without distraction on a cognitively demanding task. It's a skill that allows you to quickly master complicated information  and produce better results in less time.

– Cal Newport, Deep Work

My role as Lead Architecture Engineer at IST Research finds me quite busy. Building fully distributed systems to collect and process offline and online data in real-time is a true challenge, and one that I am fully embracing.

In order to focus on that work as well as my family and health, I am taking a hiatus from all things social media, meetup, and side work (once I finish my co-management of the DDL Incubator). I may, infrequently, post some of the latest public work from both IST Research and myself, however other than that I'm going to be heads down making the awesome.

See you in the future my friend!

post

Incorporating Human In The Loop Processes into Data Pipelines

Human Robot

Even if you're working with 100% machine-created data, more than likely you're performing some amount of manual inspection on your data at different points in the data analysis process, and the output of your machine learning models.

Many companies including Google, GoDaddy, Yahoo! and LinkedIn use what's known as HITL, or Human-In-The-Loop, to improve the accuracy of everything from maps, matching business listings, ranking top search results and referring relevant job postings.

Why are we still at this point? Because the promise of fully-automated end-to-end flows is a false one. So if we have to have a human involved at some point, what’s the best way to go about it?

Join me for a complimentary webinar on Thursday April 14th at 7PM EST where I'll show you multiple ways to implement and leverage HITL processes as part of your data pipelines.

Reserve your seat today >>

post

How to Build a Data Pipeline in Data Science Studio

Join me Thursday, March 24th at 7PM EST for a complimentary webinar where you'll learn how to build a data pipeline for cleaning and standardizing data using Data Science Studio (DSS). We all deal with dirty, messy data. I'll show you how to use DSS to clean it up and get it ready for analysis using the super easy to use drag-and-drop interface DSS provides.

Sign up for the webinar today >>

post

The Next Four Months at Data Wranglers DC

Data Wranglers DC Logo

Following the Black Hat Data Wrangling talk that Travis Hoppe and I did to kick off the 2016 year of Data Wranglers DC, the next four months is going to be awesome. Here's the lineup: [Read more…]

post

Announcing the Data Science Studio User Group

Data Science Studio User Group

On November 13, 2013 I founded Data Wranglers DC (DWDC). The focus of Data Wranglers has and is data engineering, the 80-90% of time spent on data projects that most people don't like to talk about. It includes everything from gathering and cleaning data to engineering the IT systems to gather, store and process all of the data. [Read more…]

post

District Data Labs Incubator Now Accepting Applications

Incubator

During the dotcom boom of 2000 I found myself in a catch 22: I couldn't get a tech job without experience, but I could get experience without a job. Fast forward to 2016. Companies are scrambling to build data teams and can't find enough experienced people. But how can you, someone new to data science, get experience to get one of these jobs? The District Data Labs incubator can help with that. [Read more…]

post

Recruiters: How to Find Good Software and Data People

Search

If you're a software developer, data engineer or data scientist with a LinkedIn profile or website, you're probably hearing from recruiters on a daily basis. No longer are these folks dressing in suits and attending meetup – they're getting a bit more savvy.

If you're a recruiter I've got two tips for finding awesome software and data peeps, without pissing anyone off. Read on! [Read more…]

post

Control Your Outcomes

Outcome

Operating from a victim mentality means you believe that events in your life happen to you, and that you have no control over them. The key to knowing that that isn't the case is, paradoxically, humility. [Read more…]

post

Black Hat Data Wrangling

Black Hat Data Wrangler
Simply put, the goal of any good data wrangler is to make data more accessible.

Consider the antithesis, the idea of hiding and obfuscating your data but still publishing it on the web. Let's learn from the our anti-hero, the black hat data wrangler. [Read more…]

post

Humility

Buddhists believe that only a humble mind allows embarking on the path of enlightenment and liberation.

I used to think humility meant downplaying your own opinion and giving in to others. I've come to learn that isn't what humility is all about. [Read more…]