Join me at November 7th

Robert Dempsey, Speaker, DC

I'm proud and excited to announce that I will be speaking at the conference on November 7th in McLean, VA. I'll be joined by an excellent lineup of speakers including Vicky Fu from Microsoft, Mark McGovern from CA Technologies, and Elizabeth Haubert of OpenSource Connections. It's a full day of machine learning awesome!

My talk is titled Building a Production-Level Machine Learning Pipeline. Here's what it's about: [Read more…]


A Better Way of Learning Machine Learning

Do you find yourself frustrated in your attempts to learn about machine learning? Do you find, like me, that many of the online classes you attend require a masters or PhD to understand, and that the books you read work with data that looks nothing like what you experience in real life?

My philosophy and approach to software engineering is, “keep it simple, make it work, then make it work better”. I use the same approach for learning. And when it came to digging into machine learning, or ML for short, was it simple? Not so much. But it can be with the right framework in place.

Perhaps you want to learn ML because you see that's the direction the technology industry is heading in and you want one of the jobs that the news people tell us will replace hundreds of others. Perhaps you want to learn ML because you would like a new position at your current job or a new job entirely. Perhaps you don't need ML today, but want to be the technology leader that brings it to your organization and creates a metric ton of business value.

I believe I can help you, but you to tell me if you think that's true.

Today I'm happy to announce a new online course for software engineers who want to gain a foundational knowledge of ML, engineers who, like me, subscribe to the philosophy of Keep it Simple and Make it Work™.

“So what Rob?!”, you might say, “There are tons of courses online where I can learn big data things.” And yes there are, but they aren't like this. They won't give you the complete foundation or support you need, or that added ingredient to get you past the finish line.

Visit the Introduction to Data Science course page to find out what I mean >>


DDL Data Science Project Pitchfest 3

DDL Incubator

The Spring cohort of the District Data Labs Data Science Project Incubator is coming to a conclusion with Pitchfest this Friday evening. I'm inviting you to join us and check out the projects that the teams will be presenting! [Read more…]


Application Skeleton for Flask and AngularJS

Flask and AngularJS

A constant challenge we face at IST Research is ensuring we build all of our applications in a way that makes them easy to scale. During my practice of deep work this week and thinking about that challenge, I decided that every application I build needs the following three things:

  1. Logging
  2. Statistics
  3. API (Application Programming Interface)

All three of these are very important when building and scaling fully distributed applications. [Read more…]


Incorporating Human In The Loop Processes into Data Pipelines

Human Robot

Even if you're working with 100% machine-created data, more than likely you're performing some amount of manual inspection on your data at different points in the data analysis process, and the output of your machine learning models.

Many companies including Google, GoDaddy, Yahoo! and LinkedIn use what's known as HITL, or Human-In-The-Loop, to improve the accuracy of everything from maps, matching business listings, ranking top search results and referring relevant job postings.

Why are we still at this point? Because the promise of fully-automated end-to-end flows is a false one. So if we have to have a human involved at some point, what’s the best way to go about it?

Join me for a complimentary webinar on Thursday April 14th at 7PM EST where I'll show you multiple ways to implement and leverage HITL processes as part of your data pipelines.

Reserve your seat today >>


How to Build a Data Pipeline in Data Science Studio

Join me Thursday, March 24th at 7PM EST for a complimentary webinar where you'll learn how to build a data pipeline for cleaning and standardizing data using Data Science Studio (DSS). We all deal with dirty, messy data. I'll show you how to use DSS to clean it up and get it ready for analysis using the super easy to use drag-and-drop interface DSS provides.

Sign up for the webinar today >>


The Next Four Months at Data Wranglers DC

Data Wranglers DC Logo

Following the Black Hat Data Wrangling talk that Travis Hoppe and I did to kick off the 2016 year of Data Wranglers DC, the next four months is going to be awesome. Here's the lineup: [Read more…]


Announcing the Data Science Studio User Group

Data Science Studio User Group

On November 13, 2013 I founded Data Wranglers DC (DWDC). The focus of Data Wranglers has and is data engineering, the 80-90% of time spent on data projects that most people don't like to talk about. It includes everything from gathering and cleaning data to engineering the IT systems to gather, store and process all of the data. [Read more…]


District Data Labs Incubator Now Accepting Applications


During the dotcom boom of 2000 I found myself in a catch 22: I couldn't get a tech job without experience, but I could get experience without a job. Fast forward to 2016. Companies are scrambling to build data teams and can't find enough experienced people. But how can you, someone new to data science, get experience to get one of these jobs? The District Data Labs incubator can help with that. [Read more…]


Recruiters: How to Find Good Software and Data People


If you're a software developer, data engineer or data scientist with a LinkedIn profile or website, you're probably hearing from recruiters on a daily basis. No longer are these folks dressing in suits and attending meetup – they're getting a bit more savvy.

If you're a recruiter I've got two tips for finding awesome software and data peeps, without pissing anyone off. Read on! [Read more…]