post

DDL Data Science Project Pitchfest 3

DDL Incubator

The Spring cohort of the District Data Labs Data Science Project Incubator is coming to a conclusion with Pitchfest this Friday evening. I'm inviting you to join us and check out the projects that the teams will be presenting! [Read more…]

post

The Next Four Months at Data Wranglers DC

Data Wranglers DC Logo

Following the Black Hat Data Wrangling talk that Travis Hoppe and I did to kick off the 2016 year of Data Wranglers DC, the next four months is going to be awesome. Here's the lineup: [Read more…]

post

Announcing the Data Science Studio User Group

Data Science Studio User Group

On November 13, 2013 I founded Data Wranglers DC (DWDC). The focus of Data Wranglers has and is data engineering, the 80-90% of time spent on data projects that most people don't like to talk about. It includes everything from gathering and cleaning data to engineering the IT systems to gather, store and process all of the data. [Read more…]

post

District Data Labs Incubator Now Accepting Applications

Incubator

During the dotcom boom of 2000 I found myself in a catch 22: I couldn't get a tech job without experience, but I could get experience without a job. Fast forward to 2016. Companies are scrambling to build data teams and can't find enough experienced people. But how can you, someone new to data science, get experience to get one of these jobs? The District Data Labs incubator can help with that. [Read more…]

post

Data Science in Five Steps

Special Agent OsoWhen my daughter Palamee was younger she watched a cartoon with a character named Special Agent Oso. Oso would complete his missions using three simple steps. In a recent conversation I was asked to provide my definition of data science. Today I'm going to provide that definition in not three easy steps, but five, and show a real-world implementation. [Read more…]

post

Examples of Data Pipelines You Can Build Today

PipelineOver the past few months I started hearing the term “data pipeline” more and more at the local data meetups. Curious as to just what that meant, I looked it up. In this post I'm going to tell you what I found, and more importantly provide real-world examples of data pipelines you can use for your data projects. [Read more…]

post

Upcoming Webinar: How to Choose a Data Science Tool

ChooseJoin me on November 16th, 2015 at 1:00 PM (New York) and 10 AM (California), for a free webinar – How to Choose a Data Science Tool.

On this webinar you’ll discover:

  • The four phases of selecting the tool that’s right for you and your team
  • 10 key points to consider before you start your evaluation
  • Tips on how to perform your research so you don’t waste your time during the evaluation phase
  • How best to structure your time during the evaluation to keep productivity high and have the time you need to really test the tools

By attending the webinar you’ll receive:

  1. A recording of the webinar
  2. A one-page checklist to use during your evaluation
  3. A presentation template you can use to help “sell” your tool of choice to management

Click here to sign up today >>

post

Create a Simple Python Web Scraper to Get Pricing Data

Python LogoThere are many methods in Python to create a web scraper. One of the simplest is using a combination of the built-in requests library (to obtain web pages) and the Beautiful Soup library (to parse the pages and extract data). With my book – Python Business Intelligence Cookbook – being published soon, I was curious how, or if, the pricing my publisher sets changes over time. In order to track it, I created a simple web scraper. Code below… [Read more…]

post

Python Business Intelligence Cookbook Pre-Order Offer

I know how hard it can be to take what you read in a book, especially a data science book, and apply it to the data you have to work with. I want to save you that experience.

Python Business Intelligence CookbookIf you pre-order my upcoming book – Python Business Intelligence Cookbook – you'll receive a metric ton of bonus materials and resources that will help you take what you learn in the book, and apply it to your data. Specifically, when you pre-order, you'll receive:

  • Pre-webinar videos
  • A seat at the two-hour live webinar
  • Business intelligence project checklist
  • List of additional resources
  • 30-days of post-release email Q&A

Learn more about the book and this offer at pythonbicookbook.com.

Thank you for your support!

post

Python Web Scraper Docker Image

Docker PythonFrankly, I couldn't come up with a cooler title than what you just read 😛

I was thinking to create a long post giving you the step-by-step instructions on how to install everything you need to scrape the web with Python. Then I got a bit smarter about it and made a Docker image you can download and run literally anywhere. [Read more…]