Categories
Data Science Statistics

Principal Components Regression: A Two-Part Series

I’m kicking off a two-part series on Principal Components Regression on the Win-Vector blog today. The first article demonstrates some of the pitfalls of using standard Principal Components Analysis in a predictive modeling context. John Mount has posted an introduction to my first article on the Revolutions blog, explaining our motivation in developing this series. […]

Categories
Musings Science

John Oliver on Scientific Studies

An excellent rant from John Oliver on the way science stories are handled in the media, and on the need for some healthy skepticism. And the need to track down sources for the studies yourself, to the extent that this is possible. Also, I love the “TODD Talks” skit at the end.

Categories
Data Science

Upcoming Webinar: Data Preparation with R

I’m happy to announce my upcoming webinar, sponsored by Microsoft Data Science: Data Preparation with R Thursday, March 17, 2016 10:00 A.M. – 11:00 A.M. (Pacific time) Data quality is the single most important item to the success of your data science project. Preparing data for analysis is one of the most important, laborious and […]

Categories
Data Science

Starting Strong in 2016

We had a busy January here at Win-Vector, and it shows no sign of abating. John and I had the pleasure of attending the first Shiny Developers Conference, held by RStudio and hosted at Stanford University (see here for a review of the conference, by a fellow attendee). The event energized us to resharpen our […]

Categories
Data Science Statistics

Upcoming Appearances

We have two public appearances coming up in the next few weeks: Workshop at ODSC, San Francisco – November 14 John and I will be giving a two-hour workshop called Preparing Data for Analysis using R: Basic through Advanced Techniques. We will cover key issues in this important but often neglected aspect of data science, […]