Categories
Data Science

Upcoming Webinar: Data Preparation with R

I’m happy to announce my upcoming webinar, sponsored by Microsoft Data Science: Data Preparation with R Thursday, March 17, 2016 10:00 A.M. – 11:00 A.M. (Pacific time) Data quality is the single most important item to the success of your data science project. Preparing data for analysis is one of the most important, laborious and […]

Categories
Data Science Statistics

New on Win-Vector: A Simpler Explanation of Differential Privacy

I have a new article up on Win-Vector, discussing differential privacy and the new recent results on applying differential privacy to enable reuse of holdout data in machine learning. Differential privacy was originally developed to facilitate secure analysis over sensitive data, with mixed success. It’s back in the news again now, with exciting results from […]

Categories
Data Science Statistics Writing

New on Win-Vector: Variable Selection for Sessionized Data

Illustration: Boris Artzybasheff photo: James Vaughan, some rights reserved   I’ve just put up the next installment of the new “Working with Sessionized Data” series on Win-Vector. As I mentioned in the previous installment, sessionizing log data can potentially lead to very wide data sets, with possibly more variables than there are rows in the […]

Categories
Data Science Statistics Writing

A Couple Recent Win-Vector Posts

I’ve been neglecting to announce my Win-Vector posts here — but I’ve not stopped writing them. Here are the two most recent: Wanted: A Perfect Scatterplot (with Marginals) In which I explore how to make what Matlab calls a “scatterhist:” a scatterplot, with marginal distribution plots on the sides. My version optionally adds the best […]

Categories
Science Statistics Writing

On Writing Technical Articles for the Nonspecialist

Photo: John Mount I came across a post from Emily Willingham the other day: “Is a PhD required for Good Science Writing?”. As a science writer with a science PhD, her answer is: is it not required, and it can often be an impediment. I saw a similar sentiment echoed once by Lee Gutkind, the […]