Categories
Data Science Statistics Writing

New on Win-Vector: Variable Selection for Sessionized Data

Illustration: Boris Artzybasheff photo: James Vaughan, some rights reserved   I’ve just put up the next installment of the new “Working with Sessionized Data” series on Win-Vector. As I mentioned in the previous installment, sessionizing log data can potentially lead to very wide data sets, with possibly more variables than there are rows in the […]

Categories
Data Science Statistics Writing

A Couple Recent Win-Vector Posts

I’ve been neglecting to announce my Win-Vector posts here — but I’ve not stopped writing them. Here are the two most recent: Wanted: A Perfect Scatterplot (with Marginals) In which I explore how to make what Matlab calls a “scatterhist:” a scatterplot, with marginal distribution plots on the sides. My version optionally adds the best […]