Author Archives: Jonathan Callahan

Improved Python-style Logging in R

This entry is part 21 of 21 in the series Using R

Last August, in Python-style Logging in R, we described using an R script as a wrapper around the futile.logger package to generate log files for an operational R data processing script. Today, we highlight an improved, documented version that can be …   read more …

Posted in R | Tagged | Leave a comment

Python-style Logging in R

This entry is part 20 of 21 in the series Using R

We are increasingly using R in “operational” settings that require robust error handling and logging. In this post we describe a quick-and-dirty way to get python style multi-level log files by wrapping the futile.logger package.

Posted in R, Toolbox | Tagged , | Leave a comment

When k-means Clustering Fails

This entry is part 19 of 21 in the series Using R

Letting the computer automatically find groupings in data is incredibly powerful and is at the heart of “data mining” and “machine learning”. One of the most widely used methods for clustering data is k-means clustering. Unfortunately, k-means clustering can fail …   read more …

Posted in R | Tagged , , | Comments Off on When k-means Clustering Fails

Visualizing Bikeshare Data

This entry is part 18 of 21 in the series Using R

Seattle’s Pronto bikeshare system recently announced a Data Challenge for data visualization using their first year of trip data. As avid cyclists and data analysis junkies, we of course took the bait. Below is a brief description of our Pronto Databrowser …   read more …

Posted in Data Visualization, R | Tagged , , | Comments Off on Visualizing Bikeshare Data