Improved Python-style Logging in R

Last August, in Python-style Logging in R, we described using an R script as a wrapper around the futile.logger package to generate log files for an operational R data processing script. Today, we highlight an improved, documented version that can be sourced by your R scripts or dropped into your package’s R/ directory to provide easy file and console logging.

  read more …

Posted in R | Tagged | Leave a comment

Python-style Logging in R

We are increasingly using R in “operational” settings that require robust error handling and logging. In this post we describe a quick-and-dirty way to get python style multi-level log files by wrapping the futile.logger package.

  read more …

Posted in R, Toolbox | Tagged , | Leave a comment

When k-means Clustering Fails

Letting the computer automatically find groupings in data is incredibly powerful and is at the heart of “data mining” and “machine learning”. One of the most widely used methods for clustering data is k-means clustering. Unfortunately, k-means clustering can fail spectacularly as in the example below.   read more …

Posted in R | Tagged , , | 1 Comment

Visualizing Bikeshare Data

Seattle’s Pronto bikeshare system recently announced a Data Challenge for data visualization using their first year of trip data. As avid cyclists and data analysis junkies, we of course took the bait. Below is a brief description of our Pronto Databrowser submission.   read more …

Posted in Data Visualization, R | Tagged , , | 2 Comments