Sessionizing Log Data Using data.table [Follow-up #2]

Thanks to user dnlbrky, we now have a third way to accomplish sessionizing log data for any arbitrary time out period (see methods 1 and 2), this time using data.table from R along with magrittr for piping: I agree with dnlbrky in that this feels a little better than the dplyr method for heavy SQL users […]

Sessionizing Log Data Using dplyr [Follow-up]

Last week, I wrote a blog post showing how to sessionize log data using standard SQL. The main idea of that post is that if your analytics platform supports window functions (like Postgres and Hive do), you can make quick work out of sessionizing logs. Here’s the winning query:One nested sub-query and two window functions are […]

RSiteCatalyst Version 1.4.3 Release Notes

It’s a new year, so…new version of RSiteCatalyst on CRAN! For the most part, this release fixes a handful of bugs that weren’t noticed with the prior release 1.4.2 (oops!), but there are pieces of additional functionality. New functionality: Data Feed monitoring For those of you having hourly or daily data feeds delivered via FTP, […]

%d bloggers like this: