Sessionization with Temporal Union

Posted on Posted in Big-Data Blab, Rax

A common problem in web analytics is sessionization or session reconstruction: taking a series of user events and splitting the series into a set of sessions [1]. Often, it is done based solely on the timestamps of the events: a session ends after a certain time of user inactivity. As an example, let’s take the […]

Welcome Snowflake

Posted on Posted in Big-Data Blab, Coding, Rax

Snowflake Computing has recently emerged from stealth with a bold claim of having reinvented the data warehouse. Ease of use is their main motto and I dare to say they live up to this promise. Since one of Snowflake’s co-founders, Marcin Żukowski, is a good friend of mine, I’ve got a chance to play with […]

Write Once, Run Everywhere – Rax SQL Backends

Posted on Posted in Big-Data Blab, Coding, Rax, SQL Rants

Rax is running atop an SQL-based relational database backend, translating data-heavy operations into SQL queries. Actually, there are several SQL backends that Rax can use. Rax syntax and functionality are not dependent on the backend used. In other words, Rax/Redshift can execute exactly the same Rax code as Rax/MySQL and Rax/Azure. The subtle differences between […]

What Is Rax?

Posted on Posted in Big-Data Blab, Rax

Rax (Relational Algebra with eXtensions) is a scripting language for behavioral-data exploration. The language is a mix of relational, regular, and temporal algebra and is designed to facilitate the transition from ETL to data modeling. Rax empowers data analysts to explore behavioral data in a way that feels intuitive and familiar. Rax shares some features […]

A Swiss-Army Knife

Posted on Posted in Big-Data Blab

Data analysts often use a chain of 3 to 4 tools to achieve a single task. For example, first they query the database to extract aggregated data. Then they import it into Excel to do some more processing. Next, they move to R, to run some statistical analyses. Then they go back to Excel to […]