A common problem in web analytics is sessionization or session reconstruction: taking a series of user events and splitting the series into a set of sessions . Often, it is done based solely on the timestamps of the events: a session ends after a certain time of user inactivity. As an example, let’s take the […]
Snowflake Computing has recently emerged from stealth with a bold claim of having reinvented the data warehouse. Ease of use is their main motto and I dare to say they live up to this promise. Since one of Snowflake’s co-founders, Marcin Żukowski, is a good friend of mine, I’ve got a chance to play with […]
Rax is running atop an SQL-based relational database backend, translating data-heavy operations into SQL queries. Actually, there are several SQL backends that Rax can use. Rax syntax and functionality are not dependent on the backend used. In other words, Rax/Redshift can execute exactly the same Rax code as Rax/MySQL and Rax/Azure. The subtle differences between […]
Rax (Relational Algebra with eXtensions) is a scripting language for behavioral-data exploration. The language is a mix of relational, regular, and temporal algebra and is designed to facilitate the transition from ETL to data modeling. Rax empowers data analysts to explore behavioral data in a way that feels intuitive and familiar. Rax shares some features […]
Data analysts often use a chain of 3 to 4 tools to achieve a single task. For example, first they query the database to extract aggregated data. Then they import it into Excel to do some more processing. Next, they move to R, to run some statistical analyses. Then they go back to Excel to […]
Programming is hard. There is a lot of complexity that needs to handled. To hide at least some of this complexity, people come up with abstractions. SQL is an example of an abstraction – it provides a way to process data without worrying about all sorts of details like: How to find the relevant data […]
A nice read: http://robklopp.wordpress.com/2013/08/12/thinking-about-bi-infographics-is-the-next-phase/ The company he mentions, Looker, sounds interesting. They claim to have simplified SQL by adding LookML on top of this. I couldn’t find much concrete info, though. Their website is full of buzz and the free trial requires jumping through many hoops.
I like the slogan. It’s a bit along the lines of how we’re trying to sell Rax: Spinnakr — Data Scientist As A Service The ‘Spinnakr’ data scientist is of course quite limited in its scope: it can only analyze web traffic and search for a number of patterns. However, it these patterns happen to […]
I stumbled upon a new buzzword: wide data 🙂 I must say, I’m getting lost in the financial jargon, but it’s an interesting read nevertheless.
“Unstructured data” has become a meaningless buzzword recently, but from time to time I do find something interesting in this category. This platform seems to make sense, for example.