Data science for data-driven startups

Data Warehouse Manifesto

After years of analytics pratice, we came to a set of opinions on how data warehouses should be built.

Read more...

Big data benchmark : Impala vs Hawq vs Hive

With the recent release of Pivotal HD, I wanted to check the current state of Hadoop SQL engines. SQL integration is growing in the Hadoop […]

Read more...

Price deal analysis

Price deal is a common method to boost sales. Steam push it to an art. There is always a lot of stuff to buy at […]

Read more...

The call for a Modular Data Warehouse

In those day of huge focus on the Big Data mouvement, it seems that nobody needs a data warehouse anymore but a huge cloud of […]

Read more...

Big data and mobile BI : New hype but same old issue

For the end of 2011, many around the blogosphere are forecasting what will be on hype next year. I often read that big data and […]

Read more...

About evaluation

When deploying a model, one very important thing is to monitor the results. Does it work like you’ve expected? I’m not talking about pre production […]

Read more...

Machine learning vs simulation

Lately, I was thinking on the difference between machine learning and simulation (for prediction). ¬†Machine learning use historical inputs and outputs to find subsequent outputs. […]

Read more...

The cost of reducing costs

Predicting the number of sales representatives on a particular time on a particular store is harder than expected. If you instrument the whole process, you […]

Read more...

What is the value of your work?

It’s a damn good question which should be tightly correlated to your salary in an utopic world. In other words, how do you justify your […]

Read more...