Data science for data-driven startups

Tag

data warehouse

PostgreSQL for data science : pro and cons

Is PostgreSQL a good companion for a data scientist at a startup? At which maturity stage should it be used? Let’s find out!

Read more...

The master analytics table for SaaS startups

At the foundation of deep analytics there is a nicely structured master table. Let’s see where a startup need to start.

Read more...

Data Warehouse Manifesto : Enabling agile analytics

After years of analytics pratice, we came to a set of opinions on how data warehouses should be built.

Read more...

The call for a Modular Data Warehouse

In those day of huge focus on the Big Data mouvement, it seems that nobody needs a data warehouse anymore but a huge cloud of […]

Read more...

Fast creation of surrogate keys in Greenplum

Usually we use sequence to generate unique identifier for surrogate keys.  A sequence is simply a database object that return a number every time you […]

Read more...

Big data and mobile BI : New hype but same old issue

For the end of 2011, many around the blogosphere are forecasting what will be on hype next year. I often read that big data and […]

Read more...

Data Manipulation Part 2 : ETL

My last post discuss about SQL queries. Nevertheless, sometimes data came from differents databases. In such cases, it is no longer possible to use SQL. […]

Read more...

Data Manipulation Part 1 : SQL

Data manipulation is a big part of a data mining process. Some authors claims it could take 80% of a data mining project. I could […]

Read more...

Using MySQL as a Data Warehouse

PS : This post is quite old now and isn’t relevant anymore. MySQL 5.6 introduced hash join which basically makes it more suitable to a data […]

Read more...