Data science for data-driven startups

Tag

data warehouse

How to escape the data monkey trap to leverage analytics

Do you feel spending more time getting the data instead of producing impactful insights? You are not alone. Let’s explore 4 ways to solve this.

Read more...

PostgreSQL for data science : pro and cons

Is PostgreSQL a good companion for a data scientist at a startup? At which maturity stage should it be used? Let’s find out!

Read more...

The master analytics table for SaaS startups

At the foundation of deep analytics there is a nicely structured master table. Let’s see where a startup need to start.

Read more...

Data Warehouse Manifesto : Enabling agile analytics

After years of analytics pratice, we came to a set of opinions on how data warehouses should be built.

Read more...

The call for a Modular Data Warehouse

In those day of huge focus on the Big Data mouvement, it seems that nobody needs a data warehouse anymore but a huge cloud of […]

Read more...

Fast creation of surrogate keys in Greenplum

Usually we use sequence to generate unique identifier for surrogate keys.  A sequence is simply a database object that return a number every time you […]

Read more...

Big data and mobile BI : New hype but same old issue

For the end of 2011, many around the blogosphere are forecasting what will be on hype next year. I often read that big data and […]

Read more...

Data Manipulation Part 2 : ETL

My last post discuss about SQL queries. Nevertheless, sometimes data came from differents databases. In such cases, it is no longer possible to use SQL. […]

Read more...

Data Manipulation Part 1 : SQL

Data manipulation is a big part of a data mining process. Some authors claims it could take 80% of a data mining project. I could […]

Read more...

Using MySQL as a Data Warehouse

PS : This post is quite old now and isn’t relevant anymore. MySQL 5.6 introduced hash join which basically makes it more suitable to a data […]

Read more...