data warehouse, mariadb, mysql
Is MySQL or MariaDB suited for the analytics workload of data science? Can it be used as a data warehouse? Let’s find out.
Read more...
analytics, excel, powerbi
You think Excel is not suitable for analytics? Let me convince you that Excel can be your best tool with PowerBI.
hadoop, spark
In order to use Spark on windows you need to install winutils.exe and change some environment variables. Here is a nice fix.
data warehouse, postgresql
Is PostgreSQL a good companion for a data scientist at a startup? At which maturity stage should it be used? Let’s find out!
data manipulation, hadoop
I’ve spent some time lately to dig into the Hadoop ecosystem both from a product survey and some hands on. Here is some remarks about […]
data manipulation, data warehouse
My last post discuss about SQL queries. Nevertheless, sometimes data came from differents databases. In such cases, it is no longer possible to use SQL. […]
Data manipulation is a big part of a data mining process. Some authors claims it could take 80% of a data mining project. I could […]
data mining, statistics
When it comes to data mining the tool you use is very important. It seems that peoples use many software (see How many software packages […]
data warehouse, mysql, olap
I often use a database not only to store data but also to do some treatment before mining and some analysis. I use MySQL as […]