Introduction to R fr...

Google Developers just revealed a series of videos to get started with R. Each video is between 2 and 6 minutes long covering a few topics. This could be a good way to start with R if you did not yet. Intro to R from...

Twitter Datafeed wit...

First of all, get TweePy and install it. Then follow these steps to create an access to Twitter: ● Create a twitter account. ● Go to https://dev.twitter.com/apps and log in. ● Click on “Create an application” ● Fill out the form. ● At the bottom...

Open Data for Africa

The African Development Bank’s Open Data Platform is now operational for the entire African continent. In addition to social and economic statistics, data on key development topics such as climate change, food security, infrastructure, and gender equality can be accessed by researchers,...

Gradient Descent

Gradient descent is an optimization algorithm which finds a local minimum of a function by taking steps proportional to the negative of the gradient of the function at the current point. In a machine learning problem, the function usually is the cost function you want to...

5 skills of a data s...

A data scientist is capable in both technical and business aspects. For that purpose, he needs to be proficient with the following 3 technical and 2 business skills.   In terms of techniques, the data scientist is a data hacker, able to extract data from multiple sources, model and...

Install oauth2 for P...

– Install setuptools Unzip the file and copy the folder setuptools to C:/ Run the following commands from the command prompt window (Start ‣ Accessories)   – Install oauth2 Unzip the file and copy the folder oauth2 to C:/ Run the following commands from the command...

What is a Data Scien...

Data scientists, also called data geeks, are practitioners in the field of Data Science. On a work task level, data scientists split their work in 3 phases: In the first phase, data scientists bring together related and unrelated data and explore its content to make sense of it. Then they...

Logistic Regression

The logistic regression is a type of regression which is used to predict an outcome which comes in a categorical form. It is widely used in biostatistics where binary reponses occur quite frequently – such as if somebody has cancer or not. In order to keep the outcome between 0 and 1, we...

Causality Modeling

Causality is the relationship between an event (the cause) and a second event (the effect), where the second event is understood as a consequence of the first. Please find below the basic causality models: In a causal model, circular rules are not allowed. If that rule were to be broken, the...

Shrinkage Methods fo...

By discarding part of the predictors or inputs and keeping only a subset of the original predictors, you may obtain a model which is more interpretable. In addition to that, it might have a better prediction error on new datasets by preventing over fitting the training dataset. In all...

KPI, KRI, Performanc...

Performance measurement is the collection of criterion which determines if an organization will be able to prevail; be more successful than its competitors. While many of the concepts which make an organization successful are not directly measurable, in the process of performance measurement...

Raphaël.js

Raphaël.js is a small JavaScript library that should simplify your work with interactive visualization on the web. It is well documented and you may find many usage examples on the web. One huge advantage of using Raphael.js is that it is vector based. You can draw vectors and shapes and make...

Risk Assessment

Risk assessment is the determination of quantitative or qualitative value of risk related to a concrete situation and a recognized threat, also called hazard. Risk assessment will help you define the bottlenecks and risks (leading to production halt, loss of clients…) that may prevent the...

Balanced Scorecard

The balanced scorecard (BSC) is a performance measurement framework by Kaplan and Norton. It has evolved later in a strategy execution framework. The BSC is the presentation of a mixture of financial and non-financial measures and attaching targets to them. The original BSC had 4 perspectives...