Machine learning visual experiment
R2D3 is an experiment in expressing statistical thinking or machine learning with interactive visual design. In machine learning, computers apply statistical learning techniques to automatically identify patterns in data. These techniques can be used to make highly accurate predictions.
In this case the authors Stephanie and Tony design a machine learning process in order to know if a house with unknown location is from San Francisco or New York based on other parameters such us altitude, price, sq feet,…
In most cases they seem to apply a logistic regression where high or low values of one variable such as high altitude are more likely to be from one city rather than the other. By sequentially applying this process with all variables and using recorded data the authors can get a very accurate estimate of location for any given house, provided the explanatory variables are available.