A machine learning approach to demographic prediction using geohashes

Roy A., Pebesma E.

Forschungsartikel in Sammelband (Konferenz)

Zusammenfassung

With the rapid proliferation of smartphones, human beings act as social sensors by means of carrying GPS-enabled devices that share location data. This has resulted in an abundance of sensor data gathered over long periods of time. Gaining meaningful insights from such massive amounts of spatio-temporal data accumulated by several disparate sources is offen a challenge for organizations. Identifying demographics of mobile phone users by telecommunication providers is one such example. Demographic information plays a very significant role in targeting online advertisements to focused user groups by gaining insights about user's mobility patterns. However, in practice, demographic information such as age and gender are mostly unavailable to app developers for open access due to privacy concerns. In this paper, we try to address the gap of how to enrich location data with demographics, which could be valuable for app developers. In our study, we use a machine learning approach to predict the gender and age of mobile phone users from a set of 3,252,950 anonymised GPS trajectories with 60,865 unique devices using a predictive model which is based upon the concept of Geohashes. We study to what extent users' demographics could be inferred from their frequently visited locations by encoding by formulating a multi-level classification algorithm to find the most frequently visited Geohashes and associating them with nearest points of interests which would enable predicting age-group and gender of the users who prefer to visit a specific location in a sequential manner. Experiments are conducted on a real dataset of mobile phone users collected and shared by a telecommunication provider. The experimental results show that the proposed algorithm can achieve mean prediction accuracy scores of 71.62% and 96.75% for predicting gender and age groups of the users respectively.

Details zur Publikation

Herausgeber*innen:
Seiten: 6
Veröffentlichungsjahr: 2017
Verlag: Association for Computing Machinery, Inc
ISBN: 9781450349772
Sprache, in der die Publikation verfasst istEnglisch
Link zum Volltext: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85019033654&origin=inward