Phone traffic and french elections

Some correlation plots between mobile phone traffic in the major french cities and french election results

Data Analysis

Data Aggregation and Preparation

Our data sets include mobile traffic data provided at the tile level (100x100 meter tiles) and electoral data available at the polling station level. Considering the prevalence of polling stations, most locations have a station within 300 meters. To facilitate a harmonized analysis, we aggregated both data sets at the iris level.

Mobile traffic data was attributed to the iris intersecting the most with each 100x100 meter tile. For electoral data, we took the centroid of each iris and performed a recursive search. We initially focused on all polling stations within a 300-meter radius of the centroid, averaging their results and attribiting them to the iris. If an iris did not have polling stations within this initial radius, the radius was increased to 600 meters and the process repeated.

Here is a map, at the iris level, of the diversity of voting preferences for the european election (measured by entropy).

Upon observation, it appears that Paris’ central regions exhibit less diverssity in voting preferences compared to their peripheral counterparts, suggesting a more homogeneous voting pattern in the city center.

Polarization of Electoral Results

The first step in our analysis was to calculate the polarization of the electoral results for each iris. Each party was assigned a number between -1 and 1, representing its position on the left-right axis. Then, we computed the average position in an iris: this is the weighted average of party positions, with the weighting based on the number of votes each party received. Finally, polarization is defined as the average distance of a vote from the average position of the iris. The larger this average distance, the more the iris is polarized because people vote farther away from the average (and so more extreme).

Mobile Traffic Data and Polarization

In the next phase of our research, we turned our attention towards mobile traffic data. Our focus was on analyzing the traffic between 7pm and midnight on weekdays, deliberately excluding holidays and anomaly days. This selective approach was taken to minimize the impact of confounding factors on our data.

Our primary metric for assessing the “consumption” of a specific mobile service was its share of the total traffic within an iris. The rationale behind this choice lies in the differing market penetration rates of telecom operators like Orange across various irises.

Absolute usage data can potentially skew results due to variations in the operator’s market presence across different irises. In contrast, comparing the proportional share of a given service in total traffic neutralizes this potential source of bias, making this a more robust measure for our analysis.

Results

New plots controlling for Age, Income and Education

A heat map plot of the relationship between treatment (mobile service usage) and outcome (polarization, entropy, etc.) controlling for age, income and education. The darker the color, the more IRIS fall inside the box with that combination of x-y values. In the inset you can see the regression coefficients for regressions controlling for various parameters. The magenta line is the mean value of the outcome variable for each value of the treatment variable.

Same as above but with scatter plots of the correlation between treatment and outcome for each IRIS. The issue with the scatter plots is that there are too many points and they are too close to each other to be able to see well which areas are dense with points and which are not.

Correlations

First we plot the correlation of turnout, vote diversity (measured be entropy), polarization and party affiliation with shares of mobile service usage for some of the main mobile services:

What do we find? Here are some observations

In Paris, it seems that either people vote for progressive movements or there is high polarization. This can be seen from the correlation of services both with entropy and with votes for far right parties.
Wikipedia is a predictor of progressiveness
People that vote for Macron have Apple stuff, so probably they are economically better off
Weird that e-commerce is highly correlated with populism
Ecologists listen to a lot of music, probably because they are young
People voting for macron and ecology are higher in turnout. Indeed, apps that predict progressive vote also predict turnout,

More Details on Facebook Usage and Electoral Polarization

We explore in greater detail the relationship between Facebook usage and electoral polarization in Paris.

In the map below, we depict the electoral polarization during the 2019 European election in Paris.

Interestingly, peripheral regions of Paris seem to exhibit greater polarization than the city center. Essentially, voting patterns in these outlying areas tend to span a wider range along the right-left axis.

Facebook’s usage data mirrors this trend. In the following map, we display the proportion of time spent on Facebook relative to total online time. Peripheral regions spend a greater fraction of their online time on Facebook compared to central areas.

Our visual intuition from the maps is backed up by a correlation plot, showing a significant association between Facebook usage and electoral polarization. This correlation remains robust even after adjusting for a variety of potential confounding factors, including age, income, education, region’s centrality, immigrant population, and unemployment rates. The variables we controlled for include DEC_MED19 (median income), DEC_GI19 (Gini index), P19_POP1529 (population share aged 15-19), P19_ACT_DIPLMIN (share of population without a Bac), P19_ACT_SUP2 (share of population with Bac +2), P19_CHOM1524 (unemployment rate among 15 to 24-year-olds), and P19_POP_ETR (share of population that is non-French). The correlation plot is presented below: