Facebook gleefully reported earlier this week that their privacy practices are “A-Ok”, in response to the growing federal concerns that the company leaks too much personal information. While it’s all well and good that they are legally in bounds, users still worry about just how much is shared via the popular social networking site. After all, just what does your Facebook activity say about who you are?
A lot, actually.
Michael Kosinski and his colleagues from UC Berkley recently investigated just how much Facebook activity reveals about a person. They wanted to know if a user’s age, gender, sexual orientation, or political beliefs could be inferred from their “likes”, even if such personal details were not provided. Using the myPersonality Facebook app, the team obtained 58,466 volunteers who provided demographic data as well as a list of their likes. The average volunteer liked 170 different pages, musicians, etc. They then used statistics to determine if those likes predicted a suite of personal variables, from male/female to intelligence level. Their results were staggering.
The algorithm they created could predict race and gender with over 90% accuracy. Even religion, political party affiliation, and sexual orientation could be predicted with over 80% accuracy. In the case of sexual orientation, this result was truly unexpected, as less than 5% of users were connected to explicitly gay groups.
From the pages you like, the computer could predict whether you drink, smoke, or use drugs with more than 60% accuracy. Likes also strongly correlated with age. In fact, your likes reveal so much about who you are that the team could predict whose parents divorced before they were 21 with 60% accuracy! “Although it is known that parental divorce does have long-term effects on young adults’ well-being,” write the authors, “it is remarkable that this is detectable through their Facebook Likes.”
Even personality traits, which don’t fall into neat A or B categories, were well correlated to likes. And the more likes you have, the better the computer algorithm is at figuring you out.
Simply put by the authors: “A wide variety of people’s personal attributes, ranging from sexual orientation to intelligence, can be automatically and accurately inferred using their Facebook Likes.”
This could have many upsides, including well-tailored ads, apps and such that really do suite you. But, the predictability of individual traits may have considerable negative implications as well. “It can easily be applied to large numbers of people without obtaining their individual consent and without them noticing,” caution the authors. “Commercial companies, governmental institutions, or even one’s Facebook friends could use software to infer attributes such as intelligence, sexual orientation, or political views that an individual may not have intended to share.”
In a worst-case scenario, the release of such information could be serious. With online harassment a constant issue, hate groups could use likes to target victims, even when an individual takes measures to be discreet — for example, the predictability of sexual orientation may make users vulnerable, even though individuals don’t have explicitly homosexual likes. And as the digital revolution continues to change how we interact in modern society, Facebook likes are hardly the only information that may be used for such purposes. It is becoming more and more difficult to keep track of our online footprints, and thus the information we reveal about ourselves may be harder and harder to control.
The authors acknowledge that the risks of this may deter people from integrating with digital technologies.”It is our hope, however, that the trust and goodwill among parties interacting in the digital environment can be maintained by providing users with transparency and control over their information,” they conclude, “leading to an individually controlled balance between the promises and perils of the Digital Age.”
Citation: Kosinski M., Stillwell D. & Graepel T. (2013). Private traits and attributes are predictable from digital records of human behavior, Proceedings of the National Academy of Sciences, 110 (15) 5802-5805. DOI: 10.1073/pnas.1218772110
There is an old saying that goes ( tell me whom you walk with and I will tell you who you are ) same with face book and other data bases.
This is very interesting – although when you think about it, not so surprising.
It would be interesting to see how their algorithm performs relative to humans. If someone likes Obama and the DNC pages, it’s a safe bet that they’re liberal. If I just say, yes, everyone’s parents were divorced by 21, I’ll be correct in a pretty large number of cases. Taylor Swift? Probably white. FAMU marching band? Probably black. If someone likes Cher and Barbra Streisand fan pages, I’m pretty confident in identifing them as a gay male; if you want to keep that information private, you’re probably aware of and actively avoiding broadcasting those stereotypes. So it’s a much bigger deal if the algorithm is identifying you as a member of a group based on some non-stereotypical trait – that is, something that a human wouldn’t easily be able to piece together.
Also if we’re concerned about privacy, it’s reasonable to ask, how much of this isn’t readily available from the other information on their page that anyone who has access to your Likes would already be able to see? You’ve got a photo of yourself holding a beer, you’re probably a white woman who uses alcohol.
The reason we click ‘Like” button IS to tell everyone a bit about who we are. We deliberately elect to give up a bit of privacy with every ‘Like’ click.
What does ‘Like-ing’ say about what we *truly* desire? > Join the discussion on our on.fb.me/RRYoYz