Speed dating predictions

Speed dating is one of the best ways to research human attraction, because it's based on revealed preferences rather than surveys, and everyone has a strong incentive to give truthful answers.

If you'd like to do your own analysis of speed dating data, there's a freely-available spreadsheet from a Columbia University study, with the results of over 4000 dates. The dates occurred at events with (typically) 10 male and 10 female participants, who "dated" everyone of the opposite sex in quick succession.

As well as the outcomes of the dates, the spreadsheet contains (somewhat patchy) demographic data, such as the age, undergraduate major, race, and some personality attributes of the participants.

For an in-depth analysis of the data, have a look at this Kaggle notebook.

A simple analysis

There are lots of ways to analyze the data, but a simple hypothesis to test is: given the results of previous dates, can you predict how people will decide on the next date? So, at an event with 10 dates, can you predict the decisions in round 10 from the decisions in the previous 9 rounds?

It turns out you can.

The idea is to calculate two attributes for each participant: their popularity and their pickiness. Their popularity is the fraction of dates where their partner said yes to them. Their pickiness is the fraction of dates where they said no to their partner. Then in future dates you assume they'll say yes if their partner's popularity is at least as high as their pickiness.

Using this simple rule — and no other demographic data whatsoever — you can predict the decisions in the final round with 75% accuracy. And you can predict the two-way matches with 80% accuracy (although, to be honest, that's not much better than just predicting no for each date, since most dates are unsuccessful).

If attraction were truly in the eye of the beholder, and people's preferences were uncorrelated, the accuracy would only be 50%. Clearly that's not the case, and you could almost certainly improve on 75% accuracy by using demographic data and more rounds of dates.

So, at least when it comes to initial attraction, humans are fairly predictable.

Detailed results

For the data geeks ...

  • Male decisions: accuracy = 76.3%, precision = 0.838, recall = 0.695 (true pos: 98, true neg: 102, false pos: 19, false neg: 43)
  • Female decisions: accuracy = 74.0%, precision = 0.681, recall = 0.521 (true pos: 49, true neg: 145, false pos: 23, false neg: 45)
  • Matches: accuracy = 81.7%, precision = 0.636, recall = 0.259 (true pos: 14, true neg: 200, false pos: 8, false neg: 40)

Comments

Popular posts from this blog

Are average faces more attractive?

The geography of adult tech

Assortative mating