The dark side of AI: metadata and the death of privacy (Ep. 91)

Published: Dec. 23, 2019, 2:30 p.m.

Get in touch with us\n\n\n\n\n\n\nJoin the discussion about data science, machine learning and artificial intelligence\xa0on our Discord server\n\n\n\n\xa0\nEpisode transcript\nWe always hear the word \u201cmetadata\u201d, usually in a sentence that goes like this\n\xa0\n\nYour Honor, I swear, we were not collecting users data, just metadata.\n\n\xa0\nUsually the guy saying this sentence is Zuckerberg, but could be anybody from Amazon or Google.\xa0\u201cJust\u201d metadata, so no problem. This is one of the biggest lies about the reality of data collection.\n\xa0\nF: Ok the first question is, what the hell is metadata?\xa0\n\xa0\nMetadata is data about data.\xa0\n\xa0\nF: Ok\u2026 still not clear.Imagine you make a phone call to your mum. How often do you call your mum, Francesco?F: Every day of course! (coughing)\n\xa0\nGood boy! Ok, so let\u2019s talk about today\u2019s phone call. Let\u2019s call \u201cdata\u201d the stuff that you and your mum actually said. What did you talk about?\xa0\n\xa0\nF: She was giving me the recipe for her famous lasagna.\xa0\nSo your mum\u2019s lasagna is the DATA. What is the metadata of this phone call? The lasagna has data of its own attached to it: the date and time when the conversation happened, the duration of the call, the unique hardware identifiers of your phone and your mum\u2019s phone, the identifiers of the two sim cards, the location of the cell towers that pinged the call, the GPS coordinates of the phones themselves.\xa0\n\xa0\nF: yeah well, this lasagna comes with a lot of data :)\xa0\nAnd this is assuming that this data is not linked to any other data like your Facebook account or your web browsing history. More of that later.\xa0\n\xa0\nF: Whoa Whoa Whoa, ok. Let\u2019s put a pin in that. Going back to the \u201cbasic\u201d metadata that you describe. I think we understand the concept of data about data. I am sure you did your research and you would love to paint me a dystopian nightmare, as always. Tell us why is this a big deal?\xa0\n\xa0\nMetadata is a very big deal. In fact, metadata is far more \u201cuseful\u201d than the actual data, where by \u201cuseful\u201d I mean that it allows a third party to learn about you and your whole life. What I am saying is, the fact that you talk with your mum every day for 15 minutes is telling me more about you than the content of the actual conversations. In a way, the content does not matter. Only the metadata matters.\xa0\n\xa0\nF: Ok, can you explain this point a bit more?\xa0\n\xa0\nImagine this scenario: you work in an office in Brussels, and you go by car. Every day, you use your time in the car while you go home to call your mum. So every day around 6pm, a cell tower along the path from your office to your home pings a call from your phone to your mum\u2019s phone. Someone who is looking at your metadata, knows exactly where you are while you call your mum. Every day you will talk about something different, and it doesn't really matter.\xa0 Your location will come through loud and clear. A lot of additional information can be deduced from this too: for example, you are moving along a motorway, therefore you have a car. The metadata of a call to mum now becomes information on where you are at 6pm, and the way you travel.\xa0\n\xa0\nF: I see. So metadata about the phone call is, in fact, real data about me.\xa0\n\xa0\nExactly. YOU are what is interesting, not your mum\u2019s lasagna.\n\xa0\nF: you say so because you haven\u2019t tried my mum\u2019s lasagna. But I totally get your point.\n\xa0\nNow, imagine that one day, instead of going straight home, you decide to go somewhere else. Maybe you are secretly looking for another job. Your metadata is recording the fact that after work you visit the offices of a rival company. Maybe you are a journalist and you visit your anonymous source. Your metadata records wherever you go, and one of these places is your secret meeting with your source.\xa0Anyone\u2019s metadata can be combined with yours. There will be someone who was with you at the time and place of your secret meeting. Anyone who comes in contact with you can be tagged and monitored. Now their anonymity has been reduced.\xa0\n\xa0\nF: I get it. So, compared to the content of my c