On a trip back to India two years ago, I struck up a conversation with the helper in my family's home, Jayanthi. Jayanthi is from one of the oppressed castes in India. She had been working in other people's homes, sweeping floors, washing clothes, and scrubbing dishes, ever since she was a teenager. That had been her 'family profession'.
One blistering afternoon, as an enormous ceiling fan was whirling over our heads, I asked Jayanthi if her daughters went to school. Would she send them to college some day? I asked, hoping for an affirmative and fearing the opposite.
"No, ma," she said, "What are they going to do with school?"
"Wouldn't you like to see them in an office job?" I asked.
"No, that's never going to happen," she shook her head and dismissed the notion as though it were as laughable as a unicorn in the middle of our living room.
Later that night, as I lay on fresh-smelling sheets, I reflected on our chat. I valued education above most things in life. That's partly because I had the advantage of a father who believed in education. He himself had grown up in a family filled with scholars: one of his grandfathers was a local magistrate, and the other was a doctor. Our family had been steeped in education for generations. I had personally seen how education can elevate a person's life.
Jayanthi did not have the same schemas in her head that equated learning to empowerment and progress. Her options, dreams, and prospects were circumscribed by the norms for her caste and class. I had seen many people like Jayanthi, unable to break their cycles because of history and biases. Jayanthi and I lived worlds apart, even though we inhabited the same space under that ceiling fan. Unless something changed drastically, her family would likely never catch up to mine in their quality of life. Our gap could even, sadly, widen over time.
This little encounter came into my head recently when I read a book about the use of data in mass decision-making systems and how that results in greater inequality in our society. The book is called "Weapons of Math Destruction", written by Cathy O'Neil.
In modern times, mass decision-making algorithms decide many aspects of our lives. They decide who gets a loan, who goes to top schools, who gets locked up in a prison, who is hired for jobs, and so on. They can bring greater fortune or greater ruin to individuals.
They often use heuristics (aka 'rules of thumb') to make such judgements. These heuristics are coded in or learned from data by the machine. They are usually based on the belief that the past can predict the future.
There are three problems with such heuristics.
It is not just you, the individual, on whom the bright gaze of these algorithms falls. The algorithms try to predict your behavior based on behavior of people 'like you' and their past too. People from the same race as you, the same zip code as you, the same economic stratum as you.
In human brain, such pattern matching is not new and not wholly pernicious. It helped our ancestors survive in the Savannahs. Our IQ tests judge us as intelligent if we can group similar objects based on abstract patterns. But it can often lead us astray.
It leads us to believe that a woman who has neatly styled hair, wearing bold lipstick and six-inch heels, does not fit the mold of a scientist. It can cause juries to pre-judge a black man guilty. It can cause school admissions officials to judge Jayanthi's daughter, living in her rickety shack in a squalid slum with illiterate parents, to be less likely to succeed in their schools. These are dangerous biases and stereotypes in our heads. Many of us will steer away from them if we become aware of them. But when they come sheathed in the sophistication, the supposed objectiveness, and the opaqueness of our algorithms, we simply accept them as truths.
Our data systems miss the lone pandas too. Often our data systems make decisions from aggregate wholes and miss the nuances of individuals in the group. For example, if you made a decision about the need of bamboo sticks using the average of this picture, most likely the panda would not get her bamboo. That - indeed - would be sad.
The truth is that what applies to the sum may not apply to the constituent parts. Our mass data systems, however, ignore that. They aggregate data and present conclusions that human decision-makers then turn around and apply to individuals. It becomes more damaging when these algorithms are used over large numbers of heterogenous people, do not use feedback loops to correct their mistaken inferences, and act as black boxes whose conclusions cannot be taken apart with reasoning.
I live and breath data in my work. I have built multiple software systems that use intelligence to aid human decision-making. I am not against the use of data, far from it. In our modern lives, data helps us avoid congested roads when we drive. It prods us to pack an extra sweater for our trip abroad. It tells me that I should try Yuval Noah Harari's book next, because I have liked similar books in the past and my friends have liked his writing. Modern data systems have come to understand us as well as, or even better than, our dearest friends.
But Cathy O'Neil is right. Our data systems can make mistakes. And when they make mistakes at massive scales, the result may be as innocuous as a few drenched and unhappy humans who didn't take an umbrella with them when it rained. But it may also result in the erosion of our democracy, the widening of inequality, and the destruction of justice.
Human societies have been riddled with inequality since pre-historic times. Jayanthi's station in life comes from centuries of oppression of some humans by others. But the arc of history has been bending relentlessly toward justice. As we build our future, as we architect a world driven more and more by math, let us not reverse that. Let's be cognizant of biases in data. With great power comes great responsibility. Data and math are insanely powerful. Let us wield them responsibly. Let's build trust in our systems with transparency, feedback loops, and diversity in the process of training and creating them.
I enjoyed reading the book Weapons of Math Destruction. It was thought-provoking, much needed in these times reverberating with voices of unity as well of division.
Couldn't find the panda? Locate the snowman with a checkered green scarf, move four figures to the right and look at one row above.