Machine Learning for Detecting Risky Conversations on Instagram

Saturday - April 29 2023, 11:57 UTC - 2 years ago

tldr #

A team of four research universities have proposed a way to use machine learning technology to flag risky conversations on Instagram without having to eavesdrop on them. This new system is 1,000 times more effective at detecting conversations that contained cyberbullying and other risk factors than existing software, as it takes context and intent into consideration.

content #

As regulators and providers grapple with the dual challenges of protecting younger social media users from harassment and bullying, while also taking steps to safeguard their privacy, a team of researchers from four leading universities has proposed a way to use machine learning technology to flag risky conversations on Instagram without having to eavesdrop on them.

The discovery could open opportunities for platforms and parents to protect vulnerable, younger users, while preserving their privacy.The team, led by researchers from Drexel University, Boston University, Georgia Institute of Technology and Vanderbilt University recently published its timely work—an investigation to understand what type of data input, such as metadata, text, and image features could be most useful for machine learning models to identify risky conversations—in the Proceedings of the Association for Computing Machinery's Conference on Human-Computer Interaction. Their findings suggest that risky conversations can be detected by metadata characteristics, such as conversation length and how engaged the participants are.Their efforts address a growing problem on the most popular social media platform among 13-to-21-year-olds in America. Recent studies have shown that harassment on Instagram is leading to a dramatic uptick of depression among its youngest users, particularly a rise in mental health and eating disorders among teenage girls."The popularity of a platform like Instagram among young people, precisely because of how it makes its users feel safe enough to connect with others in a very open way, is very concerning in light of what we now know about the prevalence of harassment, abuse, and bullying by malicious users," said Afsaneh Razi, Ph.D., an assistant professor in Drexel's College of Computing & Informatics, who was a co-author of the research.At the same time, platforms are under increasing pressure to protect their users' privacy, in the aftermath of the Cambridge Analytica scandal and the European Union's precedent-setting privacy protection laws. As a result, Meta, the company behind Facebook and Instagram, is rolling out end-to-end encryption of all messages on its platforms. This means that the content of the messages is technologically secured and can only be accessed by the people in the conversation.But this added level of security also makes it more difficult for the platforms to employ automated technology to detect and prevent online risks—which is why the group's system could play an important role in protecting users."One way to address this surge in bad actors, at a scale that can protect vulnerable users, is automated risk-detection programs," Razi said. "But the challenge is designing them in an ethical way that enables them to be accurate, but also non-privacy invasive. It is important to put younger generation's safety and privacy as a priority when implementing security features such as end-to-end encryption in communication platforms."The system developed by Razi and her colleagues uses machine learning algorithms in a layered approach that creates a metadata profile of a risky conversation—it's likely to be short and one-sided, for example—combined with context clues, such as whether images or links are sent. In their testing, the program wa 1,000 times more effective at detecting conversations that contained cyberbullying and other risk factors than existing software that doesn't take context, such as the meaning and intent of text, into consideration.

The research paper was published in the Proceedings of the Association for Computing Machinery's Conference on Human-Computer Interaction

hashtags #

machinelearning instagram riskyconversations privacyprotection cyberbullying drexeluniversity bostonuniversity georgiainstituteoftechnology vanderbiltuniversity cambridgeanalytica endtoendencryption securityfeatures research

worddensity #

users (7, 1.31%)
privacy (5, 0.93%)
platforms (5, 0.93%)
such (5, 0.93%)
way (4, 0.75%)