Research is underway to create anxiety and depression prediction models, using artificial intelligence (AI) and Twitter, one of the world’s largest social media platforms, that could detect signs of these illnesses before clinical diagnosis, as per researchers.
Researchers at the University of São Paulo (USP) in Brazil said that preliminary findings from the model suggest the possibility of detecting the likelihood of a person developing depression based solely on their social media friends and followers.
The findings are publish in the journal Language Resources and Evaluation.
There are multiple studies involving natural language processing (NLP) focus on depression, anxiety and bipolar disorder, most of these analyse English texts and did not match Brazilians’ profiles, as per researchers said.
The first step in this study involve constructing a database, call as SetembroBR, of information relating to a corpus of 47 million publicly posted Portuguese texts and the network of connections between 3,900 Twitter users.
These users had reportedly diagnose with or treat for mental health problems before the survey.
The tweets were collect during the COVID-19 pandemic.
Ivandre Paraboni, last author of the article and a professor at USP Said :
Because people with mental health problems tend to follow certain accounts such as discussion forums, influencers and celebrities who publicly acknowledge their depression, the study also collect tweets from friends and followers.
The second step, still in progress, has provide some preliminary findings, such as the possibility of detecting the likelihood of a person developing depression based solely on their social media friends and followers, without taking their own posts into account.
Following pre-processing of the corpus to maintain original texts by removing non-standard characters, the researchers deploy deep learning (AI), to create four text classifiers and word embeddings (context-dependent mathematical representations of relations between words) using models based on bidirectional encoder representations from transformers (BERT), a machine learning algorithm employe for NLP.
These models correspond to a neural network that learns contexts and meanings by monitoring sequential data relationships, such as words in a sentence.
The training input consist of a sample of 200 tweets select at random from each user.
Researchers find that among the models, BERT perform best in terms of predicting depression and anxiety.
They said that because the models analyse sequences of words and complete sentences, it was possible to observe that people with depression, for example, tend to write about subjects connect to themselves, using verbs and phrases in the first person, as well as topics such as death, crisis and psychology.
Ivandre Paraboni Said :
Researchers are now extending the database, refining their computational techniques and upgrading the models in order to see if they can produce a tool for future use in screening prospective sufferers from mental health problems and helping families and friends of young people at risk from depression and anxiety.