*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************
Title: Social Computing for Personalization and Credible Information Mining using Probabilistic Graphical Models
Committee:
Dr. Faramarz Fekri, ECE, Chair , Advisor
Dr. Mark Davenport, ECE
Dr. Steven McLaughlin, ECE
Dr. Justin Romberg, ECE
Dr. Yajun Mei, ISyE
Abstract:
In this dissertation, we address challenging social computing problems in personalized recommender systems and social media information mining. We tap into probabilistic graphical models, including directed and undirected graphical models, to model a large number of observed and unobserved variables as well as various dependency relationships between variables, and develop efficient computation algorithms that exploit the graph structure to solve the problems.
In recommender systems, we propose probabilistic graphical models for Collaborative Filtering (CF) algorithms in various problem settings, and solve them using Belief Propagation (BP) algorithms that allow scalable and distributed implementations. Firstly, user similarities are computed in factor graphs. Then unknown ratings are predicted in Pairwise Markov Random Fields (PMRFs). Further, when online social networks of users are provided, a Bayesian Network (BN) recommendation system is constructed based on user relations to improve recommendation for cold-start users or users do not have sufficient ratings. To preserve user privacy, a semi-distributed item-based CF system is developed, which employs semi-distributed BP for item similarity computation in factor graphs, without disclosing ratings to the server or other peer users. Finally, to protect CF recommender systems from shilling attacks, a factor graph is proposed to jointly detect colluding spammers, which significantly improves detection accuracy over classification algorithms based on a single user's rating patterns.
In social media information mining, to detect false information and keep track of information credibility, we propose a generative probabilistic model to predict the credibility of events in Twitter-like social media using streaming tweets. The proposed algorithm predicts credibility much faster than existing offline algorithms and updates prediction online with newly observed tweets. Further, to identify suspicious users that perform malevolent activities such as spamming and phishing, we propose a probabilistic PMRF model for predicting the trustworthiness of social media users. The PMRF model improves prediction accuracy by taking into account user relationships compared to existing prediction algorithms for individual users.