Abstract:
Theproblemofcommunitydetectioninsocialmediahasbeenwidelystudiedinthe social networking community in the context of the structure of the underlying graphs. Mostcommunitydetectionalgorithmsusethelinksbetweenthenodesinordertodeterminethedenseregionsinthegraph. Thesedenseregionsarethecommunitiesofsocial mediainthegraph. Suchmethodsaretypicallybasedpurelyonthelinkagestructureof the underlying social media network. Community detection algorithms are fundamentaltoolsthatallowustouncoverorganizationalprinciplesinnetworks. Whendetecting communities, there are two possible sources of information one can use: the network structure, and the features and attributes of nodes. Even though communities form around nodes that have common edges and common attributes, typically, algorithms have only focused on one of these two data modalities: community detection algorithms traditionally focus only on the network structure, while clustering algorithms mostly consider only node attributes. Inthispaper,weexplorearangeofnetworkcommunitydetectionmethodsinorder tocomparethemandtounderstandtheirrelativeperformanceandthesystematicbiases in the clusters they identify. We evaluate several common objective functions that are usedtoformalizethenotionofanetworkcommunity,andweexamineseveraldifferent classes of approximation algorithms that aim to optimize such objective functions. In addition, rather than simply fixing an objective and asking for an approximation to the best community of any size, we consider a size-resolved version of the optimization problem. Consideringcommunityqualityasafunctionofitssizeprovidesamuchfiner lens with which to examine community detection algorithms, since objective functions and approximation algorithms often have non-obvious size-dependent behavior. And we propose a new algorithm Fast Network Clustering Algorithm (FaNClust) for better performance.
Description:
We are thankful to Almighty Allah for his blessings for the successful completion of our thesis. Ourheartiestgratitude,profoundindebtednessanddeeprespectgotooursupervisor, Md. Mahboob Karim, Instructor Class-A, Department of Computer Science and Engineering, Military Institute of Science and Technology, for his constant supervision, affectionate guidance and great encouragement and motivation. His keen interest on the topic and valuable advices throughout the study was of great help in completing thesis.
We are thankful to our co-supervisor Md. Shamsur Rahman, Doctoral Student (Ph.D. Candidate), Clayton School of Information Technology, Monash University, Clayton, Victoria, Australia for his continuous guidance & support.
We are especially grateful to the Department of Computer Science and Engineering (CSE) of Military Institute of Science and Technology (MIST) for providing their all out support during the thesis work.
Finally, we would like to thank our families and our course mates for their appreciable assistance, patience and suggestions during the course of our thesis.