References & Citations
Statistics > Methodology
Title: Model-based clustering in simple hypergraphs through a stochastic blockmodel
(Submitted on 12 Oct 2022 (v1), last revised 17 May 2024 (this version, v3))
Abstract: We propose a model to address the overlooked problem of node clustering in simple hypergraphs. Simple hypergraphs are suitable when a node may not appear multiple times in the same hyperedge, such as in co-authorship datasets. Our model generalizes the stochastic blockmodel for graphs and assumes the existence of latent node groups and hyperedges are conditionally independent given these groups. We first establish the generic identifiability of the model parameters. We then develop a variational approximation Expectation-Maximization algorithm for parameter inference and node clustering, and derive a statistical criterion for model selection.
To illustrate the performance of our R package HyperSBM, we compare it with other node clustering methods using synthetic data generated from the model, as well as from a line clustering experiment and a co-authorship dataset.
Submission history
From: Catherine Matias [view email][v1] Wed, 12 Oct 2022 07:46:01 GMT (102kb,D)
[v2] Wed, 9 Aug 2023 13:08:42 GMT (165kb,D)
[v3] Fri, 17 May 2024 10:24:11 GMT (244kb,D)
Link back to: arXiv, form interface, contact.