Feb 19, 2020 – 11:45 AM
DIISM, Artificial Intelligence laboratory (room 201), Siena SI
We consider textual interaction data involving two disjoint sets of individuals/objects. An example of such data is given by the reviews on web platforms (e.g. Amazon, TripAdvisor, etc.) where buyers comment on products/services they bought.
We develop a new generative model, the latent topic block model (LTBM), along with an inference algorithm to simultaneously partition the elements of each set, accounting for the textual information. The estimation of the model parameters is performed via a variational version of the expectation maximization (EM) algorithm. A model selection criterion is formally obtained to estimate the number of
partitions. Numerical experiments on simulated data are carried out to highlight the main features of the estimation procedure. Two real-world datasets %from the Amazon e-commerce system are finally employed to show the usefulness of the proposed approach.