Unsupervised broadcast news summarization; a comparative study on maximal marginal relevance (MMR) and latent semantic analysis (LSA)

نویسندگانMajid Ramezani, Mohammad-Salar Shahryari, Amir-Reza Feizi-Derakhshi, Mohammad-Reza Feizi-Derakhshi
همایش28th International Computer Conference, Computer Society of Iran (CSICC)
تاریخ برگزاری همایش2023-1-25
نوع ارائهسخنرانی
سطح همایشبین المللی

چکیده مقاله

The automatic speech summarization methods traditionally are classified into two groups: supervised and unsupervised methods. Supervised methods rely on a set of features, while unsupervised methods perform summarization through a set of rules. Among unsupervised automatic speech summarization methods, Latent Semantic Analysis (LSA) and Maximal Marginal Relevance (MMR) are so famous. This study set out to peruse the overall efficacy of two aforementioned unsupervised methods in summarization of Persian broadcast news transcriptions. The results justify the superiority of LSA to MMR during generic summarization. This is while MMR achieves better results in query-based summarization.