Skip to Main content Skip to Navigation

Exploring Topic Evolution in Large Scientific Archives with Pivot Graphs

Abstract : There is an increasing demand for practical tools to explore the evolution of scientific research published in bibliographic archives such as the Web of Science (WoS), arXiv, PubMed or ISTEX. Revealing meaningful evolution patterns from these document archives has many applications and can be extended to synthesize narratives from datasets across multiple domains, including news archives, legal document archives and works of literature.In this thesis, we propose a data model and query language for the visualization and exploration of topic evolution graphs. Our model is independent of a particular topic extraction and alignment method and proposes a set of semantic and structural metrics for characterizing and filtering meaningful topic evolution patterns. These metrics are particularly useful for the visualization and the exploration of large topic evolution graphs. We also present a prototype implementation of our model on top of Apache Spark and experimental results obtained for four real-world document archives.
Complete list of metadata
Contributor : Bernd Amann <>
Submitted on : Friday, July 23, 2021 - 1:06:09 PM
Last modification on : Monday, July 26, 2021 - 11:27:36 AM


Files produced by the author(s)


  • HAL Id : tel-03297258, version 1


Ke Li. Exploring Topic Evolution in Large Scientific Archives with Pivot Graphs. Databases [cs.DB]. Sorbonne Université, 2021. English. ⟨tel-03297258⟩



Record views


Files downloads