GLAM Workbench
  • radio_button_unchecked

Trove newspapers: data dashboard¶

Trove's collection of digitised newspapers is always changing – new articles and newspapers are being added by the NLA, while Trove users are busy correcting OCRd text and adding tags and comments. The search you run today might produce different results than the same query did a month ago, a year ago, or ten years ago. Researchers need to understand how these changes affect the queries they make, the results they find, and the arguments they construct. This dashboard helps researchers understand the context of their queries by presenting a snapshot of Trove's digitised newspapers, based on weekly data harvests. It shows Trove's current make up from a number of angles, as well as highlighting recent changes. It is updated every Sunday.

  • Total articles and user activity
  • Total articles by publication year
  • Total articles by publication year and state
  • Article categories
  • Newspaper titles
  • Total articles by publication state
  • Significant events
Last updated: Sunday, 19 March 2023

Total articles and user activity¶

Latest data harvest 19 March 2023
First data harvest 24 April 2022
Total harvests 48
Data https://github.com/wragge/trove-newspaper-totals/blob/master/data/total_articles_by_activity.csv

As of 19 March 2023 there are 236,828,585 digitised newspaper articles in Trove.

  Current total Change since 12 Mar 2023 Change since 19 Feb 2023 Change since 24 Apr 2022 Percentage of total
All articles 236,828,585 29,926 67,746 2,316,495 100.00%
Articles with corrections 14,646,230 17,333 66,150 790,691 6.18%
Articles with tags 4,683,984 5,097 19,357 280,990 1.98%
Articles with comments 259,460 436 1,789 20,769 0.11%

Total articles by publication year¶

Latest data harvest 19 March 2023
First data harvest 19 April 2022
Total harvests 44
Data https://github.com/wragge/trove-newspaper-totals/blob/master/data/total_articles_by_year.csv

This chart shows the total number of digitised newspaper articles by year of publication. The distribution will change over time as digitisation priorities shift and more newspapers are added. This chart is useful in understanding how digitisation priorities and copyright restraints have shaped the total newspaper corpus.


Total articles by publication year and state¶

Latest data harvest 19 March 2023
First data harvest 19 April 2022
Total harvests 44
Data https://github.com/wragge/trove-newspaper-totals/blob/master/data/total_articles_by_year_and_state.csv

This interactive chart helps you explore the distribution of articles by both publication year and state. Click on a state in the bar chart or legend to filter the results by year. Click anywhere in the background of the chart to reset. This enables you to compare digitisation patterns in different states and see how they contribute to the whole corpus.

Total articles added since 05 Mar 2023 by publication year and state

This chart shows the total number of digitised newspaper articles added since the previous harvest by year and place of publication. It's useful in seeing how search results from particular time periods and locations might be affected by current digitisation priorities.

Total articles added since 19 Apr 2022 by publication year and state

This chart shows the total number of digitised newspaper articles added since the first harvest by year and place of publication. It's useful in seeing how search results from particular time periods or locations might be affected by recent digitisation priorities.