Trove newspapers: data dashboard¶
Trove's collection of digitised newspapers is always changing – new articles and newspapers are being added by the NLA, while Trove users are busy correcting OCRd text and adding tags and comments. The search you run today might produce different results than the same query did a month ago, a year ago, or ten years ago. Researchers need to understand how these changes affect the queries they make, the results they find, and the arguments they construct. This dashboard helps researchers understand the context of their queries by presenting a snapshot of Trove's digitised newspapers, based on weekly data harvests. It shows Trove's current make up from a number of angles, as well as highlighting recent changes. It is updated every Sunday. To explore past changes, see the dashboard archive.
Total articles and user activity¶
Latest data harvest | 07 July 2024 |
---|---|
First data harvest | 24 April 2022 |
Total harvests | 115 |
Data | https://github.com/wragge/trove-newspaper-totals/blob/master/data/total_articles_by_activity.csv |
As of 07 July 2024 there are 249,154,934 digitised newspaper articles in Trove.
Current total | Change since 30 Jun 2024 | Change since 09 Jun 2024 | Change since 24 Apr 2022 | Percentage of total | |
---|---|---|---|---|---|
All articles | 249,154,934 | 154,458 | 374,876 | 14,642,844 | 100.00% |
Articles with corrections | 15,793,837 | 17,443 | 70,568 | 1,938,298 | 6.34% |
Articles with tags | 4,948,038 | 2,312 | 20,062 | 545,044 | 1.99% |
Articles with comments | 284,588 | 396 | 2,467 | 45,897 | 0.11% |
Total articles by publication year¶
Latest data harvest | 07 July 2024 |
---|---|
First data harvest | 19 April 2022 |
Total harvests | 111 |
Data | https://github.com/wragge/trove-newspaper-totals/blob/master/data/total_articles_by_year.csv |
This chart shows the total number of digitised newspaper articles by year of publication. The distribution will change over time as digitisation priorities shift and more newspapers are added. This chart is useful in understanding how digitisation priorities and copyright restraints have shaped the total newspaper corpus.
Total articles by publication year and state¶
Latest data harvest | 07 July 2024 |
---|---|
First data harvest | 19 April 2022 |
Total harvests | 111 |
Data | https://github.com/wragge/trove-newspaper-totals/blob/master/data/total_articles_by_year_and_state.csv |
This interactive chart helps you explore the distribution of articles by both publication year and state. Click on a state in the bar chart or legend to filter the results by year. Click anywhere in the background of the chart to reset. This enables you to compare digitisation patterns in different states and see how they contribute to the whole corpus.
Total articles added since 30 Jun 2024 by publication year and state
This chart shows the total number of digitised newspaper articles added since the previous harvest by year and place of publication. It's useful in seeing how search results from particular time periods and locations might be affected by current digitisation priorities.
Total articles added since 19 Apr 2022 by publication year and state
This chart shows the total number of digitised newspaper articles added since the first harvest by year and place of publication. It's useful in seeing how search results from particular time periods or locations might be affected by recent digitisation priorities.