Skip to content

Access Wikipedia Pagecounts dataset in Jupyter Scala notebook

Sample Wikipedia data sets are available within the DSWB Spark Sandbox.

Wikipedia Pagecounts

This dataset gives the total counts of monthly visitors for every page on Wikipedia and related sites across all languages.

Access the dataset by consulting this Jupyter Scala notebook.
Copy the url above and import it in your Data Scientist Workbench by pasting it in the appropriate box.

Submit an idea if you'd like a different data set to be made available.

Feedback and Knowledge Base