add the sample Wikipedia Page Count Data to DSW
Went thru training this week where we used scala to operate on the wikipedia page count data
sc.textFile("dbfs:/mnt/training-write/essentials/pagecounts/staging/")
have already moved my code just need to the data from the data bricks labs to DSW
3
votes
GB
shared this idea
-
Wikipedia Pagecounts dataset is now available:
http://support.datascientistworkbench.com/knowledgebase/articles/841728Wikipedia Clickstream dataset is also available:
http://support.datascientistworkbench.com/knowledgebase/articles/841716Cheers!
-
Are you looking specifically for pagecounts-20160215-210000.gz (so, pagecounts from February 15 from 9:00 to 10:00 in the evening London time) or a different date range?