Install ggplot2 for SparkR
Install ggplot2 for SparkR. It is a special package just for Spark found at:
https://github.com/PAPL-SKKU/ggplot2.SparkR
Please also reference this talk from Spark Summit East 2016:
https://spark-summit.org/east-2016/events/ggplot2sparkr-rebooting-ggplot2-for-scalable-big-data-visualization/
ggplot2.SparkR is now available for both Jupyter notebook and RStudio users of the DSWB:
JUPYTER NOTEBOOKS
Feature 560 ggplot2.SparkR package is now available
RSTUDIO IDE
Feature 561 ggplot2.SparkR package is now available for plotting huge data sets
-
Darragh Hanley commented
I would suggest just installing all packages in R. I am struggling to find a use case for RStudio in Data Scientist Wbench due to missing packages.
Kaggle offer the same functionality but have all packages installed. See this article for how they do it with docker containers : http://blog.kaggle.com/2016/02/05/how-to-get-started-with-data-science-in-containers/