Data Analysis with Lee Hawthorn

Data Analysis Prep

Topics: R

When I embarked on learning R last year I decided to jump whole heartedly into the open source stack with Linux, using a virtual machine running Linux Mint.

There were various reason for this such as productivity with the terminal and easier access to Linux data sources, not to mention, it’s much easier to deploy R scripts with Linux.

During my Data Analysis adventures I found I had to customise Linux to make it more suitable for Data Analysis with R.

I had to rebuild the VM this week, don’t ask! Of course I had to scratch around for the configs again.

Here are the steps I used to get Mint 17.1/Ubuntu 14.04 LTS, primed for Data Analysis with R.

From the terminal :

  1. Install the latest version of R, Git, sqLite and libraries that I’ve found are required for data & web munging. Be sure to change to the url to your local mirror found on this page.
sudo add-apt-repository "deb http://cran.rstudio.com/bin/linux/ubuntu trusty/"
sudo apt-get update
sudo apt-get upgrade
sudo apt-get install r-base r-base-dev r-cran-rodbc libxml2-dev libcurl4-openssl-dev git-core sqlite3
  1. Set up GIT

    git config --global user.name "Your Name"
    git config --global user.email "your@email.com"
  2. Install R Studio

    cd ~
    wget http://download1.rstudio.org/rstudio-0.98.1091-amd64.deb
    sudo dpkg -i rstudio-0.98.1091-amd64.deb
    sudo rm rstudio-0.98.1091-amd64.deb

    This should get you going. I’ll add to this if I discover any new config on my travels.

Previous PostScraping data with R
Next PostData Analytics modelling, why tune by hand?