Links
-
Recent Posts
- Rough Draft Done!
- I Started Writing a Technical Book... (part 3)
- I Started Writing a Technical Book... (part 2)
- I Started Writing a Technical Book... (part 1)
- Trump + The Scream = ...
- ImageNet CNN Architecture Image
- Displaying Digits of Pi on Raspberry Pi in Python
- Random Buffy the Vampire Slayer Episode Generator
- Uber Taxi Bar Chart Fail
- The Hobbies of the Scripps 2014 Spelling Bee Contestants
- On the Seasonality of Hoppy Beers
- A Small Trick for Big Data in R
Archives
Categories
Meta
Tag Archives: data
The Hobbies of the Scripps 2014 Spelling Bee Contestants
I downloaded text data from http://public.spellingbee.com/public/results/2014/round_results and performed some text mining on the contestant interviews and came up with some interesting results. The hobby with the highest round average was Volunteering followed by Chess and Movies. I filtered for hobbies … Continue reading
A Small Trick for Big Data in R
The other day I was writing a prediction script in R for a very large data set. I was data prepping and needed to create a logical vector from a numeric vector. I didn't want to spend the time loading … Continue reading
Webscraping in R: Part 1
It's hard to believe, but webscraping in R can be done really easily. With no fancy packages either. I recently ran into the need to scrape weather information from the web. After writing the program, I realized this specific task … Continue reading
Using DPLYR in R: A Quicker, Easier Way to Work with Data
As I promised, I thought I would show an example using DPLYR. I decided to create my own data set, instead of going with a canned data set in R. I did this to preface my next few posts. After … Continue reading
Fast Data Frame Modification in R
One of the frequent data structures I use in R is a data frame. Data frames are similar to matrices except they allow different types of variables in each column. I always rejoice when I can reduce the analysis at … Continue reading
The Simplicity of Python (an example)
Previously, I posted on how to forward fill missing data in columns of data in R and Excel. I just figured out how to do this in Python, and was blown away. Ready? data = Series(['A','B','C','D','E','F'], index = [0,2,5,7,12,14]) data.reindex(range(14),method='ffill') … Continue reading
Separating Blocks of Numbers in Columns of Data
More and more I'm using the 'apply' functions in R (apply, mapply, sapply, lappy,...). The help functions on these are hard to decipher. Not till I read a post by Neil Saunders, did I really start using them instead of … Continue reading
Quick Data Tip in R and Excel
Last week, a colleague showed me a trick in excel. And it really bugs me when you can do something in excel that R can not do easily. So I gave it some thought and came up with a solution. … Continue reading