Category Archives: data

Short Circuiting Logical Statements in R

Short circuiting logical statements are the way ‘lazy’ programming languages evaluate logical statements.  To identify whether or not the programming language you are using is lazy or eager, reference the chart on the wiki page: http://en.wikipedia.org/wiki/Short-circuit_evaluation The lazy (and quicker) … Continue reading

Posted in analysis, data, R | Tagged , | Leave a comment

Big Data Introduction

Happy Holidays! For the next few posts, I will be talking about using some tools/tricks on big data. I think that the term 'big data' gets thrown around a bit much these days. The more common definition of big data … Continue reading

Posted in analysis, data | Tagged , , | Leave a comment

The Simplicity of Python (an example)

Previously, I posted on how to forward fill missing data in columns of data in R and Excel. I just figured out how to do this in Python, and was blown away. Ready? data = Series(['A','B','C','D','E','F'], index = [0,2,5,7,12,14]) data.reindex(range(14),method='ffill') … Continue reading

Posted in analysis, data, Python | Tagged , | Leave a comment

Intro to Survey Analysis

Lately, more people have approached me with questions on how to analyze survey data. There are very interesting problems associated with survey analysis. In dealing with surveys, I have seen mistakes made in creating them. Here are some general rules … Continue reading

Posted in analysis, data | Tagged , , | Leave a comment

Separating Blocks of Numbers in Columns of Data

More and more I'm using the 'apply' functions in R (apply, mapply, sapply, lappy,...). The help functions on these are hard to decipher. Not till I read a post by Neil Saunders, did I really start using them instead of … Continue reading

Posted in data, R | Tagged , | Leave a comment

Quick Data Tip in R and Excel

Last week, a colleague showed me a trick in excel.  And it really bugs me when you can do something in excel that R can not do easily.  So I gave it some thought and came up with a solution.  … Continue reading

Posted in data | Tagged , , , | Leave a comment

Approaching Data Analysis

All data analysis projects start with a problem.  We (hopefully) form a hypothesis before exploring the data, and we go about trying to confirm/disprove hypotheses with statistical procedures.  These statistical procedures could be a simple t-test, a more complicated non-parametric … Continue reading

Posted in data | Tagged , , , | Leave a comment