Thursday, September 13, 2012

Is MapReduce on AWS fast?

I have been playing around with Elastic MapReduce on AWS. I ran the wordsplitter example from the AWS tutorial. The job took 3 minutes to complete the word count on 12 files.

I then wrote the whole thing in native Python using a dictionary (without MapReduce), this took 4 seconds to run on the EC2 server. So actually I am not that impressed with MapReduce, it might be due to file access or job creation but still hard to see what the fuss is about.

No comments: