joshuago’s data-mining Bookmarks

02 JUL 2009
[Communications of the ACM] The End of a DBMS Era (Might be Upon Us)

If we examine the nontrivial-sized DBMS markets, it turns out that current relational DBMSs can be beaten by approximately a factor of 50 in most any market. A good overview of NoSQL alternatives for particular uses.

28 MAY 2009
[Dataspora] The Three Sexy Skills of Data Geeks

The three sexy skills of data geeks are statistics (which requires study), data munging (which demands suffering), and visualization (which favors those with a knack for storytelling).

19 MAY 2009
[Eric Ries] Vanity Metrics vs. Actionable Metrics

The only metrics that entrepreneurs should invest energy in collecting are those that help them make decisions. Unfortunately, the majority of data available in off-the-shelf analytics packages are Vanity Metrics: they might make you feel good, but they don’t offer clear guidance for what to do.

08 APR 2009
[Agile Testing] Experiences deploying a large-scale infrastructure in Amazon EC2

Expect failures and embrace them. Fully automate your infrastructure deployments. Design your infrastructure so that it scales horizontally. Establish clear measurable goals -- for example, response time. Be prepared to quickly identify and eliminate bottlenecks. Then play whack-a-mole for a while, until things get stable.