joshuago’s data-mining Bookmarks
If we examine the nontrivial-sized DBMS markets, it turns out that current relational DBMSs can be beaten by approximately a factor of 50 in most any market. A good overview of NoSQL alternatives for particular uses.
The three sexy skills of data geeks are statistics (which requires study), data munging (which demands suffering), and visualization (which favors those with a knack for storytelling).
The only metrics that entrepreneurs should invest energy in collecting are those that help them make decisions. Unfortunately, the majority of data available in off-the-shelf analytics packages are Vanity Metrics: they might make you feel good, but they don’t offer clear guidance for what to do.
Expect failures and embrace them. Fully automate your infrastructure deployments. Design your infrastructure so that it scales horizontally. Establish clear measurable goals -- for example, response time. Be prepared to quickly identify and eliminate bottlenecks. Then play whack-a-mole for a while, until things get stable.