joshuago’s algorithms Bookmarks

11 FEB 2012
STXXL

The core of STXXL is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, i. e., STXXL implements containers and algorithms that can process huge volumes of data that only fit on disks.

10 FEB 2012
How the Boehm Garbage Collector Works

The Boehm GC is able to function without any cooperation from the compiler or the runtime environment. In C, the only adjustment one needs to make is to redirect calls to stdlib’s malloc()/free() to equivalent ones supplied by the Boehm GC.

27 JAN 2012
solidDB and the secrets of speed

The most common in-memory database index strategy is called T-tree. IBM solidDB instead uses an index called trie (or prefix tree), which was originally created for text searching but turns out to be perfect for in-memory indexing.

18 JAN 2012
Damn Cool Algorithms: Spatial indexing with Quadtrees and Hilbert Curves - Nick's Blog

Spatial indexing is increasingly important as more and more data and applications are geospatially-enabled. Efficiently querying geospatial data, however, is a considerable challenge: because the data is two-dimensional (or sometimes, more), you can't use standard indexing techniques to query on position. Spatial indexes solve this through a variety of techniques.

19 DEC 2011
[Stack Overflow] What are the lesser known but cool data structures?

A feast for the mind. Excellent for review, exploration, and inspiration.

04 MAY 2011
[Coda Hale] How To Safely Store A Password

Use bcrypt because it's slow as hell. It introduces a work factor which affects how expensive the hash function will be, and can keep up with Moore's law.

24 MAR 2011
[Joshua Bloch] Nearly All Binary Searches and Mergesorts are Broken

We programmers need all the help we can get, and we should never assume otherwise. Joshua Bloch of Google walks through a binary search implementation to discuss a bug that went undetected for years.

30 DEC 2010
[Jeff Huang] Best paper awards

The best computer science papers from various top-tier conferences.

27 AUG 2010
10 Books that will Substitute A Computer Science Degree

A good list of books to refresh, replace, or supplement a core computer science education.

23 AUG 2010
Why GNU grep is fast

An explanation from the original author of GNU grep explaining its fast inner workings.