• Data Structures
    • D Comer. The Ubiquitous B-Tree. ACM Computing Surveys. 1979 Jun;11(12).
  • The Burrows-Wheeler Progression
    • U Manber and G Myers. Suffix Arrays: A New Method for On-Line String Searches. SIAM J. Comput. 1993;22(5):935-48.
    • M Burrows and DJ Wheeler. A Block-sorting Lossless Data Compression Algorithm. Technial Report 124. 1994 May 10; Digital Equipment Corporation.
    • P Ferragina and G Manzini. Opportunistic Data Structures with Applications. In Proc. of the 41st Symposium on Foundations of Computer Science (FOCS 2000). Redondo Beach, CA, USA; 2000 Nov: 390-8.
    • B Langmead B, C Trapnell, M Pop, and SL Salzberg. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009 Mar 4;10(3):R25.
    • H Li and R Durbin. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2010 Mar 1;26(5):589-95.
    • R. Durbin. Efficient haplotype matching and storage using the positional Burrows-Wheeler transform (PBWT) Bioinformatics. 2014 May 1;30(9):1266-72.
  • Distributed Systems
    • J Dean and S Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In Proc. of the 6th Symposium on Operating Systems Design and Implementation (OSDI 2004). San Francisco, CA, USA; 2004 Dec.
    • G DeCandia et al. Dynamo: Amazon's highly available key-value store. In Proc. of twenty-first ACM SIGOPS symposium on Operating systems principles (SOSP 2007). Stevenson, WA, USA; 2007 Oct:205-20.
    • A Thusoo et al. Hive - A Warehousing Solution Over a Map-Reduce Framework. In Proc. of Very Large Data Bases (VLDB 2009). Lyon, France. 2009 Aug: 1626-9.
    • A Gates et al. Building a High-Level Dataflow System on top of MapReduce: The Pig Experience. In Proc. of Very Large Data Bases (VLDB 2009). Lyon, France. 2009 Aug: 1414-25.
    • K Shvachko et al. The Hadoop Distributed File System. IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST 2010). Incline Village, NV, USA; 2010 May: 1-10.