- Data Structures
-
D Comer. The Ubiquitous B-Tree. ACM Computing Surveys.
1979 Jun;11(12).
- The Burrows-Wheeler Progression
-
U Manber and G Myers. Suffix Arrays: A New Method for On-Line String
Searches. SIAM J. Comput. 1993;22(5):935-48.
-
M Burrows and DJ Wheeler. A Block-sorting Lossless Data Compression
Algorithm. Technial Report 124. 1994 May 10; Digital Equipment
Corporation.
-
P Ferragina and G Manzini. Opportunistic Data Structures with
Applications. In Proc. of the 41st Symposium on Foundations of
Computer Science
(FOCS 2000). Redondo Beach, CA, USA; 2000 Nov: 390-8.
-
B Langmead B, C Trapnell, M Pop, and SL Salzberg. Ultrafast and
memory-efficient alignment of short DNA sequences to the human genome.
Genome Biol. 2009 Mar 4;10(3):R25.
-
H Li and R Durbin. Fast and accurate short read alignment with
Burrows-Wheeler transform. Bioinformatics. 2010 Mar 1;26(5):589-95.
-
R. Durbin. Efficient haplotype matching and storage using the
positional Burrows-Wheeler transform (PBWT) Bioinformatics. 2014
May 1;30(9):1266-72.
- Distributed Systems
-
J Dean and S Ghemawat. MapReduce: Simplified Data Processing on Large
Clusters. In Proc. of the 6th Symposium on Operating Systems Design
and Implementation (OSDI 2004). San Francisco, CA, USA; 2004 Dec.
-
G DeCandia et al. Dynamo: Amazon's highly available key-value
store. In Proc. of twenty-first ACM SIGOPS symposium on Operating
systems principles (SOSP 2007). Stevenson, WA, USA; 2007 Oct:205-20.
-
A Thusoo et al. Hive - A Warehousing Solution Over a Map-Reduce
Framework. In Proc. of Very Large Data Bases (VLDB 2009). Lyon,
France. 2009 Aug: 1626-9.
-
A Gates et al. Building a High-Level Dataflow System on top of
MapReduce: The Pig Experience. In Proc. of Very Large Data Bases
(VLDB 2009). Lyon, France. 2009 Aug: 1414-25.
-
K Shvachko et al. The Hadoop Distributed File System. IEEE 26th
Symposium on Mass Storage Systems and Technologies (MSST 2010). Incline
Village, NV, USA; 2010 May: 1-10.