COSC 7360—Advanced Operating Systems
Spring 2007: File and Storage Systems

Reading List

I. File Systems

M. K. McKusick, W. N. Joy, S. J. Leffler and R. S. Fabry, A fast file system for UNIX, ACM Transactions on Computer Systems, 2(3):181–197, 1984. (PowerPoint)

M. Rosenblum and J. Ousterhout. The design and implementation of a log-structured file system, Proc. 13th Symposium on Operating System Principles (SOSP),pages 1–15, Oct. 1991 (PowerPoint).

M. I. Seltzer, G. R. Ganger, M. K. McKusick, K. A. Smith, C. A. N. Soules, and C A. Stein. Journaling versus soft updates: Asynchronous meta-data protection in file systems. Proc. 2000 USENIX Annual Technical Conference, 2000. (PowerPoint)

II. Disk Drive Modeling

C. Ruemmler and J. Wilkes. An introduction to disk drive modeling, IEEE Computer, 27(3):17–29, March 1994. (no PowerPoint)

III. Disk Access Optimization

C. R. Lumb, J. Schindler, and G. Ganger, Track-aligned extents: Matching access patterns to disk drive characteristics, Proc. 1st Conference on File and Storage Systemsi (FAST), Jan. 2002. (PowerPoint)

IV. Workload Characterization

D. Roselli, J. Lorch, and T. Anderson, A Comparison of file system workloads, Proc. 2000 USENIX Technical Conference, June 2000. (PowerPoin)

V. File Access Prediction

G. A. S. Whittle, J.-F. Pâris, A. Amer, D. D. E. Long and R. Burns, Using multiple predictors to improve the accuracy of file access predictionsProc. 20th IEEE Symposium on Mass Storage Systems (MSST), pages 230–240, Apr. 2003. (PowerPoint)

VI. RAID and MAID

P. M. Chen, E. K. Lee, G. A. Gibson, R. Katz, and D. Patterson, RAID: High-performance, reliable secondary storageACM Computing Surveys, 26(2):145–185, 1994. (PowerPoint)

J. Wilkes, R. Golding, C. Staelin, and T. Sullivan, The HP AutoRAID hierarchical storage system. ACM Transactions on Computer Systems, 14(1): 1-29, Feb. 1996. (PowerPoint)

D. Colarelli, D Guenwald. Massive arrays of idle disks for storage archives, Proc. ACM/IEEE SC 2002 Conference, 2002. (PowerPoint)

VII. Virtualization

E. K. Lee and C. A. Thekkath, Petal: Distributed virtual disks, Proc. 7th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 84–92, 1996. (PowerPoint)

G. A. Gibson, D. F. Nagle, K, Amiri, J. Butler, F. W. Chang, H. Gobioff, C.Hardin., E. Riedel, D. Rochberg  and J. Zelenka. A cost-effective, high-bandwidth storage architecture, Proc. 8th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 92–103, Oct.1998. (PowerPoint)

VIII. Distributed File Systems

A. Adya, W. J. Bolosky, M. Castro, G. Cermak, R. Chaiken, J. R. Douceur, J. Howell, J. R. Lorch, M. Theimer and R.P. Wattenhofer, FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment, Proc. 5th Symposium on  Systems Design and Implementation  (OSDI), Dec. 02. (PowerPoint)

E. B. Nightingale, P. M. Chen, and J. Flinn, Speculative execution in a distributed file system, Proc. 20th ACM Symposium on Operating Systems Principles (SOSP), Oct. 2005

S. Rhea, P. Eaton, D. Geels, H. Weatherspoon, B. Zhao, and J. Kubiatowicz, Pond: The OceanStore prototype. Proc. 2nd Conference on File and Storage Systems (FAST), Apr. 2003. (PowerPoint to come soon)

S. A. Weil, S. A. Brandt, E. L. Miller, D. D. E. Long, and C. Maltzahn, Ceph: a scalable, high-performance, distributed object-based storage system,. Proc. 7th Symposium on Operating Systems Design and Implementation (OSDI), Nov. 2006. (PowerPoint)

IX. Security

E. L. Miller, W. E. Freeman, D. D. E. Long, B. C. Reed. Strong security for network-attached storage.  Proc. First USENIX Conference on File and Storage Technologies (FAST), Jan. 2002.

X. Replication

B. Liskov, S. Ghemawat, R. Gruber, P. Johnson, L. Shrira, and M. Williams, Replication in the Harp file system, Proc 13th ACM Symposium on Operating Systems Principles (SOSP), pages 226-238, Oct. 1991. (PowerPoint)

Y. Saito, C. Karamanolis, M. Karlsson, and M. Mahalingam, Taming replication in the Pangaea wide-area file system, Proc. 5th Symposium on Operating Systems Design and Implementation (OSDI), December 2002.

Q. Xin, E. L. Miller, D. D. E. Long, S. A. Brandt, T. Schwarz, and W. Litwin, Reliability mechanisms for very large storage systems, Proc. 20th IEEE / 11th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST), pages 146–156, April 2003.

XI. Data Compaction

A. Muthitacharoen, B. Chen, and D. Mazières, A low-bandwidth network file system, Proc. 18th ACM Symposium on Operating Systems Principles (SOSP), pages 174-187, Oct. 2001. (PowerPoint)

D. Bhagwat, K. Pollack, D. D. E. Long, E. L. Miller, T. J. Schwarz and J.-F. Pâris. Providing high reliability in a minimum redundancy archival storage system, Proceedings of the 14th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), pages 213–221, Sep. 2006. (PowerPoint)

XII. Novel Storage Devices

M. Wu and W. Zwaenepoel, eNVy: a non-volatile, main memory storage system, Proc. 6th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 86–97, Oct. 1994

T. Bisson, S. Brandt, and D. D.E. Long. A Hybrid disk-aware spin-down algorithm with I/O subsystem support. Proceedings of the International Performance Conference on Computers and Communication (IPCCC '07), Apr. 2007.(Best Paper Award)

XIII. Large File Systems

S. Ghemawat, H. Gobioff, and S. Leung, The Google file system, Proc. 19th ACM Symposium on Operating Systems Principles (SOSP), Oct. 2003.