CMSC 818:  Peer-to-Peer, Grid and Cloud Computing

CMSC 818




Note: for each class (after 2/1), you must send me email ( with one or more discussion questions on the readings for that day.

For each student lecture, 3 students will evaluate the presentation, using this form [Word] [PDF].
Completed forms should be emailed to the instructor, or hard copies turned over in class.

Introduction - What and Why? 

2/1 - A. Sussman

  • Computational Grids
    I. Foster and C. Kesselman. Chapter 2 of The Grid: Blueprint for a New Computing Infrastructure, Morgan Kaufmann, 1999. [PDF]

2/3-8 - A. Sussman

  • Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications
    Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, and Hari Balakrishnan. In Proceedings of the ACM SIGCOMM '01 Conference, August 2001. [PDF]

2/8-10 - A. Sussman

  • Cloud computing
    Brian Hayes,
    Communications of the ACM, 51(7), pp. 911, July 2008. [PDF]

  • MapReduce: simplified data processing on large clusters
    Jeffrey Dean and Sanjay Ghemawat,
    Communications of the ACM, 51(1), pp. 107113, Jan. 2008. [PDF]

2/15 - A. Sussman

  • Grids in Context
    L. Smarr. Chapter 1 of The Grid 2: Blueprint for a New Computing Infrastructure, Elsevier/Morgan Kaufmann, 2004. [from instructor]
  • MPI - A message passing standard for MPP and workstations
    J. Dongarra, S. W. Otto, M. Snir, and D. Walker, CACM, 39(7), 1996, pp. 84-90. [PDF]


2/17 - N. Crowell

  • Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems
    A. Rowstron and P. Druschel.In Proceedings of IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), November 2001. [PDF]

  • Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility
    A. Rowstron and P. Druschel. In Proceedings of ACM Symposium on Operating Systems Principles (SOSP'01),  October 2001. [PDF]

2/22 - P. Sharma

  • The Anatomy of the Grid: Enabling Scalable Virtual Organizations
    I. Foster, C. Kesselman, S. Tuecke. International J. Supercomputer Applications, 15(3), 2001. [PDF]
  • The Open Grid Services Architecture
    I. Foster, C. Kesselman, and S. Tuecke.  Chapter 17 of The Grid 2 [from instructor]
    • and/or The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration
      I. Foster, C. Kesselman, J. Nick, S. Tuecke, Open Grid Service Infrastructure Working Group, Global Grid Forum, June 22, 2002. [PDF]

2/24 - A. Sopan

  • A Scalable Content-Addressable Network
    Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp and Scott Shenker. In Proceedings of ACM SIGCOMM 2001, August 2001. [PDF]
  • Tapestry: A Resilient Global-Scale Overlay for Service Deployment
    Ben Y. Zhao, Ling Huang, Jeremy Stribling, Sean C. Rhea, Anthony D. Joseph, and John D. Kubiatowicz. IEEE Journal on Selected Areas in Communications, Vol. 22, No. 1, January 2004. [PDF]

3/1 - Y. Kim

  • Globus Toolkit Version 4: Software for Service-Oriented Systems
    I. Foster. IFIP International Conference on Network and Parallel Computing, Springer-Verlag LNCS 3779, 2006. [PDF]
  • OceanStore: An Architecture for Global-Scale Persistent Storage
    John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon, Westley Weimer, Chris Wells, and Ben Zhao. Proceedings of the Ninth international Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2000), Nov. 2000. [PDF]

Distributed File Systems and Data Grids

3/3 - C. Hill

  • The Google File System
    Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung.  In Proceedings of 19th ACM Symposium on Operating Systems Principles, October 2003. [PDF]
  • The Hadoop Distributed File System
    Tom White. Chapter 3 of Hadoop: The Definitive Guide, O'Reilly, 2009.

3/8 - T. Long

  • BigTable: A Distributed Storage System for Structured Data
    Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. In Proceedings of 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI'06) , 2006. [PDF]
  • MapReduce and parallel DBMSs: friends or foes?
    Michael Stonebraker, Daniel Abadi, David J. DeWitt, Sam Madden, Erik Paulson, Andrew Pavlo, and Alexander Rasin. Communications of the ACM, Vol. 53, No. 1, Jan. 2010. [PDF]

3/15 - E. Elsaka

  • The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets
    A. Chervenak, I. Foster, C. Kesselman, C. Salisbury, S. Tuecke. Journal of Network and Computer Applications, 23:187-200, 2001. [PDF]
  • Data Access, Integration, and Management
    M. Atkinson, A.L. Chervenak, et al.  Chapter 22 of The Grid 2, Elsevier/Morgan Kaufmann, 2004. [from instructor]

3/17 - A. Balasubramanian

  • The Farsite project: a retrospective
    William J. Bolosky, John R. Douceur, and Jon Howell.  ACM SIGOPS Operating Systems Review, Vol. 41, No. 2, April 2007.  [PDF]
    • and/or FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment
      Atul Adya, William J. Bolosky, Miguel Castro, Gerald Cermak, Ronnie Chaiken, John R. Douceur, Jon Howell, Jacob R. Lorch, Marvin Theimer, and Roger P. Wattenhofer.  In Proceedings of 5th Symposium on Operating Systems Design and Implementation, December 2002.  [PDF]
  • Scale and Performance in a Distributed File System
    J.H Howard, M.L. Kazar, S.G. Menees, D.A. Nichols, M. Satyanarayanan, R.N Sidebotham., and M.J. West.  ACM Transactions on Computer Systems, Vol. 6, No. 1, 1988. [PDF]

3/29 - R. Liu

  • Data Management in an International Data Grid Project
    W. Hoschek, J. Jaen-Martinez, A. Samar, H. Stockinger, and K.Stockinger. In Proceedings of the First IEEE/ACM International Workshop on Grid Computing, 2000. [PDF]
  • The Globus Striped GridFTP Framework and Server
    W. Allcock, J. Bresnahan, R. Kettimuthu, M. Link, C. Dumitrescu, I. Raicu, and I. Foster. In Proceedings of SC'05, Nov. 2005. [PDF]

More Cloud Data Management

3/31 - S. Krishnamoorthy

  • Eventually consistent
    Werner Vogels. Communications of the ACM 52, 1, Jan. 2009. [PDF]
  • Dremel: Interactive Analysis of Web-Scale Datasets
    Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, and Matt Tolton. Proceedings of the 36th International Conference on Very Large Data Bases (VLDB), Sept. 2010. [PDF]

4/5 - K. Zhai

  • Automatic Optimization for MapReduce Programs
    Eaman Jahani, Michael J. Cafarella, and Christopher Re. Proceedings of the VLDB Endowment, Volume 4, Number 6, March 2011. [PDF]
  • A High-Performance Computing Forecast: Partly Cloudy
    Thomas Sterling and Dylan Stark. Computing in Science & Engineering, 11(4), July/August 2009. [PDF]

Resource Management and Desktop Grids

4/7 - A. Sopan

  • On Death, Taxes, and the Convergence of Peer-to-Peer and Grid Computing
    Ian Foster and Adriana Iamnitchi.  In Proceedings of the 2nd International Workshop on Peer-to-Peer Systems (IPTPS '03), February 2003. [PDF]
  • A peer-to-peer approach to resource location in Grid environments
    A. Iamnitchi and  I. Foster.  In Grid Resource Management: State of the Art and Future Trends, J. Nabrzyski, J. M. Schopf, and J. Weglarz, Eds. Kluwer Academic Publishers, 2004. [PDF]

4/12 - G. Kothari

  • Condor - A Hunter of Idle Workstations
    Michael J. Litzkow, Miron Livny, and Matt W. Mutka. In Proc. 8th Intl. Conf. Distributed Computing Systems, June 1988. [PDF]
  • Condor and the Grid
    Douglas Thain, Todd Tannenbaum, and Miron Livny.  In Grid Computing: Making The Global Infrastructure a Reality, Fran Berman, Anthony J.G. Hey and Geoffrey Fox editors. John Wiley, 2003. [PDF]

4/14 - T. Long

  • SETI@home: an experiment in public-resource computing
    D. P. Anderson, J. Cobb, E. Korpela, M. Lebofsky, and D. Werthimer. Communications of the ACM, Vol. 45, No. 11, Nov. 2002. [PDF]
  • Designing a Runtime System for Volunteer Computing
    David P. Anderson, Carl Christensen and Bruce Allen. In Proceedings of SC'06, November 2006. [PDF]

4/19 - R. Liu

  • Cluster Computing on the Fly: resource discovery in a cycle sharing peer-to-peer system
    Dayi Zhou and Virginia Lo.  In Proceedings of IEEE International Symposium on Cluster Computing and the Grid (CCGrid), April 2004. [PDF]
  • CompuP2P: An Architecture for Internet Computing Using Peer-to-Peer Networks
    Rohit Gupta and Varun Sekhri and Arun K. Somani.  IEEE Transactions on Parallel and Distributed Systems, Vol. 17, No. 11, November 2006. [PDF]

4/21 - A. Balasubramanian and P. Sharma

  • Creating a Robust Desktop Grid using Peer-to-Peer Services
    Jik-Soo Kim, Beomseok Nam, Michael Marsh, P.ete Keleher, Bobby Bhattacharjee, Derek Richardson, Dennis Wellnitz and Alan Sussman.  Proceedings of the 2007 NSF Next Generation Software Workshop, March 2007. [PDF]
  • Resource Discovery Techniques in Distributed Desktop Grid Environments
    Jik-Soo Kim, Beomseok Nam, Pete Keleher, Michael Marsh, Bobby Bhattacharjee and Alan Sussman.   In Proceedings of the 7th IEEE/ACM International Conference on Grid Computing - GRID 2006, September 2006. [PDF]

Theory, Applications, etc.

4/26 - A. Kumar

  • Network Applications of Bloom Filters: A Survey
    Andrei Broder and Michael Mitzenmacher. Internet Mathematics, Vol. 1, No. 4, 2003. [PDF]
  • Practical Byzantine Fault Tolerance
    Miguel Castro and Barbara Liskov. Proceedings of the Third USENIX Symposium on Operating Systems Design and Implementation (OSDI), Feb. 1999. [PDF]

4/28 - S. Ng Zeng

  • The Virtual Microscope
    U. Catalyurek, M. Beynon, C. Chang, T. Kurc, A. Sussman and J. Saltz. IEEE Transactions on Information Technology in Biomedicine, Vol. 7, No. 4, December 2003. [PDF]
  • Scientific Data Federation: The World Wide Telescope
    Alexander S. Szalay and Jim Gray. Chapter 7 of The Grid 2. [from instructor]

Research Project Presentations

5/3, 5/5