Readings
CMSC 818:
 
Parallel and Distributed Data Intensive Computing


CMSC 818
Syllabus
Projects
Readings
Lectures
Exams
Dates

 

 

 

Sept. 5 - Chapter 2, Computational Grids, of The Grid: Blueprint for a New Computing Infrastructure

Sept. 10 & 12 - Chapter 11, The Globus Toolkit, of The Grid: Blueprint for a New Computing Infrastructure
                
Supplemental - Globus: A Metacomputing Infrastructure Toolkit, Intl J. Supercomputer Applications, 11(2):115-128, 1997

Sept. 19 - Message Passing
                PVM - The PVM Concurrent Computing System: Evolution, Experiences, and Trends. 
               MPI - J. Dongarra, S. W. Otto, M. Snir, and D. Walker, A message passing standard for MPP and workstations, CACM, 39(7), 1996, pp. 84-90.

Sept. 24 - Sensor data processing applications
                Satellite data - C. Chang, B. Moon, A. Acharya, C. Shock, A. Sussman and J. Saltz. Titan: A High Performance Remote-Sensing Database. In Proceedings of the 1997 International Conference on Data Engineering. April 1997. pages 375-384. IEEE Computer
Society Press.
              An extended version of the Titan paper with additional figures and text, from the journal Parallel Computing, Jan. 1998, is here.

Sept. 26 - More applications
               Medical imagery - U. Catalyurek, M. Beynon, C. Chang, T. Kurc, A. Sussman and J. Saltz. The Virtual Microscope. To appear in IEEE Transactions on Information Technology in Biomedicine. 2003.

Oct. 1 - Applications
             Petroleum reservoir simulation - Exploration and Visualization of Oil Reservoir Simulation Data, by R. Martino et. al.  Submitted for publication.

Oct. 4 - ADR and DataCutter
             T. Kurc, C. Chang, R. Ferreira, A. Sussman and J. Saltz. Querying Very Large Multi-dimensional Datasets in ADR. In Proceedings of the 1999 ACM/IEEE SC99 Conference. November 1999.  IEEE Computer Society Press.
             M. Beynon, T. Kurc, U. Catalyurek, C. Chang, A. Sussman and J. Saltz.  Distributed Processing of Very Large Datasets with DataCutterParallel Computing, 27(11), October 2001.

Oct. 8 - Clustering/Declustering
              B. Moon and J. Saltz.  Scalability Analysis of Declustering Methods for
Multidimensional Range Queries
.  IEEE Transactions on Knowledge and Data Engineering 10(2), 1998.
              B. Moon, H. Jagadish, C. Faloutsos and J. Saltz.  Analysis of the Clustering Properties of the Hilbert Space-filling Curve.  IEEE Transactions on Knowledge and Data Engineering 13(1), 2001.
     Supplemental reading: B. Moon, A. Acharya and J. Saltz.  Study of Scalable Declustering Algorithms for Parallel Grid Files.  In Proceedings of the 10th Int. Parallel Processing Symposium.  April 1996.

Oct. 10-15 - Indexing
               Douglas Comer.  The Ubiquitous B-Tree.  ACM Computing Surveys 11(2), 1979.
               Antonin Guttman.  R-Trees: A Dynamic Index Structure for Spatial Searching.  In Proceedings of SIGMOD'84.  May 1984.  ACM Press.
               N. Beckmann, H. Kriegel, R. Schneider and B. Seeger.  The R*-tree: An Efficient and Robust Access Method for Points and Rectangles.  In Proceedings of SIGMOD'90.  May 1990.  ACM Press.

Oct. 17 - Legion
              A. Grimshaw, A. Ferrari, F. Knabe and M. Humphrey.  Wide-Area Computing: Resource Sharing on a Large Scale.  IEEE Computer 32(5), 1999.
              B. Walker, M. Walker, M. Humphrey and A. Grimshaw.  LegionFS: A Secure and Scalable File System Supporting Cross-Domain High-Performance Applications.  In Proceedings of SC2001.  November 2001.  ACM Press.

Oct. 24 - MQO
             Henrique Andrade, Tahsin Kurc, Alan Sussman and Joel Saltz. Scheduling Multiple Data Visualization Query Workloads on a Shared Memory Machine. In Proceedings of the Fifth Merged IPPS/SPDP (International Parallel Processing Symposium & Symposium on Parallel and Distributed Processing). April 2002. IEEE Computer Society Press.
             Henrique Andrade, Tahsin Kurc, Alan Sussman and Joel Saltz. Exploiting Functional Decomposition for Efficient Parallel Processing of Multiple Data Analysis Queries. Technical Report CS-TR-4404 and UMIACS-TR-2002-84. University of Maryland, Department of Computer Science and UMIACS. October 2002. Submitted to IPDPS 2003.

Oct. 29-31 - Distributed File Systems
             E. Levy and A. Silberschatz.  Distributed file systems: Concepts and examples.  ACM Computing Surveys 22(4), 1990.  Read the Introduction and Sections 1-6, 9, 11-13.
             John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon, Westley Weimer, Chris Wells, and Ben Zhao. OceanStore: An Architecture for Global-Scale Persistent Storage. In Proceedings of the Ninth international Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2000), November 2000.  ACM Press.

Nov. 5 - Parallel File Systems
             P. Corbett and D. Feitelson .  The Vesta Parallel File System.  ACM Transactions on Computer Systems 14(3), 1996.  You don't have to read Section 5, on Performance.
             Rajeev Thakur, William Gropp, and Ewing Lusk. On Implementing MPI-IO Portably and with High Performance.  In Proceedings of the Sixth Workshop on I/O in Parallel and Distributed Systems (IOPADS), May 1999.

Nov. 7-12 - Data Grids
            W. Allcock, A. Chervenak, I. Foster, C. Kesselman, C. Salisbury, and S. Tuecke. The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets.  Journal of Network and Computer Applications. Vol. 23, 2001.
             Chaitanya Baru, Reagan Moore, Arcot Rajasekar, and Michael Wan.  The SDSC Storage Resource Broker.  In Proceedings of CASCON'98 Conference, December 1998.
             Arcot Rajasekar, Michael Wan and Reagan Moore.  MySRB & SRB - Components of a Data Grid.  In Proceedings of the 11th International Symposium on High Performance Distributed Computing (HPDC-11), July 2002.  IEEE Computer Society Press.

Nov. 14 - Grid tools
              D. Arnold, H. Casanova and J. Dongarra. Innovation of the NetSolve Grid Computing System. To appear in Concurrency and Computation: Practice and Experience, 2002.

Nov. 27 - Grid tools, continued
              R. Wolski , N. Spring and J. Hayes.  The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing.  Journal of Future Generation Computing Systems, Vol. 15, Nos. 5-6, 1999.
              H. Casanova and F. Berman. Parameter Sweeps on the Grid with APST, chapter 33 in Grid Computing: Making the Global Infrastructure a Reality, 2002.

Dec. 3 - Open Grid Services Architecture
            I. Foster, C. Kesselman, J. M. Nick and S. Tuecke.  The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. 2002.

Dec. 10-12 - Streaming databases and online query processing
           S. Chandrasekaran, O. Cooper, A. Deshpande, M.J. Franklin, J.M. Hellerstein, W. Hong, S. Krishnamurthy, S.R. Madden, V. Raman, F. Reiss, and M.A. Shah. TelegraphCQ: Continuous Dataflow Processing for an Uncertain World. To appear in 1st CIDR Conf., Jan 2003.
           S. Madden and M.J. Franklin. Fjording the Stream: An Architecture for Queries over Streaming Sensor Data. Proceedings of ICDE Conference, February, 2002.
           V. Raman and J.M. Hellerstein. Partial Results for Online Query Processing. Proceeding of SIGMOD 2002, June 2002.

  Last updated Tuesday, 10 December 2002 11:07 AM