CMSC 838B : Information Visualization

Application Project : Disk I/O Access Pattern

Yoo Ah Kim


Data set : Disk I/O trace from HP lab

Visualization Tool : Spotfire 5.1


Data Set Description

This data was collected by Storage System team in HP research lab. It consists of the disk I/O events generated by three HP9000 computer systems over a period of about 3 months during the summer of 1992. In this project, I used the traces for a week at a single-user workstation. Total number of records is about 44,000 and each record has the following attributes.

The traced system is a single-user workstation. The main uses of the system were electronic mail and editing the papers. Due to the file buffer cache, there was not much disk activity on this system. The following two tables show the detail of the system.

Processor MIPS HP-UX version Physical memory File buffer cache size Fixed Storage Users Usage Type Disk Type
HP 9000/845 23 8.00 32MB 3MB 0.3GB 1 Workstation HP 2200A

Table 1. The Computer System Traced

ID

Disk Type

Partition

Size

A

HP 2200A

/(root)

278MB

swap

24MB

swap

16MB

B

HP 2200A

swap

16MB

Table 2. The File System Configuration


Analysis of Data

Overview : Total Number of Read/Write

Figure 1. Hourly Value for a Week 

 

Timings

Figure 2-1 and 2-2 show the queueing delay distribution for read and write. This is caused by system load and I/O burstiness. We can find out that write operations are more bursty, therefore cause more delay.

Figure 2-1. Start Time Distribution - Read

Figure 2-2. Start Time Distribution - Write

Figure 3 shows processing time distribution. Most of processing times are between 10,000us - 50, 000us. I excluded some erroneous results.

Figure 3. Processing Time Distribution

 

Types

As we can see in figure 4-2, user data percentage is only 27% in I/O access and the remaining parts are metadata, especially inode access.

Figure 4-1. Read/Write                                                                             Figure 4-2. Types of Blocks

 

The HP-UX file system generates both synchronous and asynchronous requests. Most of reads and half of writes are synchronous.

Figure 4-3. Sync/Async

Sequential Access

This data is disk level I/O traces, not file system level. Due to UNIX buffer cache, many of events don't arrive at disk level at all. In a result, traces are not really sequential, especially read operations. There are small areas accessed sequentially.

Figure 5. Enqueue time vs. Address

I/O Sizes 

Most of I/O sizes are multiple of 2k(2k, 4k, 8k), but there are some writes for a large amount of blocks.

Figure 6. I/O Sizes (Read/Write)


Critique


Written by Yoo Ah Kim  --  Feb. 26, 2001