Visualizing and Analyzing Web Browser History Data
using Eureka and SpotFire

CMSC 838B: Information Visualization Application Project

Matthias Mayer
Research Visitor at HCIL
mayer@informatik.uni-hamburg.de
March 6th, 2001

 

Motivation

About 80% of all accessed web pages have been visited befor by the same user (Cockburn & McKenzie 2001). Revisiting these pages, however, is no easy task. Today Back Buttons and Favorites are used to revisit pages in current or previous sessions. These mechanisms could be enhanced using better and more adequate interfaces. The patterns with which users access web pages should help to find how these new interfaces should look like.

This small project uses two commercial data visualization tools in order to find patterns in my own web browsing behaviour. Therefore I use a Netscape history file which documents about 7 months of my browsing activities. Thus, Visualization here is used to reveal characteristics after browsing has already taken place - not to support the user during his sessions.

In my PhD project I develop a tool to visually support web users during their sessions by showing their history. More about these 'Browsing Icons' can be found on the web (Mayer 2001).

 

Questions

What Questions must be answered in order to develop better tools for revisiting web pages?
Some questions are:

 

Data

I wrote a simple Java application to convert the Netscape file in input to Eureka and SpotFire. This step includes the calculation of derived data such as duration between first and last visit. Each row in the file contains information about one URL:

index Simple linecount
URL the URL of the page
Title the title of the page
Kind

I assigned each etry one of the following kinds:
file (on my laptop)
server homepage
query
comp. sci. department site
our lab's website
secure accesses
unknown

Host the host part of the URL
FirstVisit date fo the first visit
FirstDaytime time in the day of the first visit
LastVisit date fo the last visit
LastDaytime time in the day of the last visit
DiffFirstLast 1 difference between first and last visit in milliseconds
DiffFirstLast 2 difference between first and last visit in days, hours, minutes
numOfVisits number of visits to this page

 

Visualization Tools

I used InXight Eureka 1.1 (www.inxight.com, formerly Table Lens) and SpotFire 5.1 (www.spotfire.com).

 

Restrictions

There are several major restrictions due to the kind of data:

 

 

Selected Visualizations

 

Fig. 1a Eureka
   
Fig. 1b SpotFire

Getting rid of errors: An accidentally inserted outlier was obvious in both tools at first sight (one data point was dated early 2001).

 

 


Fig. 2

Fig. 2 shows the data first sorted by kind, then by number of visits. Findings:

 

 


Fig. 3

Fig 3. shows the data sorted by 'number of visits' and then by 'kind'. Findings:

 

 


Fig. 4

Fig. 4 shows a SpotFire Scatterplot. Mapping: x - number of visits, y - difference between visits (in 'artificial' dates beginning at 1.1.1970), size - number of visits, color - kind, markers are jittered. The homepage would be further on the right. Findings:

 

 


Fig. 5

Fig. 5 shows a SpotFire Histogram View. Mapping: x - host, y - sum of entries with this host. Findings:

 

 


Fig. 6

Fig. 6 shows a SpotFire profile chart (a parallel coordinates visualization developed by Inselberg). Mapping: Six parallel axes show values of six attributes, each in it's own scale. Entities are connected by lines.
I filtered the view to show just items that were visited more than 20 times and over a period of at least five days. Findings:

 

 


Fig.7

Fig. 7 shows a SpotFire 3D ScatterPlot. Mapping: right to left - date of first visit, bottom to top - number of visits, depth - hosts. Number of visits was redundantly coded to size, kind to color. Findings:

 

Conclusion

 

Critique of Tools

Eureka

SpotFire

Both tools could be enhanced by

References

We currently build a bibliography for history visualizations on the HCIL wiki web:
http://jazz.cs.umd.edu:8080/hcil/historyviz.wiki?cmd=get&anchor=HistoryViz

(Cockburn & McKenzie 2001)
Cockburn, Andy and Bruce McKenzie (2001). What Do Web Users Do? An Empirical Analysis of Web Use. International Journal of Human-Computer Studies, In Press. 2001.
http://www.cosc.canterbury.ac.nz/~andy/papers/ijhcsAnalysis.pdf

(Mayer 2001)
Mayer, Matthias (2001). Browsing Icons. Website which accompanies the PhD project.
http://www.hfbk.uni-hamburg.de/lem/mayer/phd/

 

[go to top]