Browsers with Changing Parts: a Catalog Explorer for Philip Glass’ Website
Harry Hochheiser[1]
Human-Computer Interaction
Lab &
Department of Computer
Science
University of Maryland
College Park, MD 20742
+1 301 405 2725
hsh@cs.umd.edu
The development of navigational tools for a web site devoted to a
catalog of musical compositions offers a variety of design challenges. A
combination of techniques developed from information visualization research -
including starfield displays, dynamic queries, and zoomable user interfaces
(ZUIs) - was used to construct a prototype browser for possible use in a web
site dedicated to the work of Philip Glass. After a discussion of the goals and
potential users of the site, this paper describes several prototypes that were
developed and how they informed the design of a zoomable starfield browser.
Unresolved design challenges and possibilities for future work are also
discussed.
Interactive Visualization, Music, Biography, Catalog Browsing, Zoomable User Interfaces (ZUIs)
The design of web-based tools for browsing catalogs of musical works
involves numerous problems. These tools require the integration of music into
the primarily visual environment of the Web, in a manner that supports users of
varying musical backgrounds in pursuit of a wide range of goals. Information
visualization techniques such as dynamic queries, semantic zooming, and
starfield displays may be useful, but application of these techniques to musical
data has been limited, and their utility for tasks involving significant
non-visual components is unknown.
These concerns and others arose during
investigations of possible designs for browsing and navigation tools to be used
within a web site devoted to the music of Philip Glass. Known for his operas (Einstein on the Beach), film scores (Kundun), symphonies and other works,
Glass has a catalog of several hundred compositions.
Recently, Glass and his production staff have been exploring possible
designs and functionality for a Philip Glass web site. In addition to serving
as a web presence for Philip Glass, this site would provide educational
materials and promote Glass recordings, performances and other products. Another, more innovative proposed application
of the site involves licensing. Glass currently licenses his music to a wide
range of performers, filmmakers, and other artists. The web site is seen as a potential tool for supporting the
identification of works to be licensed, and for completion of the eventual
transactions.
In collaboration with Glass’ staff and the E-Commerce Intelligence Group of IBM’s T.J. Watson Research Labs, we defined potential users and user tasks and developed prototype components for this site. Although primarily focused on the design and development of components that could be used in a production web site, these efforts also provided an opportunity to address some of the challenges involved in a novel application of information visualization techniques. This paper describes several prototypes that were developed, and their influence on the design of GlassEye: an interactive visualization tool that combines musical playback with a starfield display [1] in a Zoomable User Interface [4].
Our exploration of possible designs for the Glass
site began with an informal survey of web sites devoted to musicians and
composers. As there are hundreds (if not thousands) of such sites, this effort
was not intended to be comprehensive. Rather, we hoped to understand the range
of currently existing sites in a manner that might help frame our discussion of
the goals of the Glass site. To that end, we examined several existing sites
for both classical and pop musicians, including both officially-sanctioned and
"fan" sites. This survey also included several
"uofficial" Philip Glass
sites. This exploration led to a set of categories that describe the existing
sites:
Encyclopedic sites
present information in a reference format designed (hopefully) for ease of search
and retrieval, usually for use in answering specific questions. Examples
include lists of an artist’s work, timelines, lists of songs by album/year,
list of performances, etc. Appropriate navigation tools for encyclopedic sites
may include searches, indices, and hierarchical menu traversal.
Commercial sites
are strongly oriented towards selling merchandise. Navigation through a catalog
of items available for purchase might involve browsing through a store-like
setup or possibly browsing content pages with interspersed commercial/product
links.
Educational
sites contain traditional educational material, designed to teach users about
the musician. Usually, they include images and linear explanatory text with
minimal links. Timelines might be
particular useful for these functions.
Fan-oriented
sites often contain pictures, links to other sites, personal comments, and
similar "tribute"-like information.
Random browsing and exploration are usually appropriate navigation techniques
for these sites, particularly if these pages are used as "hooks"
aimed at enticing users into further, deeper exploration of a site.
Contextual
sites provide biographical information that places an individual in a
historical, social, and artistic context.
For example, a site dedicated to contemporary music might have a single
page on Glass, along with pages describing other modern composers. This
contextual information could be provided through judicious use of side links
and sidebars.
These
explorations did not narrow the range of goals for the Glass site: instead, it
became clear that the ideal site would fill all
of these roles. However, this characterization did help focus discussions
of the impact of various goals upon the eventual site design. Specifically, we
realized that the open-ended browsing characteristic of fan-based sites might
not be compatible with the presentation of content needed for a commerce
site. On the other hand, the intrusion
of commercial content might detract from the experience of exploring
educational material.
Based
on our discussion with Glass’ staff and our examinations of related web sites,
we developed an informal set of requirements for the site, along with a
framework for describing users tasks. These requirements provided a challenging
and often inconsistent set of goals for the site.
E-Commerce: Although
CD, book, and video sales would be supported, the main E-commerce function of
the Glass site would be to arrange for licensing of recordings and rights.
Support for these goals would be provided by an electronic version of Glass'
catalog, along with tools that potential licensees could use to search the
catalog for appropriate pieces. Relevant information would include
instrumentation; length of pieces; background details about the origin, themes,
and context of the works; and (if available) descriptions of tempo, mood, and
other criteria that may be provided by Glass's production team.
Education: The
Glass site might provide a wide range of possible educational functions. Facilities might provide support for users
unfamiliar with Glass's work and interested in learning more, knowledgeable
fans looking for specific details, students working on academic studies related
to Glass and his music, and others. Guided tours, frequently asked questions
(FAQs), and suggestions ("if you want to find…") might be used to help orient
new users. This introductory material might also create of an inviting and
accessible site and provide additional navigation support for more advanced
users.
Design
of educational materials for a wide range of users presents significant
navigational challenges: a properly designed system should provide experts with
the facilities needed to identify necessary details, while providing novices
with an approachable and accessible introduction to the material. One way to
achieve that goal might be a "layered" interface, which would enable
users to progressively reveal greater levels of detail.
Exploration: Some
users of a Glass web site are likely to come to the site with goals that are
incomplete or otherwise not well specified.
For such users, developing a better understanding of their goals may be
a desirable end unto itself. Browsing
facilities might encourage users to explore while providing cues that help
users avoid disorientation. In addition
to helping users clarify goals, these tools might add to user satisfaction by
providing the enjoyable experience of "discovering" data on the site.
Exploratory components might combine e-commerce and educational functionality,
but with an added emphasis on interactive facilities. Possibilities include
musical scrollbars, musical construction kits ("build your own Philip
Glass work"), and facilities for identification of appropriate pieces for
licensing. Implementation of such
facilities in a web context may be limited by browser capabilities.
Searching: Users interested in licensing
Glass music for use in a particular context may be interested in identifying
music that meets appropriate subjective criteria. In addition to searching for
a piano piece or a chamber work, these users may be interested in music that is
particularly intense, fast, or triumphant. Support for searching along these
lines may be appealing, but generating the necessary descriptions of the pieces
in the Glass catalog may be labor-intensive. However, when combined with tools
that support exploration and readily available audio samples, subjective
searches could provide a powerful tool for generating licensing arrangements.
Generality: While
e-commerce is a primary function of the site, users should be able to enjoy and
learn from the site without necessarily making a purchase.
Inclusion of Visual
Arts: Visually, the site should incorporate aspects
that reflect Glass' collaboration with visual artists in writing music for
dance, film, and stage.
Novelty: Ordinarily,
use of new technologies strictly for their novelty value is not encouraged.
However, the Glass site should be in harmony with Glass’ reputation as an
innovator. The site should be "cool", and should strive to go past
standard linear forms often found in web designs. A site that makes
constructive use of new technologies to support users in their exploration of
the Glass catalog would ideally generate an enthusiastic user community while
extending the range of Glass’ innovations into a new realm.
Distinctive Identity:
Glass’ music has a distinctive, recognizable style. The Glass site should be
similarly distinctive: Ideally, the site will create a "brand" which will be
easily recognized as belonging to Philip Glass. This might be achieved through appropriately designed icons and
backgrounds, images from stage productions of Glass works or movies, pictures
of the artist, or other visual design techniques.
Focus on the music -
Access and Understanding: The web is still a primarily visual
medium. Streaming audio and MP3 provide audio content that is often completely
disconnected from the visual content of web sites. Despite these trends, the
site should emphasize Glass’ music as primary. Site layout, navigation tools,
Java applets, and other features should serve the goal of connecting users to
the music, in order to generate (or extend) their understanding for and
appreciation of Glass’ works.
3.3 User Tasks
As expected, navigation of the Glass catalog and selection of
items of interest were identified as the main user tasks that must be
supported. This simple classification was augmented by the definition of a
series of dimensions that could be used to describe user tasks:
·
User expertise: how much does the user know about Glass and his music?
·
User perspective: is the user a
fan? A student? A musician? All of the above?
·
Level of focus: is the user engaging in free-form exploration, or searching
for answers to highly specific questions?
·
What is the user’s goal? Finding information? Buying a CD?
These categories were used to develop more detailed lists of potential
user perspectives and tasks, which then influenced our discussion of design
alternatives. In particular, this analysis led to the explicit decision that a
single interface supporting all of these tasks would be ideal.
4. PROTOTYPES
Development of a series of prototype components proceeded in parallel with the site exploration and system requirements work described above. These prototypes provided perspective on ideas suggested by Glass’ staff and informed the development of the GlassEye browser.
4.1 Timeline Browsing
A chronological view of the catalog as a history of Glass’ career led us to consider the application of existing tools designed for browsing of temporal data. LifeLines [19] and the National Digital Library collection browser [21], both developed at the University of Maryland’s Human-Computer Interaction Lab (HCIL), provide timeline displays to facilitate browsing data sets. Both Lifelines and the collection browser support dynamic queries – "interactive user control of visual query parameters that generate a rapid (100 msec update) animated visual display of database search results"[23]. The collection browser, originally developed for the Library of Congress’ American Memory project, includes simultaneous menus [12] for selection of categorical information, along with tightly coupled displays of title and summary information.
The collection browser was adapted to display a subset of the detail from the Glass catalog, using subjective measures of mood and tempo, along with the objective categorization of the type of work (opera, orchestral music, etc.). These attributes were used as input to a series of simultaneous, dynamic query [23] menus that can be used to quickly add and remove works from the list being viewed.
The combination of a linear, categorized list of compositions with a temporal view of the catalog facilitates browsing through the data. Tightly coupled highlighting of selected pieces helps users identify correlations between the timelines, categories, and the linear list.
4.2 Sample Web Pages
A fully developed Glass site might contain individual HTML
pages describing each of the works in the catalog. A prototype home page for the opera Akhnaten was developed to illustrate potential content and layout
possibilities (Figure 2). At the top of the page, navigational links providing
access to information about related works, products, commentary, performances,
and audio clips are centered on an image of a Tibetan mandala. Further down the
page, summary texts, excerpts from liner notes, lists of collaborators, and
other details are given, along with links that can be used to drill down into
the site for greater detail. The page ends with global navigation links aimed
at helping users find their way to other appropriate areas on the site. The
mandala at the top of the page was chosen based on Glass’ interest in Buddhism
(and his work on the soundtrack for Kundun),
and might serve as a navigational theme that could be used throughout the Glass
site.
4.3 Musical Scrollbar
Inspired
by a suggestion from Glass’ staff, the Musical Scrollbar was designed as an
interactive browser that combines display of detailed information about various
Glass pieces with audio playback of selected segments. Implemented as a Java applet, the musical
scrollbar contains detail fields describing the name of the piece, the work
from which it was taken, the year, and the premier location and date. Additional text areas provide background
information, instrumentation details, and comments.
The
applet is controlled through a control at the top of the window. While the name "scrollbar" has
stuck, the control is actually implemented as a slider: users can drag the
thumb across the horizontal range, or they can click on the boxes at either end
to move left or right by one selection.
As the user drags, the detail display areas are updated dynamically,
based on the position of the slider.
There
are two possible modes for the musical playback. If the user leaves the thumb
on a given piece, or if the increment/decrement boxes at the end of the slider
are chosen, a short audio clip from the selected piece is played. If the user
drags the thumb, a special "dragging music" selection will be played
until the user releases the thumb, at which point the selected piece will begin
playing. Audio playback can be muted through selection of a checkbox provided
on the applet.
This
demonstration presents a basic model of a system that can be deployed using
current browser technology. Extensions such as multiple scrollbars for
scrolling in other dimensions might be used to provide additional navigational
power.
Zoomable
User Interfaces (ZUIs) are 2.5 D spaces that use magnification to manage object
representations that differ with scale. ZUIs support the use of "semantic zooming" – the revelation of progressively greater levels of detail with
increased magnification [4] – for the construction of overview and detail
displays, in which users can access information on topics of interest by
magnifying appropriate areas of the zoomable space.
As
a means for exploring the application of zooming to the Glass catalog, we
implemented a zoomable visualization of selected elements from the Glass
catalog. Implemented with Pad++[4],
this visualization is based on a layered timeline: compositions are displayed
in chronological order in the horizontal direction. Vertical stratification is
used to separate the pieces by type: operas on the bottom, symphonies in the
middle, chamber music above, etc. Graphical representations of the compositions
are scattered throughout the 2-dimensional display space, in effect creating a
starfield display [1].
Initially,
a 30-year timeline displays an overview of Glass' career. Users can manually
zoom or click on a hyperlink for any of the pieces to change the scale to
present details about a given piece, including information regarding
recordings, premieres, and liner notes.
Zooming further, users might be presented with audio samples or more
extended textual descriptions. At any stage, hyperlinks are available to return
users to previous levels of detail.
The
example Pad++ visualization developed for the Glass catalog also includes menus
arranged around the perimeter of the zoomable space. These menus provide
additional dynamic query filtering mechanisms that might be used to identify
music by type, popularity, instrumentation, or other criteria. While not
functional in the Pad++ prototype, these menus provide a hint of how dynamic
queries might be combined with a zoomable visualization.
The prototype web pages provided a vivid demonstration of a difficulty
that is likely to be faced by any system for browsing a complex data set such
as that of the Glass compositions. Each
of the works in the catalog has a wide range of auxiliary data, including
performance details, product details, explanatory texts, instrumentation
information, and other data that must be navigated.
Furthermore, larger compositions such as operas
might be considered as collections of individual pieces, each of which might
have a related data set. This clearly
presents a challenge for interaction tools: in addition to supporting
navigation to each of the components
in the catalog, these tools must also provide assistance for navigating within the individual pieces.
Several of the prototypes described above were shown to members of Glass’ production staff, in order to generate discussion and comments on the various designs. Since the site is likely to present a very public image of Glass and his work, we felt that the success of any system would be contingent upon the approval of Glass and his staff.
Two of the prototypes – the collection browser
and the Pad++ browser - generated particularly enthusiastic responses. The view
of the catalog provided by the collection browsers’ tight-coupling between the
catalog data and the timeline was particularly attractive to Glass’ staff, as
it supported highly interactive browsing of the catalog. They also found the
zooming and the large browsable space of the Pad++ prototype to be both
intriguing and fun. Based on these reactions, and on our interest in exploring
the possibilities of building a zoomable starfield display, we decided to build
a new prototype based on a combination of the temporal display of catalog data
in the collection browser and semantic zooming in the Pad++ prototype.
Building on the Pad++ prototype, the GlassEye browser combines a starfield display for visualization of the catalog browser with a zoomable interface that provides greater detail at higher levels of magnification. The browser is implemented using Java 1.2 and the Swing interface toolkits. Zooming support is provided by Jazz [5], a descendant of Pad++ written in Java.
In the spirit of continuous prototyping, GlassEye
was designed to provide a vision of what a fully operational browser might look
like. As such, it demonstrates possibilities, and does not address all of the
issues above. The use of Jazz also raises significant issues for eventual
deployment: the networking and processing overhead associated with the GlassEye
code and the Jazz libraries would likely render the applet unusable for users
with slow connections or slower processors.
As a result, this prototype should be viewed as a demonstration of
concepts, and not as a production system.
In
the initial GlassEye display, the horizontal axis is used as a timeline
displaying the year of composition. The vertical axis is used to display a
rough count of the number of instruments required to perform a piece, thus providing
users with one approximate perspective on the complexity of the piece – in
Glass’ case, ranging from solo piano to orchestral pieces. This initial display was chosen as a
representative guess as to which views might be most useful for users: issues
regarding semantics of starfield axes in this context will be discussed below.
Within
the starfield display, each composition has a single representation. For titled
pieces, the name of the work is used. For untitled pieces, a circle is used. In
either case, color-coding is used to display the type of the piece (orchestral,
stage, or chamber music). Size coding based on an arbitrary "popularity"
measure is used to emphasize works that users are most likely to find
interesting. The resulting display provides a single-screen overview of the
entire catalog.
Figure 3: GlassEye browser with overview
window |
Dynamic query filters [23] are provided for easy filtering based on additional attributes: checkboxes allow for selection of pieces by their type, while sliders for popularity, length, and other attributes line the bottom of the display.
The browser also provides a smaller, secondary
window containing an overview of the starfield space (Figure 3). For assistance in navigation, this window
provides a view that remains constant even as the magnification in the main
window changes.
GlassEye uses zooming to make detailed information
about each piece available at higher levels of magnification. The initial
implementation provides two levels of detail about each piece. First,
background information describing the work, collaborators, premiere date,
related works, are provide, possibly along with one or more images. At the next level, extended excerpts from
liner notes or other further details are given (Figure 4).
Magnification hyperlinks provide the simplest
approach to zooming in the GlassEye browser. Each composition is a hyperlink:
clicking on the textual or iconic representation of a work will cause an
animated magnification, resulting in the appropriately magnified display of
detail about the selected piece in the center of the screen. All displays that are targets of hyperlinks
include links that can be used to return to the previous magnification.
Currently, these links are depicted as up-arrows, indicating the upward motion
of the viewpoint that would lead to a decreased level of magnification.
Hyperlinks in the GlassEye browser are not limited to change of scale: translation links are used to connect related compositions. Links are presented to users as outline borders that appear around linked objects on mouse-over. At the main level, further emphasis on links is provided by a changed color for the object that has the current focus.
Figure 4: Detailed display for a specific composition.
Zooming is also used as a navigational tool. As GlassEye’s starfield display can become
somewhat crowded, magnification of areas of interest might help users
distinguish between labels that are located near each other.
Furthermore, semantic zooming is used to bring
individual detail items into view as magnification levels are increased,
providing some residue or "scent" that might guide users in their continued
navigation [11].
Users can choose between two possible approaches for control of general magnification. Clicking on the background of the display adjusts the magnification, with the left mouse button increasing the magnification, while the right button decreases magnification. In both cases, magnification changes continuously throughout the duration of the mouse press and is centered on the location of the click. Alternatively, a slider in the GlassEye tool bar can be used to directly adjust the magnification.
Animation in visual interfaces can be useful
for illustrating system state and for making interfaces more comprehensible
[2]. In zoomable interfaces, animation has been shown to be potentially
beneficial for tasks involving the reconstruction of tree structures based on
exploration experience [3]. GlassEye
uses animation to illustrate the transitions associated with change of
magnification (or change of position through hyperlinks). In addition to providing users with
orienting information that may assist navigation, these animations provide an
engaging visual effect that may encourage exploration.
The use of varying magnification in the GlassEye
browser presents some challenges for effective navigation. As the magnification
level increases, labels for the individual compositions grow larger, eventually
disappearing once the magnification level exceeds their maximum magnification.
In essence, the view "zooms past" the objects.
This may lead to a view in which the user does not have any navigational
cues – a situation known as "desert fog"[13].
The GlassEye browser uses an overview window to
address this problem. This display contains a miniaturized version of the
initial contents of the main GlassEye window. However, unlike the display in
the main window, the overview does not change magnification: as the user zooms
in or out in the main window, the scale of the overview remains constant. A field-of-view box [20] in the overview
illustrates the position of the current main window view in the x-y space of the
original display: as the magnification increases, the size of the outline in
the overview window decreases (and vice-versa). The overview can be used for
navigation – clicking anywhere in the outline view will cause the view in the
main window to move to a view centered on the position selected.
The overview window provides users with sufficient
context to avoid the "lost in zooming space" problem. The combination of the
miniaturized representation of the overall space and single-click navigation
provides users with a tool that can always be used to find some area of
potential interest. As screen space constraints limit the size of the overview
display, items in the overview are generally not legible: they are simply
intended as indicators of the presence of an item of potential interest at a
given location. As such, they provide additional, if incomplete, navigational
residue [11].
A "reset" button on the GlassEye tool bar provides
an additional navigational aid, in the form of a single button that can be pressed
to restore the browser to the original view.
A simple search facility provides support for
finding compositions by name. When a
user searches for targets that match a given string, matching labels are
outlined and highlighted while all others are grayed out. This provides a
display that supports quick visual location of the objects of interest while
retaining appropriate context.
Currently, search is limited to simple textual prefixes: a production
system would be likely to support more sophisticated searching.
Spotfire [25] supports starfield displays based on
arbitrary axes: given an n-dimensional
data set, starfield displays can be built using any two of those dimensions as
the spatial axes. This feature provides significant flexibility for users
interested in identifying potentially interesting visualizations. However, many
of the possible views of a given data set are not useful, leaving users with
the challenge of sorting through a potentially large space in search of appropriate
views.
The GlassEye browser provides a small set of
pre-defined views to provide the flexibility of multiple views of the data set
without the potential for confusion associated with arbitrary choices. As mentioned above, the default display of
the GlassEye browser uses the date of composition on the x-axis and the number
of instruments on the y-axis. A
list-box at the top of the screen can be used to select one of the alternative
views: currently, three choices are provided.
The current implementation of GlassEye uses the alternative views to explore a potentially interesting approach to visualization of a catalog of music: the use of subjective criteria to arrange items in a visualization. Given the potentially evocative nature of music, we hypothesized that some users might want to use criteria such as intensity, mood, or other less concrete descriptors in order to identify pieces of music.
Toward that end, we assigned ordinal values of
"mood" and "intensity" to each of the compositions in the GlassEye browser.
These attributes formed the basis for the second view of the data, which we
labeled the "emotion finder": mood on the x-axis and intensity on the y-axis.
This might be used to identify compositions with a desired mix of these
features: for example, "which pieces are upbeat but not too intense?"
A view displaying popularity (x-axis), vs.
length (y-axis) is provided as a third alternative. Popularity values might be
useful for individuals interested in locating pieces that might be most familiar.
GlassEye uses a generalization of the Musical
Scrollbar to support navigation through Glass’ music. Each composition has an
associated audio sample, which is played when the user mouses over any
component of the piece, including both the top-level label and the details
presented at greater levels of magnification. A text label at the top of the
GlassEye browser displays the name of the piece being played, providing users a
means of identifying pieces of interest.
The use of mouse-over to initiate audio playback
provides a highly interactive, if potentially confusing, model. Moving quickly
throughout the system, users might hear only a few seconds of any single piece,
but this might be sufficient for identifying items of potential interest.
Of course, incessant playback
might be annoying in certain circumstances. Musical playback in GlassEye is
turned off by default and must be activated through a checkbox selection.
6. DISCUSSION
The GlassEye browser
synthesizes a variety of information visualization techniques for use in a
novel domain. Where past starfield visualizations such as the HomeFinder [28]
and the FilmFinder [1] have focused on largely data driven tasks, with known
goals, GlassEye attempts to provide support for more open-ended exploration.
Towards that end, animated zooming, color highlighting, and digitized audio are
used to entice users. Our hope is that the combination of these features with
the dynamic querying and spatial representation of the starfield view will help
users build an understanding of the data while enjoying the experience of
exploring it.
In many ways, this system might be most valuable as a test platform for
ideas. Several of our design decisions
arose out of intuition, and were therefore not justified by user studies or
existing research. Empirical evaluation of these aspects of the design might
prove interesting and constructive for deployment
of future systems based on the GlassEye browser. Furthermore, the current
design identified numerous issues that must be addressed in any eventual
production versions of the browser.
6.1
Navigability
Although providing an enjoyable user experience is one of
the goals of the Glass site, the GlassEye browser will be most successful if users are consistently able to locate items
of interest. The starfield display,
query filters, overview window, and hyperlinks combine to provide navigation
support that is potentially both powerful and confusing. Our initial use of the
browser identified several design issues that affect navigability: further
examination of these issues might inform subsequent revisions to the system.
6.1.1
Representation and Layout of Data
As described
above, the top-level representation for a composition is currently is either
the text of its title, or a circle if no title exists. The Glass catalog
contains a large number of untitled works: inclusion of these works in the
browser might lead to a display containing large numbers of circles. In this
case, additional measures or representations might be needed to support
navigation and searching.
The use of title
strings to represent compositions raises additional concerns relating to space
utilization and overlap. Currently, these strings are simply placed by
calculating the appropriate (x, y) position
of the composition in the currently selected axes, and then using that position
as the starting point for drawing the text. This simple approach creates two
potential problems. The first – interpretation of a text string occupying
significant area in an x-y plot – is likely to be a problem for any use of
textual representations. For example,
if the text of a label covers the horizontal space corresponding to the years
1973-1978 inclusive, what conclusions can the user draw about the date of
composition?
The second, more
serious problem involves label overlap. Even with the small subset of the Glass
catalog used in the current prototype, many of the labels overlap in a manner
that obscures text and inhibits comprehension. Although the extent of this
problem might be lessened through algorithms that attempt to avoid overlap,
such approaches would have the detrimental side effect of moving labels away
from the x-y position implied by the values of the attributes used in a given
view of the data. This would reduce the information conveyed by the position of
the label in the plane. Labeling approaches such as excentric labels [9], label
connectors, or semantic labeling [15] might provide alternative possibilities
that would avoid this loss of information.
A final layout concern involves the display of detailed
information for a given composition. In
the current implementation, detailed data about each composition is displayed
when the user zooms to an appropriate magnification. However, the layout of this
information is somewhat informal, and might be improved through appropriate
graphic design.
6.1.2
Searching
Search facilities
in the browser are currently limited to simple text prefix matching based on
titles of individual compositions. Extensions of this facility to include
searches through detailed data and/or structured searches restricted to
particular data fields would significantly increase the utility of the
searching tools.
Such extensions would also bring additional design
challenges relating to the display of search results in a multi-scale
interface. Specifically, how should
search results be displayed if a matching target is not visible at the current
magnification? If the user can zoom in to see the target, a "critical zone"
indicator [13] might be displayed to indicate the presence of material that
would not be visible without further magnification. In the dual case – if the
target is not visible because magnification is too great and the view has
"zoomed past" it – appropriate strategies for indicating the search match are
not obvious, and would be an interesting area of further investigation.
6.1.3
Aggregation
Musical works can
often be viewed as aggregations of subcomponents: symphonies consist of
movements, operas of arias, etc. Display of the hierarchical nature of many of
the pieces in the Glass catalog in a manner that supports effective navigation
raises many additional issues that are not addressed in the current prototype.
Magnification and
zooming might be used to illustrate hierarchical relationships. From the
beginning overview screen, users might zoom in to a display that shows each of
the components of a given work, along with background description that pertains
to that work as a whole. Further zooming might lead to detailed information
about each of the individual components. This approach might be fruitful, but
users may become disoriented if cues are not provided to help place the
selected component within the greater whole.
Appropriate
handling of hierarchical works is also a concern for the layout of the
starfield display. The coordinate axes used by the GlassEye browser rely on the
simplifying notion that a given work can be accurately categorized along two
dimensions. However, for works that
have multiple subcomponents that differ in length, instrumentation, mood, and
other criteria that might be used on a starfield display axis, it is not at all
clear how the work as a whole should be described.
One possibility
might be to present the work as occupying some an area in the 2-D space,
corresponding to the range of values that might be appropriate for the work in
a given attribute. This approach is likely to be effective only if the values
are continuous, as discontinuous values might require multiple areas. Another
possibility is the provision of an option that would allow display of
individual sub-components as works in their own right. This promotion of the
components of a given work would allow for identification of individual
components, with the possible costs of increased screen clutter due to a vastly
larger set of data. Furthermore, it is not clear how such a display would
indicate which components belonged to the same work, thus potentially
increasing user disorientation and confusion.
The goal of any such approach would be to maximize the
amount of data displayed without hindering comprehension or navigation. As this
is a general tradeoff faced by information visualization systems, further
examination in the context of the Glass catalog browser may be an interesting
area of investigation.
6.2 Subjective
Data
Starfield displays generally
use well-defined attributes for the primary x-y axes. Numeric, alphabetic, or other familiar values are preferred, as
they likely to be most meaningful to users and thus easily interpreted.
Following this model, the GlassEye prototype uses attributes such as year,
number of instruments, and length for some of the starfield views. However,
other views – such as the mood vs. intensity display – break from this
approach, relying instead upon subjective data.
Our interest in presenting
views based upon subjective attributes was motivated by our belief that certain
users or classes of users might be interested in identifying music that meet
certain emotional or otherwise hard to quantify criteria. Some listeners might
be interested in identifying pieces that were particularly light and "up-beat",
while others might be looking for pieces that were relaxing and "not very
intense". These subjective attributes
were seen as particularly relevant for users interested in identifying music
for licensing. For example, filmmakers
might use these attributes to identify compositions with desired mood or tempo.
Attributes such as these
might be displayed using technical measures or established musical terminology.
While these measurements might be useful for providing accurate information to
musicians and other skilled users, they might prove confusing for others. Subjective ratings on artificial scales
were chosen in the hopes of avoiding confusion that might be caused by overly
technical terminology.
Further investigation will be
needed to understand the issues related to the use of attributes that are not
well defined or understood as the bases for starfield visualizations. For instance, such axes may prove useful for
establishing ordinal relationships, but users might find the meaning of
distance on a particular axis harder to comprehend. More plainly, on a time-based scale, most users- will quickly
infer that an item is roughly 45 minutes long if it is midway between the
30-minute item and the 60-minute item. However, the same distance might be much
harder to interpret if the scale is "mood" or "intensity". Determination of
when – if at all – such attributes are useful in starfield visualizations would
be an interesting area of further study.
From a purely pragmatic
viewpoint, these subjective ratings present a challenge when working with a
catalog that may contain over one thousand entries. Generating these ratings
and insuring their consistency is likely to be a daunting task. Ideally, those
most familiar with Glass’ music – his production staff - would perform this
task. Alternatively, future versions of a Glass site might use a collaborative
filtering approach [22] to support visualizations based on ratings submitted by
site visitors.
6.3
Music
Despite the
inclusion of audio clips, the GlassEye prototype is primarily a visual system.
This is not surprising, given the influence of information visualization
techniques on the system design and the overwhelmingly visual nature of most
current web environments.
Ideally, the
musical content would be an integral component of a browser for the Glass
catalog. Additional audio might be added to the current system in the form of
longer audio clips presented along with the details for individual pieces.
Other possibilities include additional facilities that support navigation
through the catalog in terms of musical similarity. For example, the Glass
catalog might be arranged by trends in the composer’s career, separating early
repetitive pieces from more recent melodic compositions.
As browser
capabilities and formats for storage and manipulation of music evolve,
additional facilities might support more active forms of exploring the Glass
catalog. For example, appropriately
designed interfaces might support display of musical notation and investigation
of the effects of changes to a score – perhaps providing a "write your own
Philip Glass music" tool.
Information visualization tools attempt to provide users with graphic
tools for visual exploration data sets, in the hopes of building understanding
of patterns and detail. In some sense,
the GlassEye browser represents a primitive attempt at "audio visualization":
the tools and techniques required to build a mental model of a body of music.
6.4
Scalability
The prototype browser currently includes a small subset of
the Glass catalog, with all relevant data hard-coded into the Java applet. Although a production system would clearly
require a more scalable data representation – perhaps a relational database or
XML implementation – the visualization aspect of the browser faces scalability
concerns. As the data set grows, crowding
and overlap in the visualization space will increase, leading to decreased
utility and increased confusion.
Segmenting the data into different data sets – perhaps by type of
composition –may lessen this problem, at the expense of eliminating the comprehensive
overview of the complete catalog.
6.5
Performance
Response times of 100ms have
been suggested as requirements for effective dynamic query systems [23]. Generalizing this requirement to the
GlassEye browser, we have a goal of 100ms response times for zooming operations,
initiation of musical playback, and other interactions. This may be a difficult goal to reach for an
applet-based visualization. While performance of components such as Swing and
Jazz [5] are constantly improving, zooming operations remain computationally
expensive and potentially slow.
Retrieval of the
audio clips required for browsing of the Glass audio may further hinder
performance. The current prototype uses Sun .au files for audio samples. Future
implementations may want to use more modern technologies such as MP3 or
streaming audio to overcome the limitations of .au files. Decisions regarding
audio formats should consider the bandwidth requirements of various digital
audio formats, and their impact upon users with low speed connections. A tiered
interface with differing bandwidth requirements might be provided to avoid
excluding users who do not have fast Internet connections.
6.7
Next Steps
This prototype
provides a demonstration of the potential of combining starfield displays with
Zoomable User Interfaces. We were particularly encouraged by the interactive,
animated zooming and "fly-over" musical playback, which work together to create
an enjoyable user experience. Unfortunately, the prototype raises almost as
many questions (described above) as it answers. Additional efforts in several
areas would be necessary for a production system base on the GlassEye
prototype:
Visual
design and placement within an overall site design: The
look-and-feel of the browser window and controls, and the detailed display view
(Figure 4) would clearly benefit from collaboration with a visual designer.
Furthermore, this browser is likely to be used as one component of a larger
site: as such, it should have a visual design that is consistent with the rest
of the site. Ideally, this visual design would help users identify the browser
with the site, and to associate both with Philip Glass, thus maximizing the
potential for "branding."
Performance,
and other technical issues. As described above, GlassEye’s extensive use of Jazz and Java
raises significant concerns regarding deployment for general Web usage.
Although appropriate design and implementation improvements might reduce some
of this overhead, it is not at all clear that acceptable performance for users
of medium and slow connections will be achievable. Audio output presents
another challenge, as streaming audio or MP3 would almost certainly be
preferable to low-quality .au files. Re-implementation in a browser plug-in
environment such as MacroMedia’s Flash [16] or equivalent might be one approach
to increasing performance and audio quality.
Navigability
and usability: Concerns raised above regarding navigability and scalability are
only two examples of the general concern of usability. Although we have been
encouraged by our use of the system, we have not conducted any formal or
informal usability testing. Such testing might help us understand the usability
of the Zoomable starfield space and identify limits of information density, in
order to inform future redesign.
7. RELATED WORK
The chronological, starfield
display used in the GlassEye browser has its roots in earlier projects such as
the FilmFinder [1], LifeLines [19], and the National Digital Library Collection
browser [21]. GlassEye differs from FilmFinder and LifeLines in its approach to
the use of zooming for display of overview and detail information. These
earlier systems use zooming to narrow focus in to a particular point of
interest, with details presented in a separate window. GlassEye uses zooming to
navigate in scale to the desired detailed information, with a separate,
dedicated window providing the overview.
Kullberg’s Dynamic Timelines
[14] used timelines and zooming in a 3D environment to present a history of
photography. The GlassEye browser and Dynamic Timelines make similar use of
zooming for access to increased levels of detail. Kullberg used a 3D display to
provide additional contextual information that might enrich the user
experience, at the possible expense of additional processing overhead that is
likely to be prohibitive for web-based environments. Dynamic Timelines does not address the issue of overviews,
leaving the possibility that users might become "lost in scale space".
Fernström and Bannon’s BROWSE
project [10] was the first to examine the possibility of starfield displays for
browsing an abstract space of musical works. BROWSE uses Macromedia Authorware
in conjunction with custom-built sound-drivers, to support exploration of the
Fleischmann collection of Irish traditional music. BROWSE uses audio
localization as a tool for navigation: up to eight objects close to the cursor
can play simultaneously, with objects represented by sources arranged in an
audio space according to their position relative to the cursor. This use of
spatial playback of audio represents both intriguing possibilities and
significant technical challenges for use in a web-based browser for Philip
Glass compositions.
Perhaps closest to the goals
of the Glass site, Sony Music’s licensing web site [24] uses Thinkmap [26]
technology to present music that might be licensed in an associative net of
concepts supporting search, query, and song samples.
Other recent efforts aimed at navigation and visualization of music
include the ISEE3D musical sound browser [27], and Open Hypermedia for
hypertext linking within audio streams [8]. Possibilities for navigating
musical spaces include query by humming and other approaches to the use of
audio input as search queries [6][17]. Finally, systems like the Brain Opera’s
Singing Tree [18] and the WorldBeat baton [7] suggest innovative approaches for
the use of new input modalities for interaction with music-based systems.
8. CONCLUSIONS
& FUTURE WORK
The GlassEye prototype
browser uses a range of techniques including starfield displays, multi-scale
zoomable interfaces, dynamic querying, and audio output to support the
exploration of a catalog of musical compositions. This design represents a
synthesis of experience with information visualization systems and analysis of
requirements and possible users tasks for a Philip Glass web site.
The success of any future
production use of a system based on this prototype will be dependent upon work
that evaluates the appropriateness and efficacy of the interface design while
resolving the technical issues of scalability and performance. Without
appropriate user studies and further analysis of tasks and needs, it will be
difficult to conclude that the design actually meets user needs.
In any case, the GlassEye
browser does not comprehensively address systems goals and user tasks described
above. Additional background data,
tighter integration of Glass’ music, structured search, and other interactive
tools for exploring and understanding music would be necessary for fully
meeting the stated goals. Of course, further understanding of users and their
needs may motivate creation of distinct tools for different needs: although a
single interface for all classes of users has a certain appeal, the result may
be a "lowest common denominator" that fails to meet the needs of any user
population.
Continuing improvements in digital music formats and available
bandwidth will undoubtedly lead to increased possibilities for web-based
interfaces to musical content. Current interfaces tend to treat musical content
as a "second-class" citizen, focusing instead on the more tractable problems of
visual display. Extensions of the GlassEye prototype to include novel
mechanisms for interacting with and creating music [7][18], or exploring the
mathematical structure of Glass’ compositions [29] present intriguing
possibilities that might help old fans and new listeners learn about and
appreciate the music of Philip Glass.
This work was done at IBM T.J. Watson Research Center, and follows from work done at University of Maryland under the support of IBM’s University Partnership Program. Thanks to Mark Podlaseck for making this project possible, to Edith Schonberg, John Karat, Kurt Munkacsi, Don Christensen, and Jim Keller, for their support and assistance, and Ben Bederson, Ben Shneiderman, and Judith Yanowitz for their comments on early versions of this paper.
[1]
Ahlberg
C., and B. Shneiderman, Visual Information Seeking: Tight Coupling of Dynamic
Query Filters With Starfield Displays,Proceedings
of ACM CHI '92 Conference, 313-317 (May 1992).
[2]
Baecker,
R., I Small, I, and R. Mander,
Animation at the Interface, In B. Laurel, ed. The Art of Human-Computer Interface Design, 251-267.
Addison-Wesley, San Francisco, CA (1990).
[3]
Bederson,
B.B., and A. Boltman, Does Animation Help Users Build Mental Maps of Spatial
Information? University of Maryland CS-TR-3964, UMIACS-TR-98-73. ftp://ftp.cs.umd.edu/pub/hcil/Reports-Abstracts-Bibliography/98-11html/98-11.html
(1998).
[4]
Bederson
B.B., J.D. Hollan, K. Perlin, J. Meyer, D. Bacon, and G. Furnas, Pad++: A
Zoomable Graphical Sketchpad for Exploring Alternate Interface Physics, Journal of Visual Languages and Computing,
7, 3-31 (1996).
[5]
Bederson,
B., and B. McAlister, Jazz: An Extensible 2D+Zooming Graphics Toolkit in
Java, University of Maryland CS-TR-4015, UMIACS-TR-99-24. URL: ftp://ftp.cs.umd.edu/pub/hcil/Reports-Abstracts-Bibliography/99-07html/99-07.html
(1999).
[6]
Blackburn,
S. and D. DeRoure, A Tool for Content-Based Navigation of Music, Proceedings of ACM Multimedia ’98 Conference,
361-368 (September, 1998).
[7]
Borchers,
J., World Beat: Designing a Baton-Based Interface for an Interactive Music
Exhibit, Proceedings of ACM CHI ’97
Conference, 131-138 (March, 1997).
[8]
DeRoure,
D., S. Blackburn, L. Oades, J. Read, and N. Ridgway. Applying Open Hypermedia to Audio, Proceedings of ACM Hypertext ’98 Conference, 285-286 (June, 1998).
[9]
Fekete,
J.D. and C. Plaisant, Excentric Labeling: Dynamic Neighborhood Labeling for
Data Visualization, Proceedings of ACM
CHI ’99 Conference, 512-519 (May, 1999).
[10]
J.M.
Fernström, J. M., and L.J. Bannon,, Explorations in Sonic Browsing, Proceedings of BCS HCI '97, Bristol, UK.
http://www.ul.ie/%7Eidc/library/papersreports/MikaelFernstrom/hciuk97/
sonicbrowse.html.
[11]
Furnas,
G., Effective View Navigation, Proceedings
of ACM CHI ’97 Conference, 367-374 (March, 1997).
[12]
Hochheiser,
H., and B. Shneiderman, Performance Benefits of Simultaneous over Sequential
Menus as Task Complexity Increases, International
Journal of Human Computer Interaction, forthcoming.
[13]
Jul,
S., and G. Furnas, Critical Zones in Desert Fog: Aids to Multiscale
Navigation, Proceedings of ACM UIST ’98
Conference, 97-108, (May, 1998).
[14]
Kullberg,
R., Dynamic Timelines: Visualizing the History of Photography, Proceedings of ACM CHI ’96 Conference
Companion, 386-387 (April, 1996).
[15]
Li,
J., C. Plaisant, C., and B. Shneiderman B., Data Object and Label Placement
for Information Abundant Visualizations, Proceedings ACM Conference on New
Paradigms for Information Visualization, 41-48 (November, 1998).
[16]
Macromedia
Flash http://www.macromedia.com/software/flash/
[17]
McNab,
R., L. Smith, I. Witten, C. Henderson, and S. Cunningham, Towards the Digital
Music Library: Tune Retrieval from Acoustic Input, Proceedings of ACM Digital Libraries ’96 Conference, 11-18 (March,
1996).
[18]
Oliver,
W., J. Yu, and E. Metois, The Singing Tree: Design of an Interactive Musical
Interface, Proceedings of ACM DIS ’97
Conference, 261-264 (August, 1997).
[19]
Plaisant,
C., B. Milash, A. Rose, S. Widoff, and B. Shneiderman, LifeLines: Visualizing
Personal Histories, Proceedings of ACM
CHI '96 Conference, 221-227 (April, 1996).
[20]
Plaisant,
C., D. Carr, and B. Shneiderman, Image Browsers: Taxonomy, Guidelines, and
Informal Specifications, IEEE Software 11(1),
33-52 (March, 1995).
[21]
Plaisant,
C., G. Marchionini, T. Bruns, A. Komlodi, and L. Cambpell, Bringing Treasures
to the Surface: Iterative Design for the Library of Congress National Digital
Library, Proceedings of ACM CHI '97
Conference, 518-525 (March, 1997).
[22]
Resnick,
P. and H. Varian, Recommender Systems; Communications
of the ACM, 40(3), 56-58 (March 1997).
[23]
B.
Shneiderman, B, Dynamic Queries for Visual Information Seeking, IEEE Software 11(6), 70-77 (November,
1994).
[24]
Sony Music Licensing http://www.sonymusic.com/licensing.
[25]
Spotfire http://www.spotfire.com
[26]
Thinkmap
http://www.plumbdesign.com/projects/thinkmap.html
[27] Vertegall, R. and B. Eaglestone, Looking for Sound? Selling Perceptual Space in Hierarchically Nested Boxes, Proceedings of ACM CHI ’98 Conference, 295-296 (April, 1998).
[28] Williamson, C., and B. Shneiderman, The Dynamic HomeFinder: Evaluating Dynamic Queries in a Real-Estate Information Exploration System, Proceedings of ACM SIGIR ’92 Conference, 338-346 (June, 1992).
[29] York, W. Form and process, Sonus I/2 (1981) reprinted in Kostelantz, R, ed. Writings on Glass. University of California Press, Berkeley CA (1997).