|
1
|
- Extract from Catherine Plaisant talk at the Human-Computer Interaction
Lab (HCIL) annual symposium (www.cs.umd.edu/hcil/soh)
- About FeatureLens (a MONK prototype)
- Interface developed at Maryland using
D2K frequent pattern analysis from NCSA
|
|
2
|
Anthony Don, Catherine Plaisant, Tanya Clement
- University of Maryland
- Loretta Auvil, NCSA
- With the help of others from the MONK project
|
|
3
|
|
|
4
|
|
|
5
|
- Study of Gertrude Stein’s “The
Making of Americans” (MoA)
- Tanya Clement, PhD student from English Department
|
|
6
|
- [1086] Always from the beginning there was to me all living as
repeating. This is now a description of my feeling. As I was saying
listening to repeating is often irritating, always repeating is all of
living, everything in a being is always repeating, more and more
listening to repeating gives to me completed understanding.
|
|
7
|
- [1086] Always from the beginning there was to me all living as repeating.
This is now a description of my feeling. As I was saying listening to
repeating is often irritating, always repeating is all of living,
everything in a being is always repeating, more and more listening to
repeating gives to me completed understanding.
|
|
8
|
- [1086] Always from the beginning there was to me all living as repeating.
This is now a description of my feeling. As I was saying listening to repeating
is often irritating, always repeating is all of living, everything in a
being is always repeating, more and more listening to repeating gives to
me completed understanding.
|
|
9
|
- [1086] Always from the beginning there was to me all living as
repeating. This is now a description of my feeling. As I was saying
listening to repeating is often irritating, always repeating is all of
living, everything in a being is always repeating, more and more
listening to repeating gives to me completed understanding.
|
|
10
|
- [1086] Always from the beginning there was to me all living as
repeating. This is now a description of my feeling. As I was saying listening
to repeating is often irritating, always repeating is all of living,
everything in a being is always repeating, more and more listening to
repeating gives to me completed understanding.
|
|
11
|
- Do the changes in repetition
correspond to the novel’s evolving
theories about identity and representation? And how?
|
|
12
|
- What text features are highly repeated in the text?
- Frequent words
- Frequent n-grams (consecutive words)
- Frequent patterns of n-grams (more “fuzzy” non consecutive matches)
- How do they change over time (i.e. along the text)?
- Locate features in text
- Compare features
- Distribution over time
- Find features that exhibit specific distributions (e.g. spike)
|
|
13
|
|
|
14
|
|
|
15
|
|
|
16
|
|
|
17
|
|
|
18
|
|
|
19
|
|
|
20
|
|
|
21
|
|
|
22
|
|
|
23
|
|
|
24
|
|
|
25
|
|
|
26
|
|
|
27
|
|
|
28
|
- Define metrics on distributions and rank features accordingly
- increase/decrease topics evolution
|
|
29
|
- Define metrics on distributions and rank features accordingly
- spikes/sinks specific
events
|
|
30
|
|
|
31
|
|
|
32
|
|
|
33
|
|
|
34
|
|
|
35
|
- Ongoing longitudinal case study
- Tanya Clement and « The Making of Americans »
- Pilot user study with 8 users
- 3 tasks then free exploration (30 min)
- think aloud protocol - gather insights about text
|
|
36
|
|
|
37
|
|
|
38
|
|
|
39
|
|
|
40
|
|
|
41
|
|
|
42
|
|
|
43
|
|
|
44
|
|
|
45
|
|
|
46
|
|
|
47
|
|
|
48
|
- What text features are highly repeated in the text?
- Frequent words
- Frequent n-grams (consecutive words)
- Frequent patterns of n-grams (more “fuzzy” non consecutive matches)
- How do they change over time (i.e. along the text)?
- Locate features in text
- Compare features
- Distribution over time
- Find features that exhibit specific distributions (e.g. spike)
|
|
49
|
- Text mining requires good UI to analyze results
- Live demo and technical report at: www.hcil.cs.umd.edu/hcil/textvis/featurelens
- Support from HCIL and Andrew W. Mellon Foundation
|