Information Visualization

Shared by: yurtgc548
Categories
Tags
-
Stats
views:
1
posted:
11/25/2012
language:
simple
pages:
74
Document Sample
scope of work template
							why does this suck?
                             Information
                            Visualization
                               Jeffrey Heer
                   UC Berkeley | PARC, Inc.
                       CS160 – 2004.11.22



(includes numerous slides from Marti Hearst, Ed Chi, Stuart Card, and Peter Pirolli)
Basic Problem



We live in a
new ecology.
Scientific Journals
Journals/person increases 10X every 50 years
 1000000
  100000
   10000         Journals
    1000
                                   Journals/People x106
     100
      10
       1
     0.1
    0.01
                      Darwin              V. Bush   You
        1750   1800    1850        1900    1950     2000
                            Year
     Web Ecologies
            10000000

             1000000

              100000
  Servers




               10000

                1000                                        1 new server every 2 seconds
                 100
                                                              7.5 new pages per second
                  10

                   1
                   Aug-92 Feb-93 Aug-93 Feb-94 Aug-94 Feb-95 Aug-95 Feb-96 Aug-96 Feb-97 Aug-97 Feb-98 Aug-98


Source: World Wide Web Consortium, Mark Gray, Netcraft Server Survey
Human Capacity
 1000000
  100000
   10000
    1000
     100
      10
       1
     0.1
    0.01
                      Darwin          V. Bush   You
        1750   1800    1850    1900    1950     2000
Attentional Processes
 “What information consumes is rather obvious: it
 consumes the attention of its recipients. Hence a
 wealth of information creates a poverty of attention, and
 a need to allocate that attention efficiently among the
 overabundance of information sources that might
 consume it.”

                                          ~Herb Simon
                                  as quoted by Hal Varian
                                      Scientific American
                                         September 1995
Human-Information Interaction
   The real design problem is not increased
    access to information, but greater efficiency
    in finding useful information.
   Increasing the rate at which people can find
    and use relevant information improves
    human intelligence.
                          Amount of
                          Accessible
                          Knowledge


                                       Cost [Time]
Information Visualization
   Leverage highly-developed human visual
    system to achieve rapid understanding of
    abstract information.


                   1.2 b/s (Reading)
                   2.3 b/s (Pictures)
Information Visualization
   “Transformation of the symbolic into the geometric”
    (McCormick et al., 1987)

   “... finding the artificial memory that best supports
    our natural means of perception.'‘ (Bertin, 1983)

   The depiction of information using spatial or
    graphical representations, to facilitate comparison,
    pattern recognition, change detection, and other
    cognitive skills by making use of the visual system.
    (Hearst, 2003)
Why Visualization?
   Use the eye for pattern recognition; people good at
         scanning
         recognizing
         remembering images
   Graphical elements facilitate comparisons via
         length
         shape
         orientation
         texture
   Animation shows changes across time
   Color helps make distinctions
   Aesthetics make the process appealing
Visualization
Success Stories
Visualization Success Story

  Mystery: what is causing a cholera
  epidemic in London in 1854?
Visualization Success Story

                         Illustration of John Snow’s
                         deduction that a cholera
                         epidemic was caused by a
                         bad water pump, circa
                         1854.

                         Horizontal lines indicate
                         location of deaths.




                                  From Visual
                                  Explanations by
                                  Edward Tufte,
                                  Graphics Press,
                                  1997
Visualization Success Story
Illustration of John
Snow’s deduction that a
cholera epidemic was
caused by a bad water
pump, circa 1854.

Horizontal lines indicate
location of deaths.




                            From Visual Explanations by Edward Tufte,
                            Graphics Press, 1997
        A Visualization
            Expedition
(a tour through past and present)
Perspective Wall
Starfield Displays




             Slide adapted from Chris North   18
Film Finder
Table Lens
Distortion Techniques
Indented Hierarchy Layout


               Places all items along vertically
               spaced rows
               Uses indentation to show parent
               child relationships
               Breadth and depth end up
               fighting for space resources
Reingold-Tilford Layout




                                              Top-down layout
                                              Uses separate
                                              dimensions for
                                              breadth and depth



tidier drawing of trees - reingold, tilford
TreeMaps


                                     Space-filling
                                     technique that divides
                                     space recursively
                                     Segments space
                                     according to ‘size’ of
                                     children nodes

map of the market – smartmoney.com
SpaceTree
Cone Trees

                                             Tree layout in three
                                             dimensions
                                             Shadows provide 2D
                                             structure
                                             Can also make
                                             “Balloon Trees” – 2D
                                             version of ConeTree
cone tree – robertson, mackinlay, and card
Degree-of-Interest Trees
Hyperbolic Trees
Network visualization

                        Often uses physics
                        models (e.g., edges
                        as springs) to
                        perform layout.
                        Can be animated
                        and interacted with.
Network Visualization




                        Skitter, www.caida.org
WebBook
Web Forager
Document Lens
Data Mountain

                Supports document
                organization in a 2.5
                dimensional
                environment.
      Designing
  Visualizations
(some tricks of the trade)
Graphical Excellence [Tufte]
   the well-designed presentation of interesting
    data – a matter of substance, of statistics, and of
    design
   consists of complex ideas communicated with
    clarity, precision and efficiency
   is that which gives to the viewer the greatest
    number of ideas in the shortest time with the
    least ink in the smallest space
   requires telling the truth about the data.
Interactive Tasks [Shneiderman]
1.   Overview: Get an overview of the collection
2.   Zoom: Zoom in on items of interest
3.   Filter: Remove uninteresting items
4.   Details on demand: Select items and get
     details
5.   Relate: View relationships between items
6.   History: Keep a history of actions for undo,
     replay, refinement
7.   Extract: Make subcollections
Proposed Data Types
1.   1D: timelines,…
2.   2D: maps,…
3.   3D: volumes,…
4.   Multi-dimensional: databases,…
5.   Hierarchies/Trees: directories,…
6.   Networks/Graphs: web,…
7.   Document collections: digital libraries,…

This is useful, but what’s wrong here?
Basic Types of Data
   Nominal (qualitative)
       (no inherent order)
       city names, types of diseases, ...
   Ordinal (qualitative)
       (ordered, but not at measurable intervals)
       first, second, third, …
       cold, warm, hot
       Mon, Tue, Wed, Thu …
   Interval (quantitative)
       integers or reals
Ranking of Applicability of Properties
for Different Data Types
(Mackinlay 88, Not Empirically Verified)



  QUANT               ORDINAL              NOMINAL

  Position            Position             Position
  Length              Density              Color Hue
  Angle               Color Saturation     Texture
  Slope               Color Hue            Connection
  Area                Texture              Containment
  Volume              Connection           Density
  Density             Containment          Color Saturation
  Color Saturation    Length               Shape
  Color Hue           Angle                Length
Visualization Design Patterns
   Pre-Attentive Patterns
     Leverage things that automatically “pop-out” to human attention
     Stark contrast in color, shape, size, orientation
   Gestalt Properties
     Use psychological theories of visual grouping
     proximity, similarity, continuity, connectedness, closure,
       symmetry, common fate, figure/ground separation
   High Data Density
     Maximize number of items/area of graphic
     This is controversial! Whitespace may contribute to good visual
       design… so balance appropriately.
   Small Multiples
     Show varying visualizations/patterns adjacent to one another
     Enable Comparisons
Visualization Design Patterns
   Focus+Context
     Highlight regions of current interest, while de-emphasizing but
      keeping visible surrounding context.
     Can visually distort space, or use degree-of-interest function to
      control what is and isn’t visualized.
   Dynamic Queries
     Allow rapid refinement of visualization criteria
     Range sliders, Query sliders
   Panning and Zooming
     Navigate large spaces using a camera metaphor
   Semantic Zooming
     Change content presentation based on zooming level
     Hide/reveal additional data in accordance with available space
Software Architectures
   The Information Visualization Reference
    Model [Chi, Card, Mackinlay, Shneiderman]
   Evaluating
Visualizations
Evaluating Visualizations
   Visualizations are user interfaces, too…established
    methodologies can be used.
   Questions to ask
       What tasks do you expect people to perform with the
        visualization?
       What interfaces currently exist for this task?
       In what ways do you expect different visualizations to help
        or hurt aspects of these tasks?
   Metrics: task time, success rate, information gained
    (e.g., test the user, or exploit priming effects), eye
    tracking.
Evaluating Hyperbolic Trees
   The Great CHI’97 Browse-Off: Individual
    browsers race against the clock to perform
    various retrieval and comparison tasks.
   Hyperbolic Tree won against M$ File Explorer
    and others.
   Can we conclude that it is the better
    browser?
                              vs.
Evaluating Hyperbolic Trees
   No!
     Different people operating each browser.
     Tasks were not ecologically valid.
         Can’t say what is better for what.
   PARC researchers did extensive eye-tracking studies uncovering
    very nuanced visual psychology.
   Found Hyperbolic Tree is better when underlying information
    design (e.g., tree structure and labeling) is better.
     In case of CHI Browse Off, the Hyperbolic Tree had a quicker
       human user “behind the wheel”.
   Moral: Exercise judicious study design, but also don’t feel let
    down if task times are not being radically improved… subtleties
    abound.
                                            Questions?




Jeffrey Heer   jheer@cs.berkeley.edu
prefuse        http://prefuse.sourceforge.net
    Accuracy Ranking of Quantitative Perceptual Tasks
Estimated; only pairwise comparisons have been validated
         (Mackinlay 88 from Cleveland & McGill)
Interpretations of Visual Properties
 Some properties can be discriminated more
 accurately but don’t have intrinsic meaning

     Density (Greyscale)
         Darker -> More
     Size / Length / Area
         Larger -> More
     Position
         Leftmost -> first, Topmost -> first
     Hue
         ??? no intrinsic meaning
     Slope
         ??? no intrinsic meaning
   Micro-Aspects of
Visualization Design
(aka fun with visual psychology)
Preattentive Processing
   A limited set of visual properties are
    processed preattentively
       (without need for focusing attention).
   This is important for design of visualizations
     what can be perceived immediately

     what properties are good discriminators

     what can mislead viewers

         All Preattentive Processing figures from Healey 97
         http://www.csc.ncsu.edu/faculty/healey/PP/PP.html
Example: Color Selection




   Viewer can rapidly and accurately determine
   whether the target (red circle) is present or absent.
   Difference detected in color.
Example: Shape Selection




   Viewer can rapidly and accurately determine
   whether the target (red circle) is present or absent.
   Difference detected in form (curvature)
Pre-attentive Processing
   < 200 - 250ms qualifies as pre-attentive
       eye movements take at least 200ms
       yet certain processing can be done very quickly,
        implying low-level processing in parallel
   If a decision takes a fixed amount of time
    regardless of the number of distractors, it is
    considered to be preattentive.
Example: Conjunction of
Features




 Viewer cannot rapidly and accurately determine
 whether the target (red circle) is present or absent when
 target has two or more features, each of which are
 present in the distractors. Viewer must search sequentially.
            All Preattentive Processing figures from Healey 97
            http://www.csc.ncsu.edu/faculty/healey/PP/PP.html
Example: Emergent Features




    Target has a unique feature with respect to
    distractors (open sides) and so the group
    can be detected preattentively.
Example: Emergent Features




    Target does not have a unique feature with respect to
    distractors and so the group cannot be detected
    preattentively.
Asymmetric and Graded
Preattentive Properties
   Some properties are asymmetric
       a sloped line among vertical lines is preattentive
       a vertical line among sloped ones is not
   Some properties have a gradation
       some more easily discriminated among than others
Use Grouping of Well-Chosen
Shapes for Displaying Multivariate
Data
SUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXO
CERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEM
SCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC
GOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREM
CERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEM
GOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREM
SCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC
SUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXO
CERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEM
SCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC
               Text NOT Preattentive


SUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXO
CERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEM
SCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC
GOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREM
CERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEM
GOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREM
SCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC
SUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXO
CERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEM
SCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC
 Preattentive Visual Properties
 (Healey 97)
length                Triesman & Gormican [1988]
width                 Julesz [1985]
size                  Triesman & Gelade [1980]
curvature              Triesman & Gormican [1988]
number                 Julesz [1985]; Trick & Pylyshyn [1994]
terminators            Julesz & Bergen [1983]
intersection           Julesz & Bergen [1983]
closure               Enns [1986]; Triesman & Souther [1985]
colour (hue)           Nagy & Sanchez [1990, 1992]; D'Zmura [1991]
                                Kawai et al. [1995]; Bauer et al. [1996]
intensity              Beck et al. [1983]; Triesman & Gormican [1988]
flicker               Julesz [1971]
direction of motion     Nakayama & Silverman [1986]; Driver & McLeod
    [1992]
binocular lustre       Wolfe & Franzel [1988]
stereoscopic depth      Nakayama & Silverman [1986]
3-D depth cues         Enns [1990]
lighting direction     Enns [1990]
Gestalt Principles

   Idea: forms or patterns transcend the stimuli
    used to create them.
       Why do patterns emerge?
       Under what circumstances?


   Principles of Pattern Recognition
       “gestalt” German for “pattern” or “form,
        configuration”
       Original proposed mechanisms turned out to be
        wrong
Gestalt Properties
Proximity




                     Why perceive pairs vs. triplets?
 Gestalt Properties
    Similarity




Slide adapted from Tamara Munzner
Gestalt Properties
Continuity




Slide adapted from Tamara Munzner
Gestalt Properties
Connectedness




Slide adapted from Tamara Munzner
Gestalt Properties
Closure




Slide adapted from Tamara Munzner
Gestalt Properties
Symmetry




Slide adapted from Tamara Munzner
Gestalt Laws of Perceptual
Organization (Kaufman 74)

   Figure and Ground
       Escher illustrations are good examples
       Vase/Face contrast
   Subjective Contour
More Gestalt Laws
   Law of Common Fate
       like preattentive motion property
            move a subset of objects among similar ones and
             they will be perceived as a group
Colors for Labeling
   Ware recommends to take into account:
       Distinctness
       Unique hues
         Component process model
       Contrast with background
       Color blindness
       Number
         Only a small number of codes can be rapidly perceived
       Field Size
         Small changes in color are difficult to perceive
       Conventions
Ware’s Recommended Colors for
Labeling




Red, Green, Yellow, Blue, Black, White, Pink, Cyan, Gray, Orange, Brown, Purple.
The top six colors are chosen because they are the unique colors that mark the ends
of the opponent color axes. The entire set corresponds to the eleven color names found
to be the most common in a cross-cultural study, plus cyan (Berlin and Kay)

						
Related docs
Other docs by yurtgc548
Nincs diacím
Views: 0  |  Downloads: 0
NickandKate-Forest.ppt - Wikispaces
Views: 0  |  Downloads: 0
NewProduct powerpointsToxic Environment.PPT
Views: 0  |  Downloads: 0
Newly Industrialised Countries
Views: 0  |  Downloads: 0
NEW WASHINGTON MARKETING RULES
Views: 0  |  Downloads: 0
New Twists on Traditional Marketing
Views: 0  |  Downloads: 0
New Struggle
Views: 0  |  Downloads: 0
New Products Portfolio Review
Views: 0  |  Downloads: 0
New Products Management
Views: 0  |  Downloads: 0