Bio Research+Teach Publications Press FAQ Personal | |
Rosalind W. Picard, Sc.D., FIEEE Director of Affective Computing Research MIT Media Lab, E14-348A 75 Amherst Street Cambridge, MA 02139; USA picard (you can make the "at") media (dot) mit (dot) edu download Curriculum Vitae (CV) Follow @RosalindPicard Assistant: R-admin (you can make the "at") media (dot) mit (dot) edu Accessibility |
Research in Video and Image Libraries: Browsing, Retrieval, AnnotationThe average person with a computer will soon have access to the world's collections of digital video and images. However, unlike text that can be alphabetized or numbers that can be ordered, image and video has no general language to aid in its organization. Although tools which can ``see'' and ``understand'' the content of imagery are still in their infancy, they are now at the point where they can provide substantial assistance to users in navigating through visual media.This research is application-oriented, and couples the results of a society of models with new tools that allow computers to help people browse, annotate, and retrieve digital images and video. A new learning system, FourEyes , has been developed, and equipped with a society of vision texture models to annotate part of a database of vacation photos. The user labels portions of some of the images in the database "building" and "street." The system infers which models best automated this labeling process, and then uses those to label the rest of the database. Once labeled, the Photobook system can retrieve images not just based on content, but also based on learned feedback from the user's input about content. Selected Publications
OLD Vision & Modeling Group Home Page |