|
|
Bio Research+Teach Publications Press FAQ Personal |
![]() Rosalind W. Picard, Sc.D., FIEEE Director of Affective Computing Research Co-Director of Things That Think Professor of Media Arts and Sciences M.I.T. Media Laboratory, E15-448 20 Ames Street Cambridge, MA 02139; USA picard (you can make the "at") media (dot) mit (dot) edu Assistant: Daniel Bender M.I.T. Media Laboratory, E15-331 Phone: (617) 253-0369 Fax: (866) 806-7264 danielb (you can make the "at") media (dot) mit (dot) edu |
Research in Video and Image Libraries: Browsing, Retrieval, AnnotationThe average person with a computer will soon have access to the world's collections of digital video and images. However, unlike text that can be alphabetized or numbers that can be ordered, image and video has no general language to aid in its organization. Although tools which can ``see'' and ``understand'' the content of imagery are still in their infancy, they are now at the point where they can provide substantial assistance to users in navigating through visual media.This research is application-oriented, and couples the results of a society of models with new tools that allow computers to help people browse, annotate, and retrieve digital images and video. A new learning system, FourEyes , has been developed, and equipped with a society of vision texture models to annotate part of a database of vacation photos. The user labels portions of some of the images in the database "building" and "street." The system infers which models best automated this labeling process, and then uses those to label the rest of the database. Once labeled, the Photobook system can retrieve images not just based on content, but also based on learned feedback from the user's input about content. Selected Publications
|