IS 202 : Information Organization and Retrieval

August 29 : Tuesday

1. Course Overview 

Lecturer: Bob Glushko

Required Readings

"As We May Think" Vannevar Bush [Reader]

"MyLifeBits: a personal database for everything" Jim Gemmell, Gordon Bell, and Roger Lueder [Reader]

Communications of the ACM (January 2006)

"Scan This Book" Kevin Kelly [Reader]

New York Times (14 May 2006)

"The Library of Babel, from Labyrinths: Selected Stories & Other Writings" Jorge Luis Borges [Reader]

Resources

August 31 : Thursday

2. How to Think About Information 

Lecturer: Bob Glushko

Required Readings

Preface, Chapter 1 of The Intellectual Foundation of Information Organization [Textbook]

"Fundamental Forms of Information" Marcia Bates [Reader, 1033 - 1045]

Journal of the American Society for Information Science and Technology 57(8) (June 2006)

"Conduit Metaphor: A Case of Frame Conflict in our Langauge About Language (In Andrew Ortony (ed.), Metaphor and Thought) (skip "semantic pathology" from 176-184)" Michael Reddy [Reader]

Resources

September 5 : Tuesday

3. Information Organization {and,or,vs} Search 

Lecturer: Bob Glushko

Required Readings

Chapter 2 of The Intellectual Foundation of Information Organization [Textbook]

"Human Information Behavior: Integrating Diverse Approaches and Information Use" Amanda Spink and Charles Cole [Reader, 25-35 (skim or skip after page 29)]

Journal of the American Society for Information Science and Technology 57(1) (January 2006)

"Searching to Eliminate Personal Information Management" Edward Cutrell, Susan Dumais, and Jaime Teevan [Reader]

Communications of the ACM (January 2006)

"From IR to Search and Beyond" Ramana Rao [Reader]

ACM Queue (May 2004)

"Enterprise Search: Tough Stuff" Rajat Mukherjee and Jianchang Mao [Reader]

ACM Queue (April 2004)

Resources

September 7 : Thursday

4. Concepts and Categories 

Lecturer: Bob Glushko

Required Readings

"The Vocabulary Problem in Human-System Communication" George W. Furnas, Thomas K. Landauer, Louis M. Gomez, and Susan T. Dumais [Reader]

Communications of the ACM, 30(11), 964-971 (1987)

"Women, Fire, and Dangerous Things. (preface through p 67)" George Lakoff [Reader]

Resources

September 12 : Tuesday

5. Metadata and Metadata Standards [1] 

Lecturer: Bob Glushko

Required Readings

Chapter 3, Chapter 4 (62 - 66) of The Intellectual Foundation of Information Organization [Textbook]

"Using Dublin Core" Diane Hillman [Reader]

"Using Dublin Core - The Elements" Diane Hillman [Reader]

"Understanding MARC Bibliographic: Machine-Readable Cataloging" Betty Furrie [Reader]

Resources

September 14 : Thursday

6. Metadata and Metadata Standards [2] 

Lecturer: Bob Glushko

Required Readings

Chapter 5 of The Intellectual Foundation of Information Organization [Textbook]

"Metadata Interoperability and Standardization – A Study of Methodology Part I. Achieving Interoperability at the Schema Level (skip or skim "Sources and References")" Lois Chan and Marcia Zeng [Reader]

"Metacrap: Putting the torch to seven straw-men of the meta-utopia" Cory Doctorow [Reader]

Resources

September 19 : Tuesday

7. Controlled Names and Controlled Vocabularies 

Lecturer: Bob Glushko

Required Readings

Chapter 6, Chapter 8 (127-132) of The Intellectual Foundation of Information Organization [Textbook]

"What is a controlled vocabulary?" Karl Fast, Fred Liese, and Mike Steckel [Reader]

"Creating a controlled vocabulary" Karl Fast, Fred Liese, and Mike Steckel [Reader]

"Document Engineering, Chapter 12, Analyzing Document Components" Glushko and McGrath [Reader]

Resources

September 21 : Thursday

8. Classification 

Lecturer: Bob Glushko

Required Readings

Chapter 8 (139-146), Chapter 9 (159-171), Chapter 10 of The Intellectual Foundation of Information Organization [Textbook]

"How to make a faceted classification and put it on the web" William Denton [Reader]

"Faceted metadata for image search and browsing" Yee, Swearington, Li, and Hearst [Reader]

Resources

September 26 : Tuesday

9. Metadata for Multimedia and Non-text Information 

Lecturer: Andrea Moed

Required Readings

"The Language of Images: Enhancing Access to Images by Applying Metadata Schemas and Structured Vocabularies. Introduction to Art Image Access.(Martha Baca, Ed)" Patricia Harping [Reader]

"Context Data in Geo-Referenced Digital Photo Collections " Mor Naaman, Susumu Harada, QianUing Wang, Hector Garcia-Molina, and Andreas Paepcke [Reader]

Proceedings of the 12th annual ACM international conference on Multimedia (2004)

"Editing Out Video Editing" Marc Davis [Reader]

IEEE Multimedia (April-June 2003 )

Resources

September 28 : Thursday

10. Social / Distributed Classification 

Lecturer: Bob Glushko

Required Readings

"Folksonomies - Cooperative Classification and Communication Through Shared Metadata" Adam Mathes [Reader]

"Social Bookmarking Tools (I) A General Review" Tony Hammond, Timo Hannay, Ben Lund, and Joanna Scott [Reader]

D-Lib Magazine (April 2005)

Resources

October 3 : Tuesday

11. Ontologies 

Lecturer: Bob Glushko

Required Readings

"Ontology 101 (1-20, through section 4)" Natalya Noy and Deborah McGuinness [Reader]

"Ontology is Overrated: Categories, Links, and Tags" Clay Shirky [Reader]

Resources

October 5 : Thursday

12. The Semantic Web 

Lecturer: Bob Glushko

Required Readings

"The Semantic Web" Tim Berners-Lee, James Hendler, and Ora Lassila [Reader]

Scientific American (May 2001)

"OWL Web Ontology Language: Use Cases and Requirements. [Sections 1 and 2, p 1-7] " [Online]

"Which Semantic Web?" Catherine Marshall and Frank Shipman [Reader]

ACM conference on Hypertext and hypermedia (2003)

Resources

October 10 : Tuesday

13. Documents and Document Models 

Lecturer: Bob Glushko

Required Readings

"Document Engineering, Chapter 2, XML Foundations" Glushko and McGrath [Reader]

"SylViA: The Syllabus Viewing Application (Overview and The SylViA Data Model)" Lisa de Larios-Heiman and Carolyn Cracraft [Reader]

"On Language Creation" Tim Bray [Reader]

XML 2005

"What's the Next Big Thing on the Web? It May Be a Small, Simple Thing -- Microformats" Knowledge@Wharton [Reader]

Resources

October 12 : Thursday

14. Databases and Data Models 

Guest Lecturer: Ray Larson of the iSchool (email, website)

Required Readings

"Introduction (part of SQL for Web Nerds)" Philip Greenspun [Reader]

"Introduction to Relational Databases" Ian Gilfillan [Reader]

Database Journal (Note: You are encouraged to follow the links at the end of this article and explore further, especially "Simple SQL") (24 June 2002)

"Database Normalization" Ian Gilfillan [Reader]

Database Journal (22 March 2002)

Resources

October 17 : Tuesday

15. Personal Information Management 

Lecturer: Bob Glushko

Required Readings

"A Few Thoughts on Cognitive Overload. (pages 19-33 [Part 1 of Article])" David Kirsh [Reader]

Intellectica

"Data Unification in Personal Information Management" David Karger and William Jones [Reader]

Communications of the ACM (January 2006)

"Digital memories in an era of ubiquitous computing and abundant storage" Mary Czerwinski, Douglas W. Gage, Jim Gemmell, Catherine C. Marshall, Manuel A. Prez-Quiones, Meredith M. Skeels, and Tiziana Catarci [Reader]

Communications of the ACM (January 2006)

Resources

October 19 : Thursday

16. Enterprise Information Management [1] 

Lecturer: Bob Glushko

Required Readings

"U.S. National Archives and Records Administration. Agency Recordkeeping Requirements: A Management Guide" [Reader]

"Information Technology Controls" [Reader]

"XML-centric workflow offers benefits to scholarly publishers" Alexander B. Schwarzman, Hyunmin Hur, Shu-Li Pai, and Carter Glass [Reader]

IdeAlliance XML 2004 Conference

Resources

October 24 : Tuesday

17. Enterprise Information Management [2] 

Lecturer: Bob Glushko

Required Readings

"Bringing together content and data management: Challenges and opportunities" A. Somani, D. Choy, and J. C. Kleewein [Reader]

IBM Systems Journal 41(4) (2002)

"Semantic Integration: Tapping the Full Potential of Enterprise Data" Neil Raden [Reader]

"Chapter 4, The Information Supply Chain" Larry Downes [Reader]

The Strategy Machine

Resources

October 26 : Thursday

18. Information Architecture 

Lecturer: Bob Glushko

Required Readings

"User Interface Design Patterns: Strengths, Challenges and Future of Design Patterns" Mano Marks and Kelly Snow [Reader]

"Chapter 2, Organizing the Content: Information Architecture and Application Structure. " Jennifer Tidwell [Reader]

Designing Interfaces

"Chapter 4, Organizing the Page: Layout of Page Elements" Jennifer Tidwell [Reader]

Designing Interfaces

"Globalization, Localization, Internationalization and Translation" [Reader]

Resources

October 31 : Tuesday

19. The Searcher's Perspective on IR; User Interfaces for IR 

Guest Lecturer: Marti Hearst of the iSchool (website)

Required Readings

"Best Practices and Future Visions for Search User Interfaces" Marc Resnick and Misha Vaughan [Reader, 781-787]

Journal of the American Society for Information Science and Technology 57(6) (2006)

"Studying User Strategies and Characteristics for Developing Web Search Interfaces (Chapters 3 and 4)" Anne Aula [Reader]

Ph D Thesis, University of Tampere (December 2005)

Resources

November 2 : Thursday

20. Multimedia Search and Retrieval 

Lecturer: Bob Glushko

Required Readings

"The Image User and the Search for Images. Introduction to Art Image Access (Martha Baca, Ed)" Christine Sundt [Reader]

"Bridging the Semantic Gap in Content Management Systems: Computational Media Aesthetics" Chitra Dorai and Svetha Venkatesh [Reader]

Proceedings of COSIGN 2001: Computational Semiotics

"Multimedia Information Retrieval: What is it, and why isnt anyone using it?" Alejandro Jaimes, Mike Christel, Sbastien Gilles, Ramesh Sarukkai, and Wei-Ying Ma [Reader]

Proceedings of the 7th ACM SIGMM (2005)

Resources

November 7 : Tuesday

21. Text Processing for IR; Boolean Models 

Lecturer: Bob Glushko

Required Readings

Chapters 1 and 2 of Introduction to Information Retrieval (draft chapters from upcoming book) [Online]

Resources

November 9 : Thursday

22. Vector Models 

Lecturer: Bob Glushko

Required Readings

Chapters 6 and 7 of Introduction to Information Retrieval (draft chapters from upcoming book) [Online]

Resources

November 14 : Tuesday

23. Structure-based Models [1] 

Lecturer: Bob Glushko

Required Readings

"The Anatomy of a Large-Scale Hypertextual Search Engine" Sergey Brin and Lawrence Page [Reader]

Chapters 20 (skip section 20.2) and 21 of Introduction to Information Retrieval (draft chapters from upcoming book) [Online]

Resources

November 16 : Thursday

24. Structure-based Models [2] 

Lecturer: Bob Glushko

Required Readings

"Search Engine Optimization" [Reader]

Chapter 10 of Introduction to Information Retrieval (draft chapters from upcoming book) [Online]

"Native XML Databases in the Real World. XML 2005 (sections 1-6)" Ron Bourret [Reader]

XML 2005

Resources

November 21 : Tuesday

25. Dimensionality Reduction & Latent Semantic Indexing 

Lecturer: Bob Glushko

Required Readings

Chapter 18 of Introduction to Information Retrieval (draft chapters from upcoming book) [Online]

"How LSI Works" [Reader]

Recommended Readings

"Data-driven approaches to information access" Susan Dumais [Reader]

Cognitive Science, 27(3), 491-524 (2003)

November 23 : Thursday : Thanksgiving

Holiday: Thanksgiving 

November 28 : Tuesday

26. Applied IR and Natural Language Processing [1] 

Lecturer: Bob Glushko

Required Readings

"Introduction to the Special Issue on the Web as Corpus" Adam Kilgarriff and Gregory Grefenstette [Reader]

Computational Linguistics 29(3) (2003)

"From Babel to Knowledge Data Mining Large Digital Collections" Daniel Cohen [Reader]

D-Lib Magazine (March 2006)

"The Elusive Goal of Machine Translation" Gary Stix [Reader]

Scientific American (March 2006)

Resources

November 30 : Thursday

27. Applied IR and Natural Language Processing [2] 

Lecturer: Bob Glushko

Required Readings

"A Plan for Spam" Paul Graham [Reader]

"Mining and Summarizing Customer Reviews" Minquing Hu and Bing Liu [Reader]

Proceedings of the tenth ACM SIGKDD (2004)

"Web question answering: Is more always better?" Susan Dumais, Michele Banko, Eric Brill, Jimmy Lin, and Andrew Ng [Reader]

Proceedings of the 25th annual international ACM SIGIR (2002)

Resources

December 5 : Tuesday

28. The Business and Professions of IO and IR (Alumni Guests) 

December 7 : Thursday

29. Course Review 

Lecturer: Bob Glushko

Resources

December 12 : Tuesday

Final Exam 

9:00am-12:00pm, 202 South Hall

last updated on 2006-12-07 by RJG