LearningBitbyBitS10
Search:
Syllabus / LearningBitbyBitS10

Learning Bit by Bit

Instructor: Heather Dewey-Hagborg
Contact: hdh216@nyu.edu
Class: Thursdays 3:30 - 6pm
Room 447
Office Hours: Tuesdays 2:30 - 4:30 ~ or by appointment ~

From mailing a letter to shopping online to walking down a city street, applications of machine learning have penetrated our daily experience. Our faces, our voices, the emails we write, the products we buy, the content we choose, all constitute our data portrait: aggregates of information that are meticulously sifted, sorted and searched by algorithms behind the scenes.

This class will take a critical tour of the technologies that learn from this data. We will look at the information that defines us and how it is analyzed using techniques common to biology, computer science, robotics and surveillance.

We will cover both the theory and the implementation of machine learning techniques that are commonly used today in applications of text analysis, web search, face recognition, speech recognition, hand writing analysis, and content suggestion. We will discuss the concept of a data portrait and how heuristics and inductive bias shape the way we are seen. Finally, we will apply these techniques to create projects of our own.

This class will involve weekly readings, as well as in and out of class work on individual and group projects engaging with the concepts. Students will be encouraged to implement projects in a variety of media including electronics, robotics, performance, installation, writing, websites, or software.


Course Goals

Students will learn the concepts behind common machine learning techniques and apply these ideas to projects of their own design.

Expectations

Assignments will include weekly readings and projects. Project mediums will be left open to student interest. Students will be expected to collaborate, to document their work, to make presentations and to discuss their ideas regularly in class.

Grading

Homework/preparedness 50% Class Participation 20% Final Project 30%

Topics Include

  1. Content Suggestion - Collaborative Filtering - Clustering
  2. Speech Recognition - Natural Language Processing - Markov Models
  3. Neural Networks, Supervised and Unsupervised Learning Techniques
  4. Face Recognition - Principle Component Analysis

Format

We will spend 3 weeks on each topic grouping. Class time will include a lecture component as well as a lab. There will be weekly technical and critical readings. For each topic students will complete a project either individually, in groups or as a class, depending on student interest. The class will conclude with final project presentations.

Books

Our Readings will come from the following books. All books will be available on reserve in BOBST library or will be distributed by me as a handout.
Some books are less expensive and should be purchased if possible. These books are denoted with *.
Some books I have found to be fantastic references. These books are denoted with ~.
If you are interested in pursuing these topics beyond this class I recommend purchasing one or more of these typically more expensive text books.
Link to course reserves list on Bobst: http://tinyurl.com/ydh9z9j

--technical
Handbook of Fingerprint Recognition
Author: Davide Maltoni, Dario Maio, Anil K. Jain, Salil Prabhakar
Publisher: Springer, Copyright Date: 2003

*Programming Collective Intelligence
Author: Toby Segaran
Publisher: O'Reilly, Copyright: 2007

*~Natural Language Processing With Python
Author: Ewan Klein, Edward Loper, Steven Bird
Publisher: O'Reilly, Copyright Date: 2009
Online here:
http://www.nltk.org/book

~Speech and Language Processing
Author: Daniel Jurafsky and James H. Martin
Publisher: Pearson, Prentice Hall, Copyright: 2009, 2nd Edition
Ch. 1, 7, 9 online
Ch. 4 online

~Artificial Intelligence
Author: George F. Luger
Publisher: Addison Wesley, Copyright Date: 2009, 6th Edition
(there is a also a copy of this book in the ITP reading room)

Reliable Face Recognition Methods
Author: Harry Wechsler
Publisher: Springer, Copyright date: 2006

Handbook of Face Recognition
Edited By: Stan Z. Li, Anil K. Jain
Publisher: Springer, Copyright Date: 2005

Introduction to Machine Learning
Author: Ethem Alpaydin
Publisher: MIT Press, Copyright date: 2010

--critical
Surveillance: Power, Problems, and Politics
Edited by: Sean P. Hier and Josh Greenberg√ Publisher: UBC Press, Copyright Date: 2009

*Niche Envy: Marketing Discrimination in the Digital Age
Author: Joseph Turow
Publisher: MIT Press, Copyright Date: 2006
This is available as an ebook from Bobst
Also pdf online

Mind Design II
Edited By: John Haugeland
Publisher: MIT Press, Copyright Date: 1997

~The Philosophy of Mind
Edited By: Brian Beakley and Peter Ludlow
Publisher: MIT Press, Copyright Date: 2006, 2nd Edition

*Does Technology Drive History?
Edited By: Merritt Roe Smith, Leo Marx
Publisher: MIT Press, Copyright Date: 1994

Classes

Please bring your laptop to class (if you have one) as we will need them for in-class labs.
To make lab time as efficient as possible make sure that you have the following software packages installed before class:

Python 2.x the latest version
You can try the academic download of the Enthought Python distribution which bundles most of the libraries we will be using in this class:
http://www.enthought.com/products/edudownload.php
Included Packages:
http://www.enthought.com/products/epdlibraries.php

Java JDK 6 Update 18 or better
Ant 1.6.0 or better

1/21 Class 1. Introduction

-Introductions, what we hope to accomplish, go over syllabus
-Fingerprint Lab

Homework:
Collective Intelligence Ch. 1-2
(Feel free to skim technical sections on programming with Python if you already feel comfortable)
To follow along with the pydelicious examples you may need to install python setuptools:
http://pypi.python.org/pypi/setuptools#cygwin-mac-os-x-linux-other
also, please not there is a typo on p. 11 of the book. The result of the sim_distance calculation should be
0.29429805508554946 NOT .148148148148
Also on references to the key "href" should be changed to "url"

Surveillance: power problems and politics Ch. “bio-benefits”
Optional: Handbook of Fingerprint Recognition Ch. 1
Also, If you don’t have Python installed on your laptop please install by next class.

Topic 1. Collaborative Filtering

•1/28 Class 2. Collaborative Filtering
-Discuss readings
-Lecture on collaborative filtering
-Students will be assigned to collaborative filtering groups
Filtering the Netflix dataset Lab

Homework:
-Group Project:
Come up with a group project proposal, begin initial work

Collective Intelligence Ch. 3
Niche Envy Ch. 1-3

•2/4 Class 3. Collaborative Filtering continued
-Discuss readings
-Lecture on clustering
-Lab: in-class time to work on group projects

Homework:
Niche Envy Ch. 4, 7, 8
-Group project:
Complete projects

•2/11 Class 4. Collaborative Filtering continued
-Share collaborative filtering projects
-Discuss reading

Homework:
Speech And Language Processing Ch. 1
Natural Language Processing With Python Ch. 1, 2, 5
Follow along with the book installing the software and corpora. Make sure you have NLTK, its dependencies, and the corpora installed before next class.
Feel free to skim technical sections on programming with Python (ch. 1-2) if you already feel comfortable.

Topic 2. Speech and Language

•2/18 Class 5. Natural Language
-Discuss readings
-Introductory lecture on Natural Language Processing.
-Natural Language Lab using NLTK

Homework:
-Group project:
Come up with a group project proposal, begin initial work

Speech And Language Processing Ch. 4.0 - 4.3 (inclusive), Ch. 9
Note: This is especially difficult reading, it's OK to get the big picture and not all the details.

Also, If you don’t have the latest Java JDK and Ant installed on your laptop please install by next class.
For mac you must nstall developer tools from the disc that came with your computer or download from developer.apple.com
can use Software Update to update to the latest Java:
http://support.apple.com/kb/HT1338?viewlocale=en_US

All other platforms:
http://java.sun.com/javase/downloads/index.jsp
http://ant.apache.org/

Please download the source distribution of Sphinx4:
http://cmusphinx.sourceforge.net/sphinx4/#source
That would be sphinx4-{version}-src.zip: provides the sources, documentation, demos, unit tests and regression tests.

•2/25 Class 6. Speech Recognition
-Discuss readings
-Introductory Lecture on Speech recognition
-Speech Lab using Sphinx4

Homework:
-Group project:
Complete projects
Alan Turing, "Computing Machinery and Intelligence":
http://cogprints.org/499/0/turing.html
Optional: Take a look at Collective Intelligence Ch. 6 (Spam filtering)

•3/4 Class 7. Projects
-Share Speech and Language projects
-Discuss reading
-Choose groups for next week

Homework:
Artificial Intelligence Ch. 11.1 – 11.3 (History, Perceptrons & Backpropagation)
John Searle, "Minds, Brains, and Programs"
Available online through MIT Cognet: in Mind Design II (scroll down and click on Ch. 7 for PDF)

Next weeks lab will require each group to have the following set of components:

-9v power supply or battery with connection to breadboard
-Breadboard & hookup wire
-Voltmeter
-LM741 op amp
-3 CDS photocells
-4 10k potentiometers
-Resistors : 470, 4.7k, 10k ohm
-2n2222 NPN transistor (or similar)
-LED or an output device of your choice (ie. Buzzer, relay controlling x, etc.) note: different output devices may require modifications of resistor values listed above so please come prepared

Topic 3. Neural Networks

•3/11 Class 8. Neural Nets
-Discuss readings
-Intro lecture on neural nets, perceptrons
-Neural circuits lab

Homework:
Artificial Intelligence Ch. 11.4 - 11.7 (Competition, Coincidence & Attractors)
The Philosophy of Mind Ch. 78 & 83

-Group project:
Come up with a group project proposal, begin initial work

•3/25 Class 9. Neural Networks continued
-Discuss readings
-Unsupervised neural networks lecture
-Discuss project ideas
-Lab: work on group projects

Homework:
-Group project:
Complete projects
- Watch Ray Kurzweil Video:
http://mitworld.mit.edu/video/422/
Optional: Skim Collective Intelligence Ch. 4 (Neural Net search engine)

•4/1 Class 10. Neural Networks continued
-Students present neural network projects
-Guest Speaker Phillip Stearns

Homework:
-Download, uncompress and install images.zip and pyfacesCmdLineVer.zip eigenface library from pyfaces: http://code.google.com/p/pyfaces/downloads/list The pyFaces module depends on Python Imaging Library and Numpy

-Readings for the next class:
Handbook of Face Recognition Ch. 1

-Final projects:
Prepare to propose an final or facial recognition project idea next week

Topic 4. Face Recognition

•4/8 Class 11. Face Recognition
-intro lecture on Face Recognition using PCA
-discuss final project ideas
-ITP Faces Lab

Homework:
Reading of your choice from Reliable Face Recognition Methods or Handbook of Face Recognition
Be prepared to present a 5 min. discussion of what you read to the class

Optional:
Really great tutorial on Principal Components Analysis. Python code of example in the text posted to the code wiki.
How Stuff Works on Facial Recognition:
http://electronics.howstuffworks.com/gadgets/high-tech-gadgets/facial-recognition.htm
Introduction to Machine Learning Ch. 6 technical discussion of dimensionality reduction techniques
-Final projects:
Work on final or facial recognition projects

•4/15 Class 12. Face Recognition continued
-Each student presents 5 min. discussion of chapter read for homework
-Ensemble Methods
-discuss final projects
-lab: work on final projects

Homework:
-Final projects:
Work on final or facial recognition projects

•4/22 Class 13. Face Recognition continued
-Guest speaker Todd M. Gureckis
-Lecture on Analyzing and Testing
-In class discussion about progress and problems and lab on final projects

Homework:
-Final projects:
Finish final or facial recognition projects

•4/29 Class 14. Final Presentations

Search
  Page last modified on April 22, 2010, at 11:23 AM