Here are some ideas I stole from a friend of mine, including some actual titles for projects that students in his class did.
Given a web page that (probably) contains glossary entries and definitions, extract the fields.
Given multiple database with addresses, create a unified database of places.
Create a more accurate battery power indicator.
Extract titles, authors, references from pdf files.
Self organization of a peer-to-peer network.
Predict server response time for nodes in a wireless network.
In RL, there are several algorithms that trade off exploration and exploitation in a theoretically motivated way. Evaluate them empirically.
Compare existing RL techniques for "mountain car" or Tetris.
Figure out how to beat a fixed set of TAC agents.
Compare techniques for merging probability distributions theoretically.
To solve multiple choice synonym questions, we've shown that multiple experts is a smart way to do this. Training is done using supervised data. Can the multiple modules be used to train each other? ("Labeling via collectives of sufficiently accurate modules").
How about modules for RL? Is there an advantage for doing policy search, table, neural net all together?
Applications:
Natural language dialog
robotics
financial trading
network diagnosis
object recognition
combining speech and commands and images
problem solving (Sokoban)
video games
"Comparing Kernel-based Learning Method with Application to Face Recognition"
"Image-based Stress Recognition from a Model-based Face Tracking System"
"Chessman Position Recognition Using Artificial Neural Networks"
"Latent Learning in Agents"
"LLE and ISOMAP Analysis of Robot Images"
"Human Identification using Silhouette Gait Data"
"Reconstruction of Walking People Images by Principal Component Analysis"
"Empirical Analysis of Predictive Algorithms for Recommender Systems"
"Estimating Constraint Costs using Regression Trees"
"Extending Implicit Negotiation to Repeated Grid Games"
"A New Evolutionary Algorithm for Multi-objective Optimization Problems"
"Evolutionary Learning Networks"
"Learning Change Patterns in Software Engineering, Practicable or Not?"
"An Empirical Comparison between ANN and SVM as Classifiers"
"Dynamic Topic Analysis: Classification Without Established Classes using Distance Thresholds"
"Evaluation of Kernel function Modification in Text Classification Using SVM"
"Using TF-IDF to Determine Word Relevance in Document Queries"
"Stock Price Prediction from Natural Language Understanding of News Headlines"
"An Evaluation of Kea: An Automatic Keyphrase Extraction Algorithm"
"ID Identification in Online Communities"
"Document Quality Prediction with Statistical Textual Features"