LIN 353C course projects will usually be done in groups of 2 students. If you want to work in a larger or smaller group, you need prior
approval
of the instructor. Section "Project topic suggestions" gives a list of possible project topics. I would suggest that you choose something from this list, but you can also choose something different. All projects need to address an NLP problem and involve programming. All projects need to be evaluated in some form. As you need some initial results to discuss in your intermediate progress report, I suggest that you start out with some simple rule-based approach, then improve on it, maybe with some change in technique, in the second half of the semester.
Project topic suggestions
Previous projects have also included:
Working in groupsIf you are working in a group:
RequirementsYou will write two documents about your course project, the intermediate report and the final report. You will also have a chance to discuss your project in the last week of classes. But this discussion will not be graded. Intermediate reportAt the time of the intermediate report, you need to have some system that addresses your problem. This can be a very simple, rule-based system, It need not be the final system.
Final reportThe final report is about your final system. This should improve over the system as it was at the time of the intermediate report, either by using a different technique, or by using the same technique in a more sophisticated way.
ResourcesAre you building a supervised classification system? Then check the NLTK chapter on classification, chapter 6.Whatever kind of system you build, you will need to do an error analysis. Counter to its somewhat negative-sounding name, an error analysis is not just a sad list of errors, but an in-depth look at how your system deals with the language data it sees: both what it does right and where it does something wrong. For a discussion of the general spirit of error analysis, check Emily Bender's blog post on "putting the linguistics in computational linguistics" |