Assignments: LIN350 Computational Semantics


Assignments

Assignments will be made available here. There will be 4 assignments.

Assignments are due at the end of their due date (midnight).


Assignments


to be added.


Course Project Information

See below for examples of previous course projects!

By default, course projects should be done by teams of 2 students; however, projects done by 1 or 3 students are possible with prior approval of the instructor.

Initial project description

This is a 1-2 page document (single-spaced, single column) that describes what your project will be about. It needs to contain the following information:

  • Research questions: What are the main questions that you want to answer, the main language phenomena you want to address, or the main ideas you want to explore?
  • Method: What distributional model will you use, or what kinds of rules are you planning to state? Be as detailed as you can. (Yes, I know you will not have worked out every detail at this point, but strive to work out as many as you can.)
  • Data: If you do a distributional project, it is vital that you figure out as early as possible what data you can use to learn your model. Is there enough data? Is it freely available? Do you have to contact someone to get it? 

Intermediate report

This is a 1-2 page document (single-spaced, single column) that describes what the status of your project is at this point. This is a revised version of your initial project description. It needs to contain the following information:

  • Research questions: any changes?
  • Method: any changes?
  • Status:
    • Describe the data that was obtained: source, size, anything else that is relevant
    • Describe at least two (smaller, and preliminary) concrete results that you have at this point

You also need to take into account the feedback that you got on the Initial project description.

Short presentation

This is a short presentation to the class. You should discuss:

  • Research questions/linguistic phenomena/main ideas you wanted to model
  • Why is this relevant? (Spend a lot of time on the research questions and their relevance. Describing the big picture is important!)
  • Data, if you are using a data-driven approach: source, size
  • Results

You will need to prepare slides for this, which you submit to the instructor ahead of time.

Final report

This is a 4-5 page document (single-spaced, single column) that describes the results of your project. This is a revised version of your intermediate project description. It needs to contain the following information:

  • Research questions/linguistic phenomena covered/main ideas pursued
  • Data: source, size, other relevant statistics
  • Method
  • Findings

If you build on previous work, you need to discuss it, and give references. Published papers (at conferences, in journals) go into the references list at the end of the paper. Links to blog posts and the like go in a footnote. Also, links to websites containing data go in a footnote, not in the references list.

You need to take into account the feedback that you got on the Initial project description and Intermediate report. 


Some previous course projects

If you need inspiration for your course projects, here is what some people have previously done:
  • Word representations in different contexts, for example:
    • Are words used differently in structurally complex sentences than in structurally simple sentences?
    • Are words used differently in metaphor-rich contexts (such as literature) compared to more literal contexts?
  • How do words differ in their "geographic" representations, that is, in which places on earth they are used?
  • How could we represent words distributionally such that their representations combine into phrase representations in a reasonable way?
  • Can idioms be assigned a reasonable distributional representation?
  • Positive and negative product reviews seen through the lense of vector space models
  • Comparing general and specific terms (hyponyms and hypernyms), like "animal" and "dog", in vector spaces
  • Prominent words in top-10 songs over the decades: What has changed?
  • How do people use emojis? That is, how do their contexts compare?
  • Encoding and solving a logic puzzle (using logical form)
  • How to solve anaphora resolution (Sandy hit Kim. He was angry.Who is "he"?) in Discourse Representation Structures

Comments