LIN392 Analyzing Linguistic Data: Syllabus

Course Information

Instructor Contact Information

  • Katrin Erk

    • office hours: Tues 1-2, Thurs 9:30-10:30 and 1-2

  • office: Liberal Arts Building 4.734

  • email: katrin dot erk at mail dot utexas dot edu


Graduate standing.

Syllabus and Text

This page serves as the syllabus for this course.

Official course textbooks:

R.H. Baayen (2008): Analyzing Linguistic Data: A Practical Introduction to Statistics Using R. Cambridge University Press.

P. R. Hinton (2004): Statistics Explained: A Guide for Social Science Students. Psychology Press; 3rd edition.

We will also make use of other readings, which will be made available on the Schedule page.

Exams and Assignments

Assignments will be updated on Canvas. A tentative schedule for the entire semester is posted on the Schedule page. Readings and exercises may change up a week in advance of their due dates. There is an end-of-term project for the course, where students will be expected to choose a dataset that they intend to analyze. Details on the requirements for the project are given on the Course Project page.

Philosophy and Goal

Many research topics in linguistics can benefit from sophisticated statistical analysis of language datasets. This course will introduce fundamental concepts that will enable students to formulate quantitatively-oriented research questions and answer them with appropriate visualization, modeling and testing. Students will learn these techniques, apply them to data sets in class, and generalize them to a dataset of their own choice.

We use the R programming language, which allows much more flexible and customizable ways of performing such exploration and analysis, compared to statistical packages based on point-and-click interfaces. It also forms a strong basis for using more complex modeling techniques than are covered in this course—including writing one's one code to do so.

Content overview

This course provides hands-on introduction to statistics for language, using the R programming language. Using data from existing linguistic studies, we will study the following topics:

  • data exploration through visualization

  • probability distributions

  • mean and standard deviation of a single dataset

  • comparing pairs of datasets and hypotheses:testing for statistical significance

  • regression modeling

  • clustering for data exploration

Course Requirements

  • Homeworks (60% overall, 12% each): 5 homeworks will be assigned during the course

  • Project (40% overall):

    • Project proposal (5%)

    • Project progress report (5%)

    • Project presentation (5%)

    • Project final report (25%)

For information on the homework assignments, see Canvas. For information on project requirements, see the Course Project page.

Extension Policy

If you turn in your assignment late, expect points to be deducted. Extensions will be considered on a case-by-case basis. If you anticipate that you will need an extension for some assignment, let me know in advance.

By default, 5 points (out of 100) will be deducted for lateness, plus an additional 1 point for every 24-hour period beyond 2 that the assignment is late. For example, an assignment due at 2pm on Tuesday will have 5 points deducted if it is turned in late but before 2pm on Thursday. It will have 6 points deducted if it is turned in by 2pm Friday, etc.

Even if you are late for some assignment, you should definitely turn it in, and you will get some credit for your work, even though some points may be deducted. But it is crucial for your learning progress that you do all the coursework.

Academic Dishonesty Policy

You are encouraged to discuss assignments with classmates. But all written work must be your own. Students caught cheating will automatically fail the course. If in doubt, ask the instructor.

Notice about students with disabilities

The University of Texas at Austin provides upon request appropriate academic accommodations for qualified students with disabilities. Please contact the Division of Diversity and Community Engagement, Services for Students with Disabilities, 512-471-6259,

Notice about missed work due to religious holy days

By UT Austin policy, you must notify me of your pending absence at least fourteen days prior to the date of observance of a religious holy day. If you must miss a class, an examination, a work assignment, or a project in order to observe a religious holy day, you will be given an opportunity to complete the missed work within a reasonable time after the absence.

Emergency Evacuation Policy

    • Occupants of buildings on The University of Texas at Austin campus are required to evacuate buildings when a fire alarm is activated. Alarm activation or announcement requires exiting and assembling outside.

    • Familiarize yourself with all exit doors of each classroom and building you may occupy. Remember that the nearest exit door may not be the one you used when ent ering the building.

    • Students requiring assistance in evacuation shall inform their instructor in writing during the first week of class.

    • In the event of an evacuation, follow the instruction of faculty or class instructors.

    • Do not re-enter a building unless given instructions by the following: Austin Fire Department, The University of Texas at Austin Police Department, or Fire Prevention Services office.

    • Link to information regarding emergency evacuation routes and emergency procedures can be found at:

Behavior Concerns Advice Line (BCAL)

If you are worried about someone who is acting differently, you may use the Behavior Concerns Advice Line to discuss by phone your concerns about another individual's behavior. This service is provided through a partnership among the Office of the Dean of Students, the Counseling and Mental Health Center (CMHC), the Employee Assistance Program (EAP), and The University of Texas Police Department (UTPD). Call 512-232-5050 or visit