CSCI 444 Spring 2026: NLP
🐝 Spring 2026 ⏰ Mon / Wed 10:00 - 11:50a 📍 DMC 151
Instructor: Xiang Ren
Announcements
See Brightspace.
Syllabus is here, see below for most updated schedule.
Summary
This class is all about language models: the fundamentals, spanning from simple architectures to modern Transformer-based neural architectures underlying large language models.
Calendar + Syllabus
| Week | Date | Class Topics | Readings | Work Due |
|---|---|---|---|---|
| 1 | Jan 12 | Introduction and Course Overview; n-gram models | ||
| Jan 14 | n-gram models (cont.) | J&M, Chap 3; | ||
| 2 | Jan 19 | Martin Luther King’s Birthday Holiday | HW1 Released | |
| Jan 21 | n-grams (cont.) & Word Embeddings | J&M, Chap 4; | ||
| 3 | Jan 26 | Project Pitches | ||
| Jan 28 | Word Embeddings |
J&M, Chap 4; J&M, Chap 5; |
||
| 4 | Feb 2 | Word2Vec (contd.) and Logistic Regression | J&M, Chap 5; word2vec Explained; | Group Formation Deadline; |
| Feb 4 | Feedforward Neural Nets | J&M, Chap 6; | HW1 Due | |
| 5 | Feb 9 | Recurrent Neural Networks | J&M, Chap 6; J&M, Chap 13; | HW2 Released |
| Feb 11 | Seq2Seq and Attention | J&M, Chap 13; | Project Proposal Due | |
| 6 | Feb 16 | Presidents Day | ||
| Feb 18 | Transformer and Pretraining | J&M, Chap 8; | ||
| 7 | Feb 23 | Language Generation | J&M, Chap 7.4; J&M, Chap 12.4; | |
| Feb 25 | Language Generation | J&M, Chap 7.4; J&M, Chap 12.4; | ||
| 8 | Mar 2 | Post-training (cont.) & Prompting | J&M, Chap 7.3; | |
| Mar 4 | Pytorch for Transformers and Huggingface | HW2 Due | ||
| 9 | Mar 9 | Flipped Classroom - Project Discussions | J&M, Chap 12; | HW3 Released |
| Mar 11 | Model Evaluation |
Bloom - BigScience Clip- Radford et al., 2021 |
||
| 10 | Mar 16 | Spring Break | ||
| Mar 18 | Spring Break | Hovy & Spruit, 2016 | ||
| 11 | Mar 23 | Cutting Edge Topics in NLP (Guest Lecture by Brihi Joshi: "Introduction to LLM Explainability" ) | ||
| Mar 25 | Paper Presentation and Discussion I | Project Midterm Report Due | ||
| 12 | Mar 30 | Paper Presentation and Discussion II |
Bloom - BigScience Clip- Radford et al., 2021 |
|
| Apr 1 | Paper Presentation and Discussion III | |||
| 13 | Apr 6 | Cutting Edge Topics in NLP | ||
| Apr 8 | Cutting Edge Topics in NLP (Guest Lecture by Matt Finlayson: "Accountability and Forensics in the Era of Closed Language Models" ) | HW 3 Due | ||
| 14 | Apr 13 | Paper Presentation and Discussion IV | ||
| Apr 15 | Paper Presentation and Discussion V | |||
| 15 | Apr 20 | Semester Project Presentations I | Project Presentations due | |
| Apr 22 | Semester Project Presentations II | |||
| 16 | Apr 27 | Semester Project Presentations III | ||
| Apr 29 | Semester Project Presentations IV | |||
| FINAL | May 11 | Project Final Report due Serves as final exam |
This calendar is subject to change. More details, e.g. lecture slides will be added as the semester continues. All work (except the project final report) is due on the specified date by 11:59 PM PT. See the syllabus for more details.
Assignments and Grading
There will be three components to course grades:
- Homeworks (18%).
- 6% X 3: There will be three coding homework assignments based on the topics of the class.
- Semester Project (55%).
- Each student will do a group class project based on the topics covered in the class. Students will propose their own project, do the research and build a proof-of-concept, create a demonstration (e.g. video) of the proof-of-concept, and present the project in their report.
- Proposal + Pitch: 10%
- Midterm Report: 15%
- Final Project Presentation: 15%
- Project + Final Write-up: 15%
- Paper Review
(12%).
- Students will write a research paper review to explain concepts underlying natural language processing in their own words (Learning Objective O2) and present the paper in class. The course explores topics through a series of assigned readings in the form of research papers and book chapters. Also, the semester project would require a literature review. Students will select one reading option and submit a two-page summary of that reading and present the papers in teams of 3 and drive discussion in the class. Reviews will be assessed based on answering a small set of questions, to be released at the time of the paper assignment, clearly and correctly. In most cases, each question will warrant at minimum a paragraph to answer.
- Class Participation (15%)
- Each student’s engagements in course discussions during class and during project discussions.
All written assignments related to the final project should use the standard *ACL paper submission template.
Late Days
The course will allow for a budget of 5 Late Day Tokens per student. These tokens can be expended on homeworks, the paper review, and project deliverables (NOT presentations or final project report) to extend the deadline, one day at a time, for a student without incurring a late penalty. These tokens should be used with no justification or explanation for taking the late time required (i.e., you do not need to explain your reason). Going over budget (e.g., turning things in late with no Late Day Tokens to expend) will incur grade penalties of 5% per day late. To ensure reasonable grading turnarounds and discussions of solutions, any assignment turned in 8 days late or more will receive an automatic zero regardless of the use of Late Day Tokens. For project teams, Late Day Token expenditures are on a per-student basis (i.e., if a team of 2 turns in their midterm report one day late, a member expending a Late Day Token will receive a 0% late penalty, while a member not expending a Late Day Token will receive a 5% late penalty). There are no refunds for late days: unused late days cannot be converted into credit of any kind.
Note: Please familiarize yourself with the academic policies and read the note about student well-being.
Pre-Requisites
Students are required to have taken
- CSCI 170 and
- 1 from (CSCI 104 or CSCI 114) and
- 1 from (MATH 225 or EE 141) and
- 1 from (EE 364 or MATH 407 or BUAD 310 or ISE 225) Recommended Preparation: Fluency with Python programming on the level of ITP 216 or TAC 216