I'm a PhD candidate in Linguistics at the University of Maryland, focusing on computational psycholinguistics.
I am advised by Profs. Philip Resnik and Colin Phillips.
My research is supported by the NSF GRFP.
Originally from the Bay Area, I graduated from UC Berkeley with bachelor's degrees in Cognitive Science and Computer Science.
There, I closely collaborated with Dr. Stephan Meylan on projects in Profs. Mahesh Srinivasan and Tom Griffiths' groups.
Afterwards, I worked as a software engineer at Amazon Web Services in Boston and decided to stay on the East Coast for grad school. I generally accept he/him pronouns.
Prediction is a key principle of human language processing. It has also been key to developing language models that produce fluent text and effectively estimate next-token probabilities.
My research involves developing and evaluating models of prediction in humans, based on what we know about machines. I'm focusing on three major directions: estimating probability above (and below) the level of individual "words,"
comparing human and model-based estimates of predictability, and modeling the cognitive mechanisms behind predictive processing.
Publications
Chiebuka Ohams*, Sathvik Nair*, Shohini Bhattasali, & Philip Resnik. A predictive coding model for online sentence processing. (* denotes equal contribution) Journal of Memory and Language, 2026[link][preprint]
Katherine Howitt, Sathvik Nair, Allison Dods, & Robert Hopkins. Generalizations across filler-gap dependencies in neural language models. CoNLL 2024[link][pdf]
Eun-Kyoung Rosa Lee, Sathvik Nair & Naomi Feldman. A Psycholinguistic Evaluation of Language Models' Sensitivity to Argument Roles. EMNLP Findings 2024[link][pdf]
Sathvik Nair & Philip Resnik. Words, Subwords, and Morphemes: What Really Matters in the Surprisal-Reading Time Relationship?
EMNLP Findings 2023[link][pdf]
Stephan Meylan, Sathvik Nair, & Tom Griffiths. Evaluating Models of Robust Word Recognition with Serial Reproduction. Cognition, 2021[link][pdf]
Sathvik Nair, Mahesh Srinivasan, & Stephan Meylan. Contextualized Word Embeddings Encode Aspects of Human-Like Word Sense Knowledge. CogALex @ COLING 2020[link][pdf]
Peer-Reviewed Conference Presentations
Eun-Kyoung Rosa Lee, Sathvik Nair & Naomi Feldman. (Un)likely words in context: A divergence between humans and large language models.
[abstract] [talk recording]
Sathvik Nair., Cassandra Jacobs, Philip Resnik & Colin Phillips. Words, Subwords, and Morphemes: Surprisal Theory and Units of Prediction.
Poster at HSP 2025 [abstract]
Sathvik Nair, Katherine Howitt, Allison Dods & Robert Hopkins. LMs are not good proxies for human language learners.
Talk at BUCLD 2024
Sathvik Nair & Philip Resnik. Words, Subwords, and Morphemes: What Really Matters in the Surprisal-Reading Time Relationship?
Talk at SciL 2024, [abstract]
Sathvik Nair., Colin Phillips, & Philip Resnik. Words, Subwords, and Morphemes: Surprisal Theory and Units of Prediction.
Poster at HSP 2024 [abstract]
Katherine Howitt, Sathvik Nair, Allison Dods & Robert Hopkins (2024) Acquiring generalizations across unbounded dependencies: How language models can provide insight into first language acquisition. Poster at MASC-SLL 2024
Sathvik Nair, Konstantine Kahadze & Philip Resnik. The Impacts of Subword Tokenization on Psycholinguistic Modeling. Poster at MASC-SLL 2024
Sathvik Nair, Shohini Bhattasali, Philip Resnik & Colin Phillips. How far does probability take us when measuring psycholinguistic fit? Evidence from Substitution Illusions and Speeded Cloze Data.
Poster at HSP 2023[abstract]
Collaborators, Mentors, Friends, and other Co-Conspirators
Research is never done in a vacuum, and publications don't reflect everyone who's intellectually influenced me. Here are some of those people.
Many are connected with UMD's CLIP Lab and Language Science Center, which bring together researchers approaching computation and language (more broadly) from all sorts of perspectives.
Other projects (not just academic) and information.
If you're applying to grad school in (computational) cognitive/language sciences and/or the GRFP, please reach out! I'm also happy to share my materials upon request.
I've helped out with a couple software tools for open and reproducible science:
How Biases in Language get Perpetuated by Technology
– Towards Data Science article on personal project investigating gender, racial, and religious bias through analogy evaluation with static word embeddings (GloVe)
Letters for Black Lives- I was involved with writing & curating resources for the South Asian community on anti-Blackness, including Hindi translation.
Outside of work, I enjoy (listening to, practicing, performing, composing, curating showcases with) music of all kinds, making and sharing home-cooked meals, and exploring historical and natural spots in the DC area and the East Coast via foot and public transit.