Allyson Ettinger

Section links:

Current projects
Relevant publications (with PDFs)
Descriptions of computational work

Bridging linguistics and computation

My work bridges linguistics and language technology in both directions:


Current projects

My two threads of current research include not-yet-published work to be included in my dissertation. (Numbering is for reference in computational project descriptions below.)

(1) Computational psycholinguistic modeling drawing on current engineering tools
I am currently integrating psycholinguistic hypotheses with computational methods like machine learning and vector space models (cutting-edge methods popular in current NLP systems) to create rich computational frameworks for modeling human language processing. My interest is in processing at syntactic and semantic levels, and my current focus is on understanding the processing of semantic role reversals, and what this phenomenon tells us about the mechanisms underlying human language comprehension.

(2) Probing for compositional information in sentence representations from neural networks
I am also applying a method (introduced in (4) below) that uses carefully controlled syntactic and semantic variation, together with analysis methodology from cognitive neuroscience, to analyze opaque (vector) sentence representations produced by neural network systems in NLP. Specifically, I am designing tests to probe these sentence representations for compositional meaning information, in order to guide toward language technologies that produce accurate, compositional sentence representations.


Relevant publications

Below are publications reflecting work that I have completed in computational psycholinguistic modeling, in NLP/engineering, or that otherwise involved using computational methods for linguistic research. (For additional publications, see here.)

(3) Using word vectors to account for associative effects in sentence processing
"Modeling N400 amplitude using vector space models of word representation". Ettinger, A., Feldman, N.H., Resnik, P., & Phillips, C. (2016). Proceedings of the 38th Annual Conference of the Cognitive Science Society. [PDF]

(4) Testing for compositional information in sentence vector representations
"Probing for semantic evidence of composition by means of simple classification tasks". Ettinger, A., Elgohary, A., & Resnik, P. (2016) Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP, ACL 2016. Recipient of Best Proposal Award. [PDF]

(5) Leveraging translation information to learn sense-specific representations of polysemous words
"Retrofitting sense-specific word vectors using parallel text". Ettinger, A., Resnik, P., & Carpuat, M. (2016) Proceedings of NAACL-HLT. [PDF]

(6) Testing capacity of word vectors to simulate human semantic priming
"Evaluating vector space models using human semantic priming results" Ettinger, A. & Linzen, T. (2016) Proceedings of the First Workshop on Evaluating Vector Space Representations for NLP, ACL 2016. [PDF]

(7) Using information theory to analyze morphological processing and prediction
"The role of morphology in phoneme prediction: Evidence from MEG". Ettinger, A., Linzen, T., Marantz, A. (2014). Brain and Language 129, 14-23. [PDF]

(8) Using discourse insights to improve automatic zero pronoun resolution
"Dialogue focus tracking for zero pronoun resolution". Rao, S., Ettinger, A., Daumé III, H., Resnik, P. (2015). Proceedings of NAACL HLT 2015. [PDF]


Descriptions of computational work

Nearly all of my work employs computational methods, with the particular methods covering a broad range.

Neural networks: My current work in (1) involves low-level designing, building, and analysis of neural network models to test psycholinguistic hypotheses.

General machine learning: In addition to design and development of neural networks in project (1), a majority of my work leverages machine learning in some way.

Working with corpus data: Many of my projects involve working with corpora.

Development of sentence generation system: One of the more challenging computational projects that I have undertaken was designing and building a sentence generation system to flexibly produce controlled sentence sets for the analysis in (2). This system allows for specification of semantic, syntactic, and lexical constraints, and it produces diverse sentences based on those constraints.

Brain data analysis and information theory: My earlier experience with computational methods came from my work on (7), analyzing complex datasets of recorded brain activity. This work also involved analysis using information-theoretic measures of surprisal and entropy.