Research Activities


Dr Sumeet Agarwal in IIT Delhi on computational linguistics projects in Hindi and Marathi.

Also started collaboration with Dr Eva Wittenberg on light verb constructions in Hindi.

I have recently begun working on DCOMA (Digital Corpus of Old Marathi)


2009-2013:   Graduate Research Assistant for the National Science Foundation Project “A Multi-Representational and Multi-Layered Treebank for Hindi/Urdu”. Contributed towards the following areas: Linguistic analysis of Hindi syntax and semantics for development of annotation guidelines, specifically, empty categories, complex predicates, mapping dependency labels to semantic roles, conversion of dependency structure to phrase structure. Also worked on annotation management, specifically annotation pipeline management for annotation tools, error checking.

Sep 2012: Graduate Research Assistant on a DARPA project BOLT: Broad Operational Language Translation. Contributed towards the annotation pipeline for Arabic and Arabic dialects.

Other Research Activities

December 2012. Conference Tutorial Presenter at COLING 2012 with Owen Rambow, Dipti Misra Sharma. Tutorial Title: New Frontiers in Hindi and Urdu Natural Language Processing

Sep –Nov 2005: Part-time Lecturer for “Media and Semiotics” at Department of Applied Linguistics, Shreemati Nathibai Damodar Thackersey Women’s University, Mumbai, India

May-June 2008: Student Organizer at IIIT-Hyderabad Advanced School on Natural Language Processing. Helped co-ordinate participants, organize and mentor morphological analysis groups.

Invited Talks

April 2017: Linguistic Structure Prediction: the case of light verbs in Hindi, Invited talk at SIGML, IIT Kanpur

November 2016: Light verb constructions in Computational Grammars: LFG and TAG. Guest Lecture in LING 6250: Comparative Topics in Linguistics (CU-Boulder).

January 2016: Complex predicates in Hindi. Presentation at ‘Mind Language and Society’ -Cognitive Perspectives on human behaviour, Workshop at IIT Delhi

September 2015:  Exploring Hindi Treebank and PropBank. Talk given at the Centre for Sanskrit Studies, JNU

July 2011. Analysis of the Hindi Proposition Bank using Dependency Structure Talk given at the Workshop on South Asian Languages: Formal Approaches and Computational Resources, Linguistic Society of America Summer Institute, Boulder, Colorado.

July 2011. Exploring PropBanks for English and Hindi. Guest Lecturer at ‘Treebanking and the Theoretical Linguist’. Linguistic Society of America Summer Institute, Boulder, Colorado


  • International Conference on Natural Language Processing 2010, 2011, 2012, 2013
  • International Conference on Computational Linguistics, 2012

Professional Memberships

2011-present: The Association for Computational Linguistics

2013  Linguistic Society of America

2014 Association for Women in Science