Chinnadhurai Sankar

Email / Google Scholar / LinkedIn / Twitter

Research Scientist at Meta.

- Prev: Research Lead at SliceX AI
- Recent works: ELM-Turbo - Efficient and Decomposable LLMs, Project CAIRaoke[CNET] - Research Focus: NLP, Conversational AI, On-device AI, DeepLearning, Deep RL. - Best Paper award, SIGDIAL 2019 and Best Paper Honorable Mention, EACL2021 & ACL 2019. - Area Chair for CoLLAs 2025.

[Education] I completed my Ph.D. at the University of Montreal, MILA lab advised by Prof. Yoshua Bengio. I earned my bachelor’s degree at IIT Madras, where I majored in Electrical engineering and minored in Physics and my master's degree in ECE at Purdue University.

News

[Jan 2025] Serving as a Senior PC for CoLLAs 2025. [Jul 2024] 📣 Excited to Launch Llama3.1-ELM-Turbo models - 3B, 4B, 6B model sizes with MMLU - 56 @ 16000 tokens/sec.[Blog post] [Apr 2024] Launched ELM-V1 model collection - LLMs with a custom efficient transformer architecure that once pre-trained, yielded multiple smaller models for inference. [Blog post] [Jan 2024] Serving as a Senior PC for CoLLAs 2024. [Dec 2023] Co-organizing NILLI 2023 at EMNLP2023! Checkout our 2022 edition - here. [Dec 2023] Our Continual Dialogue State Tracking paper, a novel way of combining continual learning with in-context learning to solve dialog state tracking is accepted to EMNLP-23 [Jul 2023] Serving as a Senior PC for CoLLAs 2023. [Dec 2022] Started a new journey at SliceX AI!. [Oct 2022] Our CheckDST paper on comprehensive Dialogue State Tracking metrics is accepted to EMNLP-22, findings. [Oct 2022] Our paper on data efficiency of instruction tuning vs prompting accepted to ENLSP @ NeurIPS2022. [Aug 2022] Our work - Data Augmented Invariant Regularization (DAIR) accepted to TMLR2022. [Apr 2022] Two of our papers have been accepted NAACL 2022 - DSR-ambiguity to model ambiguities in task oriented dialogs and KETOD to combine Task-Oriented and Chit-Chat dialogs grounded in external knowledge. [Mar 2022] Serving as a Senior PC for CoLLAs 2022. [Feb 2022] Glad to have incepted and worked on foundational conversational AI tech - Project CAIRaoke presented during the Meta AI day 2022 [ Video][CNET] [Dec 2021] Two preprints out - DSR-ambiguity to model ambiguities in task oriented dialogs and CheckDST, a checklist to measure real-world generalization of dialog models [Nov 2021] Co-organizing Learning to learn through interaction (NILLI @ EMNLP-22). [Nov 2021] Serving as an area chair for ACL ARR 2021. [Oct 2021] Checkout our work - a novel regularizing loss function to improve model robustness beyong Data Augmentation [Jul 2021] Two of our recent works on dialog - DialogStitch: a framework to synthetically create longer dialogs by stitching existing ones and an improved version of the MultiWOZ2.2 dataset by fixing inconsistent annotations are accepted to SIGDIAL2021. [May 2021] Our work on multi-step reasoning in video grounded dialogue has been accepted to ACL2021. [Apr 2021] Our paper on on-device LSH based Transformers won Best Paper Award, Honorable Mention at EACL 2021 (oral presentation). [Jan 2021] Two of our recent works on the robustness of LSH based text representations and on-device LSH based Transformers are accepted to EACL2021. [Jan 2021] I am co-organizing ICLR 2021 Workshop on Neural Conversational AI (CAIR @ ICLR-21). [Oct 2020] Our paper which proposes a new RL based data augmentation for modelling open domain dialogs has been accepted to JAIR. [Mar 2020] Joined as a Research Scientist at Facebook AI to work on conversational AI. [Jan 2020] Two of our research efforts cited as important conversational AI papers from 2019. Check out this Forbes article [Sep 2019] Our paper on modelling chit-chat dialog with discrete attributes using Deep RL won Best Paper Award at SIGDIAL 2019 (oral presentation). [Aug 2019] Our paper, TaskMaster Dialog Corpus: Toward a Realistic and Diverse Dataset, accepted to EMNLP 2019 (oral presentation). Check out Google AI blog post, Venturebeat [Jul 2019] Our paper on analyzing context representation in neural dialog systems has been accepted to ACL 2019 (oral presentation, Best Paper Award nomination). [Mar 2019] Our paper on training embedding-less word representations has been accepted to NAACL 2019. [Jan 2019] Started internship with Google Brain to work on Task oriented dialog systems. Papers in review at EMNLP 2019 and NeurIPS 2019. [Jan 2019] Gave a contributed talk at Deep-Dial, AAAI 2019 about our work on modelling chit-chat dialog. Also, accepted to Conv-AI, NeurIPS 2018. [Oct 2018] Our paper which proposes a NEW simple recurrent architecure for modelling long term dependencies has been accepted to AAAI 2019 (spotlight presentation). [Dec 2017] Our dialog bot, MILABOT won 2nd prize in NeurIPS 2017 demonstration track.

Recent Publications

Know Thy Strengths: Comprehensive Dialogue State Tracking Diagnostics [EMNLP 2022, findings][paper]

DAIR: Data Augmented Invariant Regularization [TMLR 2022][paper]

Database Search Results Disambiguation for Task-Oriented Dialog Systems [NAACL 2022][paper]

KETOD: Knowledge-Enriched Task-Oriented Dialogue [NAACL 2022, findings][paper]

Annotation Inconsistency and Entity Bias in MultiWOZ [SIGDIAL 2021][paper]

DialogStitch: Synthetic Deeper and Multi-Context Task-Oriented Dialogs [SIGDIAL 2021][paper]

DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue. [ACL 2021][paper]

ProFormer: Towards On-Device LSH Projection Based Transformers. [EACL 2021, oral, Best Paper Award, Honorable Mention][paper]

On-Device Text Representations Robust To Misspellings via Projections. [EACL 2021][paper]

Deep Reinforcement Learning For Modeling Chit-Chat Dialog With Discrete Attributes. [SIGDIAL 2019, oral, Best Paper Award ][arxiv]

TaskMaster-1 Dialog Corpus: Toward a Realistic and Diverse Dataset. [EMNLP 2019, oral][Google AI blog post][arxiv][data][Media Coverage]

Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study. [ACL 2019, oral, Best Paper Award nomination][arxiv][code]

Transferable Neural Projection Representations. [NAACL 2019][arxiv]

Towards Non-saturating Recurrent Units for Modelling Long-term Dependencies. [AAAI 2019, spotlight][arxiv]

Please refer to my Google Scholar page for a detailed list of my publications.