Can Transformers Code?virtual
Tue 14 Jun 2022 01:30 - 02:15 at Boardroom - Afternoon
In recent years, large language models based on the Transformer architecture were trained on large corpora of text from the internet. Despite being trained for the simple task of predicting the next word, these models show astonishing capabilities. They can generate long coherent text and solve tasks given only a description and possibly a few examples. I will describe the architecture, training process and results that come from these models which lead many people to ask: are they indeed capable of a form of understanding, or are they just stochastic parrots? I will then argue that coding tasks provide a good testbed for this question. I will show how large Transformers have been adapted to code and demonstrate both their successes and failures. I will then discuss a few recently introduced techniques that significantly improve coding and reasoning performance of large language models.
Mon 13 JunDisplayed time zone: Pacific Time (US & Canada) change
13:30 - 15:00 | |||
13:30 45mKeynote | Can Transformers Code?virtual MAPS Łukasz Kaiser OpenAI | ||
14:15 15mTalk | Syntax-Guided Program Reduction for Understanding Neural Code Intelligence Modelsvirtual MAPS Md Rafiqul Islam Rabin University of Houston, Aftab Hussain University of Houston, Amin Alipour University of Houston DOI Pre-print | ||
14:30 15mTalk | A Systematic Evaluation of Large Language Models of Codevirtual MAPS Frank F. Xu Carnegie Mellon University, Uri Alon Carnegie Mellon University, Graham Neubig Carnegie Mellon University, Vincent J. Hellendoorn Carnegie Mellon University | ||
14:45 15mPoster | Poster Session MAPS |
Tue 14 JunDisplayed time zone: Pacific Time (US & Canada) change
01:30 - 03:00 | |||
01:30 45mKeynote | Can Transformers Code?virtual MAPS Łukasz Kaiser OpenAI | ||
02:15 15mTalk | Syntax-Guided Program Reduction for Understanding Neural Code Intelligence Modelsvirtual MAPS Md Rafiqul Islam Rabin University of Houston, Aftab Hussain University of Houston, Amin Alipour University of Houston DOI Pre-print | ||
02:30 15mTalk | A Systematic Evaluation of Large Language Models of Codevirtual MAPS Frank F. Xu Carnegie Mellon University, Uri Alon Carnegie Mellon University, Graham Neubig Carnegie Mellon University, Vincent J. Hellendoorn Carnegie Mellon University | ||
02:45 15mPoster | Poster Session MAPS |