TitleNatural language is a programming language: Applying natural language processing to software development
Publication TypeConference Paper
Year of Publication2017
AuthorsErnst MD
Conference NameSNAPL 2017: the 2nd Summit oN Advances in Programming Languages
Pagination4:1–4:14
Date or Month PublishedMay
Conference LocationAsilomar, CA, USA
AbstractA powerful, but limited, way to view software is as source code alone. Treating a program as a sequence of instructions enables it to be formalized and makes it amenable to mathematical techniques such as abstract interpretation and model checking. \par A program consists of much more than a sequence of instructions. Developers make use of test cases, documentation, variable names, program structure, the version control repository, and more. I argue that it is time to take the blinders off of software analysis tools: tools should use all these artifacts to deduce more powerful and useful information about the program. \par Researchers are beginning to make progress towards this vision. This paper gives, as examples, four results that find bugs and generate code by applying \emphnatural language processing techniques to software artifacts. The four techniques use as input error messages, variable names, procedure documentation, and user questions. They use four different NLP techniques: document similarity, word semantics, parse trees, and neural networks. \par The initial results suggest that this is a promising avenue for future work.
Citation KeyErnst2017