Master's thesis proposal: Natural Language Processing of Textual Use Cases

Advisor: Vladimir Mencl
Student: Jaroslav Dra┼żan

The design of a software system or component starts with specifying its requirements; traditionally, use cases written in natural language (English) are used for this task. Based on the simple and uniform sentence structure used in textual use cases [10], a conversion scheme [1, 3] has been proposed in the Procasor project [12] to derive behavior specifications from textual use cases. The scheme has been implemented in a prototype tool, employing a suite of readily available natural language processing tools [7, 8, 9].

In this preliminary work, certain issues remain open, such as evaluating the quality of the parse tree provided by the linguistic tools. Recent advances in the natural language processing tools [5] permit to obtain several possible parse trees for a sentence; furthermore, there are several different parsers available which may yield different parse trees.

The goal of the thesis is to build on the conversion scheme described in [1, 3] and propose metric to evaluate the quality of a parse tree. The thesis should address the issue of evaluating several parse trees of a sentence specifying a use case step and possibly also the issue of combining the information available in the parse trees. The thesis should also address the issue of constructing matching event tokens for complementary send / receive actions in use case models of communicating entities. The thesis should be supported by a proof-of-the-concept implementation.


