The controlled language ASD-STE100 (www.asd-ste100.org ) is used for safety-critical maintenance documentation. To make text as clear as possible, each approved term in ASD-STE100 usually has only one part of speech. For example, the word 'oil' is permitted as a noun, but it is not permitted as a verb.
Mike shows how sets of patterns in text can be used to identify the part of speech that a term has (noun or verb). For example, in the structure ARTICLE+X+BE, X is a noun. Most text is more complex, and rules can be very complex. Although disambiguation is not always possible, it is sufficiently good to make an effective checker for ASD-STE100.
Although the presentation is primarily about part-of-speech disambiguation, Mike includes an overview of how the patterns are implemented in XML in a customized version of the open-source software LanguageTool (www.languagetool.org).