2.3. Lattice Example

2.4. A Demonstration of the Utility of MUSE CSP Parsing

To demonstrate the utility of MUSE CSP for simultaneously parsing multiple CSP instances, consider the problem of determining which strings of length 3n consisting of a's, b's, and c's are in the language anbncn. For the value of n=3, this problem can be represented as the single MUSE CSP problem shown in Figure 9 (the roles and role values are not depicted to simplify the figure). We have devised constraints for this language (see Figure 10) which eliminate all role values for all sentences not in the language as well as all ungrammatical role values for a sentence in the language. When these constraints are applied followed by MUSE arc consistency to a lattice like that in Figure 9 with a length divisible by three, then only the grammatical sentence will remain with a single parse. For lattices containing only sentences with lengths that are not divisible by three, all role values are eliminated by MUSE arc consistency (there is no grammatical sentence). Hence, there is no search required to extract a parse if there is one. For the n=3 case of Figure 9, the parse appears in Figure 11. A single parse will result regardless of the n chosen. Note that the modifiees for the role values in the parse are used to ensure that for each a, there is a corresponding c; for each b, there is a corresponding a; and for each c, there is a corresponding b. Figure 12 examines the time needed to extract a parse for sentences in the language anbncn from MUSE CSPs representing all strings of length 3n, tex2html_wrap_inline19201 , containing a, b, and c. The time to perform MUSE AC-1 and extract the solution is compared to the time to extract the solution without any preprocessing. The time to perform MUSE AC-1 and extract the parse is stable as sentence length grows, but the time to extract a parse grows quickly for sentence lengths greater than 15 when MUSE arc consistency is not used.

   figure15535
Figure 9: A single MUSE CSP can simultaneously test all possible orderings of a's, b's, and c's for membership in the language anbncn, n=3.

   figure15542
Figure 10: tex2html_wrap_inline18645 = tex2html_wrap_inline19229 accepts the language anbncn, n tex2html_wrap_inline18651 0.

   figure15563
Figure 11: The single parse remaining in the network depicted in Figure 9 after the applying the constraints in tex2html_wrap_inline18645 and enforcing MUSE arc consistency.

   figure15585
Figure 12: This graph depicts the time to extract the parse for the language anbncn from a MUSE CSP representing all sentences of length 3n, where n varies from 1 to 7. The time to extract the parse without MUSE arc consistency is compared to the time to perform MUSE AC-1 and extract the parse.

The previous example involves a grammar where there can only be one parse for a single sentence in the lattice; however, it is a simple matter to provide similar demonstrations for more complex cases. For example, the constraint grammar shown in Figure 13 can be to parse all possible sentences of a given length in the the language ww, such that w is in tex2html_wrap_inline19247 . Consider the MUSE CSP in Figure 14 (the roles and role values are not depicted to simplify the figure). After applying the constraints and performing MUSE arc consistency on this MUSE CSP, there are precisely 81 strings that are in ww, and their parses are compactly represented in the constraint network. The constraints plus MUSE arc consistency eliminate every value that cannot appear in a parse. For lattices containing odd length sentences, no role values remain after MUSE arc consistency. Figure 15 shows the time needed to extract all of the parses for sentences in the language ww from the MUSE CSPs as we vary the length of w from 1 to 8. The time to perform MUSE AC-1 and extract the parses grows slowly as sentence length increases because the number of parses increases with sentence length; however, it grows more slowly than the time to extract the parses when MUSE arc consistency is not used.

   figure15595
Figure 13: tex2html_wrap_inline18661 = tex2html_wrap_inline19265 accepts the language ww.

   figure15613
Figure 14: A single MUSE CSP can simultaneously test all possible orderings of a's, b's, and c's for membership in the language ww where |w|=4 .

   figure15618
Figure 15: This graph depicts the time to extract all parses for the language ww from a MUSE CSP representing all sentences of length 2 to 16 such that tex2html_wrap_inline19281 . The time to extract all parses without MUSE arc consistency is compared to the time to perform MUSE AC-1 and extract all parses.


Similar results have also been obtained with grammars used to parse word graphs constructed from spoken sentences in the resource management and ATIS domains [Harper, Jamieson, Zoltowski, & Helzerman, 1992, Zoltowski, Harper, Jamieson, & Helzerman, 1992, Harper & Helzerman, 1995a].



3. The MUSE CSP Arc Consistency Algorithm