CSE 662 Fall 2019

Parse: Derive the semantic structure of the sentence
Mapping: Map the nodes of the sentence parse tree onto concepts.
Structure: Shoehorn the nodes into a query-friendly structure.
Iterate: Confirm interpretation with the user and repeat from 2 as needed.
SQL: Generate and evaluate the query.

Precise question → Precise answer

Vague question → Imprecise answer

Vague question → Precise answer

Recover sentence structure as a tree of concepts

Given a parse tree node ( $node$ ) and every schema element ( $elem$ ), find the best match.

$Sim(node, elem) = \textbf{max}(Jaccard(node, value), Word2vec(node, value))$

All $elem$ s.t. $Sim(node, elem) > \tau$: Candidate nodes (potential matches)
$\textbf{argmax}_{elem}(Sim(node, elem))$: The best node

Every node that matches at least one schema element is labeled either NN or VN.

Every node that matches more than one schema element is ambiguous.