CSE-4/562 Spring 2019


                SELECT COUNT(*) 
                FROM R NATURAL JOIN ZipCodeLookup 
                WHERE State = 'NY'

Idea 1: Sample. Pick (e.g.) 10 random possible worlds and compute results for each.

$\mathcal R$	Name	ZipCode	$\mathcal{ID}$
1	Alice	10003	1
2	Bob	14260	1
3	Carol	13201	1
4	Alice	10003	2
5	Bob	19260	2
6	Carol	18201	2
7	Alice	10003	3
8	Bob	14260	3
9	Carol	18201	3
10	Alice	10003	4
11	Bob	14260	4
12	Carol	18201	4
13	Alice	10003	5
14	Bob	19260	5
15	Carol	18201	5

$\mathcal R$	Name	ZipCode	$\mathcal W$
1	Alice	10003	11111
2	Bob	[14260, 19260, 14260, 14260, 19260]	11111
3	Carol	[13201, 18201, 18201, 18201, 18201]	11111

$\mathcal R$	Name	ZipCode	$\mathcal W$
1	Alice	10003	11111
2	Bob	[14260, 19260, 14260, 14260, 19260]	10110
3	Carol	[13201, 18201, 18201, 18201, 18201]	10000

$P[\texttt{x} \wedge \texttt{y}] = P[\texttt{x}] \cdot P[\texttt{y}]$
(iff $\texttt{x}$ and $\texttt{y}$ are independent)

$P[\texttt{x} \wedge \texttt{y}] = 0$
(iff $\texttt{x}$ and $\texttt{y}$ are mutually exclusive)

$P[\texttt{x} \vee \texttt{y}] = 1- (1-P[\texttt{x}]) \cdot (1-P[\texttt{y}])$
(iff $\texttt{x}$ and $\texttt{y}$ are independent)

$P[\texttt{x} \vee \texttt{y}] = P[\texttt{x}] + P[\texttt{y}]$
(iff $\texttt{x}$ and $\texttt{y}$ are mutually exclusive)

Good enough to get us the probability of any boolean formula over mutually exclusive or independent variables

... and otherwise?