Finite Factored Sets: Polynomials and Probability

Scott Garrabrant

In this post, given a finite factored set , we will show how to associate each $E \subseteq S$ with a characteristic polynomial, $Q_{E}^{F}$ . We will discuss how to factor these characteristic polynomials, and use these characteristic polynomials to build up to the fundamental theorem of finite factored sets, which associates conditional orthogonality with conditional independence in probability distributions.

5.1. Characteristic Polynomials

Definition 28. Given a finite factored set $F = (S, B)$ , let ${Poly}^{F}$ denote the ring of polynomials with coefficients in $R$ and variables in $P (S)$ .

Definition 29. Given a finite factored set $F = (S, B)$ , a $p \in {Poly}^{F}$ , and an $f : P (S) \to R$ , we write $p (f) \in R$ for the evaluation of $p$ at $f$ , computed by replacing each $E \subseteq S$ with $f (E)$ .

Definition 30. Given a finite factored set $F = (S, B)$ , and a polynomial $p \in {Poly}^{F}$ , $supp (p) \subseteq P (S)$ denotes the set of all variables $v \in P (S)$ that appear in $p$ . $supp (p)$ is called the support of $p$ .

Definition 31. Given a finite factored set $F = (S, B)$ and an $E \subseteq S$ , let $Q_{E}^{F} \in {Poly}^{F}$ be given by $Q_{E}^{F} = \sum_{s \in E} \prod_{b \in B} [s]_{b}$ . $Q_{E}^{F}$ is called the characteristic polynomial of $E$ (in $F$ ).

We will be building up to an understanding of how to factor $Q_{E}^{F}$ into irreducibles. For that, we will first need to give some basic notation for manipulating polynomials in ${Poly}^{F}$ .

Definition 32. Given a finite factored set $F = (S, B)$ , an $s \in S$ , and a $C \subseteq B$ , let ${mono}_{C}^{F} (s) \in {Poly}^{F}$ be given by ${mono}_{C}^{F} (s) = \prod_{b \in C} [s]_{b}$ .

Definition 33. Given a finite factored set $F = (S, B)$ , an $E \subseteq S$ , and a $C \subseteq B$ , let ${monos}_{C}^{F} (E) \in P ({Poly}^{F})$ be given by ${monos}_{C}^{F} (E) = {{mono}_{C}^{F} (s) ∣ s \in E}$ .

Definition 34. Given a finite factored set $F = (S, B)$ , an $E \subseteq S$ , and a $C \subseteq B$ , let ${poly}_{C}^{F} (E) \in {Poly}^{F}$ be given by ${poly}_{C}^{F} (E) = \sum_{m \in {monos}_{C}^{F} (E)} m$ .

Proposition 26. Let $F = (S, B)$ be a finite factored set, and let $E \subseteq S$ . Then $Q_{E}^{F} = {poly}_{B}^{F} (E)$ .

Proof. We start by showing that for all $s \neq t \in S$ , ${mono}_{B}^{F} (s) \neq {mono}_{B}^{F} (t)$ .

Let $s \neq t \in S$ be arbitrary. By Proposition 3, if $s \neq t$ , there must be some $b \in B$ such that $[s]_{b} \neq [t]_{b}$ . Then, note that $[s]_{b} \in supp ({mono}_{B}^{F} (s))$ . If $[s]_{b}$ were also in $supp ({mono}_{B}^{F} (t))$ , then $t$ would be in both $[s]_{b}$ and $[t]_{b}$ , contradicting the fact that these two sets are disjoint. Therefore ${mono}_{B}^{F} (s) \neq {mono}_{B}^{F} (t)$ .

Thus ${monos}_{B}^{F} (E)$ has exactly one element for each element of $E$ , so we have that $\sum_{m \in {monos}_{B}^{F} (E)} m = \sum_{s \in E} {mono}_{B}^{F} (s) = Q_{E}^{F}$ . $□$

Proposition 27. Let $F = (S, B)$ be a finite factored set, and let $E_{0}, E_{1} \subseteq S$ be subsets of $S$ . Let $C_{0}, C_{1} \subseteq B$ be disjoint subsets of $B$ . Let $E_{2} = χ_{C_{0}}^{F} (E_{0}, E_{1})$ , and let $C_{2} = C_{0} \cup C_{1}$ . Then ${poly}_{C_{2}}^{F} (E_{2}) = {poly}_{C_{0}}^{F} (E_{0}) \cdot {poly}_{C_{1}}^{F} (E_{1})$ .

Proof. For $i \in {0, 1, 2}$ , let $M_{i} = {monos}_{C_{i}}^{F} (E_{i})$ . We will start by showing that $f : M_{0} \times M_{1} \to M_{2}$ , given by $f (m_{0}, m_{1}) = m_{0} m_{1}$ , is a well-defined function and a bijection.

First, observe that it follows immediately from the definition that for all $s_{0}, s_{1} \in S$ , if $s_{2} = χ_{C_{0}}^{F} (s_{0}, s_{1})$ we have that ${mono}_{C_{0}}^{F} (s_{0}) = {mono}_{C_{0}}^{F} (s_{2})$ , ${mono}_{C_{1}}^{F} (s_{1}) = {mono}_{C_{1}}^{F} (s_{2})$ , and ${mono}_{C_{0}}^{F} (s_{2}) \cdot {mono}_{C_{1}}^{F} (s_{2}) = {mono}_{C_{2}}^{F} (s_{2})$ . Combining these, we get that ${mono}_{C_{0}}^{F} (s_{0}) \cdot {mono}_{C_{1}}^{F} (s_{1}) = {mono}_{C_{2}}^{F} (χ_{C_{0}}^{F} (s_{0}, s_{1}))$ .

For all $(m_{0}, m_{1}) \in M_{0} \times M_{1}$ , there exists some $s_{0} \in E_{0}$ such that $m_{0} = {mono}_{C_{0}}^{F} (s_{0})$ , and some $s_{1} \in E_{1}$ such that $m_{1} = {mono}_{C_{1}}^{F} (s_{1})$ , and this gives us that $m_{0} m_{1} = {mono}_{C_{0}}^{F} (s_{0}) {mono}_{C_{1}}^{F} (s_{1}) = {mono}_{C_{2}}^{F} (χ_{C}^{F} (s_{0}, s_{1})) \in M_{2}$ . Thus, $f$ is well-defined.

To see that $f$ is surjective, observe that for all $m_{2} \in M_{2}$ , there exists an $s_{2} \in E_{2}$ such that $m_{2} = {mono}_{C_{2}}^{F} (s_{2})$ , and there exist $s_{0} \in E_{0}$ and $s_{1} \in E_{1}$ such that $s_{2} = χ_{C}^{F} (s_{0}, s_{1})$ , and we have $f ({mono}_{C_{0}}^{F} (s_{0}), {mono}_{C_{1}}^{F} (s_{1})) = m_{2}$ .

To see that $f$ is injective, observe that for $i \in {0, 1}$ , for all $m_{i} \in M_{i}$ , $supp (m_{i}) \subseteq ⋃_{b \in C_{i}} b$ . Further, $⋃_{b \in C_{0}} b$ and $⋃_{b \in C_{1}} b$ are disjoint. Thus, for all $m_{0} \in M_{0}$ and $m_{1} \in M_{1}$ , $supp (m_{i}) = supp (m_{0} m_{1}) \cap ⋃_{b \in C_{i}} b$ .

This means that for all $m_{0}, m_{0}^{'} \in M_{0}$ and $m_{1}, m_{1}^{'} \in M_{1}$ , if $m_{0} m_{1} = m_{0}^{'} m_{1}^{'}$ , then $supp (m_{0}) = supp (m_{0}^{'})$ and $supp (m_{1}) = supp (m_{1}^{'})$ . However, every monomial in $M_{0}$ or $M_{1}$ is just equal to the product of all variables in its support. Thus $m_{0} = \prod_{v \in supp (m_{0})} v = m_{0}^{'}$ and $m_{1} = \prod_{v \in supp (m_{1})} v = m_{1}^{'}$ . Thus $f$ is injective, and thus a bijection between $M_{0} \times M_{1}$ and $M_{2}$ .

Now, we have that

\begin{matrix} {poly}_{C_{0}}^{F} (E_{0}) \cdot {poly}_{C_{1}}^{F} (E_{1}) & = ⎛ ⎝ \sum m_{0} \in M_{0} m_{0} ⎞ ⎠ ⎛ ⎝ \sum m_{1} \in M_{1} m_{1} ⎞ ⎠ = \sum m_{0} \in M_{0} \sum m_{1} \in M_{1} m_{0} m_{1} = \sum (m_{0}, m_{1}) \in M_{0} \times M_{1} m_{0} m_{1} = \sum (m_{0}, m_{1}) \in M_{0} \times M_{1} f (m_{0}, m_{1}) = \sum m_{2} \in M_{2} m_{2} = {poly}_{C_{2}}^{F} (E_{2}) . \end{matrix}

$□$

Proposition 28. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . If $p$ divides $Q_{E}^{F}$ , then $p = r \cdot {poly}_{C}^{F} (E)$ , for some $r \in R$ and $C \subseteq B$ .

Proof. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . Let $p, q \in {Poly}^{F}$ satisfy $p q = Q_{E}^{F}$ . We thus must have $supp (p) \cup supp (q) = supp (Q_{E}^{F})$ .

If there were some $T \in supp (p) \cap supp (q)$ , then the degree of $T$ in $Q_{E}^{F}$ would be at least 2, contradicting the definition of $Q_{E}^{F}$ and Corollary 1. Thus, $supp (p) \cap supp (q) = {}$ .

There can be no combining like terms, then, in the product $p q$ . The monomial terms in $Q_{E}^{F}$ are in bijective correspondence to the pairs of monomial terms in $p$ and monomial terms in $q$ .

In particular, this means that since all the coefficients in $p q$ are equal to $1$ , all the coefficients in $p$ must be equal to some $r \in R$ , and all of the coefficients in $q$ must be equal to $1 / r$ .

Further, for all $b \in B$ , if $b \cap supp (p)$ is nonempty, $b \cap supp (q)$ must be empty, since otherwise $Q_{E}^{F}$ would contain a term with two factors in $b$ , which clearly never happens according to the definition of $Q_{E}^{F}$ .

Since $E$ is nonempty, for each $b \in B$ there must be some $T \in b \cap supp (Q_{E}^{F})$ . Thus at least one of $b \cap supp (p)$ and $b \cap supp (q)$ must be nonempty, so exactly one of $b \cap supp (p)$ and $b \cap supp (q)$ must be nonempty.

Let $C$ be the set of all $b \in B$ such that $b \cap supp (p)$ is nonempty.

For every $b \in C$ , every term of $Q_{E}^{F}$ has exactly one factor in $b$ . Thus, every term in $p$ has exactly one factor in $b$ . These cover all variables in the support of $p$ , so each term in $p$ must have total degree $| C |$ .

For each $m \in {monos}_{C}^{F} (E)$ , $m$ divides a term in $Q_{E}^{F}$ . Since $m$ has no common support with $q$ , $m$ must also divide a term in $p$ . Thus $r \cdot m$ must be a term in $p$ . Conversely, every term in $p$ divides a term in $Q_{E}^{F}$ , and thus must be in ${monos}_{C}^{F} (E)$ . Thus every term in $p$ is of the form $r \cdot m$ for some $m \in {monos}_{C}^{F} (E)$ . Thus $p = \sum_{m \in {monos}_{C}^{F} (E)} r \cdot m = r \cdot {poly}_{C}^{F} (E)$ . $□$

5.2. Factoring Characteristic Polynomials

We will now show how to factor characteristic polynomials into irreducibles.

Definition 35. Given a finite factored set $F = (S, B)$ , and a nonempty subset $E \subseteq S$ , let ${Irr}^{F} (E) \subseteq P (B)$ denote the set of all $C \subseteq B$ such that:

$C$ is nonempty,
$χ_{C}^{F} (E, E) = E$ , and
there is no nonempty strict subset $D \subset C$ such that $χ_{D}^{F} (E, E) = E$ .

Proposition 29. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . Then ${Irr}^{F} (E) \in Part (B)$ .

Proof. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . It suffices to show that the sets in ${Irr}^{F} (E)$ are pairwise disjoint and cover $B$ .

We start by showing that the set of all $C \subseteq B$ satisfying $χ_{C}^{F} (E, E) = E$ is closed under intersection. Indeed, if $χ_{C_{0}}^{F} (E, E) = E$ and $χ_{C_{1}}^{F} (E, E) = E$ , then $χ_{C_{0} \cap C_{1}}^{F} (E, E) = χ_{C_{0}}^{F} (E, χ_{C_{1}}^{F} (E, E)) = χ_{C_{0}}^{F} (E, E) = E$ .

Next, observe that $χ_{B}^{F} (E, E) = E$ . Thus, for all $b \in B$ , we can consider $C_{b} = ⋂_{C \subseteq B, b \in C, χ_{C}^{F} (E, E) = E} C$ . Since $C_{b}$ is an intersection of a finite nonempty collection of sets $C$ satisfying $χ_{C}^{F} (E, E) = E$ , we have that $χ_{C_{b}}^{F} (E, E) = E$ . Further, $b \in C_{b}$ , so $C_{b}$ is nonempty.

Assume for the purpose of contradiction that there is some nonempty strict subset $D \subset C_{b}$ such that $χ_{D}^{F} (E, E) = E$ . If $b \in D$ , then we have a contradiction by the definition of $C_{b}$ . If $b \notin D$ , then note that $χ_{B ∖ D}^{F} (E, E) = E$ , so $χ_{C_{b} ∖ D}^{F} (E, E) = E$ , and $C_{b} ∖ D$ is a nonempty strict subset of $C_{b}$ that contains $b$ , contradicting the definition of $C_{b}$ .

Thus $C_{b} \in {Irr}^{F} (E)$ for all $b \in B$ , and since $b \in C_{b}$ , this means that the sets in ${Irr}^{F} (E)$ cover $B$ .

Next, we need to show that the sets in ${Irr}^{F} (E)$ are pairwise disjoint. Let $C_{0}, C_{1} \in {Irr}^{F} (E)$ be arbitrary distinct elements. We have that $χ_{C_{0} \cap C_{1}}^{F} (E, E) = E$ , and $C_{0} \cap C_{1}$ is a subset of $C_{0}$ and $C_{1}$ , and thus a strict subset of at least one of them. Thus $C_{0} \cap C_{1}$ is empty.

Thus ${Irr}^{F} (E) \in Part (B)$ . $□$

The following two propositions constitute a factorization of $Q_{E}^{F}$ into irreducibles.

Proposition 30. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . Then $Q_{E}^{F} = \prod_{C \in {Irr}^{F} (E)} {poly}_{C}^{F} (E)$ .

Proof. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . Let $n = | {Irr}^{F} (E) |$ , and let ${Irr}^{F} (E) = {C_{0}, \dots, C_{n - 1}}$ . For $0 \leq k < n$ , let $C_{\leq k} = ⋃_{i = 0}^{k} C_{i}$ .

We will show by induction on $k$ that $\prod_{i = 0}^{k} {poly}_{C_{i}}^{F} (E) = {poly}_{C_{\leq k}}^{F} (E)$ for all $0 \leq k < n$ .

If $k = 0$ , the result is trivial, as $\prod_{i = 0}^{0} {poly}_{C_{i}}^{F} (E) = {poly}_{C_{0}}^{F} (E) = {poly}_{C_{\leq 0}}^{F} (E)$ .

For $k > 0$ , observe that $C_{k}$ and $C_{\leq k - 1}$ are disjoint, and that $E = χ_{C_{k}}^{F} (E, E)$ , thus by Proposition 27, we have ${poly}_{C_{k}}^{F} (E) \cdot {poly}_{C_{\leq k - 1}}^{F} (E) = {poly}_{C_{\leq k}}^{F} (E)$ . Thus, by induction, we get $\prod_{i = 0}^{k} {poly}_{C_{i}}^{F} (E) = {poly}_{C_{\leq k}}^{F} (E)$ .

In the case where $k = n - 1$ , this gives that $\prod_{C \in {Irr}^{F} (E)} {poly}_{C}^{F} (E) = {poly}_{B}^{F} (E) = Q_{E}^{F}$ . $□$

Proposition 31. Let $F = (S, B)$ be a finite factored set, and let $E$ be a nonempty subset of $S$ . Then ${poly}_{C}^{F} (E)$ is irreducible for all $C \in {Irr}^{F} (E)$ .

Proof. Let $F = (S, B)$ be a finite factored set, let $E$ be a nonempty subset of $S$ , and let $C \in {Irr}^{F} (E)$ .

Assume for the purpose of contradiction that $p_{0} \cdot p_{1} = {poly}_{C}^{F} (E)$ , and that both $p_{0}$ and $p_{1}$ have nonempty support. By Proposition 28, we have that $p_{i} = r_{i} \cdot {poly}_{C_{i}}^{F} (E)$ , for some $r_{0}, r_{1} \in R$ , and $C_{0}, C_{1} \subseteq B$ .

We will first need to show that $C_{0}$ and $C_{1}$ are nonempty and disjoint. They must be nonempty, because $p_{0}$ and $p_{1}$ have nonempty support. Assume for the purpose of contradiction that $b \in C_{0} \cap C_{1}$ . Let $s$ be an element of $E$ , and note that for $i \in {0, 1}$ , we have $[s]_{b} \in supp {poly}_{C_{i}}^{F} (E)$ . Thus $[s]_{b}$ must be degree at least 2 in ${poly}_{C}^{F} (E)$ , which contradicts the fact that every variable clearly has degree at most 1 in ${poly}_{C}^{F} (E)$ .

Next, we need to show that $C_{0} \cup C_{1} = C$ . We already know that

\begin{matrix} supp ({poly}_{C}^{F} (E)) & = supp (r_{0} r_{1} {poly}_{C_{0}}^{F} (E) {poly}_{C_{1}}^{F} (E)) = supp ({poly}_{C_{0}}^{F} (E)) \cup supp ({poly}_{C_{1}}^{F} (E)) . \end{matrix}

Let $s$ be an element of $E$ . Given an arbitrary $b \in B$ , we have that $b \in C$ if and only if $[s]_{b} \in supp ({poly}_{C}^{F} (E))$ if and only if $[s]_{b} \in supp ({poly}_{C_{i}}^{F} (E))$ for some $i \in {0, 1}$ if and only if $b \in C_{0} \cup C_{1}$ .

We now have that $C_{0}$ and $C_{1}$ are disjoint and that $C = C_{0} \cup C_{1}$ . Thus, by Proposition 27, we have that ${poly}_{C_{0}}^{F} (E) \cdot {poly}_{C_{1}}^{F} (E) = {poly}_{C}^{F} (χ_{C_{0}}^{F} (E, E))$ . Thus ${poly}_{C}^{F} (E) = r_{0} r_{1} {poly}_{C}^{F} (χ_{C_{0}}^{F} (E, E))$ , so ${monos}_{C}^{F} (E) = {monos}_{C}^{F} (χ_{C_{0}}^{F} (E, E))$ .

Let $s_{0}, s_{1} \in E$ be arbitrary, and let $s_{2} = χ_{C_{0}}^{F} (s_{0}, s_{1})$ . Note that ${mono}_{C}^{F} (s_{2}) \in {monos}_{C}^{F} (χ_{C_{0}}^{F} (E, E)) = {monos}_{C}^{F} (E)$ , so there is some $s_{3} \in E$ such that ${mono}_{C}^{F} (s_{2}) = {mono}_{C}^{F} (s_{3})$ . Thus $s_{2} \sim_{b} s_{3}$ for all $b \in C$ . However, we also have that $s_{2} \sim_{b} s_{1}$ for all $b \in B ∖ C$ , so $s_{2} = χ_{C}^{F} (s_{3}, s_{1})$ . Since $C \in {Irr}^{F} (E)$ , $χ_{C}^{F} (E, E) = E$ , so $s_{2} = χ_{C_{0}}^{F} (s_{0}, s_{1}) \in E$ . Since $s_{0}$ and $s_{1}$ were arbitrary elements of $E$ , we have that $χ_{C_{0}}^{F} (E, E) = E$ . Since $C_{0}$ is a nonempty strict subset of $C$ , this contradicts the fact that $C \in {Irr}^{F} (E)$ .

Thus, ${poly}_{C}^{F} (E)$ is irreducible for all $C \in {Irr}^{F} (E)$ . $□$

5.3. Characteristic Polynomials and Orthogonality

We can now give an alternate characterization of conditional orthogonality in terms of divisibility of characteristic polynomials.

Lemma 3. Let $F = (S, B)$ be a finite factored set, and let $X, Y, Z \in Part (S)$ be partitions of $S$ . The following are equivalent.

$X ⊥^{F} Y | Z$ .
$Q_{z}^{F}$ divides $Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ for all $x \in X$ , $y \in Y$ , and $z \in Z$ .
$Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F} = Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ for all $x \in X$ , $y \in Y$ , and $z \in Z$ .

Proof. Clearly condition 3 implies condition 2. We will first show that condition 1 implies condition 3, and then show that condition 2 implies condition 1.

Let $F = (S, B)$ , and let $X, Y, Z \in Part (S)$ satisfy $X ⊥^{F} Y | Z$ . Consider an arbitrary $x \in X$ , $y \in Y$ , and $z \in Z$ . We want to show that $Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F} = Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ .

Let $C = h^{F} (X | z)$ . Clearly $C ⊢^{F} X | z$ . We thus have that $χ_{C}^{F} (z, z) = z$ , so $χ_{B ∖ C}^{F} (z, z) = z$ . We also have that $h^{F} (Y | z) \subseteq B ∖ C$ , so $Y | z \leq_{z} (⋁_{S} (B ∖ C)) | z$ . These two together give that $B ∖ C ⊢^{F} Y | z$ .

Since $C ⊢^{F} X | z$ , we have that $χ_{C}^{F} (x \cap z, z) = x \cap z$ . Thus, by Proposition 27, we have that ${poly}_{C}^{F} (x \cap z) \cdot {poly}_{B ∖ C}^{F} (z) = Q_{x \cap z}^{F}$ . Similarly, since $B ∖ C ⊢^{F} Y | z$ , we have that ${poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (y \cap z) = Q_{y \cap z}^{F}$ .

Since $χ_{C}^{F} (x \cap z, y \cap z) \subseteq χ_{C}^{F} (x \cap z, z) = x \cap z$ , and $χ_{C}^{F} (x \cap z, y \cap z) \subseteq χ_{C}^{F} (z, y \cap z) = y \cap z$ , we have $χ_{C}^{F} (x \cap z, y \cap z) \subseteq x \cap y \cap z$ . We also have that

\begin{matrix} χ_{C}^{F} (x \cap z, y \cap z) & \supseteq χ_{C}^{F} (x \cap y \cap z, x \cap y \cap z) \supseteq x \cap y \cap z . \end{matrix}

Thus $χ_{C}^{F} (x \cap z, y \cap z) = x \cap y \cap z$ .

By Proposition 27, this gives that ${poly}_{C}^{F} (x \cap z) \cdot {poly}_{B ∖ C}^{F} (y \cap z) = Q_{x \cap y \cap z}^{F}$ .

Finally, since $χ_{C}^{F} (z, z) = z$ , we have that ${poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (z) = Q_{z}^{F}$ .

Thus, $Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F}$ and $Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ are both equal to ${poly}_{C}^{F} (x \cap z) \cdot {poly}_{B ∖ C}^{F} (y \cap z) \cdot {poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (z)$ .

Thus, condition 1 implies condition 3. It remains to show that condition 2 implies condition 1.

Fix $F = (S, B)$ , and $X, Y, Z \in Part (S)$ , and let $Q_{z}^{F}$ divide $Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ for all $x \in X$ , $y \in Y$ , and $z \in Z$ . Assume for the purpose of contradiction that it is not the case that $X ⊥^{F} Y | Z$ . Thus, there exists some $z \in Z$ such that $h^{F} (X | z) \cap h^{F} (Y | z) \neq {}$ . Let $z \in Z$ and $b \in B$ satisfy $b \in h^{F} (X | z) \cap h^{F} (Y | z) \neq {}$ .

Let $C \subseteq B$ be such that $b \in C$ and $C \in {Irr}^{F} (z)$ , and let $p = {poly}_{C}^{F} (z)$ . Thus, $p$ is an irreducible factor of $Q_{z}^{F}$ .

Either $p$ divides $Q_{x \cap z}^{F}$ for all $x \in X$ or $p$ divides $Q_{y \cap z}^{F}$ for all $y \in Y$ , since otherwise there would exist an $x \in X$ and a $y \in Y$ such that $p$ divides neither $Q_{x \cap z}^{F}$ nor $Q_{x \cap z}^{F}$ , but does divide their product, contradicting the fact that $p$ is irreducible, and thus prime.

Assume without loss of generality that $p$ divides $Q_{x \cap z}^{F}$ for all $x \in X$ . Fix an $x \in X$ . Let us first restrict attention to the case where $x \cap z$ is nonempty.

Let $Q_{x \cap z}^{F} = p \cdot q$ . By Proposition 28, $p = r_{0} \cdot {poly}_{C_{0}}^{F} (x \cap z)$ and $q = r_{1} \cdot {poly}_{C_{1}}^{F} (x \cap z)$ for some $r_{0}, r_{1} \in R$ and $C_{0}, C_{1} \subseteq B$ . We will show that $C_{0} = C, C_{1} = B ∖ C$ , and $r_{0} = r_{1} = 1$ .

Let $s$ be an element of $x \cap z$ . Then for all $b \in B$ , $b \in C$ if and only if $[s]_{b} \in supp (p)$ if and only if $[s]_{b} \in supp ({poly}_{C_{0}}^{F} (x \cap z))$ if and only if $b \in C_{0}$ . Thus $C_{0} = C$ .

For all $b \in B ∖ C$ , we have $[s]_{b} \in supp (Q_{x \cap z}^{F})$ and $[s]_{b} \notin supp (p)$ , so $[s]_{b} \in supp (q)$ , so $b \in C_{1}$ . Similarly, for all $b \in C_{1}$ , $[s]_{b} \in supp (q)$ , so $[s]_{b} \notin supp (p)$ , so $b \in B ∖ C$ . Thus $C_{1} = B ∖ C$ .

Since $p$ and ${poly}_{C_{0}}^{F} (x \cap z)$ both have all coefficients equal to $1$ , we have $r_{0} = 1$ . Thus, $p = {poly}_{C}^{F} (x \cap z) .$

Similarly, since all the coefficients of $p$ are $1$ and all the coefficients of $Q_{x \cap z}^{F}$ are $1$ , all the coefficients of $q$ are $1$ , so $r_{1} = 1$ . Thus, $q = {poly}_{B ∖ C}^{F} (x \cap z)$ .

We thus have that $Q_{x \cap z}^{F} = {poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (x \cap z)$ .

In the case where $x \cap z$ is empty, we also have $Q_{x \cap z}^{F} = {poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (x \cap z)$ , since both sides are $0$ .

By Proposition 27, $Q_{x \cap z}^{F} = {poly}_{B}^{F} (χ_{C}^{F} (z, x \cap z))$ . Thus, ${monos}_{B}^{F} (x \cap z) = {monos}_{B}^{F} (χ_{C}^{F} (z, x \cap z))$ , so $x \cap z = χ_{C}^{F} (z, x \cap z) = χ_{B ∖ C}^{F} (x \cap z, z)$ .

Since $x \cap z = χ_{B ∖ C}^{F} (x \cap z, z)$ for all $x \in X$ , we have that $B ∖ C ⊢^{F} X | z$ . However, this contradicts the fact that $b \notin B ∖ C$ , and $b \in h^{F} (X | z)$ .

Thus, condition 2 implies condition 1. $□$

5.4. Probability Distributions on Finite Factored Sets

The primary purpose of all this discussion of characteristic polynomials has been to build up to thinking about the relationship between orthogonality and probabilistic independence. We will now discuss probability distributions on finite factored sets.

Recall the definition of a probability distribution.

Definition 36. Given a finite set $S$ , a probability distribution on $S$ is a function $P : P (S) \to R$ such that

$P (E) \geq 0$ for all $E \subseteq S$ ,
$P ({}) = 0$ ,
$P (S) = 1$ , and
$P (E_{0} \cup E_{1}) = P (E_{0}) + P (E_{1})$ whenever $E_{0}, E_{1} \subseteq S$ satisfy $E_{0} \cap E_{1} = {}$ .

A probability distribution on a finite factored set $F$ is a probability distribution on its underlying set that also satisfies another condition, which represents the probability distribution coming from a product of distributions on the underlying factors.

Definition 37. Given a finite factored set $F = (S, B)$ , a probability distribution on $F$ is a probability distribution $P$ on $S$ such that for all $s \in S$ , we have $P ({s}) = \prod_{b \in B} P ([s]_{b})$ .

Proposition 32. Given a finite factored set $F = (S, B)$ , a probability distribution on $S$ is a probability distribution $P$ on $F$ if and only if $P (E) = Q_{E}^{F} (P)$ for all $E \subseteq S$ .

Proof. If $P (E) = Q_{E}^{F} (P)$ for all $E \subseteq S$ , in particular this means that $P ({s}) = Q_{{s}}^{F} (P) = (\prod_{b \in B} [s]_{b}) (P) = \prod_{b \in B} P ([s]_{b})$ for all $s \in S$ .

Conversely, if $P ({s}) = \prod_{b \in B} P ([s]_{b})$ for all $s \in S$ , then for all $E \subseteq S, P (E) = \sum_{s \in E} \prod_{b \in B} P ([s]_{b}) = (\sum_{s \in E} \prod_{b \in B} [s]_{b}) (P) = Q_{E}^{F} (P)$ . $□$

5.5. The Fundamental Theorem of Finite Factored Sets

We are now ready to state and prove the fundatmental theorem of finite factored sets.

Theorem 3. Let $F = (S, B)$ be a finite factored set, and let $X, Y, Z \in Part (S)$ be partitions of $S$ . Then $X ⊥^{F} Y | Z$ if and only if for all probability distributions $P$ on $F$ and all $x \in X$ , $y \in Y$ , and $z \in Z$ , we have $P (x \cap z) \cdot P (y \cap z) = P (x \cap y \cap z) \cdot P (z)$ .

Proof. We already have by Lemma 3 that if $X ⊥^{F} Y | Z$ , then for all $x \in X$ , $y \in Y$ , and $z \in Z$ , $Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F} = Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ . Thus for any probability distribution $P$ on $F$ , we have

\begin{matrix} P (z) \cdot P (x \cap y \cap z) & = Q_{z}^{F} (P) \cdot Q_{x \cap y \cap z}^{F} (P) = Q_{x \cap z}^{F} (P) \cdot Q_{y \cap z}^{F} (P) = P (x \cap z) \cdot P (y \cap z) . \end{matrix}

Conversely, assume that for all probability distributions $P$ on $F$ , and all $x \in X$ , $y \in Y$ , and $z \in Z$ , we have $P (x \cap z) \cdot P (y \cap z) = P (x \cap y \cap z) \cdot P (z)$ .

If $S$ is empty, then ${}$ is the unique partition of $S$ , and we have ${} ⊥^{F} {} | {}$ . Thus, we can restrict our attention to the case where $S$ is nonempty.

Fix an arbitrary $x \in X$ , $y \in Y$ , and $z \in Z$ . Let $q = Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F} - Q_{x \cap y \cap z}^{F} \cdot Q_{z}^{F}$ . We will first show that $q (f) = 0$ for all $f : P (S) \to R^{> 0}$ .

Given an arbitrary $f : P (S) \to R^{> 0}$ , we can define $P_{f} : P (S) \to R$ by $P_{f} (E) = Q_{E}^{F} (f) / Q_{S}^{F} (f)$ , and we will show that $P_{f}$ is a distribution on $F$ .

$P_{f}$ is well-defined because $Q_{S}^{F} (f)$ is a nonempty sum of products of positive real numbers, and thus positive. Further, since $Q_{E}^{F} (f)$ is a sum of products of positive real numbers, $P_{f} (E) \geq 0$ for all $E \subseteq S$ . Since $Q_{{}}^{F} = 0$ , we also have $P_{f} ({}) = 0$ . Clearly $P_{f} (S) = 1$ . Finally, for all $E_{0}, E_{1} \subseteq S$ with $E_{0} \cap E_{1} = {}$ , we have

\begin{matrix} P_{f} (E_{0} \cup E_{1}) & = Q_{E_{0} \cup E_{1}}^{F} (f) / Q_{S}^{F} (f) = (Q_{E_{0}}^{F} (f) + Q_{E_{1}}^{F} (f)) / Q_{S}^{F} (f) = P_{f} (E_{0}) + P_{f} (E_{1}) . \end{matrix}

Therefore $P_{f}$ is a distribution on $S$ . We still need to show that $P_{f}$ is a distribution on $F$ .

Observe that for all $s \in S$ and $b \in B$ , since $χ_{{b}}^{F} ([s]_{b}, S) = [s]_{b}$ , we have that $Q_{[s]_{b}}^{F} (f) = {poly}_{{b}}^{F} ([s]_{b}) \cdot {poly}_{B ∖ {b}}^{F} (S)$ , and since $χ_{{b}}^{F} (S, S) = S$ , we have that $Q_{S}^{F} (f) = {poly}_{{b}}^{F} (S) \cdot {poly}_{B ∖ {b}}^{F} (S)$ . Thus, we have that

\begin{matrix} P_{f} ([s]_{b}) & = {poly}_{{b}}^{F} ([s]_{b}) (f) / {poly}_{{b}}^{F} (S) (f) = f ([s]_{b}) / {poly}_{{b}}^{F} (S) (f) . \end{matrix}

Thus, for all $s \in S$ ,

\begin{matrix} \prod b \in B P_{f} ([s]_{b}) & = (\prod b \in B f ([s]_{b})) / (\prod b \in B {poly}_{{b}}^{F} (S) (f)) = Q_{{s}}^{F} (f) / Q_{S}^{F} (f) = P_{f} ({s}) . \end{matrix}

Thus $P_{f}$ is a distribution on $F$ .

It follows that P $_{f} (x \cap z) \cdot P_{f} (y \cap z) = P_{f} (x \cap y \cap z) \cdot P_{f} (z)$ . We therefore have that

\begin{matrix} q (f) & = Q_{x \cap z}^{F} (f) \cdot Q_{y \cap z}^{F} (f) - Q_{x \cap y \cap z}^{F} (f) \cdot Q_{z}^{F} (f) = (P_{f} (x \cap z) \cdot P_{f} (y \cap z) - P_{f} (x \cap y \cap z) \cdot P_{f} (z)) \cdot Q_{S}^{F} (f)^{2} = 0 \cdot Q_{S}^{F} (f)^{2} = 0. \end{matrix}

Thus, $q$ is a polynomial that is zero on an open subset of inputs, so $q$ is the zero polynomial. Thus $Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F} - Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F} = 0$ , so $Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F} = Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ . Since $x \in X$ , $y \in Y$ , and $z \in Z$ were arbitrary, by Lemma 3, we have $X ⊥^{F} Y | Z$ . $□$

In the next two posts, we will introduce temporal inference using finite factored sets, and discuss future potential research directions.

[-]Diffractor4y30

In the proof of Lemma 3, it should be

"Finally, since , we have that ${poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (z) = Q_{z}^{F}$ .

[-]Scott Garrabrant4y30

Fixed, Thanks.

AI ALIGNMENT FORUM
AF

13