Python：Advanced Guide to Artificial Intelligence

上QQ阅读APP看书，第一时间看更新

Direct sampling

With direct sampling, our goal is to approximate the full joint probability through a sequence of samples drawn from each conditional distribution. If we assume that the graph is well-structured (without unnecessary edges) and we have N variables, the algorithm is made up of the following steps:

Initialize the variable N_Samples.
Initialize a vector S with shape (N, N_Samples).
Initialize a frequency vector F_Samples with shape (N, N_Samples). In Python, it's better to employ a dictionary where the key is a combination (x₁, x₂, x₃, ..., x_N).
For t=1 to N_Samples:
1. For i=1 to N:
  1. Sample from P(X_i|Predecessors(X_i))
  2. Store the sample in S[i, t]
2. If F_Samples contains the sampled tuple S[:, t]:
  1. F_Samples[S[:, t]] += 1
3. Else:
  1. F_Samples[S[:, t]] = 1 (both these operations are immediate with Python dictionaries)
Create a vector P_Sampled with shape (N, 1).
Set P_Sampled[i, 0] = F_Samples[i]/N.

From a mathematical viewpoint, we are first creating a frequency vector F_Samples(x₁, x₂, x₃, ..., x_N; N_Samples) and then we approximate the full joint probability considering N_Samples → ∞: