BAYESIAN LEARNING : Tom Mitchell

BAYESIAN LEARNING

Machine learning : Tom Mitchell 저, McGRAW-HILL, 1997, Page 154~199

1. INTRODUCTION

2. BAYES THEOREM

(1) An Example

3. BAYES THEOREM AND CONCEPT LEARNING

(1) Brute-Force Bayes Concept Learning

(2) MAP Hypotheses and Consistent Learners

4. MAXIMUM LIKELIHOOD AND LEAST-SQUARED ERROR HYPOTHESES

5. MAXIMUM LIKELIHOOD HYPOTHESES FOR PREDICTING PROBABILITIES

(1) Gradient Search to Maximize Likelihood in a Neural Net

6. MINIMUM DESCRIPTION LENGTH PRINCIPLE

7. BAYES OPTIMAL CLASSIFIER

8. GIBBS ALGORITHM

9. NAIVE BAYES CLASSIFIER

(1) An Illustrative Example

(1.1) ESTIMATING PROBABILITIES

10. AN EXAMPLE : LEARNING TO CLASSIFY TEXT

(1) Experimental Results

11. BAYESIAN BELIEF NETWORKS

(1) Conditional Independence

(2) Representation

(3) Inference

(4) Learning Bayesian Belief Networks

(5) Gradient Ascent Training of Bayesian Networks

(6) Learning the Structure of Bayesian Networks

12. THE EM ALGORITHM

(1) Estimating Means of Gaussians

(2) General Statement of EM Algorithm

(3) Derivation of the Means Algorithm

13. SUMMARY AND FURTHER READING

p154

1. INTRODUCTION

p155

p156

2. BAYES THEOREM

Bayes theorem :

(1)

p157

(2)

(3)

(1) An Example

p158

3. BAYES THEOREM AND CONCEPT LEARNING

p159

표 1

Product rule : probability of a conjunction of two events A and B

Sum rule : probability of a disjunction of two events A and B

Bayes theorem : the posterior probability of given

Theorem of total probability : if events are mutually exclusive with , then

(1) Brute-Force Bayes Concept Learning

(i.e., )

BRUTE-FORCE MAP LEARNING algorithm

1.

2.

p160

1.

2.

3.

for all in

p161

if is inconsistent with

(5)

p162

(2) MAP Hypotheses and Consistent Learners

그림 1

p163

p164

4. MAXIMUM LIKELIHOOD AND LEAST-SQUARED ERROR HYPOTHESES

그림 2

p165

Probability density function:

p166

(6)

p167

5. MAXIMUM LIKELIHOOD HYPOTHESES FOR PREDICTING PROBABILITIES

p168

(7)

p169

(8)

(9)

(10)

(11)

(12)

p170

(13)

(1) Gradient Search to Maximize Likelihood in a Neural Net

(14)

th th

p171

(15)

(16)

6. MINIMUM DESCRIPTION LENGTH PRINCIPLE

p172

(16)

p173

(17)

p174

7. BAYES OPTIMAL CLASSIFIER

p175

(18)

p176

8. GIBBS ALGORITHM

1.

2.

p177

9. NAIVE BAYES CLASSIFIER

(19)

(20)

p178

(1) An Illustrative Example

(21)

p.179

(1.1) ESTIMATING PROBABILITIES

(22)

p180

-estimate

10. AN EXAMPLE : LEARNING TO CLASSIFY TEXT

p181

th

p182

(1) Experimental Results

p183

1.

2.

p184

11. BAYESIAN BELIEF NETWORKS

p185

(1) Conditional Independence

(23)

(24)

p186

그림 3

(2) Representation

p187

(3) Inference

p188

(4) Learning Bayesian Belief Networks

(5) Gradient Ascent Training of Bayesian Networks

(25)

p189

p190

(26)

(6) Learning the Structure of Bayesian Networks

p191

12. THE EM ALGORITHM

(1) Estimating Means of Gaussians

p192

그림 4

(27)

(28)

th

p193

Step 1 :

Step 2 :

th

p194

(2) General Statement of EM Algorithm

p195

Step 1 :

Step 2 :

(3) Derivation of the Means Algorithm

p196

(29)

(30)

(31)

p197

13. SUMMARY AND FURTHER READING

p198

EXERCISES

1. ￢cancer

2.

3. (a)

(b)

(c)

5.

(c)