Probability is the branch of mathematics concerned with describing the relative likelihood of things. Probabilitic methods have proven fruitful in characterizing patterns in natural language. Here are some starting points for teaching yourself the math.

- One of the best books on the subject of probability and natural language is Foundations of Statistical Natural Language Processing.

- John Goldsmith has a good tutorial called Probability for Linguists that uses examples from natural language.

- The
*Probability Tutorial Using Dice*explains the mathematical concepts using a familiar real world example.

- The
*Maximum Mutual Information Criterion Tutorial*explains how to use mutual information in feature reduction. A sample script shows how this works.

-- BillMcNeill - 21 Jan 2005

I | Attachment | Action | Size | Date | Who | Comment |
---|---|---|---|---|---|---|

mmic.pdf | manage | 35.8 K | 2005-04-13 - 19:08 | UnknownUser | Maximum Mutual Information Criterion Tutorial | |

two_dice.pdf | manage | 57.3 K | 2005-01-25 - 05:42 | UnknownUser | Two dice | |

txt | mmic.py.txt | manage | 5.3 K | 2005-04-12 - 23:46 | UnknownUser | Example script. Save as mmic.py |

Topic revision: r6 - 2005-04-13 - 19:08:07 - BillMcNeill

Copyright &© by the contributing authors. All material on this collaboration platform is the property of the contributing authors.

Ideas, requests, problems regarding TWiki? Send feedback

Privacy Statement Terms & Conditions

Ideas, requests, problems regarding TWiki? Send feedback

Privacy Statement Terms & Conditions