Skip to content

Latest commit

 

History

History
39 lines (23 loc) · 1.04 KB

README.md

File metadata and controls

39 lines (23 loc) · 1.04 KB

Go Summer

Summarization (sentence extraction) module based on Hidden Markov Models. Written mostly in summer 2013.

Solutions for three classic problems of HMM

  • Evaluation (forward algorithm)
  • Decoding (Viterbi algorithm)
  • Learning (counting)

Resources

  • Implementation fo Hidden Markov Models in C# link.
  • Presentation about Hidden Markov Models link to PDF.

Dependencies

  • Redis client for Go link.

What's new

  • Multiple features for emissions.
  • Integration with Redis DB (model parameters).
  • Go concurrency features for better performance.

Basic usage

Running learning process:

$ ./summer <path to full textfiles> <path to texts summaries>

Summarizing a text:

$ ./summer <path to textfile>

In progress

  • updating model with unsupervised learning (Baum-Welch algorithm)
  • estimating emission distribution (functions instead of slices)