OpenGrm NGram Library Version 1.3.16 is now available for download. NGram is now available on conda forge. This allows Linux (x86) and Mac OS X users who already...
Specialty operators This describes specialty FST functions for grammar compilation. fst::Cross The cross product operation generates a transducer from two acceptors...
Pynini: Finite state grammar development in Python Version 2.1.6.post1 is now available for download. Pynini is now available on conda forge. Users users who already...
Baum Welch expectation maximization library Version 0.3.9 is now available for download. Baum Welch is now available on conda forge. Linux (x86) and Mac OS X users...
Baum Welch documentation Training Training is performed using the Train function. It takes as arguments a FAR or WFST representing the plaintext, and channel model...
OpenGrm Thrax Grammar Development Tools Version 1.3.9 now available for download. Thrax is now available on conda forge. Linux (x86) and Mac OS X users who already...
SFst: Stochastic Finite State Transducer Library OpenGrm SFst version 1.2.1 is now available for download. SFst is a library for normalizing, sampling, combining...
String (de)compilation This directory contains functions useful for mapping strings into FSAs ( compilation ) and for mapping string FSTs onto strings ( printing...
OpenGrm NGram Library Quick Tour This tour is organized around the stages of n gram model creation, modification and use: corpus I/O ( ngramsymbols , farcompilestrings...
NGramSymbols Description Command line utility to produce a symbol table from an input text corpus. Creates a symbol entry for every type in the corpus, as well as...
NGramRead Description It has flags for specifying the format of the text input, currently one of two options: By default, the text file is read as a sorted...
FST optimization There are several ways to `optimize` a weighted finite state transducer (WFST). The C template function fst::Optimize underlies Pynini`s optimize...
OpenFst Quick Tour Below is a brief tutorial on the OpenGrm SFST library based on a running example. We use the command line SFST utilities for this; we could have...
OpenGrm SFst Available Operations The following operations are provided for SFSTs. Care must be taken that the input FSTs meet the specified requirements (e.g. canonical...
OpenGrm SFst Background Material The following is provided as background reading about stochastic finite state transducers and related material. For material...
OpenGrm Libraries OpenGrm is a collection of open source libraries for constructing, combining, applying and searching formal grammars and related representations...
Extended libraries The following are additional free libraries built using Pynini: pynini rewrite: Python module for ap plying rewrite rules edit transducer...
OpenGrm SFST Glossary $ backoff complete FST : a canonical FST for which each state s that has a failure transition to a state s` and another transition with...
OpenGrm Advanced Usage Below are a variety of topics covered in greater depth or of more specialized interest than found in the Quick Tour. Reading the Quick...
OpenGrm Thrax Grammar Compiler The OpenGrm Thrax Grammar Compiler is a set of tools for compiling grammars expressed as regular expressions and context dependent...
Path iteration This describes classes for iterating over paths in an FST. fst::PathIterator This template class provides a basic iterator over paths. It is constructed...
Thrax Release 0.1 (Alpha version.) Thrax Release 1.0 Removed dependency on ICU for UTF8 string parsing: with icu configuration flag no longer needed and...
OpenGrm SFst COPYING Licensed under the Apache License, Version 2.0 (the `License`); you may not use these files except in compliance with the License. You may obtain...
OpenGrm NGram README OpenGrm NGram Release 1.3 The OpenGrm NGram library is used for making and modifying n gram language models encoded as weighted finite state...
Thrax README Thrax Release 1.2 Thrax is a toolkit for compiling grammars based on regular expressions and context dependent rewrite rules into weighted finite state...
Known Bugs Temporary bug in thrax 1.2.2 where AssertNull and AssertEmpty are not being properly registered. This will get fixed soon, but in the meantime as a...
NGramMarginal Description (Available in versions 1.1.0 and higher.) This operation re estimates smoothed n gram models by imposing marginalization constraints...
Thrax COPYING Licensed under the Apache License, Version 2.0 (the `License`); you may not use these files except in compliance with the License. You may obtain a copy...
OpenGrm NGram COPYING Licensed under the Apache License, Version 2.0 (the `License`); you may not use these files except in compliance with the License. You may obtain...
NGramShrink Description This operation shrinks or prunes an n gram language model in one of three ways: count pruning: prunes based on count cutoffs for...
NGramMake Description This operation produces a smoothed, normalized language model from input n gram count FST. It smooths the model in one of six ways: witten...
NGramCount Description This utility counts n grams from an input FST archive. This produces a count FST with the same topology as the eventual normalized model,...
NGramMerge Description This operation merges two n gram language models or two n gram count FSTs. The operation provides options for weighting the two input FSTs...