This operation randomly generates a set of successful sentences in the input FST and outputs them in finite-state archive format. Backoff transitions, represented as epsilon transitions, are correctly interpreted as failure transitions, which distinguishes this operation from OpenFst RandGen (with the LogProbArcSelector). The (maximum) number of sentences generated, their maximum length, and the random seed can all be specified as options.


ngramrandgen [--opts] [[in.fst [out.far]]
   --max_sents: type = int64, default = 1
   --max_length: type = int, default = INT_MAX
   --seed: type = int, default = time()


$ ngramrandgen in.mod >out.far



Topic revision: r5 - 2012-02-23 - BrianRoark
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2017 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback