TWiki
>
GRM Web
>
NGramLibrary
>
NGramQuickTour
>
NGramSymbols
(2022-08-06,
KyleGorman
)
(raw view)
E
dit
A
ttach
---+ NGramSymbols ---++ Description Command line utility to produce a symbol table from an input text corpus. Creates a symbol entry for every type in the corpus, as well as for _<epsilon>_ (index 0) and an out-of-vocabulary symbol (last in the symbol table). Command line options _--epsilon_symbol_ and _--OOV_symbol_ permit the specification of the labels wanted for those special symbols. ---++ Usage |<verbatim> ngramsymbols [--options] [in.txt [out.txt]] --epsilon_symbol: type = string, default = <epsilon> --OOV_symbol: type = string, default = <UNK> </verbatim> | | ---++ Examples <verbatim> $ ngramsymbols <earnest.txt >earnest.syms </verbatim> ---++ Caveats
E
dit
|
A
ttach
|
Watch
|
P
rint version
|
H
istory
: r6
<
r5
<
r4
<
r3
<
r2
|
B
acklinks
|
V
iew topic
|
WYSIWYG
|
M
ore topic actions
Topic revision: r6 - 2022-08-06
-
KyleGorman
GRM
Log In
or
Register
GRM Web
Create New Topic
Index
Search
Changes
Notifications
Statistics
Preferences
Webs
Contrib
FST
Forum
GRM
Kernel
Main
Sandbox
TWiki
Main
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback