JavaScript is disabled on your browser.
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
Prev
Next
Frames
No Frames
All Classes
A
C
D
E
F
G
H
I
L
M
N
P
R
S
T
W
A
add(char[])
- Method in class monad.mmseg.
CharNode.KeyTree
addChunk(Chunk)
- Method in class monad.mmseg.rule.
LargestAvgLenRule
addChunk(Chunk)
- Method in class monad.mmseg.rule.
LargestSumDegreeFreedomRule
addChunk(Chunk)
- Method in class monad.mmseg.rule.
MaxMatchRule
addChunk(Chunk)
- Method in class monad.mmseg.rule.
Rule
添加 chunk
addChunk(Chunk)
- Method in class monad.mmseg.rule.
SmallestVarianceRule
addChunks(List<Chunk>)
- Method in class monad.mmseg.rule.
Rule
addOffset(int)
- Method in class monad.mmseg.
Sentence
addWordTail(char[])
- Method in class monad.mmseg.
CharNode
C
CharNode
- Class in
monad.mmseg
所有词都记录在第一个字的结点下.
CharNode()
- Constructor for class monad.mmseg.
CharNode
CharNode.KeyTree
- Class in
monad.mmseg
CharNode.KeyTree()
- Constructor for class monad.mmseg.
CharNode.KeyTree
Chunk
- Class in
monad.mmseg
它是MMSeg分词算法中一个关键的概念。Chunk中包含依据上下文分出的一组词和相关的属性,包括长度(Length)、平均长度(Average Length)、标准差的平方(Variance)和自由语素度(Degree Of Morphemic Freedom)。
Chunk()
- Constructor for class monad.mmseg.
Chunk
chunks
- Variable in class monad.mmseg.rule.
Rule
clear(String)
- Static method in class monad.mmseg.
Dictionary
clear(File)
- Static method in class monad.mmseg.
Dictionary
从单例缓存中去除
close()
- Method in class monad.mmseg.analysis.
CutLetterDigitFilter
Complex
- Class in
monad.mmseg.example
Complex()
- Constructor for class monad.mmseg.example.
Complex
ComplexAnalyzer
- Class in
monad.mmseg.analysis
mmseg 的 complex analyzer
ComplexAnalyzer()
- Constructor for class monad.mmseg.analysis.
ComplexAnalyzer
ComplexAnalyzer(String)
- Constructor for class monad.mmseg.analysis.
ComplexAnalyzer
ComplexAnalyzer(Dictionary)
- Constructor for class monad.mmseg.analysis.
ComplexAnalyzer
ComplexAnalyzer(File)
- Constructor for class monad.mmseg.analysis.
ComplexAnalyzer
ComplexSeg
- Class in
monad.mmseg
正向最大匹配, 加四个过虑规则的分词方式.
ComplexSeg(Dictionary)
- Constructor for class monad.mmseg.
ComplexSeg
createComponents(String)
- Method in class monad.mmseg.analysis.
MMSegAnalyzer
createComponents(String)
- Method in class monad.mmseg.analysis.
NameAnalyzer
CutLetterDigitFilter
- Class in
monad.mmseg.analysis
切分“字母和数”混在一起的过虑器。比如:mb991ch 切为 "mb 991 ch"
CutLetterDigitFilter(TokenStream)
- Constructor for class monad.mmseg.analysis.
CutLetterDigitFilter
D
dic
- Variable in class monad.mmseg.analysis.
MMSegAnalyzer
dic
- Variable in class monad.mmseg.example.
Complex
dic
- Variable in class monad.mmseg.
Seg
Dictionary
- Class in
monad.mmseg
词典类.
Dictionary.FileLoading
- Interface in
monad.mmseg
E
end()
- Method in class monad.mmseg.analysis.
CutLetterDigitFilter
F
finalize()
- Method in class monad.mmseg.
Dictionary
G
getAvgLen()
- Method in class monad.mmseg.
Chunk
Largest Average Word Length
getCount()
- Method in class monad.mmseg.
Chunk
有多少个词,最多3个。
getDefalutPath()
- Static method in class monad.mmseg.
Dictionary
当 words.dic 是从 jar 里加载时, 可能 defalut 不存在
getDegree()
- Method in class monad.mmseg.
Word
getDicPath()
- Method in class monad.mmseg.
Dictionary
注意:当 words.dic 是从 jar 里加载时,此时 File 可能是不存在的。
getDict()
- Method in class monad.mmseg.analysis.
MMSegAnalyzer
getDict()
- Method in class monad.mmseg.
Dictionary
仅仅用来观察词库.
getEndOffset()
- Method in class monad.mmseg.
Word
getFreq()
- Method in class monad.mmseg.
CharNode
getInstance()
- Static method in class monad.mmseg.
Dictionary
从默认目录加载词库文件.
查找默认目录顺序: 从系统属性mmseg.dic.path指定的目录中加载 从classpath/data目录 从user.dir/data目录
getInstance(String)
- Static method in class monad.mmseg.
Dictionary
getInstance(File)
- Static method in class monad.mmseg.
Dictionary
getLastLoadTime()
- Method in class monad.mmseg.
Dictionary
最后加载词库的时间
getLen()
- Method in class monad.mmseg.
Chunk
Word Length
getLength()
- Method in class monad.mmseg.
Word
getMaxLen()
- Method in class monad.mmseg.
CharNode
getOffset()
- Method in class monad.mmseg.
Sentence
句子开始处理的偏移位置
getSeg()
- Method in class monad.mmseg.example.
Complex
getSeg()
- Method in class monad.mmseg.example.
MaxWord
getSeg()
- Method in class monad.mmseg.example.
Simple
getSen()
- Method in class monad.mmseg.
Word
getStartOffset()
- Method in class monad.mmseg.
Sentence
句子在文本中的偏移位置
getStartOffset()
- Method in class monad.mmseg.
Word
此 word 在整个文本中的偏移位置
getString()
- Method in class monad.mmseg.
Word
getSumDegree()
- Method in class monad.mmseg.
Chunk
Sum of Degree of Morphemic Freedom of One-Character
getText()
- Method in class monad.mmseg.
Sentence
getType()
- Method in class monad.mmseg.
Word
getVariance()
- Method in class monad.mmseg.
Chunk
Variance of Word Lengths 就是 标准差的平方
getWordOffset()
- Method in class monad.mmseg.
Word
词在 char[] sen 的偏移位置
getWords()
- Method in class monad.mmseg.
Chunk
H
head(char)
- Method in class monad.mmseg.
Dictionary
I
incrementToken()
- Method in class monad.mmseg.analysis.
CutLetterDigitFilter
incrementToken()
- Method in class monad.mmseg.analysis.
MMSegTokenizer
indexOf(char[], int, int)
- Method in class monad.mmseg.
CharNode
isFinish()
- Method in class monad.mmseg.
Sentence
句子处理完成
isRemove(Chunk)
- Method in class monad.mmseg.rule.
LargestAvgLenRule
isRemove(Chunk)
- Method in class monad.mmseg.rule.
LargestSumDegreeFreedomRule
isRemove(Chunk)
- Method in class monad.mmseg.rule.
MaxMatchRule
isRemove(Chunk)
- Method in class monad.mmseg.rule.
Rule
判断 chunk 是否要删除。
isRemove(Chunk)
- Method in class monad.mmseg.rule.
SmallestVarianceRule
isShowChunk()
- Static method in class monad.mmseg.
ComplexSeg
isUnit(Character)
- Method in class monad.mmseg.
Dictionary
isUnit(int)
- Method in class monad.mmseg.
Seg
L
LargestAvgLenRule
- Class in
monad.mmseg.rule
Largest Average Word Length.
长度(Length)/词数
LargestAvgLenRule()
- Constructor for class monad.mmseg.rule.
LargestAvgLenRule
LargestSumDegreeFreedomRule
- Class in
monad.mmseg.rule
Largest Sum of Degree of Morphemic Freedom of One-Character.
LargestSumDegreeFreedomRule()
- Constructor for class monad.mmseg.rule.
LargestSumDegreeFreedomRule
listWordsFiles()
- Method in class monad.mmseg.
Dictionary
只要 wordsXXX.dic的文件
load(InputStream, Dictionary.FileLoading)
- Static method in class monad.mmseg.
Dictionary
加载词文件的模板
M
main(String[])
- Static method in class monad.mmseg.example.
Complex
main(String[])
- Static method in class monad.mmseg.example.
MaxWord
main(String[])
- Static method in class monad.mmseg.example.
Simple
match(char[], int, int)
- Method in class monad.mmseg.
CharNode.KeyTree
match(String)
- Method in class monad.mmseg.
Dictionary
word 能否在词库里找到
maxMatch(char[], int)
- Method in class monad.mmseg.
CharNode.KeyTree
maxMatch(ArrayList<Integer>, char[], int)
- Method in class monad.mmseg.
CharNode.KeyTree
maxMatch(char[], int)
- Method in class monad.mmseg.
CharNode
maxMatch(ArrayList<Integer>, char[], int)
- Method in class monad.mmseg.
CharNode
maxMatch(char[], int)
- Method in class monad.mmseg.
Dictionary
maxMatch(CharNode, char[], int)
- Method in class monad.mmseg.
Dictionary
maxMatch(CharNode, ArrayList<Integer>, char[], int)
- Method in class monad.mmseg.
Dictionary
maxMatch(CharNode[], int, char[], int)
- Method in class monad.mmseg.
Seg
最大匹配
从 chs[offset] 开始匹配, 同时把 chs[offset] 的字符结点保存在 cns[cnIdx]
maxMatch(CharNode[], int, char[], int, ArrayList<Integer>[], int)
- Method in class monad.mmseg.
Seg
匹配,同时找出长度.
MaxMatchRule
- Class in
monad.mmseg.rule
Maximum Matching.
chuck中各个词的长度之和
MaxMatchRule()
- Constructor for class monad.mmseg.rule.
MaxMatchRule
MaxWord
- Class in
monad.mmseg.example
MaxWord()
- Constructor for class monad.mmseg.example.
MaxWord
MaxWordAnalyzer
- Class in
monad.mmseg.analysis
最多分词方式.
MaxWordAnalyzer()
- Constructor for class monad.mmseg.analysis.
MaxWordAnalyzer
MaxWordAnalyzer(String)
- Constructor for class monad.mmseg.analysis.
MaxWordAnalyzer
MaxWordAnalyzer(Dictionary)
- Constructor for class monad.mmseg.analysis.
MaxWordAnalyzer
MaxWordAnalyzer(File)
- Constructor for class monad.mmseg.analysis.
MaxWordAnalyzer
MaxWordSeg
- Class in
monad.mmseg
最多分词.
MaxWordSeg(Dictionary)
- Constructor for class monad.mmseg.
MaxWordSeg
MMSeg
- Class in
monad.mmseg
Reader 流的分词(有字母,数字等), 析出中文(其实是 CJK)成句子
Sentence
再对 mmseg 算法分词.
非线程安全
MMSeg(Reader, Seg)
- Constructor for class monad.mmseg.
MMSeg
MMSegAnalyzer
- Class in
monad.mmseg.analysis
默认使用 max-word
MMSegAnalyzer()
- Constructor for class monad.mmseg.analysis.
MMSegAnalyzer
MMSegAnalyzer(String)
- Constructor for class monad.mmseg.analysis.
MMSegAnalyzer
MMSegAnalyzer(File)
- Constructor for class monad.mmseg.analysis.
MMSegAnalyzer
MMSegAnalyzer(Dictionary)
- Constructor for class monad.mmseg.analysis.
MMSegAnalyzer
MMSegTokenizer
- Class in
monad.mmseg.analysis
MMSegTokenizer(Seg)
- Constructor for class monad.mmseg.analysis.
MMSegTokenizer
monad.mmseg
- package monad.mmseg
monad.mmseg.analysis
- package monad.mmseg.analysis
monad.mmseg.example
- package monad.mmseg.example
monad.mmseg.rule
- package monad.mmseg.rule
N
NameAnalyzer
- Class in
monad.mmseg.analysis
针对姓名的分词算法
NameAnalyzer()
- Constructor for class monad.mmseg.analysis.
NameAnalyzer
newSeg()
- Method in class monad.mmseg.analysis.
ComplexAnalyzer
newSeg()
- Method in class monad.mmseg.analysis.
MaxWordAnalyzer
newSeg()
- Method in class monad.mmseg.analysis.
MMSegAnalyzer
newSeg()
- Method in class monad.mmseg.analysis.
SimpleAnalyzer
next()
- Method in class monad.mmseg.
MMSeg
nextToken(TokenStream, PackedTokenAttributeImpl)
- Static method in class monad.mmseg.analysis.
TokenUtils
normalizeFile(File)
- Static method in class monad.mmseg.
Dictionary
P
printChunk(List<Chunk>)
- Method in class monad.mmseg.
Seg
输出 chunks, 调试用.
R
reinit(char[], int)
- Method in class monad.mmseg.
Sentence
reload()
- Method in class monad.mmseg.
Dictionary
全新加载词库,没有成功加载会回滚。
注意:重新加载时,务必有两倍的词库树结构的内存,默认词库是 50M/个 左右。否则抛出 OOM。
remainChunks()
- Method in class monad.mmseg.rule.
Rule
reset()
- Method in class monad.mmseg.analysis.
CutLetterDigitFilter
reset()
- Method in class monad.mmseg.analysis.
MMSegTokenizer
reset(Reader)
- Method in class monad.mmseg.
MMSeg
reset()
- Method in class monad.mmseg.rule.
LargestAvgLenRule
reset()
- Method in class monad.mmseg.rule.
LargestSumDegreeFreedomRule
reset()
- Method in class monad.mmseg.rule.
MaxMatchRule
reset()
- Method in class monad.mmseg.rule.
Rule
reset()
- Method in class monad.mmseg.rule.
SmallestVarianceRule
row(String, int)
- Method in interface monad.mmseg.
Dictionary.FileLoading
Rule
- Class in
monad.mmseg.rule
过虑规则的抽象类。
Rule()
- Constructor for class monad.mmseg.rule.
Rule
run(String[])
- Method in class monad.mmseg.example.
Complex
S
search(CharNode, char[], int, int)
- Method in class monad.mmseg.
Dictionary
sen[offset] 后 tailLen 长的词是否存在.
search(char[], int, int)
- Method in class monad.mmseg.
Seg
查找chs[offset]后面的 tailLen个char是否为词.
search(CharNode, char[], int, int)
- Method in class monad.mmseg.
Seg
没有数组的复制.
seg(Sentence)
- Method in class monad.mmseg.
ComplexSeg
seg(Sentence)
- Method in class monad.mmseg.
MaxWordSeg
Seg
- Class in
monad.mmseg
分词抽象类.
Seg(Dictionary)
- Constructor for class monad.mmseg.
Seg
seg(Sentence)
- Method in class monad.mmseg.
Seg
对句子 sen 进行分词.
seg(Sentence)
- Method in class monad.mmseg.
SimpleSeg
segWords(Reader, String)
- Method in class monad.mmseg.example.
Complex
segWords(String, String)
- Method in class monad.mmseg.example.
Complex
Sentence
- Class in
monad.mmseg
句子, 在一大串字符中断出连续中文的文本.
Sentence()
- Constructor for class monad.mmseg.
Sentence
Sentence(char[], int)
- Constructor for class monad.mmseg.
Sentence
setDegree(int)
- Method in class monad.mmseg.
Word
setFreq(int)
- Method in class monad.mmseg.
CharNode
setMaxLen(int)
- Method in class monad.mmseg.
CharNode
setOffset(int)
- Method in class monad.mmseg.
Sentence
句子开始处理的偏移位置
setShowChunk(boolean)
- Static method in class monad.mmseg.
ComplexSeg
setStartOffset(int)
- Method in class monad.mmseg.
Sentence
句子在文本中的偏移位置
setType(String)
- Method in class monad.mmseg.
Word
setWords(Word[])
- Method in class monad.mmseg.
Chunk
Simple
- Class in
monad.mmseg.example
Simple()
- Constructor for class monad.mmseg.example.
Simple
SimpleAnalyzer
- Class in
monad.mmseg.analysis
mmseg 的 simple anlayzer.
SimpleAnalyzer()
- Constructor for class monad.mmseg.analysis.
SimpleAnalyzer
SimpleAnalyzer(String)
- Constructor for class monad.mmseg.analysis.
SimpleAnalyzer
SimpleAnalyzer(Dictionary)
- Constructor for class monad.mmseg.analysis.
SimpleAnalyzer
SimpleAnalyzer(File)
- Constructor for class monad.mmseg.analysis.
SimpleAnalyzer
SimpleSeg
- Class in
monad.mmseg
正向最大匹配的分词方式.
SimpleSeg(Dictionary)
- Constructor for class monad.mmseg.
SimpleSeg
SmallestVarianceRule
- Class in
monad.mmseg.rule
Smallest Variance of Word Lengths.
标准差的平方
SmallestVarianceRule()
- Constructor for class monad.mmseg.rule.
SmallestVarianceRule
subToken(PackedTokenAttributeImpl, int, int)
- Static method in class monad.mmseg.analysis.
TokenUtils
T
toFactorString()
- Method in class monad.mmseg.
Chunk
tokenQueue
- Variable in class monad.mmseg.analysis.
CutLetterDigitFilter
TokenUtils
- Class in
monad.mmseg.analysis
lucene 3.0 从 TokenStream 得到 Token 比较麻烦。
TokenUtils()
- Constructor for class monad.mmseg.analysis.
TokenUtils
toString()
- Method in class monad.mmseg.
Chunk
toString()
- Method in class monad.mmseg.
Word
TYPE_DIGIT
- Static variable in class monad.mmseg.
Word
TYPE_DIGIT_OR_LETTER
- Static variable in class monad.mmseg.
Word
数字开头的"字母或数字"
TYPE_LETTER
- Static variable in class monad.mmseg.
Word
TYPE_LETTER_NUMBER
- Static variable in class monad.mmseg.
Word
TYPE_LETTER_OR_DIGIT
- Static variable in class monad.mmseg.
Word
字母开头的"字母或数字"
TYPE_OTHER_NUMBER
- Static variable in class monad.mmseg.
Word
TYPE_WORD
- Static variable in class monad.mmseg.
Word
W
Word
- Class in
monad.mmseg
类似 lucene 的 token
Word(char[], int)
- Constructor for class monad.mmseg.
Word
Word(char[], int, String)
- Constructor for class monad.mmseg.
Word
Word(char[], int, int, int)
- Constructor for class monad.mmseg.
Word
sen[offset] 开始的 len 个字符才是此 word
wordNum()
- Method in class monad.mmseg.
CharNode
wordsFileIsChange()
- Method in class monad.mmseg.
Dictionary
词典文件是否有修改过
A
C
D
E
F
G
H
I
L
M
N
P
R
S
T
W
Overview
Package
Class
Use
Tree
Deprecated
Index
Help
Prev
Next
Frames
No Frames
All Classes
Copyright © 2015-2016
Ganshane Developer Team
. All Rights Reserved.