A C D E F G H I L M N P R S T W 

A

add(char[]) - Method in class monad.mmseg.CharNode.KeyTree
 
addChunk(Chunk) - Method in class monad.mmseg.rule.LargestAvgLenRule
 
addChunk(Chunk) - Method in class monad.mmseg.rule.LargestSumDegreeFreedomRule
 
addChunk(Chunk) - Method in class monad.mmseg.rule.MaxMatchRule
 
addChunk(Chunk) - Method in class monad.mmseg.rule.Rule
添加 chunk
addChunk(Chunk) - Method in class monad.mmseg.rule.SmallestVarianceRule
 
addChunks(List<Chunk>) - Method in class monad.mmseg.rule.Rule
 
addOffset(int) - Method in class monad.mmseg.Sentence
 
addWordTail(char[]) - Method in class monad.mmseg.CharNode
 

C

CharNode - Class in monad.mmseg
所有词都记录在第一个字的结点下.
CharNode() - Constructor for class monad.mmseg.CharNode
 
CharNode.KeyTree - Class in monad.mmseg
 
CharNode.KeyTree() - Constructor for class monad.mmseg.CharNode.KeyTree
 
Chunk - Class in monad.mmseg
它是MMSeg分词算法中一个关键的概念。Chunk中包含依据上下文分出的一组词和相关的属性,包括长度(Length)、平均长度(Average Length)、标准差的平方(Variance)和自由语素度(Degree Of Morphemic Freedom)。
Chunk() - Constructor for class monad.mmseg.Chunk
 
chunks - Variable in class monad.mmseg.rule.Rule
 
clear(String) - Static method in class monad.mmseg.Dictionary
 
clear(File) - Static method in class monad.mmseg.Dictionary
从单例缓存中去除
close() - Method in class monad.mmseg.analysis.CutLetterDigitFilter
 
Complex - Class in monad.mmseg.example
 
Complex() - Constructor for class monad.mmseg.example.Complex
 
ComplexAnalyzer - Class in monad.mmseg.analysis
mmseg 的 complex analyzer
ComplexAnalyzer() - Constructor for class monad.mmseg.analysis.ComplexAnalyzer
 
ComplexAnalyzer(String) - Constructor for class monad.mmseg.analysis.ComplexAnalyzer
 
ComplexAnalyzer(Dictionary) - Constructor for class monad.mmseg.analysis.ComplexAnalyzer
 
ComplexAnalyzer(File) - Constructor for class monad.mmseg.analysis.ComplexAnalyzer
 
ComplexSeg - Class in monad.mmseg
正向最大匹配, 加四个过虑规则的分词方式.
ComplexSeg(Dictionary) - Constructor for class monad.mmseg.ComplexSeg
 
createComponents(String) - Method in class monad.mmseg.analysis.MMSegAnalyzer
 
createComponents(String) - Method in class monad.mmseg.analysis.NameAnalyzer
 
CutLetterDigitFilter - Class in monad.mmseg.analysis
切分“字母和数”混在一起的过虑器。比如:mb991ch 切为 "mb 991 ch"
CutLetterDigitFilter(TokenStream) - Constructor for class monad.mmseg.analysis.CutLetterDigitFilter
 

D

dic - Variable in class monad.mmseg.analysis.MMSegAnalyzer
 
dic - Variable in class monad.mmseg.example.Complex
 
dic - Variable in class monad.mmseg.Seg
 
Dictionary - Class in monad.mmseg
词典类.
Dictionary.FileLoading - Interface in monad.mmseg
 

E

end() - Method in class monad.mmseg.analysis.CutLetterDigitFilter
 

F

finalize() - Method in class monad.mmseg.Dictionary
 

G

getAvgLen() - Method in class monad.mmseg.Chunk
Largest Average Word Length
getCount() - Method in class monad.mmseg.Chunk
有多少个词,最多3个。
getDefalutPath() - Static method in class monad.mmseg.Dictionary
当 words.dic 是从 jar 里加载时, 可能 defalut 不存在
getDegree() - Method in class monad.mmseg.Word
 
getDicPath() - Method in class monad.mmseg.Dictionary
注意:当 words.dic 是从 jar 里加载时,此时 File 可能是不存在的。
getDict() - Method in class monad.mmseg.analysis.MMSegAnalyzer
 
getDict() - Method in class monad.mmseg.Dictionary
仅仅用来观察词库.
getEndOffset() - Method in class monad.mmseg.Word
 
getFreq() - Method in class monad.mmseg.CharNode
 
getInstance() - Static method in class monad.mmseg.Dictionary
从默认目录加载词库文件.

查找默认目录顺序: 从系统属性mmseg.dic.path指定的目录中加载 从classpath/data目录 从user.dir/data目录

getInstance(String) - Static method in class monad.mmseg.Dictionary
 
getInstance(File) - Static method in class monad.mmseg.Dictionary
 
getLastLoadTime() - Method in class monad.mmseg.Dictionary
最后加载词库的时间
getLen() - Method in class monad.mmseg.Chunk
Word Length
getLength() - Method in class monad.mmseg.Word
 
getMaxLen() - Method in class monad.mmseg.CharNode
 
getOffset() - Method in class monad.mmseg.Sentence
句子开始处理的偏移位置
getSeg() - Method in class monad.mmseg.example.Complex
 
getSeg() - Method in class monad.mmseg.example.MaxWord
 
getSeg() - Method in class monad.mmseg.example.Simple
 
getSen() - Method in class monad.mmseg.Word
 
getStartOffset() - Method in class monad.mmseg.Sentence
句子在文本中的偏移位置
getStartOffset() - Method in class monad.mmseg.Word
此 word 在整个文本中的偏移位置
getString() - Method in class monad.mmseg.Word
 
getSumDegree() - Method in class monad.mmseg.Chunk
Sum of Degree of Morphemic Freedom of One-Character
getText() - Method in class monad.mmseg.Sentence
 
getType() - Method in class monad.mmseg.Word
 
getVariance() - Method in class monad.mmseg.Chunk
Variance of Word Lengths 就是 标准差的平方
getWordOffset() - Method in class monad.mmseg.Word
词在 char[] sen 的偏移位置
getWords() - Method in class monad.mmseg.Chunk
 

H

head(char) - Method in class monad.mmseg.Dictionary
 

I

incrementToken() - Method in class monad.mmseg.analysis.CutLetterDigitFilter
 
incrementToken() - Method in class monad.mmseg.analysis.MMSegTokenizer
 
indexOf(char[], int, int) - Method in class monad.mmseg.CharNode
 
isFinish() - Method in class monad.mmseg.Sentence
句子处理完成
isRemove(Chunk) - Method in class monad.mmseg.rule.LargestAvgLenRule
 
isRemove(Chunk) - Method in class monad.mmseg.rule.LargestSumDegreeFreedomRule
 
isRemove(Chunk) - Method in class monad.mmseg.rule.MaxMatchRule
 
isRemove(Chunk) - Method in class monad.mmseg.rule.Rule
判断 chunk 是否要删除。
isRemove(Chunk) - Method in class monad.mmseg.rule.SmallestVarianceRule
 
isShowChunk() - Static method in class monad.mmseg.ComplexSeg
 
isUnit(Character) - Method in class monad.mmseg.Dictionary
 
isUnit(int) - Method in class monad.mmseg.Seg
 

L

LargestAvgLenRule - Class in monad.mmseg.rule
Largest Average Word Length.

长度(Length)/词数

LargestAvgLenRule() - Constructor for class monad.mmseg.rule.LargestAvgLenRule
 
LargestSumDegreeFreedomRule - Class in monad.mmseg.rule
Largest Sum of Degree of Morphemic Freedom of One-Character.
LargestSumDegreeFreedomRule() - Constructor for class monad.mmseg.rule.LargestSumDegreeFreedomRule
 
listWordsFiles() - Method in class monad.mmseg.Dictionary
只要 wordsXXX.dic的文件
load(InputStream, Dictionary.FileLoading) - Static method in class monad.mmseg.Dictionary
加载词文件的模板

M

main(String[]) - Static method in class monad.mmseg.example.Complex
 
main(String[]) - Static method in class monad.mmseg.example.MaxWord
 
main(String[]) - Static method in class monad.mmseg.example.Simple
 
match(char[], int, int) - Method in class monad.mmseg.CharNode.KeyTree
 
match(String) - Method in class monad.mmseg.Dictionary
word 能否在词库里找到
maxMatch(char[], int) - Method in class monad.mmseg.CharNode.KeyTree
 
maxMatch(ArrayList<Integer>, char[], int) - Method in class monad.mmseg.CharNode.KeyTree
 
maxMatch(char[], int) - Method in class monad.mmseg.CharNode
 
maxMatch(ArrayList<Integer>, char[], int) - Method in class monad.mmseg.CharNode
 
maxMatch(char[], int) - Method in class monad.mmseg.Dictionary
 
maxMatch(CharNode, char[], int) - Method in class monad.mmseg.Dictionary
 
maxMatch(CharNode, ArrayList<Integer>, char[], int) - Method in class monad.mmseg.Dictionary
 
maxMatch(CharNode[], int, char[], int) - Method in class monad.mmseg.Seg
最大匹配
从 chs[offset] 开始匹配, 同时把 chs[offset] 的字符结点保存在 cns[cnIdx]
maxMatch(CharNode[], int, char[], int, ArrayList<Integer>[], int) - Method in class monad.mmseg.Seg
匹配,同时找出长度.
MaxMatchRule - Class in monad.mmseg.rule
Maximum Matching.

chuck中各个词的长度之和

MaxMatchRule() - Constructor for class monad.mmseg.rule.MaxMatchRule
 
MaxWord - Class in monad.mmseg.example
 
MaxWord() - Constructor for class monad.mmseg.example.MaxWord
 
MaxWordAnalyzer - Class in monad.mmseg.analysis
最多分词方式.
MaxWordAnalyzer() - Constructor for class monad.mmseg.analysis.MaxWordAnalyzer
 
MaxWordAnalyzer(String) - Constructor for class monad.mmseg.analysis.MaxWordAnalyzer
 
MaxWordAnalyzer(Dictionary) - Constructor for class monad.mmseg.analysis.MaxWordAnalyzer
 
MaxWordAnalyzer(File) - Constructor for class monad.mmseg.analysis.MaxWordAnalyzer
 
MaxWordSeg - Class in monad.mmseg
最多分词.
MaxWordSeg(Dictionary) - Constructor for class monad.mmseg.MaxWordSeg
 
MMSeg - Class in monad.mmseg
Reader 流的分词(有字母,数字等), 析出中文(其实是 CJK)成句子 Sentence 再对 mmseg 算法分词.

非线程安全

MMSeg(Reader, Seg) - Constructor for class monad.mmseg.MMSeg
 
MMSegAnalyzer - Class in monad.mmseg.analysis
默认使用 max-word
MMSegAnalyzer() - Constructor for class monad.mmseg.analysis.MMSegAnalyzer
 
MMSegAnalyzer(String) - Constructor for class monad.mmseg.analysis.MMSegAnalyzer
 
MMSegAnalyzer(File) - Constructor for class monad.mmseg.analysis.MMSegAnalyzer
 
MMSegAnalyzer(Dictionary) - Constructor for class monad.mmseg.analysis.MMSegAnalyzer
 
MMSegTokenizer - Class in monad.mmseg.analysis
 
MMSegTokenizer(Seg) - Constructor for class monad.mmseg.analysis.MMSegTokenizer
 
monad.mmseg - package monad.mmseg
 
monad.mmseg.analysis - package monad.mmseg.analysis
 
monad.mmseg.example - package monad.mmseg.example
 
monad.mmseg.rule - package monad.mmseg.rule
 

N

NameAnalyzer - Class in monad.mmseg.analysis
针对姓名的分词算法
NameAnalyzer() - Constructor for class monad.mmseg.analysis.NameAnalyzer
 
newSeg() - Method in class monad.mmseg.analysis.ComplexAnalyzer
 
newSeg() - Method in class monad.mmseg.analysis.MaxWordAnalyzer
 
newSeg() - Method in class monad.mmseg.analysis.MMSegAnalyzer
 
newSeg() - Method in class monad.mmseg.analysis.SimpleAnalyzer
 
next() - Method in class monad.mmseg.MMSeg
 
nextToken(TokenStream, PackedTokenAttributeImpl) - Static method in class monad.mmseg.analysis.TokenUtils
 
normalizeFile(File) - Static method in class monad.mmseg.Dictionary
 

P

printChunk(List<Chunk>) - Method in class monad.mmseg.Seg
输出 chunks, 调试用.

R

reinit(char[], int) - Method in class monad.mmseg.Sentence
 
reload() - Method in class monad.mmseg.Dictionary
全新加载词库,没有成功加载会回滚。

注意:重新加载时,务必有两倍的词库树结构的内存,默认词库是 50M/个 左右。否则抛出 OOM。

remainChunks() - Method in class monad.mmseg.rule.Rule
 
reset() - Method in class monad.mmseg.analysis.CutLetterDigitFilter
 
reset() - Method in class monad.mmseg.analysis.MMSegTokenizer
 
reset(Reader) - Method in class monad.mmseg.MMSeg
 
reset() - Method in class monad.mmseg.rule.LargestAvgLenRule
 
reset() - Method in class monad.mmseg.rule.LargestSumDegreeFreedomRule
 
reset() - Method in class monad.mmseg.rule.MaxMatchRule
 
reset() - Method in class monad.mmseg.rule.Rule
 
reset() - Method in class monad.mmseg.rule.SmallestVarianceRule
 
row(String, int) - Method in interface monad.mmseg.Dictionary.FileLoading
 
Rule - Class in monad.mmseg.rule
过虑规则的抽象类。
Rule() - Constructor for class monad.mmseg.rule.Rule
 
run(String[]) - Method in class monad.mmseg.example.Complex
 

S

search(CharNode, char[], int, int) - Method in class monad.mmseg.Dictionary
sen[offset] 后 tailLen 长的词是否存在.
search(char[], int, int) - Method in class monad.mmseg.Seg
查找chs[offset]后面的 tailLen个char是否为词.
search(CharNode, char[], int, int) - Method in class monad.mmseg.Seg
没有数组的复制.
seg(Sentence) - Method in class monad.mmseg.ComplexSeg
 
seg(Sentence) - Method in class monad.mmseg.MaxWordSeg
 
Seg - Class in monad.mmseg
分词抽象类.
Seg(Dictionary) - Constructor for class monad.mmseg.Seg
 
seg(Sentence) - Method in class monad.mmseg.Seg
对句子 sen 进行分词.
seg(Sentence) - Method in class monad.mmseg.SimpleSeg
 
segWords(Reader, String) - Method in class monad.mmseg.example.Complex
 
segWords(String, String) - Method in class monad.mmseg.example.Complex
 
Sentence - Class in monad.mmseg
句子, 在一大串字符中断出连续中文的文本.
Sentence() - Constructor for class monad.mmseg.Sentence
 
Sentence(char[], int) - Constructor for class monad.mmseg.Sentence
 
setDegree(int) - Method in class monad.mmseg.Word
 
setFreq(int) - Method in class monad.mmseg.CharNode
 
setMaxLen(int) - Method in class monad.mmseg.CharNode
 
setOffset(int) - Method in class monad.mmseg.Sentence
句子开始处理的偏移位置
setShowChunk(boolean) - Static method in class monad.mmseg.ComplexSeg
 
setStartOffset(int) - Method in class monad.mmseg.Sentence
句子在文本中的偏移位置
setType(String) - Method in class monad.mmseg.Word
 
setWords(Word[]) - Method in class monad.mmseg.Chunk
 
Simple - Class in monad.mmseg.example
 
Simple() - Constructor for class monad.mmseg.example.Simple
 
SimpleAnalyzer - Class in monad.mmseg.analysis
mmseg 的 simple anlayzer.
SimpleAnalyzer() - Constructor for class monad.mmseg.analysis.SimpleAnalyzer
 
SimpleAnalyzer(String) - Constructor for class monad.mmseg.analysis.SimpleAnalyzer
 
SimpleAnalyzer(Dictionary) - Constructor for class monad.mmseg.analysis.SimpleAnalyzer
 
SimpleAnalyzer(File) - Constructor for class monad.mmseg.analysis.SimpleAnalyzer
 
SimpleSeg - Class in monad.mmseg
正向最大匹配的分词方式.
SimpleSeg(Dictionary) - Constructor for class monad.mmseg.SimpleSeg
 
SmallestVarianceRule - Class in monad.mmseg.rule
Smallest Variance of Word Lengths.

标准差的平方

SmallestVarianceRule() - Constructor for class monad.mmseg.rule.SmallestVarianceRule
 
subToken(PackedTokenAttributeImpl, int, int) - Static method in class monad.mmseg.analysis.TokenUtils
 

T

toFactorString() - Method in class monad.mmseg.Chunk
 
tokenQueue - Variable in class monad.mmseg.analysis.CutLetterDigitFilter
 
TokenUtils - Class in monad.mmseg.analysis
lucene 3.0 从 TokenStream 得到 Token 比较麻烦。
TokenUtils() - Constructor for class monad.mmseg.analysis.TokenUtils
 
toString() - Method in class monad.mmseg.Chunk
 
toString() - Method in class monad.mmseg.Word
 
TYPE_DIGIT - Static variable in class monad.mmseg.Word
 
TYPE_DIGIT_OR_LETTER - Static variable in class monad.mmseg.Word
数字开头的"字母或数字"
TYPE_LETTER - Static variable in class monad.mmseg.Word
 
TYPE_LETTER_NUMBER - Static variable in class monad.mmseg.Word
 
TYPE_LETTER_OR_DIGIT - Static variable in class monad.mmseg.Word
字母开头的"字母或数字"
TYPE_OTHER_NUMBER - Static variable in class monad.mmseg.Word
 
TYPE_WORD - Static variable in class monad.mmseg.Word
 

W

Word - Class in monad.mmseg
类似 lucene 的 token
Word(char[], int) - Constructor for class monad.mmseg.Word
 
Word(char[], int, String) - Constructor for class monad.mmseg.Word
 
Word(char[], int, int, int) - Constructor for class monad.mmseg.Word
sen[offset] 开始的 len 个字符才是此 word
wordNum() - Method in class monad.mmseg.CharNode
 
wordsFileIsChange() - Method in class monad.mmseg.Dictionary
词典文件是否有修改过
A C D E F G H I L M N P R S T W 

Copyright © 2015-2016 Ganshane Developer Team. All Rights Reserved.