发布于 2017-01-24 03:33:07 | 129 次阅读 | 评论: 0 | 来源: 网友投递
Apache Lucene 6.3.0 发布了。
主要更新内容:
Lucene's best efforts to un-map memory mapped files with "MMapDirectory" now work with the latest Java9 early access builds
A new similarity "BooleanSimilarity" that gives terms a score that is equal to their query boost
The axiomatic family of similarities (6 in total) based on https://www.eecis.udel.edu/~hfang/pubs/sigir05-axiom.pdf
A new token filter "SynonymGraphFilter" that outputs a correct graph structure for multi-token synonyms at query time
Graph token streams, such as those produced by the "SynonymGraphFilter", are now handled accurately by query parsers
A new collector "DocValuesStatsCollector" gives the ability to compute statistics on DocValues field
It is now possible to filter "SortedDocValues" and "SortedSetDocValues" terms enum with a compiled automaton
The "UnifiedHighlighter" can now highlight fields with queries that don't necessarily refer to that field
DrillSideways can now run queries concurrently
Index sorting now supports sorting on multi-valued fields using MIN, MAX, etc. selectors
Points do not store the implicit split dimension in the 1-dimension case. This saves between 6% memory for the largest types such an InetAddressPoint to 33% for the smaller types such as HalfFloatPoint.
The BKD in-memory index for dimensional points now uses a compressed format, using substantially less RAM in some cases
The BKD writing now buffers each leaf block in heap before writing to disk, giving a small speedup in points-heavy use cases
"TermAutomatonQuery" now rewrites to more efficient queries when possible
更多内容及下载地址:http://lucene.apache.org/