Suffix Array 検証その後
- 4. ※ →
abcde → abcde bcde cde de e
- 14. • Lucene
- Version: 2.4.1 (newest:3.0.0)
- Index n-gram Index (CJKAnalyzer)
- Search Algorithm Inverted Index
• Suffix Array
- Version: full scratch(Java)
- Index Suffix Array build by ʻtwo stage algorithmʼ
- Search Algorithm binary search
- 15. •
- Server: Fujitsu RX200
- Cpu Intel Xeon CPU E5540 2.53GHz(Core 2 Quad)
- Memory 4GB
- HDD: 2TB
•
-
Suffix Array
-
- 18. ※ ms
Lucene : Suffix Array = 6:1
- 19. ※ ms
Lucene : Suffix Array = 6:1
40:1
- 25. → (´ ω `)
Suffix Array
Hadoop Suffix Array
- 30. _
_
_ _
_
_ _
※ ”_” null 0x00
- 31. _
_
_ _
_
_ _
Suffix Array
- 32. 1. Suffix Array
_
_
_
_
_
_
_
_
_
_
- 50. Suffix Array
•
•
• Suffix Array
• Suffix Array
• Suffix Array
- 53. • Hadoop
•
GZIP or xxxxLZ Compressed Suffix
Array FM Index
•
•