20. What is “sample_weight”?
20
sample_weight: “bin” → 出現回数を考えない
“that language is” = “that language that is”
sample_weight: “tf” → 出現回数が多いほど重く(近く)なる
“that language is” ≪ “that language that is”
sample_weight: “log_tf” → 出現回数が多いほど
そこそこ重く(近く) なる(log)
“that language is” < “that language that is”
特徴(単語)の出現回数に関する設定項目
Python: a programming language that lets you work more quickly
21. What is “global_weight”?
21
共通の特徴(単語)の重みに関する設定
Python: a programming language that lets you work more quickly
Ruby: general purpose object oriented programming language
global_weight: “idf” → 共通の特徴は軽く(遠く)、
特別な(?)特徴は重く(近く)なる (*)
global_weight: “bin” → 特に考えない
(*)programming, language と言った単語は軽く判定される