Stop Words a an and are as at be but by for if in into is it no not of on or such that the their then there these they this to was will with https://github.com/apache/lucene-solr/blob/master/lucene/ analysis/common/src/java/org/apache/lucene/analysis/en/ EnglishAnalyzer.java#L44-L50
Phonetic GET /_analyze { "tokenizer": "standard", "filter": [ { "type": "phonetic", "encoder": "beider_morse", "languageset": "any" } ], "text": "These are not the droids you are looking for." }
Slide 42
Phonetic ...
drDts drits drots loknk...
iou
ari
ori
Slide 43
Another Example Obi-Wan never told you what happened to your father.
Slide 44
Another Example obi
wan never told you what happen your father
Slide 45
Another Example <b>No</b>. I am your father.
Slide 46
Another Example i
am your
father
Slide 47
Inverted Index am droid father happen i look never obi told wan what you your
ID 1 0 1[4] 0 0 0 1[7] 0 0 0 0 0 1[5] 0
ID 2 0 0 1[9] 1[6] 0 0 1[2] 1[0] 1[3] 1[1] 1[5] 1[4] 1[8]
ID 3 1[2] 0 1[4] 0 1[1] 0 0 0 0 0 0 0 1[3]
Synonyms Index synonym or query time synonym_graph
Slide 53
GET /starwars/_mapping GET /starwars/_settings
Slide 54
PUT /starwars/_doc/1 { "quote": "These are <em>not</em> the droids you are looking for." } PUT /starwars/_doc/2 { "quote": "Obi-Wan never told you what happened to your father." } PUT /starwars/_doc/3 { "quote": "<b>No</b>. I am your father." }
Slide 55
GET /starwars/_doc/1 GET /starwars/_doc/1/_source
Slide 56
Search
Slide 57
POST /starwars/_search { "query": { "match_all": { } } }
Slide 58
{
"took": 1, "timed_out": false, "_shards": { "total": 5, "successful": 5, "failed": 0 }, "hits": { "total": 3, "max_score": 1, "hits": [ { "_index": "starwars", "_type": "_doc", "_id": "2", "_score": 1, "_source": { "quote": "Obi-Wan never told you what happened to your father." } }, ...
Don't do this. Seriously. Stop trying to think about your problem this way, it's not going to end well. — https://wiki.apache.org/lucene-java/ ScoresAsPercentages
Slide 106
GET /starwars/_analyze { "analyzer" : "my_analyzer", "text": "These are my father's machines." }
PUT /starwars/_doc/4 { "quote": "These are my father's machines." }
Slide 109
POST /starwars/_search { "query": { "match": { "quote": "my father machine" } } }
Slide 110
"hits": { "total": 4, "max_score": 2.92523, "hits": [ { "_index": "starwars", "_type": "_doc", "_id": "4", "_score": 2.92523, "_source": { "quote": "These are my father's machines." } }, { "_index": "starwars", "_type": "_doc", "_id": "1", "_score": 0.8617505, "_source": { "quote": "These are <em>not</em> the droids you are looking for." } }, ...
Slide 111
2.92523 == 100%
Slide 112
DELETE /starwars/_doc/4 POST /starwars/_search { "query": { "match": { "quote": "my father machine" } } }
Slide 113
"hits": { "total": 3, "max_score": 1.2499592, "hits": [ { "_index": "starwars", "_type": "_doc", "_id": "1", "_score": 1.2499592, "_source": { "quote": "These are <em>not</em> the droids you are looking for." } }, ...
Slide 114
1.2499592 == 43% or 100%?
Slide 115
PUT /starwars/_doc/4 { "quote": "These droids are my father's father's machines." } POST /starwars/_search { "query": { "match": { "quote": "my father machine" } } }
Slide 116
"hits": { "total": 4, "max_score": 3.0068164, "hits": [ { "_index": "starwars", "_type": "_doc", "_id": "4", "_score": 3.0068164, "_source": { "quote": "These droids are my father's father's machines." } }, { "_index": "starwars", "_type": "_doc", "_id": "1", "_score": 0.89701396, "_source": { "quote": "These are <em>not</em> the droids you are looking for." } }, ...