首页
学习
活动
专区
工具
TVP
发布
社区首页 >问答首页 >对数据集运行基本Elasticsearch查询时遇到困难

对数据集运行基本Elasticsearch查询时遇到困难
EN

Stack Overflow用户
提问于 2019-03-29 01:29:59
回答 1查看 34关注 0票数 0

我正在尝试对具有基本莎士比亚数据集的弹性搜索集群运行布尔查询。我交叉检查了许多资源,一切似乎都是正确的,但当我运行它时,speech_number的得分或操作并不像预期的那样工作。

我已经阅读了各种关于elasticsearch布尔查询的教程和文档,但我仍然没有找到逻辑不能像预期的那样工作的原因。

代码语言:javascript
复制
"query": {
            "bool": {
                "must": [
                    {
                        "match": {"play_name": "Henry IV"}
                    },
                    {
                        "bool": {
                            "should": [
                                {"range": {"speech_number": {"lte": 50}}},
                                {"range": {"speech_number": {"gte": 4}}}
                            ]
                        }
                    }
                ]
            }
        }
    }

im对其运行查询的shakespear.json文件示例如下:

代码语言:javascript
复制
{"line_id":1658,"play_name":"Henry IV","speech_number":26,"line_number":"3.1.108","speaker":"MORTIMER","text_entry":"Yea, but"}
{"index":{"_index":"shakespeare","_type":"line","_id":1658}}
{"line_id":1659,"play_name":"Henry IV","speech_number":26,"line_number":"3.1.109","speaker":"MORTIMER","text_entry":"Mark how he bears his course, and runs me up"}
{"index":{"_index":"shakespeare","_type":"line","_id":1659}}
{"line_id":1660,"play_name":"Henry IV","speech_number":26,"line_number":"3.1.110","speaker":"MORTIMER","text_entry":"With like advantage on the other side;"}
{"index":{"_index":"shakespeare","_type":"line","_id":1660}}
{"line_id":1661,"play_name":"Henry IV","speech_number":26,"line_number":"3.1.111","speaker":"MORTIMER","text_entry":"Gelding the opposed continent as much"}
{"index":{"_index":"shakespeare","_type":"line","_id":1661}}
{"line_id":1662,"play_name":"Henry IV","speech_number":26,"line_number":"3.1.112","speaker":"MORTIMER","text_entry":"As on the other side it takes from you."}
{"index":{"_index":"shakespeare","_type":"line","_id":1662}}
{"line_id":1663,"play_name":"Henry IV","speech_number":27,"line_number":"3.1.113","speaker":"EARL OF WORCESTER","text_entry":"Yea, but a little charge will trench him here"}
{"index":{"_index":"shakespeare","_type":"line","_id":1663}}
{"line_id":1664,"play_name":"Henry IV","speech_number":27,"line_number":"3.1.114","speaker":"EARL OF WORCESTER","text_entry":"And on this north side win this cape of land;"}
{"index":{"_index":"shakespeare","_type":"line","_id":1664}}
{"line_id":1665,"play_name":"Henry IV","speech_number":27,"line_number":"3.1.115","speaker":"EARL OF WORCESTER","text_entry":"And then he runs straight and even."}
{"index":{"_index":"shakespeare","_type":"line","_id":1665}}
{"line_id":1666,"play_name":"Henry IV","speech_number":28,"line_number":"3.1.116","speaker":"HOTSPUR","text_entry":"Ill have it so: a little charge will do it."}
{"index":{"_index":"shakespeare","_type":"line","_id":1666}}
{"line_id":1667,"play_name":"Henry IV","speech_number":29,"line_number":"3.1.117","speaker":"GLENDOWER","text_entry":"Ill not have it alterd."}
{"index":{"_index":"shakespeare","_type":"line","_id":1667}}
{"line_id":1668,"play_name":"Henry IV","speech_number":30,"line_number":"3.1.118","speaker":"HOTSPUR","text_entry":"Will not you?"}
{"index":{"_index":"shakespeare","_type":"line","_id":1668}}
{"line_id":1669,"play_name":"Henry IV","speech_number":31,"line_number":"3.1.119","speaker":"GLENDOWER","text_entry":"No, nor you shall not."}
{"index":{"_index":"shakespeare","_type":"line","_id":1669}}
{"line_id":1670,"play_name":"Henry IV","speech_number":32,"line_number":"3.1.120","speaker":"HOTSPUR","text_entry":"Who shall say me nay?"}
{"index":{"_index":"shakespeare","_type":"line","_id":1670}}
{"line_id":1671,"play_name":"Henry IV","speech_number":33,"line_number":"3.1.121","speaker":"GLENDOWER","text_entry":"Why, that will I."}

预期结果应该是: play_name AND (speech_number <=50 OR speech_number >=4)我得到的是: play_name AND (speech_number <=50 AND speech_number >= 4)

EN

回答 1

Stack Overflow用户

发布于 2019-03-29 07:22:35

您是对的,查询是这样做的:

Elasticsearch还在做的是分配一个分数:因此must中的所有内容都应该匹配,然后should查询中的所有内容都会提升结果( should中至少有一个必须匹配)

要提升一个更高的speech_number (请勿执行此),您可以使用更多匹配的should语句:

代码语言:javascript
复制
{
    "query": {
        "bool": {
            "must": [
                {
                    "match": { 
                        "play_name": "Henry IV"
                    }
                },

                {
                    "bool": {
                        "should": [
                            {
                                "range": {
                                    "speech_number": { "lte": 50 }
                                }
                            },

                            {
                                "range": {
                                    "speech_number": { "lte": 40 }
                                }
                            },

                            {
                                "range": {
                                    "speech_number": { "lte": 30 }
                                }
                            },

                            ...
                        ]
                    }
                }
            ]
        }
    }
}

因此,部分问题可能是lte: 50将允许<4,而gte:4 >50。但我在你的结果中看不到任何外来者。如果是顺序的话。range还支持boost (https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-range-query.html),因此不必编写多个范围,您可以拥有:

代码语言:javascript
复制
{
    "query": {
        "bool": {
            "must": [
                {
                    "match": { 
                        "play_name": "Henry IV",
                        "operator": "and
                    }
                },

                {
                    "bool": {
                        "should": [
                            {
                                "range": {
                                    "speech_number": { 
                                        "gte": 25,
                                        "lte": 50,

                                        "boost": 3
                                    }
                                }
                            },

                            {
                                "range": {
                                    "speech_number": { 
                                        "gte": 4,
                                        "lte": 50
                                    }
                                }
                            }
                        ]
                    }
                }
            ]
        }
    }
}

1* match默认为OR:https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-match-query.html如果您的数据是结构化的,则使用术语或向其添加运算符and更符合您的需要。不是你的问题的问题:)

票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/55403670

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档