ES(elasticsearch) springboot+es【深分页scrollId】

创建一个springboot项目
整体架构
1、导入对应的依赖
    <dependencies>
        <!--elasticsearch需要的包-->
<!--        <dependency>-->
<!--            <groupId>org.springframework.boot</groupId>-->
<!--            <artifactId>spring-boot-starter-data-elasticsearch</artifactId>-->
<!--        </dependency>-->

        <dependency>
            <groupId>org.elasticsearch.client</groupId>
            <artifactId>elasticsearch-rest-high-level-client</artifactId>
            <version>6.4.2</version>
        </dependency>
        <dependency>
            <groupId>org.elasticsearch.client</groupId>
            <artifactId>elasticsearch-rest-client</artifactId>
            <version>6.4.2</version>
        </dependency>
        <dependency>
            <groupId>org.elasticsearch</groupId>
            <artifactId>elasticsearch</artifactId>
            <!--版本保持一致,不然缺类-->
            <version>6.4.2</version>
        </dependency>

        <dependency>
            <groupId>org.springframework.boot</groupId>
            <artifactId>spring-boot-starter-web</artifactId>
        </dependency>

        <dependency>
            <groupId>org.springframework.boot</groupId>
            <artifactId>spring-boot-devtools</artifactId>
            <scope>runtime</scope>
            <optional>true</optional>
        </dependency>
        <dependency>
            <groupId>org.projectlombok</groupId>
            <artifactId>lombok</artifactId>
            <optional>true</optional>
        </dependency>
        <dependency>
            <groupId>org.springframework.boot</groupId>
            <artifactId>spring-boot-starter-test</artifactId>
            <scope>test</scope>
        </dependency>
        <dependency>
            <groupId>com.alibaba</groupId>
            <artifactId>fastjson</artifactId>
            <version>1.2.70</version>
         </dependency>
    </dependencies>

2、实体类User
package com.example.demo.pojo;

import lombok.AllArgsConstructor;
import lombok.Data;
import lombok.NoArgsConstructor;

@Data
@NoArgsConstructor
@AllArgsConstructor
public class User {
    private String name;
    private Integer age;
}
3、ElasticSearchClientConfig来配置一个相应的类
package com.example.demo.config;

import org.apache.http.HttpHost;
import org.elasticsearch.client.RestClient;
import org.elasticsearch.client.RestHighLevelClient;
import org.springframework.context.annotation.Bean;
import org.springframework.context.annotation.Configuration;

@Configuration
public class ElasticSearchClientConfig {

    @Bean
    public RestHighLevelClient restHighLevelClient(){
        RestHighLevelClient client=new RestHighLevelClient(
                RestClient.builder(new HttpHost("127.0.0.1",9200,"http")));
        return client;
    }
}
4、写一个测试类去测试
package com.example.demo;

import org.junit.runner.RunWith;
import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.beans.factory.annotation.Qualifier;
import org.springframework.boot.test.context.SpringBootTest;
import org.springframework.test.context.junit4.SpringRunner;

@RunWith(SpringRunner.class)
@SpringBootTest
public class DemoApplicationTests {
    @Autowired
    @Qualifier("restHighLevelClient")
    private RestHighLevelClient client;



}

5、创建索引

详情见:https://www.elastic.co/guide/en/elasticsearch/client/java-rest/7.9/java-rest-high-create-index.html

//测试创建索引
    @Test
    public void contextLoads() throws IOException {
        //创建 索引"weifan" 请求
        CreateIndexRequest request = new CreateIndexRequest("weifan");
        //客户端执行请求,并获得相应
        CreateIndexResponse createIndexResponse
                = client.indices().create(request, RequestOptions.DEFAULT);

        System.out.println(createIndexResponse);

    }

6、获取索引

详情见:https://www.elastic.co/guide/en/elasticsearch/client/java-rest/7.9/java-rest-high-indices-exists.html

    //测试获取索引
    @Test
    public void testExistIndex() throws IOException{
//      GetRequest request=new GetRequest("weifan","","");
//      boolean exists = client.exists(request, RequestOptions.DEFAULT);
//      System.out.println(exists);
        //1、创建请求对象
        GetIndexRequest request=new GetIndexRequest();
        request.indices("weifan");
        //判断
        boolean exists
                = client.indices().exists(request, RequestOptions.DEFAULT);
        System.out.println(exists);

    }
7、删除索引

详情见:https://www.elastic.co/guide/en/elasticsearch/client/java-rest/7.9/java-rest-high-delete-index.html

    //测试删除索引
    @Test
    public void testDeleteIndex() throws IOException{
        DeleteIndexRequest request=new DeleteIndexRequest("weifan");
        DeleteIndexResponse delete = client.indices().delete(request,RequestOptions.DEFAULT);
        System.out.println(delete.isAcknowledged());
    }
8、添加文档

详情见:https://www.elastic.co/guide/en/elasticsearch/client/java-rest/7.9/java-rest-high-document-index.html

@Test
    public void testAddDocument() throws IOException {
        User user = new User("狂神说", 3);
        //创建索引请求
        IndexRequest request= new IndexRequest("kuangshen");
        //IndexRequest request1=new IndexRequest("kuangshen","User","1");
        //kibana 会用 put/kuangshen/User/1  添加数据
        //           {
        //              "name":"狂神说",
        //              "age":23
        //            }
        request.type("User");
        request.id("1");
        //将我们的数据放入请求json中(指定添加的数据)
        request.source(JSON.toJSONString(user), XContentType.JSON);

        //客户端发送请求,获取响应的结果
        IndexResponse indexResponse = client.index(request, RequestOptions.DEFAULT);

        System.out.println(indexResponse.getIndex());//kuangshen
        System.out.println(indexResponse.toString());//IndexResponse[index=kuangshen,type=User,id=1,version=1,result=created,seqNo=0,primaryTerm=1,shards={"total":2,"successful":1,"failed":0}]
        System.out.println(indexResponse.status());//对应我们命令返回状态是CREATED
    }
9、获取文档,判断是否存在
@Test
    public void testIsExists() throws IOException {
        GetRequest getRequest = new GetRequest("kuangshen","User", "1");
        boolean exists = client.exists(getRequest, RequestOptions.DEFAULT);
        System.out.println(exists);
    }
10、获取文档信息
@Test
    public void testGetDocument() throws IOException{
        GetRequest getRequest = new GetRequest("kuangshen","User", "1");
        GetResponse getResponse = client.get(getRequest, RequestOptions.DEFAULT);
        //打印文档的内容
        System.out.println(getResponse.getSourceAsString());//{"age":3,"name":"狂神说"}
        System.out.println(getResponse);//{"_index":"kuangshen","_type":"User","_id":"1","_version":1,"found":true,"_source":{"age":3,"name":"狂神说"}}
    }
11、更新文档信息
//更新文档信息
    @Test
    public void testUpdateRequest() throws IOException {
        UpdateRequest updateRequest=new UpdateRequest("kuangshen","User","1");
        updateRequest.timeout("1s");

        User user=new User("狂神说Java",18);
        updateRequest.doc(JSON.toJSONString(user),XContentType.JSON);

        UpdateResponse updateResponse = client.update(updateRequest, RequestOptions.DEFAULT);
        System.out.println(updateResponse.status());//OK
        System.out.println(updateResponse.toString());//UpdateResponse[index=kuangshen,type=User,id=1,version=2,seqNo=-2,primaryTerm=0,result=noop,shards=ShardInfo{total=0, successful=0, failures=[]}]
        System.out.println(updateResponse);
    }
12、删除文档记录
@Test
    public void testDeleteRequest() throws IOException {

        DeleteRequest request = new DeleteRequest("kuangshen","User", "1");
        request.timeout("1s");

        DeleteResponse deleteResponse = client.delete(request, RequestOptions.DEFAULT);
        System.out.println(deleteResponse.status());
    }
13、批量插入数据
@Test
    public void testBulkRequest() throws IOException {
        BulkRequest bulkRequest=new BulkRequest();
        bulkRequest.timeout("10s");

        List<User> userList=new ArrayList<>();
        userList.add(new User("weifan1",3));
        userList.add(new User("weifan2",3));
        userList.add(new User("weifan3",3));
        userList.add(new User("weifan4",3));

        //批处理
        for(int i=0;i<userList.size();i++){
            bulkRequest.add(
                            new IndexRequest("kuangshen")
                                    .type("User")
                                    .id(""+(i+1))
                                    .source(JSON.toJSONString(userList.get(i)),XContentType.JSON));
        }

        BulkResponse bulkResponse = client.bulk(bulkRequest, RequestOptions.DEFAULT);
        System.out.println(bulkResponse.hasFailures());
    }
14、查询

详情见:https://www.elastic.co/guide/en/elasticsearch/client/java-rest/7.9/java-rest-high-search.html

例14.1
    //查询
    @Test
    public void testSearch() throws IOException {
        SearchRequest searchRequest = new SearchRequest("kuangshen");
        //构建搜索条件
        SearchSourceBuilder sourceBuilder = new SearchSourceBuilder();
        //sourceBuilder.sort() 排序查询


        //查询条件,我们可以使用QueryBuilders工具来实现
        //QueryBuilders.termQuery() 精确
        //QueryBuilders.matchAllQuery() 匹配所有
        //QueryBuilders.fuzzyQuery() 模糊查询
        //QueryBuilders.rangeQuery() 范围查询
        TermQueryBuilder termQueryBuilder = QueryBuilders.termQuery("name", "weifan1");
        sourceBuilder.query(termQueryBuilder);
        sourceBuilder.timeout(new TimeValue(60, TimeUnit.SECONDS));
        
        searchRequest.source(sourceBuilder);

        SearchResponse searchResponse = client.search(searchRequest, RequestOptions.DEFAULT);
        
        System.out.println(JSON.toJSONString(searchResponse.getHits()));
        //结果:{"fragment":true,"hits":[{"fields":{},"fragment":false,"highlightFields":{},"id":"1","matchedQueries":[],"score":0.2876821,"sortValues":[],"sourceAsMap":{"name":"weifan1","age":3},"sourceAsString":"{\"age\":3,\"name\":\"weifan1\"}","sourceRef":{"childResources":[],"fragment":true},"type":"User","version":-1}],"maxScore":0.2876821,"totalHits":1}
        System.out.println("=====================");
        for (SearchHit documentFields : searchResponse.getHits().getHits()) {
            System.out.println(documentFields.getSourceAsString());//{"age":3,"name":"weifan1"}
            //System.out.println(documentFields.getSourceAsMap());
        }

    }
例14.2
/**
     * ##搜索address中包含mill的所有年龄分布和平均年龄
     * GET bank/_search
     * {
     *   "query":{
     *    "match": {
     *      "address":"mill"
     *    }
     *   },
     *   "aggs": {
     *     "ageAgg": {
     *       "terms": {
     *         "field": "age",
     *         "size": 10
     *       }
     *     },
     *     "ageAvg":{
     *       "avg": {
     *         "field": "age"
     *       }
     *     }
     *   }
     * }
     * */

    @Test
    public void searchData() throws IOException {
        SearchRequest searchRequest = new SearchRequest("bank");

        SearchSourceBuilder searchSourceBuilder = new SearchSourceBuilder();
        searchSourceBuilder.query(QueryBuilders.termQuery("address", "mill"));

        TermsAggregationBuilder aggregation = AggregationBuilders.terms("ageAgg")//Aggregations聚合
                .field("age").size(10);
        aggregation.subAggregation(AggregationBuilders.avg("ageAvg")
                .field("age"));
        searchSourceBuilder.aggregation(aggregation);

        searchRequest.source(searchSourceBuilder);
        System.out.println(searchRequest);

        SearchResponse searchResponse = client.search(searchRequest, RequestOptions.DEFAULT);
        System.out.println(searchResponse);
    }

15、分页查询(from-size 适用于浅分页;分页越深,性能越差)

from-size浅分页适合数据量不大的情况(官网推荐是数据少于10000条)
详细请见官网:https://www.elastic.co/guide/cn/elasticsearch/guide/2.x/_query_phase.html
从下图可知es中有7条数据

es中有7条数据

     //分页查询
    @Test
    public void testSearchByPage1() throws IOException {
        Integer currentPage=1;
        Integer pageSize=3;

        SearchRequest searchRequest=new SearchRequest();
        searchRequest.indices("kuangshen");
        SearchSourceBuilder searchSourceBuilder=new SearchSourceBuilder();
        /**
         * 浅分页
         * GET /_search{
         * "from":0
         * "size":3
         *}
         */
        searchSourceBuilder.from(currentPage-1);
        searchSourceBuilder.size(pageSize);
        searchRequest.source(searchSourceBuilder);

        SearchResponse response = client.search(searchRequest, RequestOptions.DEFAULT);

        System.out.println(JSON.toJSONString(response.getHits()));
        System.out.println("------------------------------------------------------");
        SearchHit[] hits = response.getHits().getHits();
        for (SearchHit hit:hits) {
            System.out.println(hit.getSourceAsString());
        }
    }
    //浅分页多次查询(from-size)
    @Test
    public void testSearchByPage2() throws IOException {
        Integer currentPage=1;
        Integer pageSize=3;

        SearchRequest searchRequest=new SearchRequest();
        searchRequest.indices("kuangshen");
        SearchSourceBuilder searchSourceBuilder=new SearchSourceBuilder();
        searchSourceBuilder.from(currentPage-1);
        searchSourceBuilder.size(pageSize);

        Boolean hasMore=true;
        while (hasMore){
            searchRequest.source(searchSourceBuilder);
            SearchResponse response = client.search(searchRequest, RequestOptions.DEFAULT);
            //System.out.println(JSON.toJSONString(response.getHits()));
            System.out.println("------------------------------------------------------"+"from "+ Integer.toString(currentPage-1));
            SearchHit[] hits = response.getHits().getHits();
            for (SearchHit hit:hits) {
                System.out.println(hit.getSourceAsString());
            }

            if(hits.length==0){//返回没值时,则表示遍历完成
                hasMore=false;
            }
            currentPage++;
            searchSourceBuilder.from((currentPage-1)*pageSize);
            searchSourceBuilder.size(pageSize);
        }
        System.out.println("全部查完");

    }

浅分页多次查询

16、深分页(scroll【还有一种search_after方法】)

 //多次分页查询(scroll)
    @Test
    public void testSearchByPage3() throws IOException {
        //Integer currentPage=1;
        Integer pageSize=3;

        SearchRequest searchRequest=new SearchRequest();
        searchRequest.indices("kuangshen");
        searchRequest.scroll(TimeValue.timeValueMinutes(1L));//设置scroll失效时间为1分钟
        SearchSourceBuilder searchSourceBuilder=new SearchSourceBuilder();
        //不需要传从第几条开始
        //searchSourceBuilder.from(currentPage-1);
        searchSourceBuilder.size(pageSize);
        searchSourceBuilder.sort("age", SortOrder.ASC);//排序,查出来的数据根据age排序
        searchRequest.source(searchSourceBuilder);
        SearchResponse response = client.search(searchRequest, RequestOptions.DEFAULT);
        //System.out.println(JSON.toJSONString(response.getHits()));
        System.out.println("------------------------首页------------------------------");
        SearchHit[] hits = response.getHits().getHits();
        for (SearchHit hit:hits) {
            System.out.println(hit.getSourceAsString());
        }

        String scrollId=response.getScrollId();
        System.out.println("scrollId为: "+response.getScrollId());

        Boolean hasMore=true;
        while (hasMore){
            SearchScrollRequest searchScrollRequest=new SearchScrollRequest();
            searchScrollRequest.scroll(TimeValue.timeValueMinutes(1L));
            searchScrollRequest.scrollId(scrollId);
            SearchResponse scrollResponse = client.scroll(searchScrollRequest, RequestOptions.DEFAULT);
            System.out.println("------------------------下一页------------------------------");
            SearchHit[] hits1 = scrollResponse.getHits().getHits();
            for (SearchHit hit:hits1) {
                System.out.println(hit.getSourceAsString());
            }
            if(hits1.length==0){//返回没值时,则表示遍历完成
                hasMore=false;
            }
            scrollId = scrollResponse.getScrollId();
            System.out.println("scrollId为: "+scrollId);
        }
        System.out.println("全部查完");
    }
深分页查询结果
16、from+size 和 scroll两种方式比较
分页方式比较

scroll用的是快照模式,有个窗口期,都是基于这个窗口期的快照来做的查询,scrollId对应的就是这个快照,scrollId是不变的

17、elasticsearch scroll查询的原理

1、https://elasticsearch.cn/question/2935
2、https://www.elastic.co/guide/cn/elasticsearch/guide/2.x/_fetch_phase.html
3、https://www.jianshu.com/p/91d03b16af77

©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 213,417评论 6 492
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 90,921评论 3 387
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 158,850评论 0 349
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 56,945评论 1 285
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 66,069评论 6 385
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 50,188评论 1 291
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 39,239评论 3 412
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 37,994评论 0 268
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 44,409评论 1 304
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 36,735评论 2 327
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 38,898评论 1 341
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 34,578评论 4 336
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 40,205评论 3 317
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 30,916评论 0 21
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 32,156评论 1 267
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 46,722评论 2 363
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 43,781评论 2 351

推荐阅读更多精彩内容