java 在Double elasticSearch中按范围计数使用聚合的Sping Boot

6l7fqoea  于 2023-06-28  发布在  Java
关注(0)|答案(2)|浏览(152)

我正在尝试从ElasticSearch中计数特定范围内的记录。我有3个范围,代表不同的值在双。

  1. low (0-4]
  2. medium (4-7]
  3. high (7-10]

这个物体就像

  1. {
  2. "company":"companyName", // string
  3. "score": 6.2 // double
  4. }

假设对于公司1,我想要获得得分值的所有计数。
返回一个像下面这样的对象{“high”:20,“medium”:10,“low”:3}
我找到了一种使用API的方法,

  1. public interface ItemRepository extends ElasticsearchRepository<Item, String> {
  2. @Query("{\"bool\":{\"must\":[{\"match\":{\"userId\":100}},{\"range\":{\"score\":{\"gte\":?0,\"lt\":?1}}}]}}")
  3. long countByScoreRangeAndUserId(int lowerBound, int upperBound);
  4. }
  5. @Service
  6. public class ItemService {
  7. @Autowired
  8. private ItemRepository itemRepository;
  9. public void getScoreCountRanges() {
  10. long range1Count = itemRepository.countByScoreRangeAndUserId(0, 40);
  11. long range2Count = itemRepository.countByScoreRangeAndUserId(40, 70);
  12. long range3Count = itemRepository.countByScoreRangeAndUserId(70, 101); // inclusive lower bound, exclusive upper bound
  13. System.out.println("Range 0-39 Count: " + range1Count);
  14. System.out.println("Range 40-69 Count: " + range2Count);
  15. System.out.println("Range 70-100 Count: " + range3Count);
  16. }
  17. }

但我想这样做,在一个单一的滑动超过数据库,而不是计数在3次。哪条路更好更快
多谢

ddrv8njm

ddrv8njm1#

尝试使用NativeSearchQueryBuilder

  1. @Service
  2. public class ItemService {
  3. @Autowired
  4. private ElasticsearchOperations elasticsearchOperations;
  5. public void getScoreCountRanges() {
  6. SearchQuery searchQuery = new NativeSearchQueryBuilder()
  7. .withQuery(QueryBuilders.matchQuery("userId", 100))
  8. .addAggregation(AggregationBuilders.range("score_ranges")
  9. .field("score")
  10. .addUnboundedTo("low", 4)
  11. .addRange("medium", 4, 7)
  12. .addUnboundedFrom("high", 7)
  13. )
  14. .build();
  15. Aggregations aggregations = elasticsearchOperations.query(searchQuery, SearchResponse::getAggregations);
  16. Range rangeAggregation = aggregations.get("score_ranges");
  17. long lowCount = rangeAggregation.getBucketByKey("low").getDocCount();
  18. long mediumCount = rangeAggregation.getBucketByKey("medium").getDocCount();
  19. long highCount = rangeAggregation.getBucketByKey("high").getDocCount();
  20. System.out.println("Low Range Count: " + lowCount);
  21. System.out.println("Medium Range Count: " + mediumCount);
  22. System.out.println("High Range Count: " + highCount);
  23. }
  24. }
展开查看全部
krcsximq

krcsximq2#

我会用工作代码来回答这个问题,如果将来有人会发现它有用的话。

  1. @Override
  2. public Scores getScoreCountRanges() {
  3. Scores scores = new Scores();
  4. String aggregationName = "score_ranges";
  5. NativeSearchQuery searchQuery = new NativeSearchQueryBuilder()
  6. .withQuery(QueryBuilders.matchQuery("userId", "DESIRED-USER-ID")
  7. .withAggregations(AggregationBuilders.range(aggregationName)
  8. .field("scores")
  9. .addRange(LOW, LOW_LOWER_BOUND,MEDIUM_LOWER_BOUND)
  10. .addRange(MEDIUM, MEDIUM_LOWER_BOUND, HIGH_LOWER_BOUND)
  11. .addRange(HIGH, HIGH_LOWER_BOUND, HIGH_UPPER_BOUND)
  12. )
  13. .build();
  14. SearchHits<?> searchHits = operations.search(searchQuery, ClassOfData.class);
  15. if (!searchHits.hasAggregations())
  16. return scores;
  17. AggregationsContainer<?> aggregationsContainer = searchHits.getAggregations();
  18. if (aggregationsContainer == null) {
  19. return scores;
  20. }
  21. Aggregations aggregations = (Aggregations) aggregationsContainer.aggregations();
  22. ParsedRange rangeAggregation = aggregations.get(aggregationName);
  23. rangeAggregation.getBuckets().forEach(bucket -> fillScores(scores, bucket.getKey().toString(), bucket.getDocCount()));
  24. return scores;
  25. }

这段代码将执行所谓的rangeAggregation,并将返回包含响应matchQuery的记录的桶,这些记录在您决定的范围内找到。
我们将使用该存储桶的文档计数来了解每个特定范围内有多少条记录符合条件。
你可以根据自己的意愿使用填充分数,或者做其他任何事情,返回一个包含键和值的map也是一个不错的选择。
希望对你有帮助。

展开查看全部

相关问题