datastax cassandra驱动程序异步结果集获取不工作

aiqt4smr  于 2021-06-10  发布在  Cassandra
关注(0)|答案(1)|浏览(398)

我想加载一行大的数据,所以我的计划是将语句分成几个部分,除以时间戳,然后异步运行它。

...
// List to save ResultSets
List<CompletableFuture<AsyncResultSet>> pending = new ArrayList<>();

for(Range range : ranges) {
    System.out.println("Asynchronous execute query will be called soon!");
    pending.add(executeQuery(session, preparedStatement, range));
}

...

private static CompletableFuture<AsyncResultSet> executeQuery(CqlSession session, 
    PreparedStatement preparedStatement, Range range) {

return session
    .executeAsync(preparedStatement.bind()
        .setInstant("startDateTime", range.getStartDateTime().toInstant())
        .setInstant("endDateTime", range.getEndDateTime().toInstant())
        .setPageSize(1000000))
    .toCompletableFuture()
    .whenCompleteAsync((asyncResultSet, throwable) -> {
        if (throwable == null) {
            System.out.println("Range " + range.getStart() + " to " + range.getEnd() + 
                " has " + asyncResultSet.remaining() + " records.");

            fetchResultSet(asyncResultSet, throwable);

            if(asyncResultSet.hasMorePages()) {
                asyncResultSet.fetchNextPage().whenComplete(LoadCassandraAsync::fetchResultSet);
            }
        } else {
            throwable.printStackTrace();
        }
    }, Executors.newFixedThreadPool(4))
    .exceptionally(throwable -> {
        throwable.printStackTrace();
        return null;
    });
}

我会得到随机退出代码0(不是从主方法),表明它关闭。或者,我会在一些获取后什么也得不到,就像有一个线程在运行,但什么也不做。
如果我评论了“行获取”部分,我得到:

...
Asynchronous execute query will be called soon!
Asynchronous execute query will be called soon!
Asynchronous execute query will be called soon!
Asynchronous execute query will be called soon!
Range 2020-02-14 00:00:00+0700 to 2020-02-14 01:00:00+0700 has 102974 records.
Range 2020-02-14 01:00:00+0700 to 2020-02-14 02:00:00+0700 has 98201 records.
Range 2020-02-14 06:00:00+0700 to 2020-02-14 07:00:00+0700 has 104529 records.
Range 2020-02-14 08:00:00+0700 to 2020-02-14 09:00:00+0700 has 105257 records.
...

我想这意味着 executeQuery() 这个方法很有效。
我做错了什么?

nnvyjq4y

nnvyjq4y1#

根据您可能会耗尽cassandra线程的查询数-并发读取(如果我没记错的话,缺省值是250)。
如果你查一下日志( /var/log/cassandra/system.log )应该有一个与问题相关的消息。要解决此问题,请添加一个人造线程。例如,在发送200个查询后等待。

相关问题