Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not able to deserialize ClickHouseBitmap for too many bitmaps(>4294967295)! #918

Closed
wzqiang1332 opened this issue May 6, 2022 · 10 comments · Fixed by #1011
Closed

Not able to deserialize ClickHouseBitmap for too many bitmaps(>4294967295)! #918

wzqiang1332 opened this issue May 6, 2022 · 10 comments · Fixed by #1011
Labels
Milestone

Comments

@wzqiang1332
Copy link
Contributor

clickhouse-server version:22.2.2.1
clickhouse-jdbc version:0.3.2-patch8

create
search

an error occurs when I fetch the bitmap of the next.
even if I adjust the order of index_val, the second data will still report an error.

jdbc sql: select index_name, index_val, length(bitmapToArray(cst_bitmap)) as bitmap_count, cst_bitmap fromindex_bitmap_mergetree orderbyindex_name,index_val
female
jdbc sql: select index_name, index_val, length(bitmapToArray(cst_bitmap)) as bitmap_count, cst_bitmap fromindex_bitmap_mergetree orderbyindex_name,index_val desc
man

test code

  public static void main(String[] args) throws SQLException {
      String url = "jdbc:clickhouse://192.168.70.131:8123";
      Properties properties = new Properties();
      ClickHouseDataSource dataSource = new ClickHouseDataSource(url, properties);
//        insert(dataSource, 2000);
      searchBitmap(dataSource);
  }

  public static void searchBitmap(ClickHouseDataSource dataSource) {
      try (Connection connection = dataSource.getConnection("default", "");
           Statement statement = connection.createStatement();
           ResultSet resultSet = statement.executeQuery("" +
                   "select " +
                   "  index_name," +
                   "  index_val," +
                   "  length(bitmapToArray(cst_bitmap)) as bitmap_count, " +
                   "  cst_bitmap " +
                   "from index_bitmap_mergetree " +
                   "order by index_name, index_val desc")) {
          while (resultSet.next()) {
              String indexName = resultSet.getString(1);
              String indexVal = resultSet.getString(2);
              int bitmapCount = resultSet.getInt(3);
              Roaring64NavigableMap roaringBitmap = (Roaring64NavigableMap) resultSet.getObject(4, ClickHouseBitmap.class).unwrap();
              System.out.printf("indexName:%s, indexVal:%s, size:%d%n", indexName, indexVal, bitmapCount);
              roaringBitmap.stream().forEach(i -> System.out.print(i + ","));
              System.out.println();
          }
      } catch (SQLException e) {
          e.printStackTrace();
      }
  }

error

Exception in thread "main" java.lang.IllegalStateException: Not able to deserialize ClickHouseBitmap for too many bitmaps(>4294967295)!
	at com.clickhouse.client.data.ClickHouseBitmap.deserialize(ClickHouseBitmap.java:372)
	at com.clickhouse.client.data.ClickHouseBitmap.deserialize(ClickHouseBitmap.java:339)
	at com.clickhouse.client.data.BinaryStreamUtils.readBitmap(BinaryStreamUtils.java:210)
	at com.clickhouse.client.data.ClickHouseRowBinaryProcessor$MappedFunctions.lambda$buildMappingsForAggregateFunctions$0(ClickHouseRowBinaryProcessor.java:178)
	at com.clickhouse.client.data.ClickHouseRowBinaryProcessor$MappedFunctions.lambda$buildMappingsForAggregateFunctions$2(ClickHouseRowBinaryProcessor.java:185)
	at com.clickhouse.client.data.ClickHouseRowBinaryProcessor$MappedFunctions.deserialize(ClickHouseRowBinaryProcessor.java:464)
	at com.clickhouse.client.data.ClickHouseRowBinaryProcessor.readAndFill(ClickHouseRowBinaryProcessor.java:496)
	at com.clickhouse.client.ClickHouseDataProcessor$1.next(ClickHouseDataProcessor.java:122)
	at com.clickhouse.client.ClickHouseDataProcessor$1.next(ClickHouseDataProcessor.java:108)
	at com.clickhouse.jdbc.ClickHouseResultSet.next(ClickHouseResultSet.java:708)
	at com.yatop.clickhouse.JdbcDemo.searchBitmap(JdbcDemo.java:39)
	at com.yatop.clickhouse.JdbcDemo.main(JdbcDemo.java:25)
@zhicwu zhicwu added the bug label May 6, 2022
@zhicwu
Copy link
Contributor

zhicwu commented May 7, 2022

Hi @wzqiang1332, will UInt32 work for you? As far as I know there's discrepancy between Java and C++ implementation of RoaringBitmap64, but I'll check this one when I'm done with patch9 release.

@wzqiang1332
Copy link
Contributor Author

@zhicwu UInt32 is ok

@wolfxyx
Copy link

wolfxyx commented May 27, 2022

was this bug solved?

@zhicwu
Copy link
Contributor

zhicwu commented May 29, 2022

was this bug solved?

You meant RoaringBitmap/RoaringBitmap#366? Perhaps you should consider multiple UInt32 bitmaps with or without flag instead. It would be great someone can help you address the issue. I'll revisit this when I'm done with new type system for Aggregatefunction - maybe a custom version of RoaringBitmap just for ClickHouse.

@wzqiang1332
Copy link
Contributor Author

wzqiang1332 commented Jun 16, 2022

Hi @zhicwu I found that the old driver can deserialize 64-bit bitmaps, but this driver can not insert 64-bit bitmaps when datasize > 1000000. The new driver can be inserted normally

<dependency>
    <groupId>ru.yandex.clickhouse</groupId>
    <artifactId>clickhouse-jdbc</artifactId>
    <version>0.3.1-patch</version>
</dependency>

@zhicwu
Copy link
Contributor

zhicwu commented Jun 16, 2022

Hi @zhicwu I found that the old driver can deserialize 64-bit bitmaps, but this driver can not insert 64-bit bitmaps when datasize > 1000000. The new driver can be inserted normally

<dependency>
    <groupId>ru.yandex.clickhouse</groupId>
    <artifactId>clickhouse-jdbc</artifactId>
    <version>0.3.1-patch</version>
</dependency>

Thanks for the hint. Will cross check to understand the discrepancy after releasing patch10.

@panda0120
Copy link

Hi @zhicwu I found that the old driver can deserialize 64-bit bitmaps, but this driver can not insert 64-bit bitmaps when datasize > 1000000. The new driver can be inserted normally

<dependency>
    <groupId>ru.yandex.clickhouse</groupId>
    <artifactId>clickhouse-jdbc</artifactId>
    <version>0.3.1-patch</version>
</dependency>

Thanks for the hint. Will cross check to understand the discrepancy after releasing patch10.

This bug will be fixed in version patch10?
About when will it be released?
Now,when I use old jar with group id "ru.yandex.clickhouse",the query opertion is fine,but the insert operation will cause “Memory limit exceed” error on the ck server if the 64 bitmap capacity is more than 1 million.

@zhicwu
Copy link
Contributor

zhicwu commented Jul 26, 2022

@wzqiang1332, thanks for the pull request. Sorry I don't have much time to add more tests for the change. Since it's been merged into develop branch, would you mind to generate a build and test again in your environment? Also is it necessary to update RoaringBitmap dependency?

@wzqiang1332
Copy link
Contributor Author

@zhicwu I have tested it in my local environment. The bug is caused by the bitmap reading 3 bytes too many, resulting in the following field reading misalignment

@zhicwu
Copy link
Contributor

zhicwu commented Jul 27, 2022

Thanks for confirming @wzqiang1332! It will be part of 0.3.2-patch11 which hopefully can be released by Friday, before I start to play Xenoblade 3 :)

@zhicwu zhicwu added this to the 0.3.2-patch11 milestone Jul 27, 2022
@zhicwu zhicwu closed this as completed Jul 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants